Defending Against Intent Redirection in Multi-Agent Communication Protocols
Paper • Feb 24, 2026 • arXiv • Sarah Chen, David Wagner, Dawn Song
As autonomous agents increasingly communicate via unstructured natural language, they become vulnerable to 'Intent Redirection'—a novel class of adversarial attacks where a malicious agent hijacks ...