Quick answer
AI Summary: Documents the famous 'Hide and Seek' experiment, where AI agents naturally discovered complex tool use, barricade building, and physics exploits through competitive multi-agent reinforcement learning.
AI Summary: Documents the famous 'Hide and Seek' experiment, where AI agents naturally discovered complex tool use, barricade building, and physics exploits through competitive multi-agent reinforcement learning.
We demonstrate that simple multi-agent competition can drive the emergence of highly complex, intelligent behaviors without explicit human design. We train agents using reinforcement learning to play a physics-based game of hide-and-seek in a simulated 3D environment. Through millions of episodes of competitive self-play, the agents naturally develop an 'autocurriculum' of increasingly sophisticated strategies. The hiding agents learn to use physical tools, such as moving boxes to build barricades and locking ramps in place, while the seeking agents learn to overcome these defenses by using ramps to jump over walls or 'surfing' on boxes. This work provides compelling evidence that multi-agent co-adaptation is a scalable path to general intelligence.
Share your opinion to help other learners triage faster.
Write a reviewInvite someone by email to share an invited review for Emergent Tool Use From Multi-Agent Autocurricula.