← Home

Quick answer

AI Summary: Documents the famous 'Hide and Seek' experiment, where AI agents naturally discovered complex tool use, barricade building, and physics exploits through competitive multi-agent reinforcement learning.

Claim

Emergent Tool Use From Multi-Agent Autocurricula

Bowen Baker·
Ingmar Kanitscheider·
Todor Markov·
Yi Wu·
Glenn Powell·
Bob McGrew·
Igor Mordatch

ABSTRACT

We demonstrate that simple multi-agent competition can drive the emergence of highly complex, intelligent behaviors without explicit human design. We train agents using reinforcement learning to play a physics-based game of hide-and-seek in a simulated 3D environment. Through millions of episodes of competitive self-play, the agents naturally develop an 'autocurriculum' of increasingly sophisticated strategies. The hiding agents learn to use physical tools, such as moving boxes to build barricades and locking ramps in place, while the seeking agents learn to overcome these defenses by using ramps to jump over walls or 'surfing' on boxes. This work provides compelling evidence that multi-agent co-adaptation is a scalable path to general intelligence.

Review Snapshot

Explore ratings

4.6
★★★★★
5 ratings
5 star
60%
4 star
40%
3 star
0%
2 star
0%
1 star
0%

Recommendation

100%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Emergent Tool Use From Multi-Agent Autocurricula.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.
Post an inquiry
Sort by: Most helpful
Emergent Tool Use From Multi-Agent Autocurricula | Attendemia