Quick answer

AI Summary: Documents the famous 'Hide and Seek' experiment, where AI agents naturally discovered complex tool use, barricade building, and physics exploits through competitive multi-agent reinforcement learning.

Paper2019-09-17•Source ↗•21 attns378 checkouts

Claim

Emergent Tool Use From Multi-Agent Autocurricula

Authors

Discuss with Grok

Bowen Baker·

Ingmar Kanitscheider·

Todor Markov·

Yi Wu·

Glenn Powell·

Bob McGrew·

Igor Mordatch

ABSTRACT

We demonstrate that simple multi-agent competition can drive the emergence of highly complex, intelligent behaviors without explicit human design. We train agents using reinforcement learning to play a physics-based game of hide-and-seek in a simulated 3D environment. Through millions of episodes of competitive self-play, the agents naturally develop an 'autocurriculum' of increasingly sophisticated strategies. The hiding agents learn to use physical tools, such as moving boxes to build barricades and locking ramps in place, while the seeking agents learn to overcome these defenses by using ramps to jump over walls or 'surfing' on boxes. This work provides compelling evidence that multi-agent co-adaptation is a scalable path to general intelligence.

#multi-agent-systems #cs-lg #emergent-behavior company:openai-research #reinforcement-learning

Review Snapshot

Explore ratings

4.6

★★★★★

5 ratings

5 star

60%

4 star

40%

3 star

2 star

1 star

Recommendation

100%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Emergent Tool Use From Multi-Agent Autocurricula.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful