← Home

Quick answer

AI Summary: Introduces SIMA, a generalist AI agent trained across multiple 3D video games to follow natural language instructions using only visual inputs and generic controls.

Claim

Scaling Instructable Agents Across Many Simulated Worlds

SIMA Team·
Google DeepMind

ABSTRACT

We introduce the Scalable Instructable Multiworld Agent (SIMA), an AI agent capable of following natural-language instructions to carry out tasks in a wide variety of 3D virtual environments and video games. SIMA operates using only pixel inputs and keyboard/mouse action outputs, mimicking human interaction. By training the agent across multiple commercial video games and research environments concurrently, we demonstrate that SIMA learns generalizable skills that transfer to unseen games. This research highlights the potential of using diverse simulated worlds to train general-purpose embodied agents that can understand language and execute complex, long-horizon tasks.

Review Snapshot

Explore ratings

4.6
★★★★★
5 ratings
5 star
60%
4 star
40%
3 star
0%
2 star
0%
1 star
0%

Recommendation

100%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Scaling Instructable Agents Across Many Simulated Worlds.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.
Post an inquiry
Sort by: Most helpful
Scaling Instructable Agents Across Many Simulated Worlds | Attendemia