Quick answer

AI Summary: Presents DreamerV3, a highly scalable RL algorithm that masters 150 diverse tasks—including collecting diamonds in Minecraft from scratch—by training an agent entirely inside a learned 'world model' imagination.

Paper2023-01-10•Source ↗•38 attns397 checkouts

Claim

Mastering Diverse Domains through World Models

Authors

Discuss with Grok

Danijar Hafner·

Jurgis Pasukonis·

Jimmy Ba·

Timothy Lillicrap

ABSTRACT

General intelligence requires solving tasks across diverse domains without human intervention. We present DreamerV3, a general and scalable reinforcement learning algorithm that masters a wide range of domains with fixed hyperparameters. DreamerV3 learns a world model from environmental interactions and trains an actor-critic policy entirely from imagined trajectories predicted by the world model. It outperforms previous approaches across 150 tasks spanning continuous control, visual navigation, and discrete Atari games. Notably, DreamerV3 is the first algorithm to collect diamonds in Minecraft entirely from scratch without using human demonstrations or hand-crafted curricula.

#dreamerv3 #cs-lg lab:deep-mind-ai #cs-ai #world-models

Review Snapshot

Explore ratings

4.6

★★★★★

5 ratings

5 star

60%

4 star

40%

3 star

2 star

1 star

Recommendation

100%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Mastering Diverse Domains through World Models.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful