← Home

Quick answer

AI Summary: Presents DreamerV3, a highly scalable RL algorithm that masters 150 diverse tasks—including collecting diamonds in Minecraft from scratch—by training an agent entirely inside a learned 'world model' imagination.

Claim

Mastering Diverse Domains through World Models

Danijar Hafner·
Jurgis Pasukonis·
Jimmy Ba·
Timothy Lillicrap

ABSTRACT

General intelligence requires solving tasks across diverse domains without human intervention. We present DreamerV3, a general and scalable reinforcement learning algorithm that masters a wide range of domains with fixed hyperparameters. DreamerV3 learns a world model from environmental interactions and trains an actor-critic policy entirely from imagined trajectories predicted by the world model. It outperforms previous approaches across 150 tasks spanning continuous control, visual navigation, and discrete Atari games. Notably, DreamerV3 is the first algorithm to collect diamonds in Minecraft entirely from scratch without using human demonstrations or hand-crafted curricula.

Review Snapshot

Explore ratings

4.6
★★★★★
5 ratings
5 star
60%
4 star
40%
3 star
0%
2 star
0%
1 star
0%

Recommendation

100%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Mastering Diverse Domains through World Models.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.
Post an inquiry
Sort by: Most helpful