← Home

Quick answer

AI Summary: Introduces Hindsight Experience Replay (HER), an elegant RL technique that allows agents to learn efficiently from sparse rewards by retroactively treating failures as successful achievements of alternative goals.

Claim

Hindsight Experience Replay

Marcin Andrychowicz·
Filip Wolski·
Alex Ray·
Jonas Schneider·
Rachel Fong·
Peter Welinder·
Bob McGrew·
Josh Tobin·
Pieter Abbeel·
Wojciech Zaremba

ABSTRACT

Dealing with sparse rewards is one of the biggest challenges in Reinforcement Learning (RL). We present a novel technique called Hindsight Experience Replay (HER) which allows sample-efficient learning from rewards which are sparse and binary. HER can be combined with any off-policy RL algorithm and is applicable whenever there are multiple goals that can be achieved. The core idea is to replay past experiences with the goals that were actually achieved, rather than the original intended goals. This allows the agent to learn from failure, recognizing that even if it failed its intended task, it successfully learned how to achieve the state it ended up in.

Review Snapshot

Explore ratings

4.6
★★★★★
5 ratings
5 star
60%
4 star
40%
3 star
0%
2 star
0%
1 star
0%

Recommendation

100%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Hindsight Experience Replay.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.
Post an inquiry
Sort by: Most helpful