Quick answer

AI Summary: Introduces Hindsight Experience Replay (HER), an elegant RL technique that allows agents to learn efficiently from sparse rewards by retroactively treating failures as successful achievements of alternative goals.

Paper2017-07-05•Source ↗•15 attns140 checkouts

Claim

Hindsight Experience Replay

Authors

Discuss with Grok

Marcin Andrychowicz·

Filip Wolski·

Alex Ray·

Jonas Schneider·

Rachel Fong·

Peter Welinder·

Bob McGrew·

Josh Tobin·

Pieter Abbeel·

Wojciech Zaremba

ABSTRACT

Dealing with sparse rewards is one of the biggest challenges in Reinforcement Learning (RL). We present a novel technique called Hindsight Experience Replay (HER) which allows sample-efficient learning from rewards which are sparse and binary. HER can be combined with any off-policy RL algorithm and is applicable whenever there are multiple goals that can be achieved. The core idea is to replay past experiences with the goals that were actually achieved, rather than the original intended goals. This allows the agent to learn from failure, recognizing that even if it failed its intended task, it successfully learned how to achieve the state it ended up in.

#cs-lg #robotics company:openai-research #sparse-rewards #reinforcement-learning

Review Snapshot

Explore ratings

4.6

★★★★★

5 ratings

5 star

60%

4 star

40%

3 star

2 star

1 star

Recommendation

100%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Hindsight Experience Replay.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful