Quick answer

AI Summary: GSR teaches robots to decompose manipulation tasks into logical sub-goals based on object affordances.

Paper2026-02-10•Source ↗•17 attns0 checkouts

Claim

GSR: Learning Structured Reasoning for Embodied Manipulation

Authors

Kewei Hu·

Michael Zhang·

Hanwen Kang

ABSTRACT

We introduce Grounded Scene-graph Reasoning (GSR), a structured reasoning paradigm that explicitly models world-state evolution as transitions over semantically grounded scene graphs. By reasoning step-wise over object states and spatial relations, rather than mapping perception to actions, GSR enables explicit reasoning about preconditions and consequences. We construct Manip-Cognition-1.6M, a large-scale dataset to supervise world understanding and action planning. Evaluations across RLBench and real-world tasks show that GSR significantly improves zero-shot generalization and long-horizon task completion over prompting-based baselines by treating scene graphs as the primary state space for decision making.

#robotics #cs-ro #cs-ai

Review Snapshot

Explore ratings

0.0

★★★★★

0 ratings

5 star

4 star

3 star

2 star

1 star

Recommendation

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for GSR: Learning Structured Reasoning for Embodied Manipulation.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful