← Home

Quick answer

AI Summary: Provides a dense, differentiable scene representation that allows for better trajectory planning.

Claim

Towards Physically Executable 3D Gaussian for Embodied Navigation

Authors
Wancai Zheng·
Hao Chen·
Xinyi Yu

ABSTRACT

3D Gaussian Splatting (3DGS) is a photorealistic rendering method but lacks semantics and physical executability for Visual-Language Navigation (VLN). We propose SAGE-3D, a paradigm that upgrades 3DGS into an executable environment. It comprises Object-Centric Semantic Grounding, adding fine-grained annotations, and Physics-Aware Execution Jointing, which embeds collision objects into 3DGS. Experiments show that 3DGS scene data, while difficult to converge, exhibits strong generalizability, improving baseline performance by 31% on VLN-CE Unseen tasks. We provide a dense, differentiable scene representation that allows for significantly better trajectory planning in complex indoor environments.

Review Snapshot

Explore ratings

0.0
★★★★★
0 ratings
5 star
0%
4 star
0%
3 star
0%
2 star
0%
1 star
0%

Recommendation

0%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for Towards Physically Executable 3D Gaussian for Embodied Navigation.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.
Post an inquiry
Sort by: Most helpful