Computer Science > Robotics

arXiv:2403.13777 (cs)

[Submitted on 20 Mar 2024 (v1), last revised 13 Nov 2024 (this version, v2)]

Title:Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a Compact Representation

Authors:Hugues Thomas, Mouli Sivapurapu, Jian Zhang

Abstract:This paper presents the Embedding Pose Graph (EPG), an innovative method that combines the strengths of foundation models with a simple 3D representation suitable for robotics applications. Addressing the need for efficient spatial understanding in robotics, EPG provides a compact yet powerful approach by attaching foundation model features to the nodes of a pose graph. Unlike traditional methods that rely on bulky data formats like voxel grids or point clouds, EPG is lightweight and scalable. It facilitates a range of robotic tasks, including open-vocabulary querying, disambiguation, image-based querying, language-directed navigation, and re-localization in 3D environments. We showcase the effectiveness of EPG in handling these tasks, demonstrating its capacity to improve how robots interact with and navigate through complex spaces. Through both qualitative and quantitative assessments, we illustrate EPG's strong performance and its ability to outperform existing methods in re-localization. Our work introduces a crucial step forward in enabling robots to efficiently understand and operate within large-scale 3D spaces.

Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2403.13777 [cs.RO]
	(or arXiv:2403.13777v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2403.13777

Submission history

From: Hugues Thomas [view email]
[v1] Wed, 20 Mar 2024 17:41:21 UTC (5,516 KB)
[v2] Wed, 13 Nov 2024 23:46:42 UTC (5,516 KB)

Computer Science > Robotics

Title:Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a Compact Representation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a Compact Representation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators