Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.14702 (cs)

[Submitted on 23 Oct 2023 (v1), last revised 7 Dec 2023 (this version, v2)]

Title:BM2CP: Efficient Collaborative Perception with LiDAR-Camera Modalities

Authors:Binyu Zhao, Wei Zhang, Zhaonian Zou

View PDF

Abstract:Collaborative perception enables agents to share complementary perceptual information with nearby agents. This would improve the perception performance and alleviate the issues of single-view perception, such as occlusion and sparsity. Most existing approaches mainly focus on single modality (especially LiDAR), and not fully exploit the superiority of multi-modal perception. We propose a collaborative perception paradigm, BM2CP, which employs LiDAR and camera to achieve efficient multi-modal perception. It utilizes LiDAR-guided modal fusion, cooperative depth generation and modality-guided intermediate fusion to acquire deep interactions among modalities of different agents, Moreover, it is capable to cope with the special case where one of the sensors, same or different type, of any agent is missing. Extensive experiments validate that our approach outperforms the state-of-the-art methods with 50X lower communication volumes in both simulated and real-world autonomous driving scenarios. Our code is available at this https URL.

Comments:	14 pages, 8 figures. Accepted by CoRL 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2310.14702 [cs.CV]
	(or arXiv:2310.14702v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.14702

Submission history

From: Binyu Zhao [view email]
[v1] Mon, 23 Oct 2023 08:45:12 UTC (40,615 KB)
[v2] Thu, 7 Dec 2023 04:42:07 UTC (40,615 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:BM2CP: Efficient Collaborative Perception with LiDAR-Camera Modalities

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:BM2CP: Efficient Collaborative Perception with LiDAR-Camera Modalities

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators