Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.01548 (cs)

[Submitted on 2 Aug 2024]

Title:Trainable Pointwise Decoder Module for Point Cloud Segmentation

Authors:Bike Chen, Chen Gong, Antti Tikanmäki, Juha Röning

Abstract:Point cloud segmentation (PCS) aims to make per-point predictions and enables robots and autonomous driving cars to understand the environment. The range image is a dense representation of a large-scale outdoor point cloud, and segmentation models built upon the image commonly execute efficiently. However, the projection of the point cloud onto the range image inevitably leads to dropping points because, at each image coordinate, only one point is kept despite multiple points being projected onto the same location. More importantly, it is challenging to assign correct predictions to the dropped points that belong to the classes different from the kept point class. Besides, existing post-processing methods, such as K-nearest neighbor (KNN) search and kernel point convolution (KPConv), cannot be trained with the models in an end-to-end manner or cannot process varying-density outdoor point clouds well, thereby enabling the models to achieve sub-optimal performance. To alleviate this problem, we propose a trainable pointwise decoder module (PDM) as the post-processing approach, which gathers weighted features from the neighbors and then makes the final prediction for the query point. In addition, we introduce a virtual range image-guided copy-rotate-paste (VRCrop) strategy in data augmentation. VRCrop constrains the total number of points and eliminates undesirable artifacts in the augmented point cloud. With PDM and VRCrop, existing range image-based segmentation models consistently perform better than their counterparts on the SemanticKITTI, SemanticPOSS, and nuScenes datasets.

Comments:	No comments
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2408.01548 [cs.CV]
	(or arXiv:2408.01548v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.01548

Submission history

From: Bike Chen [view email]
[v1] Fri, 2 Aug 2024 19:29:35 UTC (4,858 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Trainable Pointwise Decoder Module for Point Cloud Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Trainable Pointwise Decoder Module for Point Cloud Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators