Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3343031.3351042acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

SRINet: Learning Strictly Rotation-Invariant Representations for Point Cloud Classification and Segmentation

Published: 15 October 2019 Publication History

Abstract

Point cloud analysis has drawn broader attentions due to its increasing demands in various fields. Despite the impressive performance has been achieved on several databases, researchers neglect the fact that the orientation of those point cloud data is aligned. Varying the orientation of point cloud may lead to the degradation of performance, restricting the capacity of generalizing to real applications where the prior of orientation is often unknown. In this paper, we propose the point projection feature, which is invariant to the rotation of the input point cloud. A novel architecture is designed to mine features of different levels. We adopt a PointNet-based backbone to extract global feature for point cloud, and the graph aggregation operation to perceive local shape structure. Besides, we introduce an efficient key point descriptor to assign each point with different response and help recognize the overall geometry. Mathematical analyses and experimental results demonstrate that the proposed method can extract strictly rotation-invariant representations for point cloud recognition and segmentation without data augmentation, and outperforms other state-of-the-art methods.

References

[1]
Tolga Birdal and Slobodan Ilic. 2015. Point pair features based object detection and pose estimation revisited. In 2015 International Conference on 3D Vision. IEEE, 527--535.
[2]
Tolga Birdal and Slobodan Ilic. 2017. Cad priors for accurate and flexible instance reconstruction. In Proceedings of the IEEE International Conference on Computer Vision. 133--142.
[3]
Michael M Bronstein, Joan Bruna, Yann LeCun, Arthur Szlam, and Pierre Vandergheynst. 2017. Geometric deep learning: going beyond euclidean data. IEEE Signal Processing Magazine 34, 4 (2017), 18--42.
[4]
Haowen Deng, Tolga Birdal, and Slobodan Ilic. 2018. Ppf-foldnet: Unsupervised learning of rotation invariant 3d local descriptors. In Proceedings of the European Conference on Computer Vision (ECCV). 602--618.
[5]
Bertram Drost, Markus Ulrich, Nassir Navab, and Slobodan Ilic. 2010. Model globally, match locally: Efficient and robust 3D object recognition. In 2010 IEEE computer society conference on computer vision and pattern recognition. Ieee, 998--1005.
[6]
Yifan Feng, Zizhao Zhang, Xibin Zhao, Rongrong Ji, and Yue Gao. 2018. GVCNN: Group-view convolutional neural networks for 3D shape recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 264--272.
[7]
Pedro Hermosilla, Tobias Ritschel, Pere-Pau Vázquez, Àlvar Vinacua, and Timo Ropinski. 2018. Monte Carlo convolution for learning on non-uniformly sampled point clouds. In SIGGRAPH Asia 2018 Technical Papers. ACM, 235.
[8]
Mingyang Jiang, Yiran Wu, and Cewu Lu. 2018. Pointsift: A sift-like network module for 3d point cloud semantic segmentation. arXiv preprint arXiv:1807.00652 (2018).
[9]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[10]
Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).
[11]
Roman Klokov and Victor Lempitsky. 2017. Escape from cells: Deep kd-networks for the recognition of 3d point cloud models. In Proceedings of the IEEE International Conference on Computer Vision. 863--872.
[12]
Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, and Baoquan Chen. 2018. PointCNN: Convolution On X-Transformed Points. In Advances in Neural Information Processing Systems. 828--838.
[13]
Xinhai Liu, Zhizhong Han, Yu-Shen Liu, and Matthias Zwicker. 2018. Point2Sequence: Learning the shape representation of 3D point clouds with an attention-based sequence to sequence network. arXiv preprint arXiv:1811.02565 (2018).
[14]
Daniel Maturana and Sebastian Scherer. 2015. Voxnet: A 3d convolutional neural network for real-time object recognition. In 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 922--928.
[15]
Charles R Qi, Hao Su, Kaichun Mo, and Leonidas J Guibas. 2017. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 652--660.
[16]
Charles R Qi, Hao Su, Matthias Nießner, Angela Dai, Mengyuan Yan, and Leonidas J Guibas. 2016. Volumetric and multi-view cnns for object classification on 3d data. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5648--5656.
[17]
Charles Ruizhongtai Qi, Li Yi, Hao Su, and Leonidas J Guibas. 2017. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In Advances in Neural Information Processing Systems. 5099--5108.
[18]
Riccardo Roveri, A Cengiz Öztireli, Ioana Pandele, and Markus Gross. 2018. Pointpronets: Consolidation of point clouds with convolutional neural networks. In Computer Graphics Forum, Vol. 37. Wiley Online Library, 87--99.
[19]
Yiru Shen, Chen Feng, Yaoqing Yang, and Dong Tian. 2018. Mining point cloud local structures by kernel correlation and graph pooling. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4548--4557.
[20]
Ivan Sipiran and Benjamin Bustos. 2011. Harris 3D: a robust extension of the Harris operator for interest point detection on 3D meshes. The Visual Computer 27, 11 (2011), 963.
[21]
Hang Su, Subhransu Maji, Evangelos Kalogerakis, and Erik Learned-Miller. 2015. Multi-view convolutional neural networks for 3d shape recognition. In Proceedings of the IEEE international conference on computer vision. 945--953.
[22]
Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay E Sarma, Michael M Bronstein, and Justin M Solomon. 2018. Dynamic graph cnn for learning on point clouds. arXiv preprint arXiv:1801.07829 (2018).
[23]
Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3d shapenets: A deep representation for volumetric shapes. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1912--1920.
[24]
Li Yi, Vladimir G Kim, Duygu Ceylan, I Shen, Mengyan Yan, Hao Su, Cewu Lu, Qixing Huang, Alla Sheffer, Leonidas Guibas, et al. 2016. A scalable active framework for region annotation in 3d shape collections. ACM Transactions on Graphics (TOG) 35, 6 (2016), 210.
[25]
Li Yi, Hao Su, Xingwen Guo, and Leonidas J Guibas. 2017. Syncspeccnn: Synchronized spectral cnn for 3d shape segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2282--2290.
[26]
Kangxue Yin, Hui Huang, Daniel Cohen-Or, and Hao Zhang. 2018. P2P-NET: bidirectional point displacement net for shape transform. ACM Transactions on Graphics (TOG) 37, 4 (2018), 152.
[27]
Yang You, Yujing Lou, Qi Liu, Lizhuang Ma, Weiming Wang, Yuwing Tai, and Cewu Lu. 2018. PRIN: Pointwise Rotation- Invariant Network. arXiv preprint arXiv:1811.09361 (2018).

Cited By

View all
  • (2025)3D Lidar Point Cloud Segmentation for Automated DrivingIEEE Intelligent Transportation Systems Magazine10.1109/MITS.2023.332585417:1(8-29)Online publication date: Jan-2025
  • (2025)Revisiting 3D point cloud analysis with Markov processPattern Recognition10.1016/j.patcog.2024.110997158(110997)Online publication date: Feb-2025
  • (2025)A cascaded graph convolutional network for point cloud completionThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-024-03354-x41:1(659-674)Online publication date: 1-Jan-2025
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '19: Proceedings of the 27th ACM International Conference on Multimedia
October 2019
2794 pages
ISBN:9781450368896
DOI:10.1145/3343031
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. 3d shape analysis
  2. point cloud
  3. rotation invariance

Qualifiers

  • Research-article

Funding Sources

  • National Natural Science Foundation of China

Conference

MM '19
Sponsor:

Acceptance Rates

MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)58
  • Downloads (Last 6 weeks)4
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)3D Lidar Point Cloud Segmentation for Automated DrivingIEEE Intelligent Transportation Systems Magazine10.1109/MITS.2023.332585417:1(8-29)Online publication date: Jan-2025
  • (2025)Revisiting 3D point cloud analysis with Markov processPattern Recognition10.1016/j.patcog.2024.110997158(110997)Online publication date: Feb-2025
  • (2025)A cascaded graph convolutional network for point cloud completionThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-024-03354-x41:1(659-674)Online publication date: 1-Jan-2025
  • (2024)RIMeshGNN: A Rotation-Invariant Graph Neural Network for Mesh Classification2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00312(3138-3148)Online publication date: 3-Jan-2024
  • (2024)SR-Adv: Salient Region Adversarial Attacks on 3D Point Clouds for Autonomous DrivingIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.340615325:10(14019-14030)Online publication date: Oct-2024
  • (2024)Bilevel Fusion With Local and Global Cues for Point Cloud UpsamplingIEEE Transactions on Industrial Informatics10.1109/TII.2024.344163520:12(14094-14103)Online publication date: Dec-2024
  • (2024)Unsupervised Pose Decoder: Learn to Disentangle the Pose Attribute for Point Cloud Shape AnalysisIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2024.339344362(1-14)Online publication date: 2024
  • (2024)RIA-Net: Rotation Invariant Aware 3D Point Cloud for Large-Scale Place RecognitionIEEE Robotics and Automation Letters10.1109/LRA.2024.33848879:6(5014-5021)Online publication date: Jun-2024
  • (2024)MSGFusion: Muti-scale Semantic Guided LiDAR-Camera Fusion for 3D Object Detection2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10651407(1-8)Online publication date: 30-Jun-2024
  • (2024)A General Framework for Rotation Invariant Point Cloud AnalysisICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10446048(3665-3669)Online publication date: 14-Apr-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media