research-article

SRINet: Learning Strictly Rotation-Invariant Representations for Point Cloud Classification and Segmentation

Authors:

Jianguo XiaoAuthors Info & Claims

MM '19: Proceedings of the 27th ACM International Conference on Multimedia

Pages 980 - 988

https://doi.org/10.1145/3343031.3351042

Published: 15 October 2019 Publication History

Abstract

Point cloud analysis has drawn broader attentions due to its increasing demands in various fields. Despite the impressive performance has been achieved on several databases, researchers neglect the fact that the orientation of those point cloud data is aligned. Varying the orientation of point cloud may lead to the degradation of performance, restricting the capacity of generalizing to real applications where the prior of orientation is often unknown. In this paper, we propose the point projection feature, which is invariant to the rotation of the input point cloud. A novel architecture is designed to mine features of different levels. We adopt a PointNet-based backbone to extract global feature for point cloud, and the graph aggregation operation to perceive local shape structure. Besides, we introduce an efficient key point descriptor to assign each point with different response and help recognize the overall geometry. Mathematical analyses and experimental results demonstrate that the proposed method can extract strictly rotation-invariant representations for point cloud recognition and segmentation without data augmentation, and outperforms other state-of-the-art methods.

References

[1]

Tolga Birdal and Slobodan Ilic. 2015. Point pair features based object detection and pose estimation revisited. In 2015 International Conference on 3D Vision. IEEE, 527--535.

Digital Library

[2]

Tolga Birdal and Slobodan Ilic. 2017. Cad priors for accurate and flexible instance reconstruction. In Proceedings of the IEEE International Conference on Computer Vision. 133--142.

[3]

Michael M Bronstein, Joan Bruna, Yann LeCun, Arthur Szlam, and Pierre Vandergheynst. 2017. Geometric deep learning: going beyond euclidean data. IEEE Signal Processing Magazine 34, 4 (2017), 18--42.

[4]

Haowen Deng, Tolga Birdal, and Slobodan Ilic. 2018. Ppf-foldnet: Unsupervised learning of rotation invariant 3d local descriptors. In Proceedings of the European Conference on Computer Vision (ECCV). 602--618.

[5]

Bertram Drost, Markus Ulrich, Nassir Navab, and Slobodan Ilic. 2010. Model globally, match locally: Efficient and robust 3D object recognition. In 2010 IEEE computer society conference on computer vision and pattern recognition. Ieee, 998--1005.

[6]

Yifan Feng, Zizhao Zhang, Xibin Zhao, Rongrong Ji, and Yue Gao. 2018. GVCNN: Group-view convolutional neural networks for 3D shape recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 264--272.

[7]

Pedro Hermosilla, Tobias Ritschel, Pere-Pau Vázquez, Àlvar Vinacua, and Timo Ropinski. 2018. Monte Carlo convolution for learning on non-uniformly sampled point clouds. In SIGGRAPH Asia 2018 Technical Papers. ACM, 235.

[8]

Mingyang Jiang, Yiran Wu, and Cewu Lu. 2018. Pointsift: A sift-like network module for 3d point cloud semantic segmentation. arXiv preprint arXiv:1807.00652 (2018).

[9]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[10]

Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).

[11]

Roman Klokov and Victor Lempitsky. 2017. Escape from cells: Deep kd-networks for the recognition of 3d point cloud models. In Proceedings of the IEEE International Conference on Computer Vision. 863--872.

[12]

Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, and Baoquan Chen. 2018. PointCNN: Convolution On X-Transformed Points. In Advances in Neural Information Processing Systems. 828--838.

[13]

Xinhai Liu, Zhizhong Han, Yu-Shen Liu, and Matthias Zwicker. 2018. Point2Sequence: Learning the shape representation of 3D point clouds with an attention-based sequence to sequence network. arXiv preprint arXiv:1811.02565 (2018).

[14]

Daniel Maturana and Sebastian Scherer. 2015. Voxnet: A 3d convolutional neural network for real-time object recognition. In 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 922--928.

Digital Library

[15]

Charles R Qi, Hao Su, Kaichun Mo, and Leonidas J Guibas. 2017. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 652--660.

[16]

Charles R Qi, Hao Su, Matthias Nießner, Angela Dai, Mengyuan Yan, and Leonidas J Guibas. 2016. Volumetric and multi-view cnns for object classification on 3d data. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5648--5656.

[17]

Charles Ruizhongtai Qi, Li Yi, Hao Su, and Leonidas J Guibas. 2017. Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In Advances in Neural Information Processing Systems. 5099--5108.

[18]

Riccardo Roveri, A Cengiz Öztireli, Ioana Pandele, and Markus Gross. 2018. Pointpronets: Consolidation of point clouds with convolutional neural networks. In Computer Graphics Forum, Vol. 37. Wiley Online Library, 87--99.

[19]

Yiru Shen, Chen Feng, Yaoqing Yang, and Dong Tian. 2018. Mining point cloud local structures by kernel correlation and graph pooling. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4548--4557.

[20]

Ivan Sipiran and Benjamin Bustos. 2011. Harris 3D: a robust extension of the Harris operator for interest point detection on 3D meshes. The Visual Computer 27, 11 (2011), 963.

Digital Library

[21]

Hang Su, Subhransu Maji, Evangelos Kalogerakis, and Erik Learned-Miller. 2015. Multi-view convolutional neural networks for 3d shape recognition. In Proceedings of the IEEE international conference on computer vision. 945--953.

Digital Library

[22]

Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay E Sarma, Michael M Bronstein, and Justin M Solomon. 2018. Dynamic graph cnn for learning on point clouds. arXiv preprint arXiv:1801.07829 (2018).

[23]

Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3d shapenets: A deep representation for volumetric shapes. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1912--1920.

[24]

Li Yi, Vladimir G Kim, Duygu Ceylan, I Shen, Mengyan Yan, Hao Su, Cewu Lu, Qixing Huang, Alla Sheffer, Leonidas Guibas, et al. 2016. A scalable active framework for region annotation in 3d shape collections. ACM Transactions on Graphics (TOG) 35, 6 (2016), 210.

Digital Library

[25]

Li Yi, Hao Su, Xingwen Guo, and Leonidas J Guibas. 2017. Syncspeccnn: Synchronized spectral cnn for 3d shape segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2282--2290.

[26]

Kangxue Yin, Hui Huang, Daniel Cohen-Or, and Hao Zhang. 2018. P2P-NET: bidirectional point displacement net for shape transform. ACM Transactions on Graphics (TOG) 37, 4 (2018), 152.

Digital Library

[27]

Yang You, Yujing Lou, Qi Liu, Lizhuang Ma, Weiming Wang, Yuwing Tai, and Cewu Lu. 2018. PRIN: Pointwise Rotation- Invariant Network. arXiv preprint arXiv:1811.09361 (2018).

Cited By

Abbasi RBashir ARehman AGe Y(2025)3D Lidar Point Cloud Segmentation for Automated DrivingIEEE Intelligent Transportation Systems Magazine10.1109/MITS.2023.332585417:1(8-29)Online publication date: Jan-2025
https://doi.org/10.1109/MITS.2023.3325854
Jiang CMa WHuang KWang QYang XZhao WWu JWang XXiao JNiu Z(2025)Revisiting 3D point cloud analysis with Markov processPattern Recognition10.1016/j.patcog.2024.110997158(110997)Online publication date: Feb-2025
https://doi.org/10.1016/j.patcog.2024.110997
Wang LLi JGuo SHan S(2025)A cascaded graph convolutional network for point cloud completionThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-024-03354-x41:1(659-674)Online publication date: 1-Jan-2025
https://dl.acm.org/doi/10.1007/s00371-024-03354-x
Show More Cited By

Index Terms

SRINet: Learning Strictly Rotation-Invariant Representations for Point Cloud Classification and Segmentation
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Shape inference
      2. Computer vision representations
        Shape representations
  2. Computer graphics
    1. Shape modeling
      1. Point-based models

Recommendations

Image analysis by circularly semi-orthogonal moments

Various types of circularly orthogonal moments have been widely used for image reconstruction and rotation invariant classification. However, they suffer from two errors namely numerical integration error and geometric error, which affect their ...
Two-Dimensional Polar Harmonic Transforms for Invariant Image Representation

This paper introduces a set of 2D transforms, based on a set of orthogonal projection bases, to generate a set of features which are invariant to rotation. We call these transforms Polar Harmonic Transforms (PHTs). Unlike the well-known Zernike and ...
Error analysis and accurate calculation of rotational moments

The Orthogonal rotation invariant moments suffer from two major errors - the geometric error and the numerical integration error. As a consequence of this, their rotation and scale invariance properties are affected. In this paper, we study the behavior ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '19: Proceedings of the 27th ACM International Conference on Multimedia

October 2019

2794 pages

ISBN:9781450368896

DOI:10.1145/3343031

General Chairs:
Laurent Amsaleg
CNRS-IRISA, France
,
Benoit Huet
EURECOM, France
,
Martha Larson
Radboud University and TU Delft (Netherlands)
,
Program Chairs:
Guillaume Gravier
CNRS-IRISA, France
,
Hayley Hung
Delft University of Technology Netherlands
,
Chong-Wah Ngo
City University of Hong Kong Hong Kong
,
Wei Tsang Ooi
National University of Singapore Singapore

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China

Conference

MM '19

Sponsor:

SIGMM

MM '19: The 27th ACM International Conference on Multimedia

October 21 - 25, 2019

Nice, France

Acceptance Rates

MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

70
Total Citations
View Citations
649
Total Downloads

Downloads (Last 12 months)58
Downloads (Last 6 weeks)4

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Abbasi RBashir ARehman AGe Y(2025)3D Lidar Point Cloud Segmentation for Automated DrivingIEEE Intelligent Transportation Systems Magazine10.1109/MITS.2023.332585417:1(8-29)Online publication date: Jan-2025
https://doi.org/10.1109/MITS.2023.3325854
Jiang CMa WHuang KWang QYang XZhao WWu JWang XXiao JNiu Z(2025)Revisiting 3D point cloud analysis with Markov processPattern Recognition10.1016/j.patcog.2024.110997158(110997)Online publication date: Feb-2025
https://doi.org/10.1016/j.patcog.2024.110997
Wang LLi JGuo SHan S(2025)A cascaded graph convolutional network for point cloud completionThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-024-03354-x41:1(659-674)Online publication date: 1-Jan-2025
https://dl.acm.org/doi/10.1007/s00371-024-03354-x
Shakibajahromi BKim EBreen D(2024)RIMeshGNN: A Rotation-Invariant Graph Neural Network for Mesh Classification2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00312(3138-3148)Online publication date: 3-Jan-2024
https://doi.org/10.1109/WACV57701.2024.00312
Zheng SLiu WGuo YZang YShen SWen CCheng MZhong PWang C(2024)SR-Adv: Salient Region Adversarial Attacks on 3D Point Clouds for Autonomous DrivingIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2024.340615325:10(14019-14030)Online publication date: Oct-2024
https://doi.org/10.1109/TITS.2024.3406153
Zhu YZhang ZCheng XZhang J(2024)Bilevel Fusion With Local and Global Cues for Point Cloud UpsamplingIEEE Transactions on Industrial Informatics10.1109/TII.2024.344163520:12(14094-14103)Online publication date: Dec-2024
https://doi.org/10.1109/TII.2024.3441635
Zhang ZLi ZDu MShi J(2024)Unsupervised Pose Decoder: Learn to Disentangle the Pose Attribute for Point Cloud Shape AnalysisIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2024.339344362(1-14)Online publication date: 2024
https://doi.org/10.1109/TGRS.2024.3393443
Hao WZhang WSu H(2024)RIA-Net: Rotation Invariant Aware 3D Point Cloud for Large-Scale Place RecognitionIEEE Robotics and Automation Letters10.1109/LRA.2024.33848879:6(5014-5021)Online publication date: Jun-2024
https://doi.org/10.1109/LRA.2024.3384887
Zhu HXue YCheng XHou B(2024)MSGFusion: Muti-scale Semantic Guided LiDAR-Camera Fusion for 3D Object Detection2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10651407(1-8)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10651407
Luo SGao W(2024)A General Framework for Rotation Invariant Point Cloud AnalysisICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10446048(3665-3669)Online publication date: 14-Apr-2024
https://doi.org/10.1109/ICASSP48485.2024.10446048
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten