Abstract
Object recognition in 3D point clouds is a challenging task, mainly when time is an important factor to deal with, such as in industrial applications. Local descriptors are an amenable choice whenever the 6 DoF pose of recognized objects should also be estimated. However, the pipeline for this kind of descriptors is highly time-consuming. In this work, we propose an update to the traditional pipeline, by adding a preliminary filtering stage referred to as saliency boost. We perform tests on a standard object recognition benchmark by considering four keypoint detectors and four local descriptors, in order to compare time and recognition performance between the traditional pipeline and the boosted one. Results on time show that the boosted pipeline could turn out up to 5 times faster, with the recognition rate improving in most of the cases and exhibiting only a slight decrease in the others. These results suggest that the boosted pipeline can speed-up processing time substantially with limited impacts or even benefits in recognition accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alexandre, L.A.: 3D descriptors for object and category recognition: a comparative evaluation. In: IEEE International Conference on Intelligent Robots and Systems (IROS) (2012)
Aytekin, C., Iosifidis, A., Gabbouj, M.: Probabilistic saliency estimation. Pattern Recogn. 74, 359–372 (2018). https://doi.org/10.1016/j.patcog.2017.09.023
Chen, H., Li, Y., Su, D.: Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection. Pattern Recogn. (2018). https://doi.org/10.1016/j.patcog.2018.08.007
Chen, H., Bhanu, B.: 3D free-form object recognition in range images using local surface patches. Pattern Recogn. Lett. 28(10), 1252–1262 (2007). https://doi.org/10.1016/j.patrec.2007.02.009
Gomes, R.B., da Silva, B.M.F., de Medeiros Rocha, L.K., Aroca, R.V., Velho, L.C.P.R., Gonçalves, L.M.G.: Efficient 3D object recognition using foveated point clouds. Comput. Graph. 37(5), 496–508 (2013). https://doi.org/10.1016/j.cag.2013.03.005
Guo, Y., Bennamoun, M., Sohel, F., Lu, M., Wan, J., Kwok, N.M.: A comprehensive performance evaluation of 3D local feature descriptors. Int. J. Comput. Vis. 116(1), 66–89 (2015). https://doi.org/10.1007/s11263-015-0824-y
Hou, Q., Cheng, M., Hu, X., Borji, A., Tu, Z., Torr, P.H.S.: Deeply supervised salient object detection with short connections. IEEE Trans. Pattern Anal. Mach. Intell. 41(4), 815–828 (2019). https://doi.org/10.1109/TPAMI.2018.2815688
Johnson, A., Hebert, M.: Using spin images for efficient object recognition in cluttered 3D scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1999). https://doi.org/10.1109/34.765655
Kastner, S., Ungerleider, L.G.: Mechanisms of visual attention in the human cortex. Annu. Rev. Neurosci. 23(1), 315–341 (2000). https://doi.org/10.1146/annurev.neuro.23.1.315
Li, Z., Lang, C., Feng, S., Wang, T.: Saliency ranker: a new salient object detection method. J. Vis. Commun. Image Represent. 50, 16–26 (2018). https://doi.org/10.1016/j.jvcir.2017.11.004
Lowe, D.G.: Object recognition from local scale-invariant features. In: 7th IEEE International Conference on Computer Vision, vol. 2, pp. 1150–1157, September 1999. https://doi.org/10.1109/ICCV.1999.790410
Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_34
Rusu, R.B., Cousins, S.: 3D is here: Point cloud library (PCL). In: 2011 IEEE International Conference on Robotics and Automation, pp. 1–4, May 2011. https://doi.org/10.1109/ICRA.2011.5980567
Rusu, R.B., Blodow, N., Beetz, M.: Fast point feature histograms (FPFH) for 3D registration. In: IEEE International Conference on Robotics and Automation (ICRA). IEEE, May 2009. https://doi.org/10.1109/robot.2009.5152473
Rusu, R., Blodow, N., Marton, Z., Beetz, M.: Aligning point cloud views using persistent feature histograms. In: IEEE International Conference on Intelligent Robots and Systems (IROS). IEEE, September 2008. https://doi.org/10.1109/iros.2008.4650967
Salti, S., Tombari, F., Di Stefano, L.: SHOT: unique signatures of histograms for surface and texture description. Comput. Vis. Image Underst. 125, 251–264 (2014). https://doi.org/10.1016/j.cviu.2014.04.011
Song, S., Xiao, J.: Deep sliding shapes for Amodal 3D object detection in RGB-D images. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, June 2016. https://doi.org/10.1109/cvpr.2016.94
Tombari, F., Salti, S., Di Stefano, L.: Performance evaluation of 3D keypoint detectors. Int. J. Comput. Vis. 102(1–3), 198–220 (2012). https://doi.org/10.1007/s11263-012-0545-4
Zhong, Y.: Intrinsic shape signatures: a shape descriptor for 3D object recognition. In: 12th IEEE International Conference on Computer Vision (ICCV) Workshops. IEEE, September 2009. https://doi.org/10.1109/iccvw.2009.5457637
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Marcon, M., Spezialetti, R., Salti, S., Silva, L., Di Stefano, L. (2019). Boosting Object Recognition in Point Clouds by Saliency Detection. In: Cristani, M., Prati, A., Lanz, O., Messelodi, S., Sebe, N. (eds) New Trends in Image Analysis and Processing – ICIAP 2019. ICIAP 2019. Lecture Notes in Computer Science(), vol 11808. Springer, Cham. https://doi.org/10.1007/978-3-030-30754-7_32
Download citation
DOI: https://doi.org/10.1007/978-3-030-30754-7_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30753-0
Online ISBN: 978-3-030-30754-7
eBook Packages: Computer ScienceComputer Science (R0)