Abstract
The performance of feature matching algorithms is well known to be one of the main Achilles heels of visual SLAM algorithms, and particularly for point-based visual SLAM. Which is prone to fail in low-textured ,enarios like man-made environments where points are insufficient. Yet, many environments in which, despite being low textured, can still reliably estimate line-based geometric primitives. The line-based structural features in the Manhattan world encode useful geometric information of parallelism, orthogonality, and coplanarity in the scene. By fully exploiting these structural features, we propose a novel monocular SLAM system merging feature points and structure lines, which can provide a more accurate estimation of camera poses. To integrate the structure lines into the framework of the system, we have made efforts in the detection, parameterization, feature fusion, and optimization modules of the structure lines. First, we used the consistency of the direction of the structure lines and the vanishing points to extract the structure lines. In the optimization module, we incorporated the error model of the structure lines into the nonlinear optimization framework, and proposed a new optimization strategy. Finally, a complete SLAM system based on points and structure lines is designed. With structure lines as a new observation, the robustness of the matching algorithm between consecutive frames in low-texture scenes is increased, ensuring continuous updating of the tracking thread when the feature points are lost. Secondly, the dominant directions of structure lines can provide global effective constraints to reduce the accumulated orientation errors and the position drift in consequence. Experiments in man-made environments have demonstrated that the proposed system outperforms existing state-of-the-art monocular SLAM systems in terms of accuracy and robustness.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Cadena, C., Carlone, L.: Past, present, and future of simultaneous localization and mapping: toward the robust-perception age. IEEE Trans. Robot. 32(6), 1309–1332 (2016)
David, G.L., Lowe, D.G.: Distinctive image features from scale-invariant key-points. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Forster, C., Zhang, Z., Gassner, M.: Semidirect visual odometry for monocular and multicamera systems. IEEE Trans. Robot. 33(2), 249–265 (2017)
Mur-Artal, R., Montiel, J.M.M., Tardos, J.D.: ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE Trans. Robot. 31(5), 1147–1163 (2015)
Mur-Artal, R., Tardós, J.D.: Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras[J]. IEEE Trans. Robot. 33(5), 1255–1262 (2017)
Bartoli, A., Sturm, P.: Structure-from-motion using lines: representation, triangulation, and bundle adjustment. Comput. Vis. Image Underst. 100(3), 416–441 (2005)
Li, K., Yao, J.: Line segment matching and reconstruction via exploiting coplanar cues. ISPRS J. Photogramm. Remote Sens. 125(3), 33–49 (2017)
Zhang, G., Lee, J.H., Lim, J., et al.: Building a 3-D line-based map using stereo SLAM. IEEE Trans. Robot. 31(6), 1–14 (2016)
Zhou, H., Zou, D., Pei, L., et al.: StructSLAM: visual SLAM with building structure lines. IEEE Trans. Veh. Technol. 64(4), 1364–1375 (2015)
Mei, S.C., Benhimane, E., Malis, P., Rives, et al.: Efficienthomo graphy-based tracking and 3-D reconstruction for single-viewpoint sensors. IEEE Trans. Robot. 24(6), 1352–1364 (2008)
Košecká́, J., Zhang, W.: Extraction, matching, and pose recovery based on dominant rectangular structures. Comput. Vis. Image Underst. 100(3), 274–293 (2005)
Wu, Y., Zou, D., Pei, L., et al.: StructVIO: visual-inertial odometry with structural regularity of man-made environments. IEEE Trans. Robot. 35(4), 999–1013 (2019)
Harris, C., Stephens, M.: A combined corner and edge detector. In: Alvey vision conference, pp. 147–151 (1988)
Rublee, E., Rabaud, V., Konolige, K., et al.: ORB: an efficient alternative to SIFT or SURF. In: International conference on computer vision. IEEE (2012)
Montemerlo, M., Thrun, S., Koller, D., Wegbreit, B., et al.: Fast-SLAM: a factored solution to the simultaneous localization and mapping problem. In: IAAI, pp. 593–598 (2002)
Klein, G., Murray, D., et al.: Parallel tracking and mapping for small AR workspaces. In: Mixed and augmented reality. In: ISMAR, pp. 225–234 (2007)
Klein, G., Murray, D.W.: Improving the agility of keyframe-based SLAM. In: European conference on computer vision, pp. 802–815 (2008)
Forster, C., Pizzoli, M., Scaramuzza, D.: SVO: fast semi-direct monocular visual odometry. In: IEEE international conference on robotics and automation, pp. 15–22 (2014)
Smith, P., Reid, I., Davison, A.J.: Real-time monocular SLAM with straight lines. In: Proceedings of the British machine vision conference, vol. 3, pp. 1–10 (2006)
Gee, A.P., Mayol-Cuevas, W.: Real-time model-based SLAM using line segments[C]. International Symposium on Visual Computing, pp. 354–363. Springer, Berlin, Heidelberg (2006)
Lemaire, T., Lacroix, S.: Monocular-vision based SLAM using line segments. In: International conference on robotics and automation, pp. 2791–2796 (2007)
Sola, J., Vidal-Calleja, T., Devy, M.: Undelayed initialization of line segments in monocular SLAM. In: International conference on intelligent robots and systems, pp. 1553–1558 (2009)
Pumarola, A., Vakhitov, A., Agudo, A., et al.: PL-SLAM: real-time monocular visual SLAM with points and lines. In: International conference on robotics and automation, pp. 4503–4508 (2017)
Lu, X., Yao, J., Li, H., Liu, Y.: 2-Line exhaustive searching for real-time vanishing point estimation in Manhattan world. In: WACV (2017)
Lee, Y.H., Nam, C., Lee, K.Y., et al.: VPass: algorithmic compass using vanishing points in indoor environments. In: IROS (2009)
Camposeco, F., Pollefeys, M.: Using vanishing points to improve visual-inertial odometry. In: ICRA (2015)
Kwon, J., Lee, K.M.: Monocular SLAM with locally planar landmarks via geometric Rao-Blackwellized particle filtering on Lie groups. In: CVPR (2010)
Coughlan, J., Yuille, A.L.: Manhattan world: compass direction from a single image by Bayesian inference. In: International conference on computer vision, pp. 941–947 (1999)
Von Gioi, R.G., Jakubowicz, J., Morel, J.M., et al.: LSD: a line segment detector[J]. Image Process Line 24(2), 35–55 (2012)
Lu, X., Yaoy, J., Li, H., et al.: 2-Line exhaustive searching for real-time vanishing point estimation in Manhattan world. In: Workshop on applications of computer vision, pp. 345–353 (2017)
Gomez-Ojeda, R., Zuiga-Nol, D., Moreno, F.A., et al.: PL-SLAM: a stereo SLAM system through the combination of points and line segments. arXiv (2017)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yang, F., Cao, Y. & Zhang, W. PSL-SLAM: a monocular SLAM system using points and structure lines in Manhattan World. Int J Intell Robot Appl 6, 52–68 (2022). https://doi.org/10.1007/s41315-021-00204-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41315-021-00204-0