Search-Based Planning and Reinforcement Learning for Autonomous Systems and Robotics

Than Le^4,7,
Bui Thanh Hung⁵ &
Pham Van Huy⁶

Part of the book series: Studies in Computational Intelligence ((SCI,volume 984))

1838 Accesses

Abstract

In this chapter, we address the competent Autonomous Vehicles should have the ability to analyze the structure and unstructured environments and then to localize itself relative to surrounding things, where GPS, RFID or other similar means cannot give enough information about the location. Reliable SLAM is the most basic prerequisite for any further artificial intelligent tasks of autonomous mobile robots. The goal of this paper is to simulate a SLAM process on advanced software development. The model represents the system itself, whereas the simulation represents the operation of the system over time. And the software architecture will help us to focus our work to realize our wish with least trivial work. It is an open-source meta-operating system, which provides us tremendous tools for robotics related problems. Specifically, we address the advanced vehicles should have the ability to analyze the structured and unstructured environment based on solving the search-based planning and then we move to discuss interested in reinforcement learning-based model to optimal trajectory in order to apply to autonomous systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Artificial Intelligence in Autonomous Systems. A Collection of Projects in Six Problem Classes

Control system integration methods to maintain the position and speed of the robot in spatial forbidden areas

Article 09 August 2022

A Review of Path Planning Algorithms

References

Wilson AC, Roelofs R, Stern M, Srebro N, Recht B (2018) The marginal value of adaptive gradient methods in machine learning. arXiv:1705.08292 [stat.ML]
Nguyen DM, Tsiligianni E, Deligiannis N (2018) Matrix factorization via deep learning. arXiv:1812.01478 [cs.LG]
He K, Gkioxari G, Dollár P, Girshick R (2018) Mask r-cnn. arXiv:1703.06870 [cs.CV]
Le TD, Le AT, Nguyen DT (2017) Model-based q-learning for humanoid robots. In: 2017 18th international conference on advanced robotics (ICAR), pp 608-613. https://doi.org/10.1109/ICAR.2017.8023674
Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A, Hubert T, Baker L, Lai M, Bolton A, Chen Y, Lillicrap TP, Hui F, Sifre L, van den Driessche G, Graepel T, Hassabis D (2017) Mastering the game of go without human knowledge. Nature 550(7676):354–359. Available: https://doi.org/10.1038/nature24270
Schulman J, Wolski F, Dhariwal P, Radford A, Klimov O (2017) Proximal policy optimization algorithms. arXiv:1707.06347 [cs.LG]
Le AT, Le TD (2018) Search-based planning and replanning in robotics and autonomous systems. In: Rijeka R (ed) Advanced path planning for mobile entities. IntechOpen. Chapter 4. https://doi.org/10.5772/intechopen.71663. Available: https://doi.org/10.5772/intechopen.71663
Berns Karsten PEV (2009) Autonomous land vehicles: steps towards service robots. Germany, Vieweg Teubner Verlag, Springer Fachmedien Wiesbaden. https://doi.org/10:3834804215, ISBN13:9783834804211
Thrun S, Burgard W, Fox D (2005) Probabilistic robotics. MIT Press, Cambridge, Mass
Google Scholar
Doan KN, Le AT, Le TD, Peter N (2017) Swarm robots’ communica- tion and cooperation in motion planning. In: Zhang D, Wei B (eds) Mechatronics and robotics engineering for advanced and intelligent manufacturing. Springer International Publishing, Cham, pp 191–205. 978-3-319-33581-0
Google Scholar
Lee Y, Hwang J-W, Lee S, Bae Y, Park J (2019) An energy and gpu-computation e?cient backbone network for real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops
Google Scholar
Moghaddamjoo AR, Kirlin RL () Robust adaptive kalman filtering. In: Approximate Kalman filtering, pp 65–85. https://doi.org/10.1142/9789814317399_0006. eprint: https://www.worldscientific.com/doi/pdf/10.1142/9789814317399_0006. Available: https://www.worldscientific.com/doi/pdf/10.1142/9789814317399_0006
Moghaddamjoo A, Kirlin RL (1989) Robust adaptive Kalman filtering with unknown inputs. IEEE Trans Acoust Speech Signal Process 37(8):1166–1175. https://doi.org/10.1109/29.31265
Article MATH Google Scholar
Hajiyev C, Soken HE (2013) Robust adaptive Kalman filter for estimation of uav dynamics in the presence of sensor/actuator faults. Aerosp Sci Technol 28(1):376–383. ISSN: 1270-9638. https://doi.org/10.1016/j.ast.2012.12.003. Available: http://www.sciencedirect.com/science/article/pii/S1270963812002027
Luo X, Wang H (2017) Robust adaptive Kalman filtering–a method based on quasi-accurate detection and plant noise variance-covariance matrix tuning. J Navigation 70(1):137–148. https://doi.org/10.1017/S0373463316000564
Article Google Scholar
Chui CK, Chen G (1987) Kalman filtering with real-time applications. Springer, Berlin, Heidelberg. 0387183857
Google Scholar
Noriega G, Pasupathy S (1992) Application of Kalman filtering to real-time preprocessing of geophysical data. IEEE Trans Geosci Rem Sens 30(5):897–910. https://doi.org/10.1109/36.175324
Article Google Scholar
Riisgaard S, Blas MR (2005) Slam for dummies: a tutorial approach to simultaneous localization and mapping. Technical Report. Available: http://ocw.mit.edu/courses/aeronautics-and-astronautics/16-412j-cognitive-robotics-spring-2005/projects/1aslam_blas_repo.pdf
Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24(6):381–395. ISSN: 0001-0782. https://doi.org/10.1145/358669.358692
Souza C (2010) Random sample consensus (ransac) in c sharp. online: Blog. Available: http://crsouza.com/2010/06/random-sample-consensus-ransac-in-c/
Parsons S (2006) Probabilistic robotics by sebastian thrun, wolfram burgard and dieter fox. Knowl Eng Rev 21(no. 3):287–289. ISSN: 0269-8889. https://doi.org/10.1017/S0269888906210993
Choset H, Lynch K, Hutchinson S, Kantor G, Burgard W, Kavraki L, Thrun S (2005). Principles of robot motion: theory, algorithms, and implementations, English. MIT Press, May 2005, ISBN: 0262033275
Google Scholar
Nguyen HV, Le TD, Huynh DD, Nauth P (2016) Forward kinematics of a human-arm system and inverse kinematics using vector calculus. In: 2016 14th international conference on control, automation, robotics and vision (ICARCV), pp 1–6. https://doi.org/10.1109/ICARCV.2016.7838641
Durrant-Whyte H, Bailey T (2006) Simultaneous localization and mapping: part I. IEEE Robot Autom Mag 13(2):99–110. https://doi.org/10.1109/MRA.2006.1638022
Article Google Scholar
Claessens R, Muller Y, Schnieders B (2013) Graph-based simultaneous localization and mapping on the turtlebot platform, en, Web. 23 Jan 2013. Available: http://airesearch.de/written/Graph-based%5C%20Simultaneous%5C%20Localization%5C%20and%5C%20Mapping%5C%20on%5C%20the%5C%20TurtleBot%5C%20platform.pdf
Sutton RS, Barto AG (2018) Reinforcement learning: an introduction, 2nd edn. The MIT Press. Available: http://incompleteideas.net/book/the-book-2nd.html
Le TD, Bui DT, Pham VH (2018) Encoded communication based on sonar and ultrasonic sensor in motion planning. IEEE Sens 2018:1–4. https://doi.org/10.1109/ICSENS.2018.8589706
Article Google Scholar
Le TD (2020) Real-time search-based planning in structure environments. https://doi.org/10.36227/techrxiv.11603514.v1

Download references

Acknowledgements

We would like to thanks to support the grants from Thu Dau Mot University and Ton Duc Thang University.

Author information

Authors and Affiliations

Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
Than Le
Data Analytics and Artificial Intelligence Laboratory, Faculty of Engineering and Technology, Thu Dau Mot University, Ho Chi Minh City, Vietnam
Bui Thanh Hung
Artificial Intelligence Laboratory, Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
Pham Van Huy
Faculty of Engineering and Technology, Thu Dau Mot University, Ho Chi Minh City, Vietnam
Than Le

Authors

Than Le
View author publications
You can also search for this author in PubMed Google Scholar
Bui Thanh Hung
View author publications
You can also search for this author in PubMed Google Scholar
Pham Van Huy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bui Thanh Hung .

Editor information

Editors and Affiliations

College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia
Anis Koubaa
College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia
Ahmad Taher Azar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Le, T., Hung, B.T., Van Huy, P. (2021). Search-Based Planning and Reinforcement Learning for Autonomous Systems and Robotics. In: Koubaa, A., Azar, A.T. (eds) Deep Learning for Unmanned Systems. Studies in Computational Intelligence, vol 984. Springer, Cham. https://doi.org/10.1007/978-3-030-77939-9_14

Download citation

DOI: https://doi.org/10.1007/978-3-030-77939-9_14
Published: 02 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77938-2
Online ISBN: 978-3-030-77939-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics