Q-Learning with Adaptive State Space Construction

Hajime Murao³ &
Shinzo Kitamura³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1545))

Included in the following conference series:

European Workshop on Learning Robots

324 Accesses

Abstract

In this paper, we propose Q-learning with adaptive state space construction. This provides an efficient method to construct the state space suitable for Q-learning to accomplish the task in continuous sensor space. In the proposed algorithm, a robot starts with single state covering whole sensor space. A new state is generated incrementally by segmenting a sub-region of the sensor space or combining the existing states. The criterion for incremental segmentation and combination is derived from Q-learning algorithm. Simulation results show that the proposed algortithm is able to construct the sensor space effectively to accomplish the task. The resulting state space reveals the sensor space in a Voronoi tessellation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Path planning of a mobile robot in a free-space environment using Q-learning

Article 11 December 2018

Q-Learning Based Robot Path Planning with Improved Dynamic Window Approach

A Path Planning Method for Multi-robot Formation Based on Improved Q-Learning

References

Whitehead, S. D., Ballard, D.H.: Learning to perceive and act by trial and error. ML 7 (1991) 45–83
Google Scholar
Chrisman, L.: Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach. Proc. of AAAI-92 (1992) 183–188
Google Scholar
Tan, M.: Cost-sensitive reinforcement learning for adaptive classification and control. Proc. of AAAI-91 (1991)
Google Scholar
Krose, B.J.A., van Dam, J.W.M.: Adaptive state space quantisation for reinforcement learning of collision-free navigation. Proc. of IROS 2 (1992) 1327–1331
Google Scholar
Dubrawski, A., Reignier, P.: Learning to Categorize Perceptual Space of a Mobile Robot Using Fuzzy-ART Neural Network. Proc. of IROS 2 (1994) 1272–1277
Google Scholar
Takahashi, Y., Asada, M., Hosoda, K.: Reasonable Performance in Less Learning Time by Real Robot Based on Incremental State Space Segmentation. Proc. of IROS 3 (1996) 1518–1524
Google Scholar
Ishiguro, H., Sato, R., Ishida, T.: Robot Oriented State Space Construction. Proc. of IROS 3 (1996) 1496–1501.
Google Scholar
Chapman, D., Kaebling, L.P.: Input Generalisation in Delayed Reinforcement Learning: an Algorithm and Performance Comparisons. Proc. of IJCAI-91 (1991) 726–731
Google Scholar
Munos, R., Patinel, J.: Reinforcement learning with dynamic covering of state-action space: Partitioning Q-learning. Proc. of SAB (1994) 354–363
Google Scholar
Murao, H., Kitamura, S.: Q-Learning with Adaptive State Segmentation (QLASS). Proc. of CIRA (1997) 179–184
Google Scholar
Watkins, C: Learning from Delayed Rewards. Ph.D. Dissertation of Cambridge University (1989)
Google Scholar
Naito, T., Odagiri, R., Matsunaga, Y., Tanifuji, M., Murase, K.: Genetic evolution of a logic circuit which controls an autonomous mobile robot. Proc. of ICES96 (1996)
Google Scholar
MaCallum, R.A.: Overcoming Incomplete Perception with Utile Distinction Memory. Proc. of ICML (1993) 190–196
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Engineering, Kobe University, 1-1 Rokkodai, Nada, Kobe, 6578501, Japan
Hajime Murao & Shinzo Kitamura

Authors

Hajime Murao
View author publications
You can also search for this author in PubMed Google Scholar
Shinzo Kitamura
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Artificial Intelligence Laboratory, Vrije Universiteit Brussel, Pleinlaan 2, B-1050, Brussels, Belgium
Andreas Birk
Department of Artificial Intelligence, University of Edinburgh, 5 Forrest Hill, Edinburgh, EH1 2QL, Scotland, UK
John Demiris

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Murao, H., Kitamura, S. (1998). Q-Learning with Adaptive State Space Construction. In: Birk, A., Demiris, J. (eds) Learning Robots. EWLR 1997. Lecture Notes in Computer Science(), vol 1545. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49240-2_2

Download citation

DOI: https://doi.org/10.1007/3-540-49240-2_2
Published: 09 June 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65480-3
Online ISBN: 978-3-540-49240-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Q-Learning with Adaptive State Space Construction

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Path planning of a mobile robot in a free-space environment using Q-learning

Q-Learning Based Robot Path Planning with Improved Dynamic Window Approach

A Path Planning Method for Multi-robot Formation Based on Improved Q-Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Q-Learning with Adaptive State Space Construction

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Path planning of a mobile robot in a free-space environment using Q-learning

Q-Learning Based Robot Path Planning with Improved Dynamic Window Approach

A Path Planning Method for Multi-robot Formation Based on Improved Q-Learning

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation