Analysis for Adaptability of Policy-Improving System with a Mixture Model of Bayesian Networks to Dynamic Environments

Daisuke Kitakoshi²¹,
Hiroyuki Shioya²² &
Ryohei Nakano²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3684))

Included in the following conference series:

International Conference on Knowledge-Based and Intelligent Information and Engineering Systems

1433 Accesses
2 Citations

Abstract

We have proposed an online policy-improving system of reinforcement learning (RL) agents with a mixture model of Bayesian Networks (BNs), and discussed properties of the system. In this paper, two types of mixture models have been applied to the system. A structure of BN in the mixture model is selected based on data collected by agents in an environment, and is regarded as a stochastic knowledge of the environment. This research investigates the adaptability of our system to dynamic environments containing an unexperienced environment, in which an agent does not have the knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Learning in the Presence of Multiple Agents

A Bayesian Posterior Updating Algorithm in Reinforcement Learning

A Bayesian reinforcement learning approach in markov games for computing near-optimal policies

Article 10 June 2023

References

Forbes, J., Huang, T., Kanazawa, K., Russell, S.: The MATmobile: Towards a Bayesian Automated Taxi. In: Proc. of the 14th Int. Joint Conf. on Artificial Intelligence, pp. 1878–1885 (1995)
Google Scholar
Kitakoshi, D., Shioya, H., Kurihara, M.: Analysis of a Method Improving Reinforcement Learning Agents’ Policies. Journal of ACIII 7(3), 276–282 (2003)
Google Scholar
Kitakoshi, D., Shioya, H., Kurihara, M.: A Reinforcement Learning System by using a Mixture Model of Bayesian Network. In: SICE Annual Conference 2003 Proc. TAII-14-1 (2003) (CD-ROM)
Google Scholar
Heckerman, D.: A Tutorial on Learning with Bayesian Networks. Technical Report MSR-TR-95-06, Microsoft Research (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya, 466-8555, Japan
Daisuke Kitakoshi & Ryohei Nakano
Muroran Institute of Technology, 27-1, Mizumoto-cho, Muroran, 050-0071, Japan
Hiroyuki Shioya

Authors

Daisuke Kitakoshi
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Shioya
View author publications
You can also search for this author in PubMed Google Scholar
Ryohei Nakano
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Business, La Trobe University, 3086, Melbourne, Victoria, Australia
Rajiv Khosla
Centre for SMART systems Engineering Research Centre, University of Brighton, Moulsecoomb, BN2 4GJ, Brighton, UK
Robert J. Howlett
School of Electrical and Information Engineering, Knowledge Based Intelligent Engineering Systems Centre, University of South Australia, 5095, Mawson Lakes, SA, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kitakoshi, D., Shioya, H., Nakano, R. (2005). Analysis for Adaptability of Policy-Improving System with a Mixture Model of Bayesian Networks to Dynamic Environments. In: Khosla, R., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2005. Lecture Notes in Computer Science(), vol 3684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11554028_102

Download citation

DOI: https://doi.org/10.1007/11554028_102
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28897-8
Online ISBN: 978-3-540-31997-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Analysis for Adaptability of Policy-Improving System with a Mixture Model of Bayesian Networks to Dynamic Environments

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Learning in the Presence of Multiple Agents

A Bayesian Posterior Updating Algorithm in Reinforcement Learning

A Bayesian reinforcement learning approach in markov games for computing near-optimal policies

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Analysis for Adaptability of Policy-Improving System with a Mixture Model of Bayesian Networks to Dynamic Environments

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Learning in the Presence of Multiple Agents

A Bayesian Posterior Updating Algorithm in Reinforcement Learning

A Bayesian reinforcement learning approach in markov games for computing near-optimal policies

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation