Article

Rapid on-line temporal sequence prediction by an adaptive agent

Authors:

Steven Jensen,

Daniel Boley,

Maria Gini,

Paul SchraterAuthors Info & Claims

AAMAS '05: Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems

Pages 67 - 73

https://doi.org/10.1145/1082473.1082484

Published: 25 July 2005 Publication History

Get Access

Abstract

Robust sequence prediction is an essential component of an intelligent agent acting in a dynamic world. We consider the case of near-future event prediction by an online learning agent operating in a non-stationary environment. The challenge for a learning agent under these conditions is to exploit the relevant experience from a limited environmental event history while preserving flexibility.We propose a novel time/space efficient method for learning temporal sequences and making short-term predictions. Our method operates on-line, requires few exemplars, and adapts easily and quickly to changes in the underlying stochastic world model. Using a short-term memory of recent observations, the method maintains a dynamic space of candidate hypotheses in which the growth of the space is systematically and dynamically pruned using an entropy measure over the observed predictive quality of each candidate hypothesis.The method compares well against Markov-chain predictions, and adapts faster than learned Markov-chain models to changes in the underlying distribution of events. We demonstrate the method using both synthetic data and empirical experience from a game-playing scenario with human opponents.

References

[1]

R. Agrawal and R. Srikant. Mining sequential patterns. In P. S. Yu and A. S. P. Chen, editors, Eleventh International Conference on Data Engineering, pages 3--14, Taipei, Taiwan, 1995. IEEE computer Society Press.

Digital Library

Google Scholar

[2]

Y. Bengio, S. Bengio, J.-F. Isabelle, and Y. Singer. Shared context probabilistic transducers. NIPS'97, 10, 1998.

Digital Library

Google Scholar

[3]

M. Bowling and M. Veloso. Multiagent learning using a variable learning rate. Artificial Intelligence, 136:215--250, 2002.

Digital Library

Google Scholar

[4]

D. Fudenberg and D. K. Levine. The Theory of Learning in Games. MIT Press, Cambridge, Massachusetts, 1999.

Google Scholar

[5]

I. Guyon and F. Pereira. Design of a linguistic postprocessor using variable memory length Markov models. International Conference on Document Analysis and Recognition, pages 454--457, 1995.

Digital Library

Google Scholar

[6]

S. A. Huettel, P. B. Mack, and G. McCarthy. Perceiving patterns in random series: dynamic processing of sequence in prefrontal cortex. Nature Neuroscience, 5(5):485--490, May 2002.

Crossref

Google Scholar

[7]

G. A. Miller. The magical number 7 plus or minus two: Some limits on our capacity in processing information. Psychol. Rev., 63:81--97, 1956.

Crossref

Google Scholar

[8]

D. Ron, Y. Singer, and N. Tishby. The power of amnesia: Learning probabilistic automata with variable memory length. Machine Learning, 25, 1996.

Digital Library

Google Scholar

[9]

L. K. Saul and M. I. Jordan. Mixed memory Markov models: decomposing complex stochastic processes as mixtures of simpler ones. Machine Learning, pages 1--11, 1998.

Digital Library

Google Scholar

[10]

Y. Singer. Adaptive mixture of probabilistic transducers. Neural Computation, 9, 1997.

Digital Library

Google Scholar

Cited By

View all

Stetter MLang E(2021)Learning Intuitive Physics and One-Shot Imitation Using State-Action-Prediction Self-Organizing MapsComputational Intelligence and Neuroscience10.1155/2021/55904452021Online publication date: 1-Jan-2021
https://dl.acm.org/doi/10.1155/2021/5590445
Mohammadi Sepahvand NStöttinger EDanckert JAnderson B(2014)Sequential Decisions: A Computational Comparison of Observational and Reinforcement AccountsPLoS ONE10.1371/journal.pone.00943089:4(e94308)Online publication date: 18-Apr-2014
https://doi.org/10.1371/journal.pone.0094308
Hernandez-Leal PMunoz de Cote ESucar L(2014)A framework for learning and planning against switching strategies in repeated gamesConnection Science10.1080/09540091.2014.88529426:2(103-122)Online publication date: 14-Apr-2014
https://doi.org/10.1080/09540091.2014.885294
Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

AAMAS '05: Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems

July 2005

1407 pages

ISBN:1595930930

DOI:10.1145/1082473

Program Chairs:
Michal Pechoucek
Czech Republic
,
Donald Steiner
USA
,
Simon Thompson
UK

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2005

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

AAMAS05

Sponsor:

SIGAI

AAMAS05: AAMAS '05 - Fourth International Joint Conference on Autonomous Agents and Multiagent Systems 2005

July 25 - 29, 2005

The Netherlands

Acceptance Rates

Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
403
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Stetter MLang E(2021)Learning Intuitive Physics and One-Shot Imitation Using State-Action-Prediction Self-Organizing MapsComputational Intelligence and Neuroscience10.1155/2021/55904452021Online publication date: 1-Jan-2021
https://dl.acm.org/doi/10.1155/2021/5590445
Mohammadi Sepahvand NStöttinger EDanckert JAnderson B(2014)Sequential Decisions: A Computational Comparison of Observational and Reinforcement AccountsPLoS ONE10.1371/journal.pone.00943089:4(e94308)Online publication date: 18-Apr-2014
https://doi.org/10.1371/journal.pone.0094308
Hernandez-Leal PMunoz de Cote ESucar L(2014)A framework for learning and planning against switching strategies in repeated gamesConnection Science10.1080/09540091.2014.88529426:2(103-122)Online publication date: 14-Apr-2014
https://doi.org/10.1080/09540091.2014.885294
Compta MLópez B(2011)Integration of sequence learning and CBR for complex equipment failure predictionProceedings of the 19th international conference on Case-Based Reasoning Research and Development10.1007/978-3-642-23291-6_30(408-422)Online publication date: 12-Sep-2011
https://dl.acm.org/doi/10.1007/978-3-642-23291-6_30
Jensen SBoley DGini MSchrater P(2005)Non-stationary policy learning in 2-player zero sum gamesProceedings of the 20th national conference on Artificial intelligence - Volume 210.5555/1619410.1619459(789-794)Online publication date: 9-Jul-2005
https://dl.acm.org/doi/10.5555/1619410.1619459

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Recommendations

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations