Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1082473.1082484acmconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
Article

Rapid on-line temporal sequence prediction by an adaptive agent

Published: 25 July 2005 Publication History

Abstract

Robust sequence prediction is an essential component of an intelligent agent acting in a dynamic world. We consider the case of near-future event prediction by an online learning agent operating in a non-stationary environment. The challenge for a learning agent under these conditions is to exploit the relevant experience from a limited environmental event history while preserving flexibility.We propose a novel time/space efficient method for learning temporal sequences and making short-term predictions. Our method operates on-line, requires few exemplars, and adapts easily and quickly to changes in the underlying stochastic world model. Using a short-term memory of recent observations, the method maintains a dynamic space of candidate hypotheses in which the growth of the space is systematically and dynamically pruned using an entropy measure over the observed predictive quality of each candidate hypothesis.The method compares well against Markov-chain predictions, and adapts faster than learned Markov-chain models to changes in the underlying distribution of events. We demonstrate the method using both synthetic data and empirical experience from a game-playing scenario with human opponents.

References

[1]
R. Agrawal and R. Srikant. Mining sequential patterns. In P. S. Yu and A. S. P. Chen, editors, Eleventh International Conference on Data Engineering, pages 3--14, Taipei, Taiwan, 1995. IEEE computer Society Press.
[2]
Y. Bengio, S. Bengio, J.-F. Isabelle, and Y. Singer. Shared context probabilistic transducers. NIPS'97, 10, 1998.
[3]
M. Bowling and M. Veloso. Multiagent learning using a variable learning rate. Artificial Intelligence, 136:215--250, 2002.
[4]
D. Fudenberg and D. K. Levine. The Theory of Learning in Games. MIT Press, Cambridge, Massachusetts, 1999.
[5]
I. Guyon and F. Pereira. Design of a linguistic postprocessor using variable memory length Markov models. International Conference on Document Analysis and Recognition, pages 454--457, 1995.
[6]
S. A. Huettel, P. B. Mack, and G. McCarthy. Perceiving patterns in random series: dynamic processing of sequence in prefrontal cortex. Nature Neuroscience, 5(5):485--490, May 2002.
[7]
G. A. Miller. The magical number 7 plus or minus two: Some limits on our capacity in processing information. Psychol. Rev., 63:81--97, 1956.
[8]
D. Ron, Y. Singer, and N. Tishby. The power of amnesia: Learning probabilistic automata with variable memory length. Machine Learning, 25, 1996.
[9]
L. K. Saul and M. I. Jordan. Mixed memory Markov models: decomposing complex stochastic processes as mixtures of simpler ones. Machine Learning, pages 1--11, 1998.
[10]
Y. Singer. Adaptive mixture of probabilistic transducers. Neural Computation, 9, 1997.

Cited By

View all
  • (2021)Learning Intuitive Physics and One-Shot Imitation Using State-Action-Prediction Self-Organizing MapsComputational Intelligence and Neuroscience10.1155/2021/55904452021Online publication date: 1-Jan-2021
  • (2014)Sequential Decisions: A Computational Comparison of Observational and Reinforcement AccountsPLoS ONE10.1371/journal.pone.00943089:4(e94308)Online publication date: 18-Apr-2014
  • (2014)A framework for learning and planning against switching strategies in repeated gamesConnection Science10.1080/09540091.2014.88529426:2(103-122)Online publication date: 14-Apr-2014
  • Show More Cited By

Index Terms

  1. Rapid on-line temporal sequence prediction by an adaptive agent

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    AAMAS '05: Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
    July 2005
    1407 pages
    ISBN:1595930930
    DOI:10.1145/1082473
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 July 2005

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. markov decision process
    2. n-gram
    3. rapid learning
    4. sequence prediction

    Qualifiers

    • Article

    Conference

    AAMAS05
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 21 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Learning Intuitive Physics and One-Shot Imitation Using State-Action-Prediction Self-Organizing MapsComputational Intelligence and Neuroscience10.1155/2021/55904452021Online publication date: 1-Jan-2021
    • (2014)Sequential Decisions: A Computational Comparison of Observational and Reinforcement AccountsPLoS ONE10.1371/journal.pone.00943089:4(e94308)Online publication date: 18-Apr-2014
    • (2014)A framework for learning and planning against switching strategies in repeated gamesConnection Science10.1080/09540091.2014.88529426:2(103-122)Online publication date: 14-Apr-2014
    • (2011)Integration of sequence learning and CBR for complex equipment failure predictionProceedings of the 19th international conference on Case-Based Reasoning Research and Development10.1007/978-3-642-23291-6_30(408-422)Online publication date: 12-Sep-2011
    • (2005)Non-stationary policy learning in 2-player zero sum gamesProceedings of the 20th national conference on Artificial intelligence - Volume 210.5555/1619410.1619459(789-794)Online publication date: 9-Jul-2005

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media