Article

Free access

Reinforcement learning and mistake bounded algorithms

Author:

Yishay MansourAuthors Info & Claims

COLT '99: Proceedings of the twelfth annual conference on Computational learning theory

Pages 183 - 192

https://doi.org/10.1145/307400.307437

Published: 06 July 1999 Publication History

PDF eReader

References

[1]

Peter Auer, Nicolb Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. Gambling in a rigged casino: The adversarial multiarmed bandit problem. In 36th Annual Symposium on Foundations of Computer Science, pages 322-331, 1995.

Digital Library

Google Scholar

[2]

N. Alon and J. Spencer. The Probabilistic Method. Wiley, 1992.

Google Scholar

[3]

Richard Bellman. Dynamic Programming. Princeton University Press, Princeton, N.J., 1957.

Digital Library

Google Scholar

[4]

Dimitri P. Bertsekas. Dynamic Programming: deterministic and stochastic methods. Prentice-Hall, 1987.

Digital Library

Google Scholar

[5]

Dimitri P. Bertsekas. Dynamic Programming and optimal control. Athena Scientific, 1995.

Digital Library

Google Scholar

[6]

Dimitri P. Bertsekas and John N. Tsitsiklis. Neuro-Dynamic programing. Athena Scientific, 1996. Deals with various techniques related to Neural networks.

Digital Library

Google Scholar

[7]

Thomas H. Cormen, Charles E. Leiserson, and Ronald L. Rivest. Introduction to Algorithms. MIT Press, 1990.

Digital Library

Google Scholar

[8]

R. Howard. Dynamic Programming and Markov Processes. MIT Press, 1960.

Google Scholar

[9]

M. Kearns, Y. Mansour, and A. Ng. Approximate planning in large pomdps via reusable trajectories.

Google Scholar

[10]

Michael J. Kearns and Umesh V. Vaxirani. Computational Learning Theory. MIT press, 1994.

Digital Library

Google Scholar

[11]

Nick Littlestone. Learning when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning, 2:285-318, 1988.

Digital Library

Google Scholar

[12]

Michael L. Littman. Algorithms for sequential decision making. PhD thesis, Brown University, 1996.

Digital Library

Google Scholar

[13]

Christos H. Papadimitrio and John N. Tsitsiklis. The complexity of markov decision processes. Mathematics of Operation Research, 12(3):441-450, 1987.

Digital Library

Google Scholar

[14]

Richard S. Sutton and Andrew G. Barto. Reinforcement Learning. MIT press, 1998.

Digital Library

Google Scholar

[15]

Gerald Tesauro. TD-Gammon, A Self- Teaching Backgammon Program, Achieves Master-Level Play. Neural Computation, 6:215-219, 1994.

Digital Library

Google Scholar

[16]

V.N. Vapnik. Estimation of Dependences Based on Empirical Data. Springer-Verlag, New York, 1982.

Digital Library

Google Scholar

Cited By

View all

Krishnamurthy AAgarwal ALangford J(2016)PAC reinforcement learning with rich observationsProceedings of the 30th International Conference on Neural Information Processing Systems10.5555/3157096.3157303(1848-1856)Online publication date: 5-Dec-2016
https://dl.acm.org/doi/10.5555/3157096.3157303

Index Terms

Reinforcement learning and mistake bounded algorithms
1. Computing methodologies
  1. Machine learning

Recommendations

Reinforcement learning algorithms: A brief survey
Highlights
- RL can be used to solve problems involving sequential decision-making.
- RL is based on trial-and-error learning through rewards and punishments.
- The ultimate goal of an RL agent is to maximize cumulative reward.
- RL agent tries ...
Abstract
Reinforcement Learning (RL) is a machine learning (ML) technique to learn sequential decision-making in complex problems. RL is inspired by trial-and-error based human/animal learning. It can learn an optimal policy autonomously with knowledge ...
Design and analysis of efficient reinforcement learning algorithms
Algorithms for Reinforcement Learning

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

COLT '99: Proceedings of the twelfth annual conference on Computational learning theory

July 1999

333 pages

ISBN:1581131674

DOI:10.1145/307400

Chairmen:
Shai Ben-David,
Phil Long

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 July 1999

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Article

Conference

COLT99

Sponsor:

SIGAI
Univ. of California,
SIGACT

COLT99: The 12th Annual Conference on Computation Learning Theory

July 7 - 9, 1999

California, Santa Cruz, USA

Acceptance Rates

COLT '99 Paper Acceptance Rate 35 of 71 submissions, 49%;

Overall Acceptance Rate 35 of 71 submissions, 49%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
426
Total Downloads

Downloads (Last 12 months)35
Downloads (Last 6 weeks)3

Reflects downloads up to 12 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Krishnamurthy AAgarwal ALangford J(2016)PAC reinforcement learning with rich observationsProceedings of the 30th International Conference on Neural Information Processing Systems10.5555/3157096.3157303(1848-1856)Online publication date: 5-Dec-2016
https://dl.acm.org/doi/10.5555/3157096.3157303

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Reinforcement learning algorithms: A brief survey

Design and analysis of efficient reinforcement learning algorithms

Algorithms for Reinforcement Learning