Markov Decision Processes: Discrete Stochastic Dynamic Programming | Guide books

Markov Decision Processes: Discrete Stochastic Dynamic ProgrammingJanuary 1994

January 1994

Author:
Martin L. Puterman

Publisher:

John Wiley & Sons, Inc.
605 Third Ave. New York, NY
United States

ISBN:978-0-471-61977-2

Published:01 January 1994

Pages:

672

Available at Amazon

Bibliometrics

Sections

1994

Abstract

From the Publisher:

The past decade has seen considerable theoretical and applied research on Markov decision processes, as well as the growing use of these models in ecology, economics, communications engineering, and other fields where outcomes are uncertain and sequential decision-making processes are needed. A timely response to this increased activity, Martin L. Puterman's new work provides a uniquely up-to-date, unified, and rigorous treatment of the theoretical, computational, and applied research on Markov decision process models. It discusses all major research directions in the field, highlights many significant applications of Markov decision processes models, and explores numerous important topics that have previously been neglected or given cursory coverage in the literature. Markov Decision Processes focuses primarily on infinite horizon discrete time models and models with discrete time spaces while also examining models with arbitrary state spaces, finite horizon models, and continuous-time discrete state models. The book is organized around optimality criteria, using a common framework centered on the optimality (Bellman) equation for presenting results. The results are presented in a "theorem-proof" format and elaborated on through both discussion and examples, including results that are not available in any other book. A two-state Markov decision process model, presented in Chapter 3, is analyzed repeatedly throughout the book and demonstrates many results and algorithms. Markov Decision Processes covers recent research advances in such areas as countable state space models with average reward criterion, constrained models, and models with risk sensitive optimality criteria. It also explores several topics that have received little or no attention in other books, including modified policy iteration, multichain models with average reward criterion, and sensitive optimality. In addition, a Bibliographic Remarks section in each chapter comments on relevant historic

Cited By

Contributors

Martin L Puterman
UBC Sauder School of Business
- Publication Years1978 - 2021
- Publication counts37
- Citation count1,491
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article40
View Full Profile

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Variability Sensitive Markov Decision Processes

Considered are time-average Markov Decision Processes MDPs with finite state and action spaces. Two definitions of variability are introduced, namely, the expected time-average variability and time-average expected variability. The two criteria are in ...
Continuous Time Discounted Jump Markov Decision Processes: A Discrete-Event Approach

This paper introduces and develops a new approach to the theory of continuous time jump Markov decision processes (CTJMDP). This approach reduces discounted CTJMDPs to discounted semi-Markov decision processes (SMDPs) and eventually to discrete-time ...
Continuous-Time Markov Decision Processes with Discounted Rewards: The Case of Polish Spaces

This paper deals with continuous-time Markov decision processes in Polish spaces, under an expected discounted reward criterion. The transition rates of underlying continuous-time jump Markov processes are allowed to be unbounded, and the reward rates ...

Save to Binder

Sections

Cited By

Save to Binder

Recommendations

Variability Sensitive Markov Decision Processes

Continuous Time Discounted Jump Markov Decision Processes: A Discrete-Event Approach

Continuous-Time Markov Decision Processes with Discounted Rewards: The Case of Polish Spaces