Robust markov decision processes with uncertain transition matrices

January 2004

Author:
Arnab Nilim
University of California, Berkeley
,
Chair:
Laurent El Ghaoui
University of California, Berkeley

Publisher:

University of California at Berkeley
Computer Science Division 571 Evans Hall Berkeley, CA
United States

ISBN:978-0-542-00985-3

Order Number:AAI3165509

Pages:

148

Purchase on ProQuest

Bibliometrics

Abstract

Optimal solutions to Markov decision problems may be very sensitive with respect to the state transition probabilities. In many practical problems, the estimation of these probabilities is far from accurate. Hence, estimation errors are limiting factors in applying Markov decision processes to real-world problems.

We consider a robust control problem for a finite-state, finite-action Markov decision process, where uncertainty on the transition matrices is described in terms of possibly non-convex sets. We show that perfect duality holds for this problem, and that as a consequence, it can be solved with a variant of the classical dynamic programming algorithm, the “robust dynamic programming” algorithm. We show that a particular choice of the uncertainty sets, involving likelihood regions or entropy bounds, leads to both a statistically accurate representation of uncertainty, and a complexity of the robust recursion that is almost the same as that of the classical recursion. Hence, robustness can be added at practically no extra computing cost. We derive similar results for other uncertainty sets, including one with a finite number of possible values for the transition matrices.

We describe in a practical path planning example the benefits of using a robust strategy instead of the classical optimal strategy; even if the uncertainty level is only crudely guessed, the robust strategy yields a much better worst-case expected travel time.

Cited By

Contributors

Arnab Nilim
Department of Electrical Engineering and Computer Sciences
- Publication Years2003 - 2005
- Publication counts3
- Citation count219
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article73
View Full Profile
Laurent Marc El Ghaoui
University of California, Berkeley
- Publication Years1990 - 2024
- Publication counts49
- Citation count2,044
- Available for Download9
- Downloads (cumulative)9,217
- Downloads (12 months)502
- Downloads (6 weeks)85
- Average Downloads per Article1,024
- Average Citation per Article42
View Full Profile

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Robust Control of Markov Decision Processes with Uncertain Transition Matrices

Optimal solutions to Markov decision problems may be very sensitive with respect to the state transition probabilities. In many practical problems, the estimation of these probabilities is far from accurate. Hence, estimation errors are limiting factors ...
Distributionally robust Markov decision processes
NIPS'10: Proceedings of the 24th International Conference on Neural Information Processing Systems - Volume 2

We consider Markov decision processes where the values of the parameters are uncertain. This uncertainty is described by a sequence of nested sets (that is, each set contains the previous one), each of which corresponds to a probabilistic guarantee for a ...
Distributionally Robust Markov Decision Processes

We consider Markov decision processes where the values of the parameters are uncertain. This uncertainty is described by a sequence of nested sets (that is, each set contains the previous one), each of which corresponds to a probabilistic guarantee for ...

Browse Theses

Sections

Cited By

Robust Control of Markov Decision Processes with Uncertain Transition Matrices

Distributionally robust Markov decision processes

Distributionally Robust Markov Decision Processes

Sections

Cited By

Save to Binder

Recommendations

Robust Control of Markov Decision Processes with Uncertain Transition Matrices

Distributionally robust Markov decision processes

Distributionally Robust Markov Decision Processes