research-article

Partially Observable Markov Decision Processes and Performance Sensitivity Analysis

Authors:

Yanjie Li,

Baoqun Yin,

Hongsheng XiAuthors Info & Claims

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, Volume 38, Issue 6

Pages 1645 - 1651

https://doi.org/10.1109/TSMCB.2008.927711

Published: 01 December 2008 Publication History

Abstract

The sensitivity-based optimization of Markov systems has become an increasingly important area. From the perspective of performance sensitivity analysis, policy-iteration algorithms and gradient estimation methods can be directly obtained for Markov decision processes (MDPs). In this correspondence, the sensitivity-based optimization is extended to average reward partially observable MDPs (POMDPs). We derive the performance-difference and performance-derivative formulas of POMDPs. On the basis of the performance-derivative formula, we present a new method to estimate the performance gradients. From the performance-difference formula, we obtain a sufficient optimality condition without the discounted reward formulation. We also propose a policy-iteration algorithm to obtain a nearly optimal finite-state-controller policy.

Cited By

View all

Yin BCao JKang YLu XJiang X(2018)A novel POMDP-based server RAM caching algorithm for VoD systemsMultimedia Tools and Applications10.1007/s11042-017-4930-477:10(13023-13045)Online publication date: 1-May-2018
https://dl.acm.org/doi/10.1007/s11042-017-4930-4
Yao JYin BTan XJiang X(2017)A POMDP framework for forwarding mechanism in named data networkingComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2016.11.005112:C(167-175)Online publication date: 15-Jan-2017
https://dl.acm.org/doi/10.1016/j.comnet.2016.11.005
Li HLi BTran TSicker D(2016)Transmission Schemes for Multicasting Hard Deadline Constrained Prioritized Data in Wireless Multimedia StreamingIEEE Transactions on Wireless Communications10.1109/TWC.2015.249354615:3(1631-1641)Online publication date: 8-Mar-2016
https://dl.acm.org/doi/10.1109/TWC.2015.2493546
Show More Cited By

Partially Observable Markov Decision Processes and Performance Sensitivity Analysis
1. Mathematics of computing
  1. Probability and statistics
    1. Probabilistic representations
    2. Stochastic processes
2. Theory of computation
  1. Theory and algorithms for application domains
    1. Machine learning theory

Recommendations

Partially Observable Risk-Sensitive Markov Decision Processes

We consider the problem of minimizing a certainty equivalent of the total or discounted cost over a finite and an infinite time horizon that is generated by a partially observable Markov decision process POMDP. In contrast to a risk-neutral decision ...
What is decidable about partially observable Markov decision processes with ω-regular objectives

Decidability of qualitative analysis of parity POMDPs under finite-memory strategies.Optimal memory bounds and complexity (EXPTIME-completeness) for the above problem.Implementation of our algorithm with several heuristics and experimental results. We ...
Optimally solving Markov decision processes with total expected discounted reward function

Compared computational performance of linear programming and the policy iteration.Considered only discrete-time infinite-horizon MDPs with discounted reward.Used randomly generated test problems and a real-life health-care problem.Showed that, unlike ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics Volume 38, Issue 6

December 2008

230 pages

ISSN:1083-4419

Issue’s Table of Contents

Publisher

IEEE Press

Publication History

Published: 01 December 2008

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Yin BCao JKang YLu XJiang X(2018)A novel POMDP-based server RAM caching algorithm for VoD systemsMultimedia Tools and Applications10.1007/s11042-017-4930-477:10(13023-13045)Online publication date: 1-May-2018
https://dl.acm.org/doi/10.1007/s11042-017-4930-4
Yao JYin BTan XJiang X(2017)A POMDP framework for forwarding mechanism in named data networkingComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2016.11.005112:C(167-175)Online publication date: 15-Jan-2017
https://dl.acm.org/doi/10.1016/j.comnet.2016.11.005
Li HLi BTran TSicker D(2016)Transmission Schemes for Multicasting Hard Deadline Constrained Prioritized Data in Wireless Multimedia StreamingIEEE Transactions on Wireless Communications10.1109/TWC.2015.249354615:3(1631-1641)Online publication date: 8-Mar-2016
https://dl.acm.org/doi/10.1109/TWC.2015.2493546
Lin FYin BHuang JWu X(2012)Admission control with elastic QoS for video on demand systemsInternational Journal of Automation and Computing10.1007/s11633-012-0668-79:5(467-473)Online publication date: 1-Oct-2012
https://dl.acm.org/doi/10.1007/s11633-012-0668-7
Yu LBrooks R(2011)Observable subspace solution for irreducible POMDPs with infinite horizonProceedings of the Seventh Annual Workshop on Cyber Security and Information Intelligence Research10.1145/2179298.2179392(1-1)Online publication date: 12-Oct-2011
https://dl.acm.org/doi/10.1145/2179298.2179392

Abstract

Cited By

Recommendations

Partially Observable Risk-Sensitive Markov Decision Processes

What is decidable about partially observable Markov decision processes with ω-regular objectives

Optimally solving Markov decision processes with total expected discounted reward function

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Share

Share this Publication link

Share on social media

Affiliations