Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Partially Observable Markov Decision Processes and Performance Sensitivity Analysis

Published: 01 December 2008 Publication History

Abstract

The sensitivity-based optimization of Markov systems has become an increasingly important area. From the perspective of performance sensitivity analysis, policy-iteration algorithms and gradient estimation methods can be directly obtained for Markov decision processes (MDPs). In this correspondence, the sensitivity-based optimization is extended to average reward partially observable MDPs (POMDPs). We derive the performance-difference and performance-derivative formulas of POMDPs. On the basis of the performance-derivative formula, we present a new method to estimate the performance gradients. From the performance-difference formula, we obtain a sufficient optimality condition without the discounted reward formulation. We also propose a policy-iteration algorithm to obtain a nearly optimal finite-state-controller policy.

Cited By

View all
  • (2018)A novel POMDP-based server RAM caching algorithm for VoD systemsMultimedia Tools and Applications10.1007/s11042-017-4930-477:10(13023-13045)Online publication date: 1-May-2018
  • (2017)A POMDP framework for forwarding mechanism in named data networkingComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2016.11.005112:C(167-175)Online publication date: 15-Jan-2017
  • (2016)Transmission Schemes for Multicasting Hard Deadline Constrained Prioritized Data in Wireless Multimedia StreamingIEEE Transactions on Wireless Communications10.1109/TWC.2015.249354615:3(1631-1641)Online publication date: 8-Mar-2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics  Volume 38, Issue 6
December 2008
230 pages

Publisher

IEEE Press

Publication History

Published: 01 December 2008

Author Tags

  1. Finite-state controller (FSC)
  2. gradient estimation
  3. partially observable Markov decision processes (POMDPs)
  4. policy iteration
  5. sensitivity analysis

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)A novel POMDP-based server RAM caching algorithm for VoD systemsMultimedia Tools and Applications10.1007/s11042-017-4930-477:10(13023-13045)Online publication date: 1-May-2018
  • (2017)A POMDP framework for forwarding mechanism in named data networkingComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.comnet.2016.11.005112:C(167-175)Online publication date: 15-Jan-2017
  • (2016)Transmission Schemes for Multicasting Hard Deadline Constrained Prioritized Data in Wireless Multimedia StreamingIEEE Transactions on Wireless Communications10.1109/TWC.2015.249354615:3(1631-1641)Online publication date: 8-Mar-2016
  • (2012)Admission control with elastic QoS for video on demand systemsInternational Journal of Automation and Computing10.1007/s11633-012-0668-79:5(467-473)Online publication date: 1-Oct-2012
  • (2011)Observable subspace solution for irreducible POMDPs with infinite horizonProceedings of the Seventh Annual Workshop on Cyber Security and Information Intelligence Research10.1145/2179298.2179392(1-1)Online publication date: 12-Oct-2011

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media