Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Optimizing Stochastic Control through State TransitionSeparability and Resource-Utility Exchange

Published: 06 September 2024 Publication History

Abstract

In the realm of stochastic control, particularly in the fields of economics and engineering, Markov Decision Processes (MDP's) are employed to represent various processes ranging from asset management to transportation logistics. Upon closer examination these constrained MDP's often exhibit specific causal structures concerning the dynamics of transitions and rewards. Thus, leveraging this structure can facilitate computational simplifications for determining the optimal policy. This study introduces a framework, which we denote as SD-MDP, in which we disentangle the causal structure of state transition and reward function dynamics. Through this method, we are able to establish theoretical guarantees on improvements in computational efficiency compared to standard MDP solver (such as linear programming). We further derive error bounds on the optimal value approximation via Monte Carlo simulation for this family of stochastic control problems.

References

[1]
John C Gittins. 1979. Bandit processes and dynamic allocation indices. Journal of the Royal Statistical Society Series B: Statistical Methodology, 41, 2, 148--164.
[2]
G.H. Hardy, J.E. Littlewood, and G. P´olya. 1952. Inequalities. Cambridge Mathematical Library. Cambridge University Press. isbn: 9780521358804.
[3]
Yangyi Lu, Amirhossein Meisami, and Ambuj Tewari. 2022. Efficient reinforcement learning with prior causal knowledge. In Conference on Causal Learning and Reasoning. PMLR, 526--541.
[4]
Jean-Paul Watson and David L Woodruff. 2011. Progressive hedging innovations for a class of stochastic mixed-integer resource allocation problems. Computational Management Science, 8, 355--370.

Index Terms

  1. Optimizing Stochastic Control through State TransitionSeparability and Resource-Utility Exchange
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM SIGMETRICS Performance Evaluation Review
        ACM SIGMETRICS Performance Evaluation Review  Volume 52, Issue 2
        September 2024
        61 pages
        DOI:10.1145/3695411
        • Editor:
        • Bo Ji
        Issue’s Table of Contents
        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 06 September 2024
        Published in SIGMETRICS Volume 52, Issue 2

        Check for updates

        Qualifiers

        • Article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 7
          Total Downloads
        • Downloads (Last 12 months)7
        • Downloads (Last 6 weeks)2
        Reflects downloads up to 10 Nov 2024

        Other Metrics

        Citations

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media