Nothing Special   »   [go: up one dir, main page]

skip to main content
article

A mean field approach for optimization in discrete time

Published: 01 March 2011 Publication History

Abstract

This paper investigates the limit behavior of Markov decision processes made of independent objects evolving in a common environment, when the number of objects ( N ) goes to infinity. In the finite horizon case, we show that when the number of objects becomes large, the optimal cost of the system converges to the optimal cost of a discrete time system that is deterministic. Convergence also holds for optimal policies. We further provide bounds on the speed of convergence by proving second order results that resemble central limits theorems for the cost and the state of the Markov decision process, with explicit formulas for the limit. These bounds (of order $1/\sqrt{N}$ ) are proven to be tight in a numerical example. One can even go further and get convergence of order $\sqrt{\log N}/N$ to a stochastic system made of the mean field limit and a Gaussian term. Our framework is applied to a brokering problem in grid computing. Several simulations with growing numbers of processors are reported. They compare the performance of the optimal policy of the limit system used in the finite case with classical policies by measuring its asymptotic gain. Several extensions are also discussed. In particular, for infinite horizon cases with discounted costs, we show that first order limits hold and that second order results also hold as long as the discount factor is small enough. As for infinite horizon cases with non-discounted costs, examples show that even the first order limits may not hold.

References

[1]
Benaïm M, Le Boudec J-Y (2008) A class of mean field interaction models for computer and communication systems. Perform Eval 65(11-12):823-838.
[2]
Berten V, Gaujal B (2007a) Brokering strategies in computational grids using stochastic prediction models. Parallel Comput (Special Issue on Large Scale Grids).
[3]
Berten V, Gaujal B (2007b) Grid brokering for batch allocation using indexes. In: Euro-FGI NETCOOP. LNCS, Avignon, France.
[4]
Bobbio A, Gribaudo M, Telek M (2008) Analysis of large scale interacting systems by mean field method. In: 5th international conference on quantitative evaluation of systems (QEST), St Malo, pp 215-224.
[5]
Bordenave C, Anantharam V (2007) Optimal control of interacting particle systems. Tech Rep 00397327, CNRS Open-Archive HAL.
[6]
Bordenave C, McDonald D, Proutière A (2010) A particle system in interaction with a rapidly varying environment: mean field limits and applications. Networks and Heterogeneous Media (NHM) 5(1):31-62.
[7]
Borkar V (2008) Stochastic approximation: a dynamical systems viewpoint. Cambridge University Press.
[8]
Durrett R (1991) Probability: theory and examples. Wadsworth & Brooks/Cole.
[9]
EGEE (2010) Enabling grids for E-science.
[10]
Gast N, Gaujal B (2009) A mean field approach for optimization in particle systems and applications. In: Fourth international conference on performance evaluation methodologies and tools, ValueTools.
[11]
Gast N, Gaujal B, Le Boudec J-Y (2010) Mean field for Markov decision processes: from discrete to continuous optimization. Tech rep, INRIA.
[12]
Graham C (2000) Chaoticity on path space for a queueing network with selection of the shortest queue among several. J Appl Probab 37:198-211.
[13]
Hochbaum DS (1996) Approximation algorithms for NP-hard problems. PWS Publishing Co. Boston, MA, USA.
[14]
Hoeffding W (1963) Probability inequalities for sums of bounded random variables. J Am Stat Assoc 58(301):13-30.
[15]
Kurtz T (1978) Strong approximation theorems for density dependent Markov chains. Stoch Process Their Appl. Elsevier.
[16]
Le Boudec J-Y, McDonald D, Mundinger J (2007) A generic mean field convergence result for systems of interacting objects. QEST 2007:3-18.
[17]
Palmer J, Mitrani I (2005) Optimal and heuristic policies for dynamic server allocation. J Parallel Distrib Comput 65(10):1204-1211. Special issue: design and performance of networks for super-, cluster-, and grid-computing (part I).
[18]
Papadimitriou CH, Tsitsiklis JN (1999) The complexity of optimal queueing network control. Math Oper Res 24:293-305.
[19]
Puterman ML (2005) Markov decision processes: discrete stochastic dynamic programming. Wiley-Interscience.
[20]
Rolski T (1983) Comparison theorems for queues with dependent interarrival times. In: Lecture notes in control and information sciences, vol 60. Springer, pp 42-71.
[21]
Schwartz JT (1969) Nonlinear functional analysis. Gordon and Breach Science Publishers, New York.
[22]
Weber RR, Weiss G (1990) On an index policy for restless bandits. J Appl Probab 27:637-648.
[23]
Whittle P (1988) A celebration of applied probability. J Appl Probab Spec 25 A (chap. restless bandits: activity allocation in a changing world): 287-298.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Discrete Event Dynamic Systems
Discrete Event Dynamic Systems  Volume 21, Issue 1
March 2011
135 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 March 2011

Author Tags

  1. Brokering
  2. Markov decision process
  3. Mean field

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Mean Field Markov Decision ProcessesApplied Mathematics and Optimization10.1007/s00245-023-09985-188:1Online publication date: 10-Apr-2023
  • (2022)On the Whittle index of Markov modulated restless banditsQueueing Systems: Theory and Applications10.1007/s11134-022-09737-y102:3-4(373-430)Online publication date: 1-Dec-2022
  • (2021)A mean field absorbing control model for interacting objects systemsDiscrete Event Dynamic Systems10.1007/s10626-021-00339-z31:3(349-372)Online publication date: 1-Sep-2021
  • (2021)Discrete‐time mean‐field stochastic linear‐quadratic optimal control problem with finite horizonAsian Journal of Control10.1002/asjc.230623:2(979-989)Online publication date: 1-Mar-2021
  • (2020)Refined Mean Field Analysis: The Gossip Shuffle Protocol RevisitedCoordination Models and Languages10.1007/978-3-030-50029-0_15(230-239)Online publication date: 15-Jun-2020
  • (2019)A Refined Mean Field Approximation for Synchronous Population ProcessesACM SIGMETRICS Performance Evaluation Review10.1145/3305218.330523046:2(30-32)Online publication date: 17-Jan-2019
  • (2018)Asymptotic Optimal Control of Markov-Modulated Restless BanditsProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/31794102:1(1-25)Online publication date: 3-Apr-2018
  • (2017)Stein's Method for Mean Field Approximations in Light and Heavy Traffic RegimesProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/30844491:1(1-27)Online publication date: 13-Jun-2017
  • (2016)Discrete-Time Control for Systems of Interacting Objects with Unknown Random Disturbance DistributionsApplied Mathematics and Optimization10.1007/s00245-015-9312-674:1(197-227)Online publication date: 1-Aug-2016
  • (2014)Impact of demand-response on the efficiency and prices in real-time electricity marketsProceedings of the 5th international conference on Future energy systems10.1145/2602044.2602052(171-182)Online publication date: 11-Jun-2014
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media