Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/3306127.3331747acmconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
research-article

An Evolutionary Approach to Find Optimal Policies with an Agent-Based Simulation

Published: 08 May 2019 Publication History

Abstract

In this paper, we introduce a new agent-based method to build a decision-aid tool aimed to improve policy design. In our approach, a policy is defined as a set of levers, modelling the set of actions, the means to impact a complex system. Our method is generic, as it could be applied to any domain, and be coupled with any agent-based simulator. We could deal not only with simple levers (a single variable whose value is modified) but also complex ones (multiple variable modifications, qualitative effects, ...), unlike most optimization methods. It is based on the evolutionary algorithm CMA-ES, coupled with a normalized and aggregated fitness function. The fitness is normalized using estimated Ideal (best policy) and Nadir (worst policy) values, these values being dynamically computed during the execution of CMA-ES through a Pareto Front estimated with the ABM simulation. Moreover, to deal with complex levers, we introduce the FSM-branching algorithm, where a Finite State Machine (FSM) determines whether a complex policy can potentially be improved or has to be aborted. We tested our method with Economic Policies on the French Labor Market (FLM), allowing the modification of multiple elements of the FLM, and we compared the results to the reference, the FLM without any policy applied. The policies studied here comprise simple and complex levers. This experience shows the viability of our approach, the efficiency of our algorithms and illustrates how this combination of evolutionary optimization, multi-criteria aggregation and agent-based simulation could help any policy-maker to design better policies.

References

[1]
Charles Audet. 2014. A Survey on Direct Search Methods for Blackbox Optimization and Their Applications .Springer New York, New York, NY, 31--56.
[2]
Jurgen Branke, Kalyanmoy Deb, Henning Dierolf, and Matthias Osswald. 2004. Finding knees in multi-objective optimization. In In the Eighth Conference on Parallel Problem Solving from Nature (PPSN VIII). Lecture Notes in Computer Science . Springer-Verlag, 722--731.
[3]
Lucian Bucs oniu, Robert Babuvs ka, and Bart De Schutter. 2010. Multi-agent Reinforcement Learning: An Overview .Springer Berlin Heidelberg, Berlin, Heidelberg, 183--221.
[4]
Adiel Teixeira de Almeida, Jonatas Araujo de Almeida, Ana Paula Cabral Seixas Costa, and Adiel Teixeira de Almeida-Filho. 2016. A new method for elicitation of criteria weights in additive models: Flexible and interactive tradeoff. European Journal of Operational Research, Vol. 250, 1 (2016), 179 -- 191.
[5]
K. Deb and H. Jain. 2014. An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints. IEEE Transactions on Evolutionary Computation, Vol. 18, 4 (Aug 2014), 577--601.
[6]
Lucie Galand and Patrice Perny. 2006. Search for Compromise Solutions in Multiobjective State Space Graphs. In Proceedings of the 2006 Conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva Del Garda, Italy . IOS Press, Amsterdam, The Netherlands, The Netherlands, 93--97. http://dl.acm.org/citation.cfm?id=1567016.1567042
[7]
Oliver Goudet, Jean-Daniel Kant, and Gérard Ballot. 2014. Forbidding Fixed Duration Contracts: Unfolding the Opposing Consequences with a Multi-Agent Model of the French Labor Market. Advances in Artificial Economics . Springer, 151--167.
[8]
Olivier Goudet, Jean-Daniel Kant, and Gérard Ballot. 2017. WorkSim: A Calibrated Agent-Based Model of the Labor Market Accounting for Workers' Stocks and Gross Flows. Comput. Econ., Vol. 50, 1 (June 2017), 21--68.
[9]
Nikolaus Hansen. 2011. A CMA-ES for Mixed-Integer Nonlinear Optimization. Research Report RR-7751. INRIA . https://hal.inria.fr/inria-00629689
[10]
Nikolaus Hansen, Anne Auger, Raymond Ros, Steffen Finck, and Petr Povs'ik. 2010. Comparing Results of 31 Algorithms from the Black-box Optimization Benchmarking BBOB-2009. In Proceedings of the 12th Annual Conference Companion on Genetic and Evolutionary Computation (GECCO '10). ACM, New York, NY, USA, 1689--1696.
[11]
Nikolaus Hansen and Andreas Ostermeier. 2001. Completely derandomized self-adaptation in evolution strategies. Evolutionary computation, Vol. 9, 2 (2001), 159textendash195.
[12]
Stephan Hutterer, Michael Affenzeller, and Franz Auinger. 2012. Evolutionary Optimization of Multi-agent Controlstrategies for Electric Vehicle Charging. In Proceedings of the 14th Annual Conference Companion on Genetic and Evolutionary Computation (GECCO '12). ACM, New York, NY, USA, 3--10.
[13]
Leslie Pack Kaelbling, Michael L. Littman, and Andrew W. Moore. 1996. Reinforcement Learning: A Survey. CoRR, Vol. cs.AI/9605103 (1996). http://arxiv.org/abs/cs.AI/9605103
[14]
J. Knowles and D. Corne. 1999. The Pareto archived evolution strategy: a new baseline algorithm for Pareto multiobjective optimisation. In Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), Vol. 1. 98--105 Vol. 1.
[15]
R.T. Marler and J.S. Arora. 2004. Survey of multi-objective optimization methods for engineering. Structural and Multidisciplinary Optimization, Vol. 26, 6 (01 Apr 2004), 369--395.
[16]
M.Ehrgott and D.Tenfelde-Podehl. 2003. Computation of ideal and Nadir values and implications for their use in MCDM methods. European Journal of Operational Research, Vol. 151 (2003), 119--139.
[17]
Olaf Mersmann, Mike Preuss, and Heike Trautmann. 2010. Benchmarking Evolutionary Algorithms: Towards Exploratory Landscape Analysis. In Proceedings of the 11th International Conference on Parallel Problem Solving from Nature: Part I (PPSN'10). Springer-Verlag, Berlin, Heidelberg, 73--82. http://dl.acm.org/citation.cfm?id=1885031.1885040
[18]
Francois Michon. 2008. France: Temporary agency work and collective bargaining in the EU. (18 Dec. 2008). https://www.eurofound.europa.eu/publications/report/2008/france-temporary-agency-work-and-collective-bargaining-in-the-eu
[19]
Thierry Pénard, Michel Sollogoub, and Valérie Ulrich. 1999. Hiring contracts, Wage, and Job Satisfaction : Theory and evidence on French low qualified youths. (June 1999). https://perso.univ-rennes1.fr/thierry.penard/biblio/jole.pdf
[20]
Shani Rostami and Alex Shenfield. 2012. CMA-PAES: Pareto archived evolution strategy using covariance matrix adaptation for Multi-Objective Optimisation. In 2012 12th UK Workshop on Computational Intelligence (UKCI). 1--8.
[21]
Ralph E. Steuer. 1989. Multiple Criteria Optimization: Theory, Computation, and Application .Krieger. 89032917 https://books.google.fr/books?id=tSA_PgAACAAJ
[22]
Ralph E. Steuer and Eng-Ung Choo. 1983. An Interactive Weighted Tchebycheff Procedure for Multiple Objective Programming. Math. Program., Vol. 26, 3 (Oct. 1983), 326--344.
[23]
Freek Stulp and Olivier Sigaud. 2013. Policy Improvement: Between Black-Box Optimization and Episodic Reinforcement Learning. In Proceedings JFPDA. 1--15.
[24]
W.D. Wallis. 2014. The Mathematics of Elections and Voting .Springer International Publishing.
[25]
Handing Wang, Shan He, and Xin Yao. 2015. Nadir Point Estimation for Many-Objective Optimization Problems Based on Emphasized Critical Regions. Soft Computing (Nov 2015).
[26]
Andrzej P. Wierzbicki. 1986. On the completeness and constructiveness of parametric characterizations to vector optimization problems. Operations-Research-Spektrum, Vol. 8, 2 (01 Jun 1986), 73--87.
[27]
Andrzej P. Wierzbicki. 1999. Reference Point Approaches .Springer US, Boston, MA, 237--275.

Cited By

View all
  • (2020)An Information Distribution Method for Avoiding Hunting Phenomenon in Theme ParksProceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3398761.3399071(2050-2052)Online publication date: 5-May-2020

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
AAMAS '19: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems
May 2019
2518 pages
ISBN:9781450363099

Sponsors

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

Richland, SC

Publication History

Published: 08 May 2019

Check for updates

Author Tags

  1. agent-based simulation
  2. evolutionary optimization
  3. labor economics
  4. multi-criteria aggregation
  5. policy design

Qualifiers

  • Research-article

Conference

AAMAS '19
Sponsor:

Acceptance Rates

AAMAS '19 Paper Acceptance Rate 193 of 793 submissions, 24%;
Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2020)An Information Distribution Method for Avoiding Hunting Phenomenon in Theme ParksProceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3398761.3399071(2050-2052)Online publication date: 5-May-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media