research-article

An Evolutionary Approach to Find Optimal Policies with an Agent-Based Simulation

Authors:

Nicolas De Bufala,

Jean-Daniel KantAuthors Info & Claims

AAMAS '19: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems

Pages 610 - 618

Published: 08 May 2019 Publication History

Abstract

In this paper, we introduce a new agent-based method to build a decision-aid tool aimed to improve policy design. In our approach, a policy is defined as a set of levers, modelling the set of actions, the means to impact a complex system. Our method is generic, as it could be applied to any domain, and be coupled with any agent-based simulator. We could deal not only with simple levers (a single variable whose value is modified) but also complex ones (multiple variable modifications, qualitative effects, ...), unlike most optimization methods. It is based on the evolutionary algorithm CMA-ES, coupled with a normalized and aggregated fitness function. The fitness is normalized using estimated Ideal (best policy) and Nadir (worst policy) values, these values being dynamically computed during the execution of CMA-ES through a Pareto Front estimated with the ABM simulation. Moreover, to deal with complex levers, we introduce the FSM-branching algorithm, where a Finite State Machine (FSM) determines whether a complex policy can potentially be improved or has to be aborted. We tested our method with Economic Policies on the French Labor Market (FLM), allowing the modification of multiple elements of the FLM, and we compared the results to the reference, the FLM without any policy applied. The policies studied here comprise simple and complex levers. This experience shows the viability of our approach, the efficiency of our algorithms and illustrates how this combination of evolutionary optimization, multi-criteria aggregation and agent-based simulation could help any policy-maker to design better policies.

References

[1]

Charles Audet. 2014. A Survey on Direct Search Methods for Blackbox Optimization and Their Applications .Springer New York, New York, NY, 31--56.

[2]

Jurgen Branke, Kalyanmoy Deb, Henning Dierolf, and Matthias Osswald. 2004. Finding knees in multi-objective optimization. In In the Eighth Conference on Parallel Problem Solving from Nature (PPSN VIII). Lecture Notes in Computer Science . Springer-Verlag, 722--731.

[3]

Lucian Bucs oniu, Robert Babuvs ka, and Bart De Schutter. 2010. Multi-agent Reinforcement Learning: An Overview .Springer Berlin Heidelberg, Berlin, Heidelberg, 183--221.

[4]

Adiel Teixeira de Almeida, Jonatas Araujo de Almeida, Ana Paula Cabral Seixas Costa, and Adiel Teixeira de Almeida-Filho. 2016. A new method for elicitation of criteria weights in additive models: Flexible and interactive tradeoff. European Journal of Operational Research, Vol. 250, 1 (2016), 179 -- 191.

[5]

K. Deb and H. Jain. 2014. An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints. IEEE Transactions on Evolutionary Computation, Vol. 18, 4 (Aug 2014), 577--601.

[6]

Lucie Galand and Patrice Perny. 2006. Search for Compromise Solutions in Multiobjective State Space Graphs. In Proceedings of the 2006 Conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva Del Garda, Italy . IOS Press, Amsterdam, The Netherlands, The Netherlands, 93--97. http://dl.acm.org/citation.cfm?id=1567016.1567042

Digital Library

[7]

Oliver Goudet, Jean-Daniel Kant, and Gérard Ballot. 2014. Forbidding Fixed Duration Contracts: Unfolding the Opposing Consequences with a Multi-Agent Model of the French Labor Market. Advances in Artificial Economics . Springer, 151--167.

[8]

Olivier Goudet, Jean-Daniel Kant, and Gérard Ballot. 2017. WorkSim: A Calibrated Agent-Based Model of the Labor Market Accounting for Workers' Stocks and Gross Flows. Comput. Econ., Vol. 50, 1 (June 2017), 21--68.

Digital Library

[9]

Nikolaus Hansen. 2011. A CMA-ES for Mixed-Integer Nonlinear Optimization. Research Report RR-7751. INRIA . https://hal.inria.fr/inria-00629689

[10]

Nikolaus Hansen, Anne Auger, Raymond Ros, Steffen Finck, and Petr Povs'ik. 2010. Comparing Results of 31 Algorithms from the Black-box Optimization Benchmarking BBOB-2009. In Proceedings of the 12th Annual Conference Companion on Genetic and Evolutionary Computation (GECCO '10). ACM, New York, NY, USA, 1689--1696.

Digital Library

[11]

Nikolaus Hansen and Andreas Ostermeier. 2001. Completely derandomized self-adaptation in evolution strategies. Evolutionary computation, Vol. 9, 2 (2001), 159textendash195.

Digital Library

[12]

Stephan Hutterer, Michael Affenzeller, and Franz Auinger. 2012. Evolutionary Optimization of Multi-agent Controlstrategies for Electric Vehicle Charging. In Proceedings of the 14th Annual Conference Companion on Genetic and Evolutionary Computation (GECCO '12). ACM, New York, NY, USA, 3--10.

Digital Library

[13]

Leslie Pack Kaelbling, Michael L. Littman, and Andrew W. Moore. 1996. Reinforcement Learning: A Survey. CoRR, Vol. cs.AI/9605103 (1996). http://arxiv.org/abs/cs.AI/9605103

[14]

J. Knowles and D. Corne. 1999. The Pareto archived evolution strategy: a new baseline algorithm for Pareto multiobjective optimisation. In Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), Vol. 1. 98--105 Vol. 1.

[15]

R.T. Marler and J.S. Arora. 2004. Survey of multi-objective optimization methods for engineering. Structural and Multidisciplinary Optimization, Vol. 26, 6 (01 Apr 2004), 369--395.

[16]

M.Ehrgott and D.Tenfelde-Podehl. 2003. Computation of ideal and Nadir values and implications for their use in MCDM methods. European Journal of Operational Research, Vol. 151 (2003), 119--139.

[17]

Olaf Mersmann, Mike Preuss, and Heike Trautmann. 2010. Benchmarking Evolutionary Algorithms: Towards Exploratory Landscape Analysis. In Proceedings of the 11th International Conference on Parallel Problem Solving from Nature: Part I (PPSN'10). Springer-Verlag, Berlin, Heidelberg, 73--82. http://dl.acm.org/citation.cfm?id=1885031.1885040

Digital Library

[18]

Francois Michon. 2008. France: Temporary agency work and collective bargaining in the EU. (18 Dec. 2008). https://www.eurofound.europa.eu/publications/report/2008/france-temporary-agency-work-and-collective-bargaining-in-the-eu

[19]

Thierry Pénard, Michel Sollogoub, and Valérie Ulrich. 1999. Hiring contracts, Wage, and Job Satisfaction : Theory and evidence on French low qualified youths. (June 1999). https://perso.univ-rennes1.fr/thierry.penard/biblio/jole.pdf

[20]

Shani Rostami and Alex Shenfield. 2012. CMA-PAES: Pareto archived evolution strategy using covariance matrix adaptation for Multi-Objective Optimisation. In 2012 12th UK Workshop on Computational Intelligence (UKCI). 1--8.

[21]

Ralph E. Steuer. 1989. Multiple Criteria Optimization: Theory, Computation, and Application .Krieger. 89032917 https://books.google.fr/books?id=tSA_PgAACAAJ

[22]

Ralph E. Steuer and Eng-Ung Choo. 1983. An Interactive Weighted Tchebycheff Procedure for Multiple Objective Programming. Math. Program., Vol. 26, 3 (Oct. 1983), 326--344.

Digital Library

[23]

Freek Stulp and Olivier Sigaud. 2013. Policy Improvement: Between Black-Box Optimization and Episodic Reinforcement Learning. In Proceedings JFPDA. 1--15.

[24]

W.D. Wallis. 2014. The Mathematics of Elections and Voting .Springer International Publishing.

Digital Library

[25]

Handing Wang, Shan He, and Xin Yao. 2015. Nadir Point Estimation for Many-Objective Optimization Problems Based on Emphasized Critical Regions. Soft Computing (Nov 2015).

Digital Library

[26]

Andrzej P. Wierzbicki. 1986. On the completeness and constructiveness of parametric characterizations to vector optimization problems. Operations-Research-Spektrum, Vol. 8, 2 (01 Jun 1986), 73--87.

Digital Library

[27]

Andrzej P. Wierzbicki. 1999. Reference Point Approaches .Springer US, Boston, MA, 237--275.

Cited By

Yamada HKamiyama NEl Fallah Seghrouchni ASukthankar GAn BYorke-Smith N(2020)An Information Distribution Method for Avoiding Hunting Phenomenon in Theme ParksProceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3398761.3399071(2050-2052)Online publication date: 5-May-2020
https://dl.acm.org/doi/10.5555/3398761.3399071

Index Terms

An Evolutionary Approach to Find Optimal Policies with an Agent-Based Simulation
1. Applied computing
  1. Law, social and behavioral sciences
    1. Economics
2. Computing methodologies
  1. Artificial intelligence
    1. Distributed artificial intelligence
      1. Multi-agent systems
  2. Modeling and simulation
    1. Simulation types and techniques
      1. Agent / discrete models

Recommendations

Towards Proactive and Flexible Agent-Based Generation of Policy Packages for Active Transportation
HICSS '14: Proceedings of the 2014 47th Hawaii International Conference on System Sciences

One of the approaches gaining ground in policy design is the implementation of combinations of policy measures as policy packages with the aim of increasing efficiency and effectiveness of the designed policies. In this paper, we describe the recent ...
Accelerated biogeography-based optimization with neighborhood search for optimization

Biogeography-based optimization (BBO) inherently lacks exploration capability that leads to slow convergence. To address this limitation, authors present a memetic algorithm (MA) named as aBBOmDE, which is a new variant of BBO. In aBBOmDE, the ...
Agent based simulation architecture for evaluating operational policies in transshipping containers

An agent based simulator for evaluating operational policies in the transshipment of containers in a container terminal is described. The simulation tool, called SimPort, is a decentralized approach to simulating managers and entities in a container ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

AAMAS '19: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems

May 2019

2518 pages

ISBN:9781450363099

General Chairs:
Edith Elkind
University of Oxford, UK
,
Manuela Veloso
CMU (on leave), JPMorgan, USA
,
Program Chairs:
Noa Agmon
Bar-Ilan University, Israel
,
Matthew E. Taylor
Borealis AI, Canada

Sponsors

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

Richland, SC

Publication History

Published: 08 May 2019

Check for updates

Author Tags

Qualifiers

Research-article

Conference

AAMAS '19

Sponsor:

SIGAI

AAMAS '19: International Conference on Autonomous Agents and Multiagent Systems

May 13 - 17, 2019

Montreal QC, Canada

Acceptance Rates

AAMAS '19 Paper Acceptance Rate 193 of 793 submissions, 24%;

Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
55
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 21 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Yamada HKamiyama NEl Fallah Seghrouchni ASukthankar GAn BYorke-Smith N(2020)An Information Distribution Method for Avoiding Hunting Phenomenon in Theme ParksProceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3398761.3399071(2050-2052)Online publication date: 5-May-2020
https://dl.acm.org/doi/10.5555/3398761.3399071

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents