Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3520304.3534031acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article
Open access

Bingo: a customizable framework for symbolic regression with genetic programming

Published: 19 July 2022 Publication History

Abstract

In this paper, we introduce Bingo, a flexible and customizable yet performant Python framework for symbolic regression with genetic programming. Bingo maintains a modular code structure for simple abstraction and easily swappable components. Fitness functions, selection methods, and constant optimization methods allow for easy problem-specific customization. Bingo also maintains several features for increased efficiency such as parallelism, equation simplification, and a C++ backend. We compare Bingo's performance to other genetic programming for symbolic regression (GPSR) methods to show that it is both competitive and flexible.

References

[1]
Geoffrey F. Bomarito, Patrick E. Leser, Nolan M. Strauss, Karl M. Garbrecht, and Jacob D. Hochhalter. 2022. Bayesian Model Selection for Reducing Bloat and Overfitting in Genetic Programming for Symbolic Regression. In Proceedings of the 24th annual conference on Genetic and evolutionary computation (GECCO '22). Association for Computing Machinery, Boston, MA, USA.
[2]
Geoffrey F. Bomarito, Tyler S. Townsend, Kristen M. Stewart, Kathryn V. Esham, John M. Emery, and Jacob D. Hochhalter. 2021. Development of interpretable, data-driven plasticity models with symbolic regression. 252 (2021), 106557.
[3]
Bogdan Burlacu, Gabriel Kronberger, and Michael Kommenda. 2020. Operon C++: an efficient genetic programming framework for symbolic regression. In Proceedings of the 2020 Genetic and Evolutionary Computation Conference Companion. Association for Computing Machinery, New York, NY, USA, 1562--1570.
[4]
Chetan Chudasama, S. M. Shah, and Mahesh Panchal. 2011. Comparison of parents selection methods of genetic algorithm for TSP. In International Conference on Computer Communication and Networks CSI-COMNET-2011, Proceedings. Citeseer, 85--87.
[5]
Joel S. Cohen. 2003. Computer algebra and symbolic computation: Mathematical methods. AK Peters/CRC Press.
[6]
Lisandro Dalcín, Rodrigo Paz, and Mario Storti. 2005. MPI for Python. J. Parallel and Distrib. Comput. 65, 9 (Sept. 2005), 1108--1115.
[7]
Vinicius Veloso De Melo, Benjamin Fowler, and Wolfgang Banzhaf. 2015. Evaluating methods for constant optimization of symbolic regression benchmark problems. In 2015 Brazilian conference on intelligent systems (BRACIS). IEEE, 25--30.
[8]
François-Michel De Rainville, Félix-Antoine Fortin, Marc-André Gardner, Marc Parizeau, and Christian Gagné. 2014. DEAP: enabling nimbler evolutions. ACM SIGEVOlution 6, 2 (2014), 17--26.
[9]
Z-Flores Emigdio, Leonardo Trujillo, Oliver Schütze, Pierrick Legrand, et al. 2014. Evaluating the effects of local search in genetic programming. In EVOLVE-A Bridge between Probability, Set Oriented Numerics, and Evolutionary Computation V. Springer, 213--228.
[10]
Daniel JA Hills, Adrian M Grütter, and Jonathan J Hudson. 2015. An algorithm for discovering Lagrangians automatically from data. PeerJ Computer Science 1 (2015), e31.
[11]
Wenzel Jakob, Jason Rhinelander, and Dean Moldovan. 2016. pybind11 --- Seamless operability between C++11 and Python. https://github.com/pybind/pybind11
[12]
Khalid Jebari and Mohammed Madiafi. 2013. Selection methods for genetic algorithms. International Journal of Emerging Sciences 3, 4 (2013), 333--344.
[13]
George Em Karniadakis, Ioannis G Kevrekidis, Lu Lu, Paris Perdikaris, Sifan Wang, and Liu Yang. 2021. Physics-informed machine learning. Nature Reviews Physics 3, 6 (2021), 422--440.
[14]
Michael Kommenda, Gabriel Kronberger, Stephan Winkler, Michael Affenzeller, and Stefan Wagner. 2013. Effects of constant optimization by nonlinear least squares minimization in symbolic regression. In Proceedings of the 15th annual conference companion on Genetic and evolutionary computation. 1121--1128.
[15]
John R. Koza. 1992. Genetic programming: on the programming of computers by means of natural selection. MIT Press, Cambridge, Mass.
[16]
William La Cava, Patryk Orzechowski, Bogdan Burlacu, Fabrício Olivetti de França, Marco Virgolin, Ying Jin, Michael Kommenda, and Jason H. Moore. 2021. Contemporary Symbolic Regression Methods and their Relative Performance. arXiv:2107.14351 [cs] (July 2021). http://arxiv.org/abs/2107.14351 arXiv: 2107.14351.
[17]
William La Cava, Tilak Raj Singh, James Taggart, Srinivas Suri, and Jason H. Moore. 2018. Learning concise representations for regression by evolving networks of trees. (2018).
[18]
Samir W. Mahfoud. 1995. Niching Methods for Genetic Algorithms. Ph.D. Dissertation. University of Illinois at Urbana-Champaign.
[19]
W. N. Martin, Jens Lienig, and James P. Cohoon. 1997. Island (migration) models: evolutionary algorithms based on punctuated equilibria. Handbook of Evolutionary Computation 6 (1997), 101--124.
[20]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett (Eds.). Curran Associates, Inc., 8024--8035. http://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf
[21]
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12 (2011), 2825--2830.
[22]
Brenden K. Petersen, Mikel Landajuela Larma, T. Nathan Mundhenk, Claudio P. Santiago, Soo K. Kim, and Joanne T. Kim. 2021. Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients. arXiv:1912.04871 [cs, stat] (April 2021). http://arxiv.org/abs/1912.04871 arXiv:1912.04871.
[23]
Michael Schmidt and Hod Lipson. 2007. Comparison of tree and graph encodings as function of problem complexity. In Proceedings of the 9th annual conference on Genetic and evolutionary computation. 1674--1679.
[24]
Michael Schmidt and Hod Lipson. 2009. Distilling free-form natural laws from experimental data. science 324, 5923 (2009), 81--85.
[25]
Michael Schmidt and Hod Lipson. 2010. Symbolic regression of implicit equations. In Genetic Programming Theory and Practice VII. Springer, 73--85.
[26]
Michael D. Schmidt and Hod Lipson. 2008. Coevolution of Fitness Predictors. IEEE Transactions on Evolutionary Computation 12, 6 (Dec. 2008), 736--749.
[27]
Michael D. Schmidt and Hod Lipson. 2010. Age-fitness pareto optimization. In Proceedings of the 12th annual conference on Genetic and evolutionary computation (GECCO '10). Association for Computing Machinery, New York, NY, USA, 543--544.
[28]
Marco Virgolin, Tanja Alderliesten, Cees Witteveen, and Peter A. N. Bosman. 2017. Scalable genetic programming by gene-pool optimal mixing and input-space entropy-based building-block learning. In Proceedings of the Genetic and Evolutionary Computation Conference. ACM, Berlin Germany, 1041--1048.
[29]
Pauli Virtanen, Ralf Gommers, Travis E. Oliphant, et al. 2020. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature Methods 17, 3 (March 2020), 261--272.

Cited By

View all
  • (2024)Modeling plasticity-mediated void growth at the single crystal scale: A physics-informed machine learning approachMechanics of Materials10.1016/j.mechmat.2024.105151199(105151)Online publication date: Dec-2024
  • (2024)Interpretable machine learning for microstructure-dependent models of fatigue indicator parametersInternational Journal of Fatigue10.1016/j.ijfatigue.2023.108019178(108019)Online publication date: Jan-2024
  • (2024)Inherently interpretable machine learning solutions to differential equationsEngineering with Computers10.1007/s00366-023-01915-740:4(2349-2361)Online publication date: 1-Aug-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '22: Proceedings of the Genetic and Evolutionary Computation Conference Companion
July 2022
2395 pages
ISBN:9781450392686
DOI:10.1145/3520304
Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. genetic programming
  2. genetic programming for symbolic regression
  3. symbolic regression

Qualifiers

  • Research-article

Conference

GECCO '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)306
  • Downloads (Last 6 weeks)39
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Modeling plasticity-mediated void growth at the single crystal scale: A physics-informed machine learning approachMechanics of Materials10.1016/j.mechmat.2024.105151199(105151)Online publication date: Dec-2024
  • (2024)Interpretable machine learning for microstructure-dependent models of fatigue indicator parametersInternational Journal of Fatigue10.1016/j.ijfatigue.2023.108019178(108019)Online publication date: Jan-2024
  • (2024)Inherently interpretable machine learning solutions to differential equationsEngineering with Computers10.1007/s00366-023-01915-740:4(2349-2361)Online publication date: 1-Aug-2024
  • (2024)The Inefficiency of Genetic Programming for Symbolic RegressionParallel Problem Solving from Nature – PPSN XVIII10.1007/978-3-031-70055-2_17(273-289)Online publication date: 7-Sep-2024
  • (2023)Reducing Overparameterization of Symbolic Regression Models with Equality SaturationProceedings of the Genetic and Evolutionary Computation Conference10.1145/3583131.3590346(1064-1072)Online publication date: 15-Jul-2023
  • (2023)Complementing a continuum thermodynamic approach to constitutive modeling with symbolic regressionJournal of the Mechanics and Physics of Solids10.1016/j.jmps.2023.105472181(105472)Online publication date: Dec-2023

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media