Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3071178.3071287acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Scalable genetic programming by gene-pool optimal mixing and input-space entropy-based building-block learning

Published: 01 July 2017 Publication History

Abstract

The Gene-pool Optimal Mixing Evolutionary Algorithm (GOMEA) is a recently introduced model-based EA that has been shown to be capable of outperforming state-of-the-art alternative EAs in terms of scalability when solving discrete optimization problems. One of the key aspects of GOMEA's success is a variation operator that is designed to extensively exploit linkage models by effectively combining partial solutions. Here, we bring the strengths of GOMEA to Genetic Programming (GP), introducing GP-GOMEA. Under the hypothesis of having little problem-specific knowledge, and in an effort to design easy-to-use EAs, GP-GOMEA requires no parameter specification. On a set of well-known benchmark problems we find that GP-GOMEA outperforms standard GP while being on par with more recently introduced, state-of-the-art EAs. We furthermore introduce Input-space Entropy-based Building-block Learning (IEBL), a novel approach to identifying and encapsulating relevant building blocks (subroutines) into new terminals and functions. On problems with an inherent degree of modularity, IEBL can contribute to compact solution representations, providing a large potential for knock-on effects in performance. On the difficult, but highly modular Even Parity problem, GP-GOMEA+IEBL obtains excellent scalability, solving the 14-bit instance in less than 1 hour.

References

[1]
P. J. Angeline and Jordan B. Pollack. 1994. Coevolving High-Level Representations. In Artificial Life III, C. Langton (Ed.). Addison-Wesley, 55--71.
[2]
L. Breiman, J. Friedman, C. J. Stone, and R. A. Olshen. 1984. Classification and regression trees. CRC press.
[3]
Y. P. Chen, T.-L. Yu, K. Sastry, and D. E. Goldberg. 2007. A survey of linkage learning techniques in genetic and evolutionary algorithms. IlliGAL report 2007014 (2007).
[4]
A. Dessì, A. Giani, and A. Starita. 1999. An Analysis of Automatic Subroutine Discovery in Genetic Programming. In Proceedings of the 1st Annual Conference on Genetic and Evolutionary Computation - Volume 2. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 996--1001.
[5]
R. M. Downing. 2005. Evolving binary decision diagrams using implicit neutrality. In 2005 IEEE Congress on Evolutionary Computation, Vol. 3. IEEE, 2107--2113.
[6]
R. Ffrancon and M. Schoenauer. 2015. Memetic semantic genetic programming. In Proc. of the 2015 Annual Conf. on Genetic and Evol. Comp. ACM, 1023--1030.
[7]
I. Gronau and S. Moran. 2007. Optimal implementations of UPGMA and other common clustering algorithms. Inform. Process. Lett. 104, 6 (2007), 205--210.
[8]
D. Jackson and A. P. Gibbons. 2007. Layered Learning in Boolean GP Problems. Springer Berlin Heidelberg, Berlin, Heidelberg, 148--159.
[9]
M. Keijzer, C. Ryan, and M. Cattolico. 2004. Run Transferable Libraries --- Learning Functional Bias in Problem Domains. Springer, Berlin, 531--542.
[10]
E. E. Korkmaz and G. Uçoluk. 2003. Design and usage of a new benchmark problem for genetic programming. In International Symposium on Computer and Information Sciences. Springer, 561--567.
[11]
J. R. Koza. 1992. Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge, MA, USA.
[12]
L. O. V. B. Oliveira, F. E. B. Otero, G. L. Pappa, and J. Albinati. 2015. Sequential Symbolic Regression with Genetic Programming. Springer, 73--90.
[13]
T. P. Pawlak, B. Wieloch, and K. Krawiec. 2015. Semantic backpropagation for designing search operators in genetic programming. IEEE Transactions on Evolutionary Computation 19, 3 (2015), 326--340.
[14]
J. C. Pereira and F. G. Lobo. 2015. Java Implementation of a Parameter-less Evolutionary Portfolio. CoRR abs/1506.08867 (2015).
[15]
S. C. Roberts, D. Howard, and J. R. Koza. 2001. Evolving Modules in Genetic Programming by Subtree Encapsulation. In Proceedings of the 4th European Conference on Genetic Programming. Springer-Verlag, London, UK, 160--175.
[16]
K. Sastry and D. E. Goldberg. 2003. Probabilistic model building and competent genetic programming. In Genetic Prog. Theory and Practice. Springer, 205--220.
[17]
D. Thierens and P. A. N. Bosman. 2013. Hierarchical Problem Solving with the Linkage Tree Genetic Algorithm. In Proceedings of the 15th Annual Conference on Genetic and Evolutionary Computation. ACM, New York, NY, USA, 877--884.
[18]
R. A. Watson and T. Jansen. 2007. A Building-block Royal Road Where Crossover is Provably Essential. In Proceedings of the 9th Annual Conference on Genetic and Evolutionary Computation. ACM, New York, NY, USA, 1452--1459.

Cited By

View all
  • (2024)M5GP: Parallel Multidimensional Genetic Programming with Multidimensional Populations for Symbolic RegressionMathematical and Computational Applications10.3390/mca2902002529:2(25)Online publication date: 18-Mar-2024
  • (2024)Parameterless Gene-Pool Optimal Mixing Evolutionary AlgorithmsEvolutionary Computation10.1162/evco_a_00338(1-27)Online publication date: 27-Jun-2024
  • (2024)Improving the efficiency of GP-GOMEA for higher-arity operatorsProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654118(971-979)Online publication date: 14-Jul-2024
  • Show More Cited By

Index Terms

  1. Scalable genetic programming by gene-pool optimal mixing and input-space entropy-based building-block learning

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference
      July 2017
      1427 pages
      ISBN:9781450349208
      DOI:10.1145/3071178
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 July 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. building blocks
      2. genetic programming
      3. linkage learning
      4. optimal mixing
      5. program synthesis

      Qualifiers

      • Research-article

      Funding Sources

      • Kinderen Kankervrij foundation

      Conference

      GECCO '17
      Sponsor:

      Acceptance Rates

      GECCO '17 Paper Acceptance Rate 178 of 462 submissions, 39%;
      Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)61
      • Downloads (Last 6 weeks)3
      Reflects downloads up to 14 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)M5GP: Parallel Multidimensional Genetic Programming with Multidimensional Populations for Symbolic RegressionMathematical and Computational Applications10.3390/mca2902002529:2(25)Online publication date: 18-Mar-2024
      • (2024)Parameterless Gene-Pool Optimal Mixing Evolutionary AlgorithmsEvolutionary Computation10.1162/evco_a_00338(1-27)Online publication date: 27-Jun-2024
      • (2024)Improving the efficiency of GP-GOMEA for higher-arity operatorsProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654118(971-979)Online publication date: 14-Jul-2024
      • (2024)The Inefficiency of Genetic Programming for Symbolic RegressionParallel Problem Solving from Nature – PPSN XVIII10.1007/978-3-031-70055-2_17(273-289)Online publication date: 7-Sep-2024
      • (2023)Predicting ordinary differential equations with transformersProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3618491(1978-2002)Online publication date: 23-Jul-2023
      • (2023)Mini-Batching, Gradient-Clipping, First- versus Second-Order: What Works in Gradient-Based Coefficient Optimisation for Symbolic Regression?Proceedings of the Genetic and Evolutionary Computation Conference10.1145/3583131.3590368(1127-1136)Online publication date: 15-Jul-2023
      • (2023)Reducing Overparameterization of Symbolic Regression Models with Equality SaturationProceedings of the Genetic and Evolutionary Computation Conference10.1145/3583131.3590346(1064-1072)Online publication date: 15-Jul-2023
      • (2023)Explainable Artificial Intelligence by Genetic Programming: A SurveyIEEE Transactions on Evolutionary Computation10.1109/TEVC.2022.322550927:3(621-641)Online publication date: Jun-2023
      • (2023)Problem of Domain/Building Block Preservation in the Evolution of Biological Macromolecules and Evolutionary ComputationIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2022.317590820:2(1345-1362)Online publication date: 1-Mar-2023
      • (2023)Information fusion via symbolic regressionInformation Fusion10.1016/j.inffus.2022.11.03092:C(326-335)Online publication date: 1-Apr-2023
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media