Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1276958.1277277acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
Article

Generalisation of the limiting distribution of program sizes in tree-based genetic programming and analysis of its effects on bloat

Published: 07 July 2007 Publication History

Abstract

Recent research [1] has found that standard sub-tree crossover with uniform selection of crossover points, in the absence of fitness pressure, pushes a population of GP trees towards a Lagrange distribution of tree sizes. However, the result applied to the case of single arity function plus leaf node combinations, e.g., unary, binary, ternary, etc trees only. In this paper we extend those findings and show that the same distribution is also applicable to the more general case where the function set includes functions of mixed arities. We also provide empirical evidence that strongly corroborates this generalisation. Both predicted and observed results show a distinct bias towards the sampling of shorter programs irrespective of the mix of function arities used. Practical applications and implications of this knowledge are investigated with regard to search efficiency and program bloat. Work is also presented regarding the applicability of the theory to the traditional 90%-function 10%-terminal crossover node selection policy.

References

[1]
Riccardo Poli, William B. Langdon, and Stephen Dignum. On the limiting distribution of program sizes in tree-based genetic programming. In Proceedings of the 10th European Conference on Genetic Programming, Lecture Notes in Computer Science, Valencia, Spain, 11--13 April 2007. Forthcoming.
[2]
Riccardo Poli and Nicholas Freitag McPhee. Exact schema theorems for GP with one-point and standard crossover operating on linear structures and their application to the study of the evolution of size. In Julian F. Miller, Marco Tomassini, Pier Luca Lanzi, Conor Ryan, Andrea G. B. Tettamanzi, and William B. Langdon, editors, Genetic Programming, Proceedings of EuroGP'2001, volume 2038 of LNCS, pages 126--142, Lake Como, Italy, 18--20 April 2001. Springer-Verlag.
[3]
Riccardo Poli and Nicholas Freitag McPhee. General schema theory for genetic programming with subtree-swapping crossover: Part II. Evolutionary Computation, 11(2):169--206, June 2003.
[4]
Riccardo Poli and Nicholas Freitag McPhee. General schema theory for genetic programming with subtree-swapping crossover: Part I. Evolutionary Computation, 11(1):53--66, March 2003.
[5]
Sean Luke. Two fast tree-creation algorithms for genetic programming. IEEE Transactions on Evolutionary Computation, 4(3):274--283, September 2000.
[6]
Saeed Ghahramani. Fundamentals of Probability. Prentice-Hall Inc, Upper Saddle River, NJ 07458, 1996.
[7]
John R. Koza. Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge, MA, USA, 1992.
[8]
W. B. Langdon and Riccardo Poli. Foundations of Genetic Programming. Springer-Verlag, 2002.
[9]
Sean Luke. ECJ 13: A Java evolutionary computation library. http://cs.gmu.edu/~eclab/projects/ecj/, 2005.
[10]
Riccardo Poli. A simple but theoretically-motivated method to control bloat in genetic programming. In Conor Ryan, Terence Soule, Maarten Keijzer, Edward Tsang, Riccardo Poli, and Ernesto Costa, editors, Genetic Programming, Proceedings of EuroGP'2003, volume 2610 of LNCS, pages 204--217, Essex, 14--16 April 2003. Springer-Verlag.

Cited By

View all
  • (2024)On the Nature of the Phenotype in Tree Genetic ProgrammingProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654129(868-877)Online publication date: 14-Jul-2024
  • (2024)A Semantic-Based Hoist Mutation Operator for Evolutionary Feature Construction in RegressionIEEE Transactions on Evolutionary Computation10.1109/TEVC.2023.333123428:6(1689-1703)Online publication date: Dec-2024
  • (2024)Univariate Skeleton Prediction in Multivariate Systems Using TransformersMachine Learning and Knowledge Discovery in Databases. Research Track and Demo Track10.1007/978-3-031-70371-3_7(107-125)Online publication date: 22-Aug-2024
  • Show More Cited By
  1. Generalisation of the limiting distribution of program sizes in tree-based genetic programming and analysis of its effects on bloat

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GECCO '07: Proceedings of the 9th annual conference on Genetic and evolutionary computation
    July 2007
    2313 pages
    ISBN:9781595936974
    DOI:10.1145/1276958
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 July 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. bloat
    2. crossover bias
    3. genetic programming
    4. initialisation
    5. program size distribution

    Qualifiers

    • Article

    Conference

    GECCO07
    Sponsor:

    Acceptance Rates

    GECCO '07 Paper Acceptance Rate 266 of 577 submissions, 46%;
    Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 14 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)On the Nature of the Phenotype in Tree Genetic ProgrammingProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654129(868-877)Online publication date: 14-Jul-2024
    • (2024)A Semantic-Based Hoist Mutation Operator for Evolutionary Feature Construction in RegressionIEEE Transactions on Evolutionary Computation10.1109/TEVC.2023.333123428:6(1689-1703)Online publication date: Dec-2024
    • (2024)Univariate Skeleton Prediction in Multivariate Systems Using TransformersMachine Learning and Knowledge Discovery in Databases. Research Track and Demo Track10.1007/978-3-031-70371-3_7(107-125)Online publication date: 22-Aug-2024
    • (2023)A Double Lexicase Selection Operator for Bloat Control in Evolutionary Feature Construction for RegressionProceedings of the Genetic and Evolutionary Computation Conference10.1145/3583131.3590365(1194-1202)Online publication date: 15-Jul-2023
    • (2023)Explainable Artificial Intelligence by Genetic Programming: A SurveyIEEE Transactions on Evolutionary Computation10.1109/TEVC.2022.322550927:3(621-641)Online publication date: Jun-2023
    • (2021)An Exploration of Asocial and Social Learning in the Evolution of Variable-length Structures2021 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC45853.2021.9504900(2307-2314)Online publication date: 28-Jun-2021
    • (2020)Time and Individual Duration in Genetic ProgrammingIEEE Access10.1109/ACCESS.2020.29757538(38692-38713)Online publication date: 2020
    • (2020)Genetic Programming Symbolic Regression: What Is the Prior on the Prediction?Genetic Programming Theory and Practice XVII10.1007/978-3-030-39958-0_11(201-225)Online publication date: 8-May-2020
    • (2019)Mutational Robustness and Structural Complexity in Grammatical Evolution2019 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC.2019.8790010(1338-1344)Online publication date: Jun-2019
    • (2016)Studying bloat control and maintenance of effective code in linear genetic programming for symbolic regressionNeurocomputing10.1016/j.neucom.2015.10.109180:C(79-93)Online publication date: 5-Mar-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media