Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3638530.3654282acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
poster

Multi-task Genetic Programming with Semantic based Crossover for Multi-output Regression

Published: 01 August 2024 Publication History

Abstract

Multi-output regression involves predicting two or more target variables simultaneously. In contrast to its single-output counterpart, multi-output regression poses additional challenges primarily because the target variables are frequently interdependent. Achieving accurate predictions for one variable may necessitate a thorough consideration of its relationships with other variables. In this paper, multi-output regression problems are regarded as multi-task optimization problems where predicting one output variable is considered as one task. A new multi-task multi-population genetic programming method is proposed to solve the problem. The method utilizes the semantic based crossover operator to transfer positive knowledge and accelerate convergence. Additionally, it adopts an offspring reservation strategy to keep the quality of the individuals for the corresponding tasks. The empirical results demonstrate that our proposed method significantly enhances the training and the test performances of multi-task multi-population GP and also outperforms standard GP on five real-world multi-output regression datasets.

References

[1]
Baligh Al-Helali, Qi Chen, Bing Xue, and Mengjie Zhang. 2024. Genetic programming for feature selection based on feature removal impact in high-dimensional symbolic regression. IEEE Transactions on Emerging Topics in Computational Intelligence (2024), 1--14.
[2]
Hanen Borchani, Gherardo Varando, Concha Bielza, and Pedro Larranaga. 2015. A survey on multi-output regression. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 5, 5 (2015), 216--233.
[3]
Qi Chen, Bing Xue, Wolfgang Banzhaf, and Mengjie Zhang. 2023. Relieving genetic programming from coefficient learning for symbolic regression via correlation and linear scaling. In Proceedings of the Genetic and Evolutionary Computation Conference. 420--428.
[4]
Nicholas C Coops, Piotr Tompalski, Tristan RH Goodbody, Martin Queinnec, Joan E Luther, Douglas K Bolton, Joanne C White, Michael A Wulder, Oliver R van Lier, and Txomin Hermosilla. 2021. Modelling lidar-derived estimates of forest attributes over space and time: A review of approaches and future trends. Remote Sensing of Environment 260 (2021), 112477.
[5]
Weiwei Liu, Donna Xu, Ivor W Tsang, and Wenjie Zhang. 2018. Metric learning for multi-output tasks. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 2 (2018), 408--422.
[6]
Quang Uy Nguyen, Xuan Hoai Nguyen, and Michael O'Neill. 2009. Semantic aware crossover for genetic programming: The case for real-valued function regression. In Genetic Programming: 12th European Conference, EuroGP. 292--302.
[7]
Quang Uy Nguyen, Michael O'Neill, Nguyen Xuan Hoai, Bob Mckay, and Edgar Galván-López. 2010. Semantic similarity based crossover in GP: The case for real-valued function regression. In Artifical Evolution: 9th International Conference, Evolution Artificielle. 170--181.
[8]
Eleftherios Spyromitros-Xioufis, Grigorios Tsoumakas, William Groves, and Ioannis Vlahavas. 2016. Multi-target regression via input space expansion: Treating targets as inputs. Machine Learning 104 (2016), 55--98.
[9]
Athanasios Tsanas and Angeliki Xifara. 2012. Accurate quantitative estimation of energy performance of residential buildings using statistical machine learning tools. Energy and buildings 49 (2012), 560--567.
[10]
Devis Tuia, Jochem Verrelst, Luis Alonso, Fernando Pérez-Cruz, and Gustavo Camps-Valls. 2011. Multioutput support vector regression for remote sensing biophysical parameter estimation. IEEE Geoscience and Remote Sensing Letters 8, 4 (2011), 804--808.
[11]
Chunyu Wang, Qi Chen, Bing Xue, and Mengjie Zhang. 2023. Shapley value based feature selection to improve generalization of genetic programming for high-dimensional symbolic regression. In Australasian Conference on Data Science and Machine Learning. 163--176.
[12]
Fangfang Zhang, Yi Mei, Su Nguyen, Kay Chen Tan, and Mengjie Zhang. 2021. Multitask genetic programming-based generative hyperheuristics: A case study in dynamic scheduling. IEEE Transactions on Cybernetics 52, 10 (2021), 10515--10528.
[13]
Hengzhe Zhang, Qi Chen, Bing Xue, Wolfgang Banzhaf, and Mengjie Zhang. 2023. MAP-Elites for genetic programming-based ensemble learning: An interactive approach. IEEE Computational Intelligence Magazine 18, 4 (2023), 62--63.
[14]
Hengzhe Zhang, Qi Chen, Bing Xue, Wolfgang Banzhaf, and Mengjie Zhang. 2023. A semantic-based hoist mutation operator for evolutionary feature construction in regression. IEEE Transactions on Evolutionary Computation (2023).
[15]
Yu Zhang and Qiang Yang. 2021. A survey on multi-task learning. IEEE Transactions on Knowledge and Data Engineering 34, 12 (2021), 5586--5609.
[16]
Liming Zheng, Ke Jia, Tianshu Bi, Yu Fang, and Zhe Yang. 2020. Cosine similarity based line protection for large-scale wind farms. IEEE Transactions on Industrial Electronics 68, 7 (2020), 5990--5999.

Cited By

View all
  • (2024)Semantics-guided multi-task genetic programming for multi-output regressionPattern Recognition10.1016/j.patcog.2024.111289(111289)Online publication date: Dec-2024

Index Terms

  1. Multi-task Genetic Programming with Semantic based Crossover for Multi-output Regression

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GECCO '24 Companion: Proceedings of the Genetic and Evolutionary Computation Conference Companion
    July 2024
    2187 pages
    ISBN:9798400704956
    DOI:10.1145/3638530
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 August 2024

    Check for updates

    Author Tags

    1. multi-output regression
    2. genetic programming
    3. evolutionary multitask optimization

    Qualifiers

    • Poster

    Conference

    GECCO '24 Companion
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)61
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 23 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Semantics-guided multi-task genetic programming for multi-output regressionPattern Recognition10.1016/j.patcog.2024.111289(111289)Online publication date: Dec-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media