Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3605098.3636191acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
poster

Evolutionary Feature Selection for Time-Series Forecasting

Published: 21 May 2024 Publication History

Abstract

In machine learning, feature selection is crucial for pinpointing the key subset of features that enhances interpretability and preserves or boosts the model's original performance. Filter methods, which assess features using statistical metrics, are particularly notable. Recently, a novel metric called Conditional Dependence Coefficient has been proposed to measure the dependence between subsets of variables.
This paper introduces a novel filter feature selection method that integrates the Conditional Dependence Coefficient metric with an evolutionary algorithm to find the optimal feature subset. This approach combines the adaptability of genetic algorithms with the strength of an intuitive metric. Unlike many filter-based methods, our technique does not rely on parameters directly linked to the number of features (like thresholds). Moreover, it evaluates the collective merit of feature subsets rather than individual significance.
We conducted tests on six different multivariate time-series datasets to address the forecasting challenge, determining the relevant lags. Considering no selection as baseline, experimental results indicate that our approach is competitive in terms of efficacy while demonstrating a reduction in the number of features selected.

References

[1]
M. Azadkia and S. Chatterjee. 2021. A simple measure of conditional dependence. The Annals of Statistics 49, 6 (2021), 3070--3102.
[2]
V. Bolón-Canedo, A. Alonso-Betanzos, L. Morán-Fernández, and B. Cancela. 2022. Feature selection: From the past to the future. In Proceedings of Advances in Selected Artificial Intelligence Areas: World Outstanding Women in Artificial Intelligence. Springer, 11--34.
[3]
CDT. 2015. California department of transportation.
[4]
J. Dunn, L. Mingardi, and Y.D. Zhuo. 2021. Comparing interpretability and explainability for feature selection.
[5]
F. J. Galán-Sales, P. Reina-Jiménez, M. Carranza-García, and J. M. Luna-Romera. 2023. An Approach to Enhance Time Series Forecasting by Fast Fourier Transform. In Proceedings of the International Conference on Soft Computing Models in Industrial and Environmental Applications. Springer, 259--268.
[6]
R. Godahewa, C. Bergmeir, G. Webb, R. Hyndman, and P. Montero-Manso. 2020. Electricity Hourly Dataset.
[7]
A. Gómez-Losada, G. Asencio-Cortés, F. Martínez-Álvarez, and J.C. Riquelme. 2018. A novel approach to forecast urban surface-level ozone considering heterogeneous locations and limited information. Environmental Modelling & Software 110 (2018), 52--61.
[8]
M.J. Jiménez-Navarro, C. Restrepo-Estrada, L. Melgar-García, and D. Gutierrez-Aviles. 2023. Feature Selection Guided by CVOA Metaheuristic for Deep Neural Networks: Application to Multivariate Time Series Forecasting. In Proceedings of 18th International Conference on Soft Computing Models in Industrial and Environmental Applications. Springer, 209--218.
[9]
M. J. Jiménez-Navarro, M. Martínez-Ballesteros, I. Brito, F. Martínez-Álvarez, and G. Cortés. 2022. Feature-Aware Drop Layer (FADL): A Nonparametric Neural Network Layer for Feature Selection. In Proceedings of the International Conference on Soft Computing Models in Industrial and Environmental Applications. Springer, 557--566.
[10]
A. Khemphila and V. Boonjing. 2011. Heart Disease Classification Using Neural Network and Feature Selection. Proceedings of the International Conference on Systems Engineering 21 (2011), 406--409.
[11]
M. L. Linares-Barrera, M. Martínez-Ballesteros, J. M. García-Heredia, and J. C. Riquelme. 2023. A Feature Selection and Association Rule Approach to Identify Genes Associated with Metastasis and Low Survival in Sarcoma. In Proceedings of the International Conference on Hybrid Artificial Intelligence Systems. Springer, 731--742.
[12]
J. Miao and L. Niu. 2016. A survey on feature selection. Procedia computer science 91 (2016), 919--926.
[13]
T. Niu, J. Li, W. Wei, and H. Yue. 2022. A hybrid deep learning framework integrating feature selection and transfer learning for multi-step global horizontal irradiation forecasting. Applied Energy 326 (2022), 119964.
[14]
M. Rong, D. Gong, and X. Gao. 2019. Feature selection and its use in big data: challenges, methods, and trends. IEEE Access 7 (2019), 19709--19725.
[15]
S. Velliangiri and S. Alagumuthukrishnan. 2019. A review of dimensionality reduction techniques for efficient computation. Procedia Computer Science 165 (2019), 104--111.
[16]
J. Wu, X. Chen, H. Zhang, L. Xiong, H. Lei, and S. Deng. 2019. Hyperparameter optimization for machine learning models based on Bayesian optimization. Journal of Electronic Science and Technology 17, 1 (2019), 26--40.

Cited By

View all
  • (2024)Explainable deep learning on multi-target time series forecasting: an air pollution use caseResults in Engineering10.1016/j.rineng.2024.103290(103290)Online publication date: Nov-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '24: Proceedings of the 39th ACM/SIGAPP Symposium on Applied Computing
April 2024
1898 pages
ISBN:9798400702433
DOI:10.1145/3605098
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 May 2024

Check for updates

Author Tags

  1. machine learning
  2. feature selection
  3. genetic algorithm
  4. regression
  5. time-series forecasting

Qualifiers

  • Poster

Funding Sources

  • Ministerio de Ciencia e Innovación

Conference

SAC '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)49
  • Downloads (Last 6 weeks)13
Reflects downloads up to 19 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Explainable deep learning on multi-target time series forecasting: an air pollution use caseResults in Engineering10.1016/j.rineng.2024.103290(103290)Online publication date: Nov-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media