Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2908812.2908882acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Classification of Cardiac Arrhythmia by Random Forests with Features Constructed by Kaizen Programming with Linear Genetic Programming

Published: 20 July 2016 Publication History

Abstract

Cardiac rhythm disorders may cause severe heart diseases, stroke, and even sudden cardiac death. Some arrhythmias are so serious that can cause injury to other organs, for instance, brain, kidneys, lungs or liver. Therefore, early and correct diagnosis of cardiac arrhythmia is essential to the prevention of serious problems. There are expert systems to classify arrhythmias from electrocardiograms signals. However, it has been shown that not only selecting the correct features from the dataset but also generating combined features could be the key to having real progress in classification. Therefore, this paper investigates a novel hybrid evolutionary technique to perform both tasks at the same time, finding complementary features that cover different characteristics of the data. The new features were tested with a widely-used classifier called Random Forests. The method reduced a dataset with 279 attributes to 26 attributes and achieved accuracies of 86.39% for binary classification and 77.69% for multiclass. Our approach outperformed several popular feature selection, feature generation, and state-of-the-art related work from the literature.

References

[1]
Ignacio Arnaldo, Una-May O'Reilly, and Kalyan Veeramachaneni. Building predictive models via feature synthesis. In Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation, GECCO '15, pages 983--990, New York, NY, USA, 2015. ACM.
[2]
Markus F. Brameier and Wolfgang Banzhaf. Linear Genetic Programming. Springer, 2007.
[3]
Leo Breiman. Random forests. Mach. Learn., 45(1):5--32, October 2001.
[4]
Léo Françoso dal Piccol Sotto and Vinıcius Veloso de Melo. Studying bloat control and maintenance of effective code in linear genetic programming for symbolic regression. Neurocomputing, 2015.
[5]
Vinicius V. De Melo. Kaizen programming. In Proceedings of the 2014 Conference on Genetic and Evolutionary Computation, GECCO '14, pages 895--902, New York, NY, USA, 2014. ACM.
[6]
Vinicius Veloso de Melo and Wolfgang Banzhaf. Kaizen programming for feature generation. In Rick Riolo, William P. Worzel, and Kala Groscurth, editors, Genetic Programming Theory and Practice XIII, Genetic and Evolutionary Computation, Ann Arbor, USA, May 2015. Springer. Forthcomming.
[7]
Vinicius Veloso de Melo and Wolfgang Banzhaf. Predicting high-performance concrete compressive strength using features constructed by kaizen programming. In 2015 Brazilian Conference on Intelligent Systems, BRACIS 2015, Natal, Brazil, November 4--7, 2015, pages 80--85, 2015.
[8]
Sigve Dreyer. Evolutionary feature selection, 2013.
[9]
Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H. Witten. The weka data mining software: An update. SIGKDD Explorations, 11, 2009.
[10]
Mark A. Hall. Correlation-based Feature Subset Selection for Machine Learning. PhD thesis, University of Waikato, Hamilton, New Zealand, 1998.
[11]
Winston Haynes. Wilcoxon rank sum test. In Werner Dubitzky, Olaf Wolkenhauer, Kwang-Hyun Cho, and Hiroki Yokota, editors, Encyclopedia of Systems Biology, pages 2354--2355. Springer New York, 2013.
[12]
Gunawan Herman, Bang Zhang, Yang Wang, Getian Ye, and Fang Chen. Mutual information-based method for selecting informative feature sets. Pattern Recognition, 46(12):3315 -- 3327, 2013.
[13]
Chris J. Hinde, Anoud I. Bani-Hani, Thomas W. Jackson, and Yen P. Cheung. Evolving polynomials of the inputs for decision tree building. Journal of Emerging Technologies in Web Intelligence, 4(2):198--203, 2012.
[14]
Sotiris B. Kotsiantis. Supervised machine learning: A review of classification techniques. In Proceedings of the 2007 Conference on Emerging Artificial Intelligence Applications in Computer Engineering: Real Word AI Systems with Applications in eHealth, HCI, Information Retrieval and Pervasive Technologies, pages 3--24, Amsterdam, The Netherlands, The Netherlands, 2007. IOS Press.
[15]
Krzysztof Krawiec. Encyclopedia of Machine Learning, chapter Evolutionary Feature Selection and Construction, pages 353--357. Springer US, Boston, MA, 2010.
[16]
Stuart E. Lacy, Michael A. Lones, and Stephen L Smith. Forming classifier ensembles with multimodal evolutionary algorithms. In 2015 IEEE Congress on Evolutionary Computation (CEC), pages 723--729, May 2015.
[17]
Moshe Lichman. UCI machine learning repository, 2013.
[18]
Hiroshi Motoda and Huan Liu. Feature selection, extraction and construction. In Communication of IICM, volume 5, pages 67--72, MAY 2002.
[19]
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. Scikit-learn: Machine learning in python. Journal of Machine Learning Research, 12:2825--2830, 2011.
[20]
Saleha Samad, Shoab A. Khan, Anam Haq, and Amna Riaz. Classification of arrhythmia. International Journal of Electrical Energy, 2(1):57--61, 2014.
[21]
Bernhard Scholkopf, Alexander Smola, and Klaus-Robert Muller. Kernel principal component analysis. In Wulfram Gerstner, Alain Germond, Martin Hasler, and Jean-Daniel Nicoud, editors, Artificial Neural Networks - ICANN'97, volume 1327 of Lecture Notes in Computer Science, pages 583--588. Springer Berlin Heidelberg, 1997.
[22]
Swati Shilaskar and Ashok Ghatol. Feature selection for medical diagnosis : Evaluation for cardiovascular diseases. Expert Systems with Applications, 40(10):4146 -- 4153, 2013.
[23]
Wenhao Shu and Wenbin Qian. A fast approach to attribute reduction from perspective of attribute measures in incomplete decision systems. Knowledge-Based Systems, 72(0):60 -- 71, 2014.
[24]
Lindsay I. Smith. A tutorial on principal components analysis. Technical report, Cornell University, USA, February 26 2002.
[25]
Lee Spector and Sean Luke. Cultural transmission of information in genetic programming. In Proceedings of the 1st Annual Conference on Genetic Programming, pages 209--214, Cambridge, MA, USA, 1996. MIT Press.
[26]
Xin Sun, Yanheng Liu, Jin Li, Jianqi Zhu, Xuejie Liu, and Huiling Chen. Using cooperative game theory to optimize the feature selection problem. Neurocomputing, 97(0):86 -- 93, 2012.
[27]
Xin Sun, Yanheng Liu, Mantao Xu, Huiling Chen, Jiawei Han, and Kunhao Wang. Feature selection using dynamic weights for classification. Knowledge-Based Systems, 37(0):541 -- 549, 2013.
[28]
Augustus D. Waller. A demonstration on man of electromotive changes accompanying the heart beat. Journal of Physiology, 8(5):229--234, 1887.
[29]
Yunyun Wang, Songcan Chen, and Hui Xue. Support vector machine incorporated with feature discrimination. Expert Systems with Applications, 38(10):12506 -- 12513, 2011.
[30]
Changming Zhu and Daqi Gao. Improved multi-kernel classification machine with nyström approximation technique. Pattern Recognition, 48(4):1490 -- 1509, 2015.

Cited By

View all
  • (2021)Fake News and Imbalanced Data PerspectiveData Preprocessing, Active Learning, and Cost Perceptive Approaches for Resolving Data Imbalance10.4018/978-1-7998-7371-6.ch011(195-210)Online publication date: 2021
  • (2018)GP-RVM: Genetic Programing-Based Symbolic Regression Using Relevance Vector Machine2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC.2018.00054(255-262)Online publication date: Oct-2018
  • (2018)Increasing the Prediction Quality of Software Defective Modules with Automatic Feature EngineeringInformation Technology – New Generations10.1007/978-3-319-77028-4_68(527-535)Online publication date: 2018

Index Terms

  1. Classification of Cardiac Arrhythmia by Random Forests with Features Constructed by Kaizen Programming with Linear Genetic Programming

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GECCO '16: Proceedings of the Genetic and Evolutionary Computation Conference 2016
    July 2016
    1196 pages
    ISBN:9781450342063
    DOI:10.1145/2908812
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 July 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. ECG
    2. Kaizen programming
    3. arrhythmia
    4. linear genetic programming
    5. random forests

    Qualifiers

    • Research-article

    Funding Sources

    • Brazilian Government CNPq (Universal)
    • São Paulo Research Foundation (FAPESP)
    • CAPES (Science without Borders program)

    Conference

    GECCO '16
    Sponsor:
    GECCO '16: Genetic and Evolutionary Computation Conference
    July 20 - 24, 2016
    Colorado, Denver, USA

    Acceptance Rates

    GECCO '16 Paper Acceptance Rate 137 of 381 submissions, 36%;
    Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Fake News and Imbalanced Data PerspectiveData Preprocessing, Active Learning, and Cost Perceptive Approaches for Resolving Data Imbalance10.4018/978-1-7998-7371-6.ch011(195-210)Online publication date: 2021
    • (2018)GP-RVM: Genetic Programing-Based Symbolic Regression Using Relevance Vector Machine2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC.2018.00054(255-262)Online publication date: Oct-2018
    • (2018)Increasing the Prediction Quality of Software Defective Modules with Automatic Feature EngineeringInformation Technology – New Generations10.1007/978-3-319-77028-4_68(527-535)Online publication date: 2018

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media