Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3671151.3671308acmotherconferencesArticle/Chapter ViewAbstractPublication PagescibdaConference Proceedingsconference-collections
research-article

SC-AE: An Improved Spectral Clustering Unsupervised Feature Selection Algorithm Guided by Autoencoders Based on Pan-cancer Multi-view Omics Data

Published: 23 July 2024 Publication History

Abstract

Omics data is characterized by high-dimensional small samples and contains a great number of gene features unrelated to diseases. Pan-cancer exhibits heterogeneity and can be subdivided into different subtypes. Identifying disease subtypes is pivotal for advancing precision medicine, and disease subtype analysis through omics data has become a popular method. Without knowing the sample labels, how to select the most relevant subset of features for pan-cancer subtypes becomes an immediate challenge. This paper proposes an improved spectral clustering unsupervised feature selection algorithm guided by autoencoders (SC-AE) based on pan-cancer multi-view omics data to solve it. In the feature selection stage of spectral clustering, this paper proposes the evaluation metrics of feature importance to measure whether the feature is important. It takes into account both feature discernibility and feature independence. Through the autoencoder, the low-dimensional and denoised latent space of the original features is learned as the target representation to further guide the selected feature subset. We propose new evaluation metrics. It evaluates whether a feature subset is an important feature based on both clustering quality and clinical significance. The algorithm has been used on multi-view data from five pan-cancer datasets. Compared with the current six mainstream unsupervised feature selection algorithms, the optimal feature subset obtained by SC-AE exhibits better clustering effects and has more realistic clinical significance.

References

[1]
Park J Y, Lee J W, Park M. Comparison of cancer subtype identification methods combined with feature selection methods in omics data analysis[J]. BioData Mining, 2023, 16(1): 18.
[2]
Lin Y, Zhang W, Cao H, Classifying breast cancer subtypes using deep neural networks based on multi-omics data[J]. Genes, 2020, 11(8): 888.
[3]
Si C, Zhao L, Liu J. Deep Feature Selection Algorithm for Classification of Gastric Cancer Subtypes[C]//2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE, 2023: 698-705.
[4]
Wang X W, Wang T, Schaub D P, Benchmarking omics-based prediction of asthma development in children[J]. Respiratory Research, 2023, 24(1): 63.
[5]
Solorio-Fernández S, Carrasco-Ochoa J A, Martínez-Trinidad J F. A review of unsupervised feature selection methods[J]. Artificial Intelligence Review, 2020, 53(2): 907-948.
[6]
Akiba T, Sano S, Yanase T, Optuna: A next-generation hyperparameter optimization framework[C]//Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. 2019: 2623-2631.
[7]
Poggiali A, Bernasconi A, Berti A, Quantum feature selection with variance estimation[C]//Proceedings of ESANN. 2023.
[8]
Liu Y, Mu Y, Chen K, Daily activity feature selection in smart homes based on pearson correlation coefficient[J]. Neural Processing Letters, 2020, 51: 1771-1787.
[9]
Cardall A C, Hales R C, Tanner K B, LASSO (L1) Regularization for Development of Sparse Remote-Sensing Models with Applications in Optically Complex Waters Using GEE Tools[J]. Remote Sensing, 2023, 15(6): 1670.
[10]
Ban Y, Lao H, Li B, Diagnosis of Alzheimer's disease using hypergraph p-Laplacian regularized multi-task feature learning[J]. Journal of Biomedical Informatics, 2023, 140: 104326.
[11]
Wang M, Han H, Huang Z, Unsupervised spectral feature selection algorithms for high dimensional data[J]. Frontiers of Computer Science, 2023, 17(5): 175330.
[12]
Xie J, Ding L, Wang M. Spectral clustering based unsupervised feature selection algorithms[J]. Journal of Software, 2020, 31(4): 1009-1024.

Index Terms

  1. SC-AE: An Improved Spectral Clustering Unsupervised Feature Selection Algorithm Guided by Autoencoders Based on Pan-cancer Multi-view Omics Data

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CIBDA '24: Proceedings of the 5th International Conference on Computer Information and Big Data Applications
    April 2024
    1285 pages
    ISBN:9798400718106
    DOI:10.1145/3671151
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 July 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    CIBDA 2024

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 13
      Total Downloads
    • Downloads (Last 12 months)13
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 16 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media