Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3651781.3651816acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article
Open access

Adaptive feature selection using Label Uncertainty Reduction for multi-label classification

Published: 30 May 2024 Publication History

Abstract

Multi-label learning deals with datasets in which each instance is associated with a set of labels. The objective is to enhance the learning model by eliminating redundant features while assessing the uncertainty of each label. We present an approach to tackle intricate classification tasks. Our method starts by evaluating the nature of the feature space, whether binary or continuous, to ensure its suitability within the given context. Subsequently, we employ information entropy to quantify the importance of labels, which is then multiplied by the similarity between features and labels (FL) similarity. This approach effectively balances the relevance of features with the uncertainty in label prediction using Grey Relational Analysis (GRA) based optimization. Proposed method computes feature rankings by optimizing Feature Relevance and Label Certainty (FRLCO). The top-ranked features are then selected for classification tasks, and we utilize Particle Swarm Optimization (PSO) to optimize the Multi-Label k-Nearest Neighbors (MLKNN) algorithm. As a result, our proposed approach, FRLCO, excels beyond state-of-the-art multi-label feature selection techniques.

References

[1]
Afan Ali and Fan Yangyu. 2017. Dominant eigen vector based feature selection using singular value decomposition in automatic modulation classification. International Journal of Computer Theory and Engineering 9, 5 (2017).
[2]
P Anuradha and VK David. 2022. Feature selection by ModifiedBoostARoota and classification by CatBoost model on high dimensional heart disease datasets. Int. J. Computer Theory and Engineering 14, 4 (2022), 141–148.
[3]
Lejla Batina, Benedikt Gierlichs, Emmanuel Prouff, Matthieu Rivain, François-Xavier Standaert, and Nicolas Veyrat-Charvillon. 2011. Mutual information analysis: a comprehensive study. Journal of Cryptology 24, 2 (2011), 269–291.
[4]
Forrest Briggs, Yonghong Huang, Raviv Raich, Konstantinos Eftaxias, Zhong Lei, William Cukierski, Sarah Frey Hadley, Adam Hadley, Matthew Betts, Xiaoli Z Fern, 2013. The 9th annual MLSP competition: New methods for acoustic classification of multiple simultaneous bird species in a noisy environment. In 2013 IEEE international workshop on machine learning for signal processing (MLSP). IEEE, 1–8.
[5]
Sotiris Diplaris, Grigorios Tsoumakas, Pericles A Mitkas, and Ioannis Vlahavas. 2005. Protein classification with multiple algorithms. In Advances in Informatics: 10th Panhellenic Conference on Informatics, PCI 2005, Volas, Greece, November 11-13, 2005. Proceedings 10. Springer, 448–456.
[6]
Jorge Gonzalez-Lopez, Sebastián Ventura, and Alberto Cano. 2020. Distributed multi-label feature selection using individual mutual information measures. Knowledge-Based Systems 188 (2020), 105052.
[7]
Amin Hashemi, Mohammad Bagher Dowlatshahi, and Hossein Nezamabadi-Pour. 2020. MFS-MCDM: Multi-label feature selection using multi-criteria decision making. Knowledge-Based Systems 206 (2020), 106365.
[8]
Amin Hashemi, Mohammad Bagher Dowlatshahi, and Hossein Nezamabadi-Pour. 2020. MGFS: A multi-label graph-based feature selection algorithm via PageRank centrality. Expert Systems with Applications 142 (2020), 113024.
[9]
Mark J Huiskes and Michael S Lew. 2008. The mir flickr retrieval evaluation. In Proceedings of the 1st ACM international conference on Multimedia information retrieval. 39–43.
[10]
Deng Ju-Long. 1982. Control problems of grey systems. Systems & control letters 1, 5 (1982), 288–294.
[11]
Jaesung Lee and Dae-Won Kim. 2013. Feature selection for multi-label classification using multivariate mutual information. Pattern Recognition Letters 34, 3 (2013), 349–357.
[12]
Jaesung Lee and Dae-Won Kim. 2015. Memetic feature selection algorithm for multi-label classification. Information Sciences 293 (2015), 80–96.
[13]
Jaesung Lee and Dae-Won Kim. 2015. Mutual information-based multi-label feature selection using interaction information. Expert Systems with Applications 42, 4 (2015), 2013–2025.
[14]
Jinghua Liu, Songwei Yang, Yaojin Lin, Chenxi Wang, Cheng Wang, and Jixiang Du. 2023. Multi-label feature selection via joint label enhancement and pairwise label correlations. International Journal of Machine Learning and Cybernetics (2023), 1–22.
[15]
Mohsen Paniri, Mohammad Bagher Dowlatshahi, and Hossein Nezamabadi-Pour. 2020. MLACO: A multi-label feature selection algorithm based on ant colony optimization. Knowledge-Based Systems 192 (2020), 105285.
[16]
Mohsen Paniri, Mohammad Bagher Dowlatshahi, and Hossein Nezamabadi-pour. 2021. Ant-TD: Ant colony optimization plus temporal difference reinforcement learning for multi-label feature selection. Swarm and Evolutionary Computation 64 (2021), 100892.
[17]
Dipanjyoti Paul, Anushree Jain, Sriparna Saha, and Jimson Mathew. 2021. Multi-objective PSO based online feature selection for multi-label classification. Knowledge-Based Systems 222 (2021), 106966.
[18]
Oscar Reyes, Carlos Morell, and Sebastián Ventura. 2015. Scalable extensions of the ReliefF algorithm for weighting and selecting features on the multi-label learning context. Neurocomputing 161 (2015), 168–182.
[19]
Claude E Shannon. 1997. The mathematical theory of communication. 1963.MD Comput 14 (1997), 306.
[20]
DM Shawky and AF Ali. 2011. A feature selection method using misclassified patterns. International Journal of Computer Theory and Engineering 3, 5 (2011), 643.
[21]
Ashok N Srivastava and Brett Zane-Ulman. 2005. Discovering recurring anomalies in text reports regarding complex space systems. In 2005 IEEE aerospace conference. IEEE, 3853–3862.
[22]
Amos Tversky. 1977. Features of similarity.Psychological review 84, 4 (1977), 327.
[23]
Li Yang and Abdallah Shami. 2020. On hyperparameter optimization of machine learning algorithms: Theory and practice. Neurocomputing 415 (2020), 295–316.
[24]
Min-Ling Zhang and Zhi-Hua Zhou. 2007. ML-KNN: A lazy learning approach to multi-label learning. Pattern recognition 40, 7 (2007), 2038–2048.
[25]
Ping Zhang and Wanfu Gao. 2021. Feature relevance term variation for multi-label feature selection. Applied Intelligence 51 (2021), 5095–5110.
[26]
Ping Zhang, Guixia Liu, and Jiazhi Song. 2023. MFSJMI: Multi-label feature selection considering join mutual information and interaction weight. Pattern Recognition 138 (2023), 109378.
[27]
Haibo Zhong, Ping Zhang, and Guixia Liu. 2023. Multi-label feature selection via redundancy of the selected feature set. Applied Intelligence 53, 9 (2023), 11073–11091.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICSCA '24: Proceedings of the 2024 13th International Conference on Software and Computer Applications
February 2024
395 pages
ISBN:9798400708329
DOI:10.1145/3651781
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 May 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Grey Relational Analysis
  2. Meta-heuristic
  3. PSO
  4. feature selection
  5. multi-label learning

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICSCA 2024

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 79
    Total Downloads
  • Downloads (Last 12 months)79
  • Downloads (Last 6 weeks)24
Reflects downloads up to 30 Nov 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media