Nothing Special   »   [go: up one dir, main page]

Skip to main content
Log in

Fast fixed granular-ball for attribute reduction in label noise environments and its application in medical diagnosis

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

Although neighborhood rough set(NRS) based attribute reduction methods have achieved excellent performance in many scenarios, the efficiency and robustness of these methods have not attracted much attention. In this study, we propose a fast fixed granular-ball model (FFGB) for attribute reduction in label noise environments. In FFGB, we propose a fast neighborhood search mechanism to improve the efficiency of NRS. This fast mechanism reduces the neighborhood search range from the universe to a neighborhood and reduces the time complexity of the neighborhood calculation to much less than \(O(n^2)\). Based on the fast mechanism, we propose FFGB model whose definitions are relaxed to be robust to against label noise. In addition, a FFGB attribute reduction algorithm is designed. Finally, we apply the FFGB attribute reduction to medical diagnosis. The experimental results indicate that FFGB is more efficient and robust than the comparison methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Dong L, Chen D (2020) Incremental attribute reduction with rough set for dynamic datasets with simultaneously increasing samples and attributes. Int J Mach Learn Cybernet 11:1339–1355

    Google Scholar 

  2. Fan B, Wu W, Xu W, Li W (2019) Attribute-oriented cognitive concept learning strategy: a multi-level method. Int J Mach Learn Cybernet 10:2421–2437

    CAS  Google Scholar 

  3. Zhu X, Pedrycz W, Li Z (2021) A development of granular input space in system modeling. IEEE Trans Cybernet 51(3):1639–1650

    Google Scholar 

  4. Qian Y, Cheng H, Wang J, Liang J, Pedrycz W, Dang C (2017) Grouping granular structures in human granulation intelligence. Inform Sci 382:150–169

    Google Scholar 

  5. Chu X, Sun B, Chu X, Wu J, Han K, Zhang Y, Huang Q (2022) Multi-granularity dominance rough concept attribute reduction over hybrid information systems and its application in clinical decision-making. Inform Sci 597:274–299

    Google Scholar 

  6. Ng WW, Jiang X, Tian X, Pelillo M, Wang H, Kwong S (2020) Incremental hashing with sample selection using dominant sets. Int J Mach Learn Cybernet 11:2689–2702

    Google Scholar 

  7. Yang X, Yang J, Wu C, Yu D (2008) Dominance-based rough set approach and knowledge reductions in incomplete ordered information system. Inform Sci 178(4):1219–1234

    MathSciNet  Google Scholar 

  8. Li D, Wu W (2018) On the characterization of fuzzy rough sets based on a pair of implications. Int J Mach Learn Cybernet 9:2081–2092

    Google Scholar 

  9. Yang Y, Song S, Chen D, Xiao Z (2020) Discernible neighborhood counting based incremental feature selection for heterogeneous data. Int J Mach Learn Cybernet 11:1115–1127

    Google Scholar 

  10. Zhang K, Zhan J, Wang X (2020) TOPSIS-WAA method based on a covering-based fuzzy rough set: An application to rating problem. Inform Sci 539:397–421

    MathSciNet  Google Scholar 

  11. Shao M, Wu W, Wang X, Wang C (2020) Knowledge reduction methods of covering approximate spaces based on concept lattice. Knowl-Based Syst 191:105269

    Google Scholar 

  12. Jiang H, Zhan J, Chen D (2019) Covering-based variable precision (I, T)-fuzzy rough sets with applications to multiattribute decision-making. IEEE Trans Fuzzy Syst 27(8):1558–1572

    Google Scholar 

  13. Chen X, Chen D, Mi J (2021) Feature distribution-based label correlation in multi-label classification. Int J Mach Learn Cybernet 12:1705–1719

    Google Scholar 

  14. Tsang EC, Hu Q, Chen D (2016) Feature and instance reduction for pnn classifiers based on fuzzy rough sets. Int J Mach Learn Cybernet 7:1–11

    Google Scholar 

  15. Sun L, Yin T, Ding W, Qian Y, Xu J (2020) Multilabel feature selection using ML-relieff and neighborhood mutual information for multilabel neighborhood decision systems. Inform Sci 537:401–424

    MathSciNet  Google Scholar 

  16. Xu Z, He Y, Wang X (2019) An overview of probabilistic-based expressions for qualitative decision-making: techniques, comparisons and developments. Int J Mach Learn Cybernet 10:1513–1528

    Google Scholar 

  17. Zuo H, Lu J, Zhang G, Pedrycz W (2019) Fuzzy rule-based domain adaptation in homogeneous and heterogeneous spaces. IEEE Trans Fuzzy Syst 27(2):348–361

    Google Scholar 

  18. Zhan J, Jiang H, Yao Y (2021) Three-way multiattribute decision-making based on outranking relations. IEEE Trans Fuzzy Syst 29(10):2844–2858

    Google Scholar 

  19. Shao Y, Qi X, Gong Z (2020) A general framework for multi-granulation rough decision-making method under q-rung dual hesitant fuzzy environment. Artificial Intell Rev 53:4903–4933

    Google Scholar 

  20. Gao C, Zhou J, Miao D et al (2021) Granular-conditional-entropy-based attribute reduction for partially labeled data with proxy labels. Inform Sci 580:111–128

    MathSciNet  Google Scholar 

  21. Zhang X, Mei C, Chen D, Yang Y, Li J (2020) Active incremental feature selection using a fuzzy-rough-set-based information entropy. IEEE Trans Fuzzy Syst 28(5):901–915

    Google Scholar 

  22. Gao C, Lai Z, Zhou J et al (2019) Granular maximum decision entropy-based monotonic uncertainty measure for attribute reduction. Int J Approx Reason 104:9–24

    MathSciNet  CAS  Google Scholar 

  23. M. A. Hall, Correlation-based feature selection for discrete and numeric class machine learning, in: International Conference on Machine Learning, Vol. 7, 2000, pp. 359-366

  24. Frénay B, Verleysen M (2014) Classification in the presence of label noise: A survey. IEEE Trans Neural Netw Learn Syst 25(5):845–869

  25. Xu R, Wen Z, Gui L, Lu Q, Li B, Wang X (2020) Ensemble with estimation: seeking for optimization in class noisy data. Int J Mach Learn Cybernet 11:231–248

    CAS  Google Scholar 

  26. S. Xia, G. Wang, Z. Chen, Y. Duan, Q. liu, Complete random forest based class noise filtering learning for improving the generalizability of classifiers, IEEE Trans Knowl Data Eng 31 (11) (2019) 2063-2078

  27. Luengo J, Snchez-Tarrag D, Prati RC, Herrera F (2021) Multiple instance classification: bag noise filtering for negative instance noise cleaning. Inform Sci 579:388–400

    MathSciNet  Google Scholar 

  28. S. Xia, S. Zheng, G. Wang, X. Gao, B. Wang, Granular ball sampling for noisy label classification or imbalanced classification, IEEE Transactions on Neural Networks and Learning Systems DOI:https://doi.org/10.1109/TNNLS.2021.3105984.

  29. G. Roffo, S. Melzi, U. Castellani, A. Vinciarelli, Infinite latent feature selection: A probabilistic latent graph-based ranking approach. In: IEEE International Conference on Computer Vision, IEEE Computer Society, 2017, pp. 1407-1415

  30. Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182

    Google Scholar 

  31. Hu Q, Yu D, Xie Z (2008) Numerical attribute reduction based on neighborhood granulation and rough approximation. J Softw 19(3):640–649

    Google Scholar 

  32. Hu Q, Liu J, Yu D (2008) Mixed feature selection based on granulation and approximation. Knowl-Based Syst 21(4):294–304

    Google Scholar 

  33. Hu Q, Zhao H, Yu D (2008) Efficient symbolic and numerical attribute reduction with neighborhood rough sets. Pattern Recognit Artificial Intell 21(6):732–738

    Google Scholar 

  34. Hu Q, Pedrycz W, Yu D, Lang J (2010) Selecting discrete and continuous features based on neighborhood decision error minimization. IEEE Trans Syst Man Cybernet Part B (Cybernetics) 40(1):137–150

    Google Scholar 

  35. Wang C, Hu Q, Wang X, Chen D, Qian Y, Dong Z (2018) Feature selection based on neighborhood discrimination index. IEEE Trans Neural Netw Learn Syst 29(7):2986–2999

    MathSciNet  PubMed  Google Scholar 

  36. Sang B, Chen H, Yang L, Li T, Xu W, Luo C (2021) Feature selection for dynamic interval valued ordered data based on fuzzy dominance neighborhood rough set. Knowl-Based Syst 227:107223

    Google Scholar 

  37. S. Xu, X. Yang, E. Tsang, E. A. Mantey, Neighborhood collaborative classifiers, in: International Conference on Machine Learning and Cybernetics, Vol. 1, 2016, pp. 470-476

  38. Wang C, Shao M, He Q, Qian Y, Qi Y (2016) Feature subset selection based on fuzzy neighborhood rough sets. Knowl-Based Syst 111:173–179

    Google Scholar 

  39. Li J, Yang X, Song X, Li J, Wang P, Yu D (2019) Neighborhood attribute reduction: a multicriterion approach. Int J Mach Learn Cybernet 10(4):731–742

    CAS  Google Scholar 

  40. Liao S, Zhu Q, Qian Y, Lin G (2018) Multi-granularity feature selection on cost-sensitive data with measurement errors and variable costs. Knowl-Based Syst 158:25–42

    Google Scholar 

  41. Sun L, Wang L, Ding W et al (2020) Neighborhood multi-granulation rough sets-based attribute reduction using lebesgue and entropy measures in incomplete neighborhood decision systems. Knowl-Based Syst 192:105373

    Google Scholar 

  42. L. Sun, X. Zhang, J Xu, et al. An Attribute Reduction Method Using Neighborhood Entropy Measures in Neighborhood Rough Sets. Entropy 21(2)(2019) 155

  43. Li W, Huang Z, Jia X, Cai X (2016) Neighborhood based decision-theoretic rough set models. Int J Approximate Reason 69:1–17

    MathSciNet  CAS  Google Scholar 

  44. J. Zhang, T. Li, Y. Yang, L. Wang, Neighborhood rough sets based matrix approach for calculation of the approximations, in: Rough Sets and Knowledge Technology, Vol. 6, 2011, pp. 166-171

  45. Peng X, Wang P, Xia S, Wang C, Pu C, Qian J (2022) FNC: a fast neighborhood calculation framework. Knowl-Based Syst 252:109394

    Google Scholar 

  46. Liu Y, Huang W, Jiang Y, Zeng Z (2014) Quick attribute reduct algorithm for neighborhood rough set model. Inform Sci 271:65–81

    MathSciNet  Google Scholar 

  47. Hu M, Tsang E, Guo Y, Chen D, Xu W (2021) A novel approach to attribute reduction based on weighted neighborhood rough sets. Knowl-Based Syst 220:106908

    Google Scholar 

  48. Xia S, Zhang H, Li W, Wang G, Giem E, Chen Z (2022) GBNRS: A novel rough set algorithm for fast adaptive attribute reduction in classification. IEEE Trans Knowl Data Eng 34(3):1231–1242

    Google Scholar 

  49. Xia S, Liu Y, Ding X, Wang G, Luo Y (2019) Granular ball computing classifiers for efficient, scalable and robust learning. Inform Sci 483:136–152

    MathSciNet  Google Scholar 

  50. S. Xia, X. Dai, G. Wang, X. Gao, E. Giem. An Efficient and Adaptive Granular-Ball Generation Method in Classification Problem. IEEE Transactions on Neural Networks and Learning Systems DIO: 10.1109/TNNLS.2022.3203381

Download references

Acknowledgements

This work is supported in part by the National Key Research and Development Program of China under Grant 2021YFB3301000, and in part by the Chongqing Talent Plan Project under Grant cstc2021ycjh-bgzxm0206. Doctoral Talent Training Program of Chongqing University of Posts and Telecommunications under Grant No. BYJS202010, the Intelligent Manufacturing Industry Technology Research Institute Open Fund under Grant No. ZNZZ2108, the Key Research and Development Program of Dazhou Science and Technology Bureau under Grant Nos. 20ZDYF0003 and 20ZDYF0001, the Ministry of Education’s Industry School Cooperation Collaborative Education Project under Grant 22097042270822, and by the Multi dimensional data perception and intelligent information processing Dazhou key laboratory project under Grant Nos. DWSJ2202 and DWSJ2207.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ping Wang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Peng, X., Wang, P., Shao, Y. et al. Fast fixed granular-ball for attribute reduction in label noise environments and its application in medical diagnosis. Int. J. Mach. Learn. & Cyber. 15, 1039–1054 (2024). https://doi.org/10.1007/s13042-023-01954-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-023-01954-y

Keywords

Navigation