Nothing Special   »   [go: up one dir, main page]

Skip to main content

A Search for the Best Data Mining Method to Predict Melanoma

  • Conference paper
  • First Online:
Rough Sets and Current Trends in Computing (RSCTC 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2475))

Included in the following conference series:

  • 890 Accesses

Abstract

Our main objective was to decrease the error rate of diagnosis of melanoma, a very dangerous skin cancer. Since diagnosticians routinely use the so-called ABCD formula for melanoma prediction, our main concern was to improve the ABCD formula. In our search for the best coefficients of the ABCD formula we used two different discretization methods, agglomerative and divisive, both based on cluster analysis. In our experiments we used the data mining system LERS (Learning from Examples based on Rough Sets). As a result of more than 30,000 experiments, two optimal ABCD formulas were found, one with the use of the agglomerative method, the other one with divisive. These formulas were evaluated using statistical methods. Our final conclusion is that it is more important to use an appropriate discretization method than to modify the ABCD formula. Also, the divisive method of discretization is better than agglomerative. Finally, diagnosis of melanoma without taking into account results of the ABCD formula is much worse, i.e., the error rate is significantly greater, comparing with any form of the ABCD formula.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Bajcar, S., Grzymala-Busse, J. W., and Hippe. Z. S.: A comparison of six discretization algorithms used for prediction of melanoma. Accepted for the Eleventh International Symposium on Intelligent Information Systems, Poland, June 3–6, 2002.

    Google Scholar 

  2. Booker, L. B., Goldberg, D. E., and Holland J. F.: Classifier systems and genetic algorithms. In Machine Learning. Paradigms and Methods. Carbonell, J. G. (Ed.), The MIT Press, Boston, MA, 1990, 235–282.

    Google Scholar 

  3. Chmielewski, M. R. and Grzymala-Busse, J. W.: Global discretization of continuous attributes as preprocessing for machine learning. Int. Journal of Approximate Reasoning 15, 1996, 319–331.

    Article  MATH  Google Scholar 

  4. Friedman, R. J., Rigel, D. S., and Kopf, A. W.: Early detection of malignant melanoma: the role of physician examination and self-examination of the skin. CA Cancer J. Clin. 35, 1985, 130–151.

    Article  Google Scholar 

  5. Grzymala-Busse, J. P., Grzymala-Busse, J. W., and Hippe Z. S.: Melanoma prediction using data mining system LERS. Proceeding of the 25th Anniversary Annual International Computer Software and Applications Conference COMPSAC 2001, October 8–12, 2001, Chicago, IL, 615–620.

    Google Scholar 

  6. Grzymala-Busse, J. W.: LERS—A system for learning from examples based on rough sets. In Intelligent Decision Support. Handbook of Applications and Advances of the Rough Sets Theory. Slowinski, R. (ed.), Kluwer Academic Publishers, Dordrecht, Boston, London, 1992, 3–18.

    Google Scholar 

  7. Grzymala-Busse J. W.: A new version of the rule induction system LERS. Fundamenta Informaticae 31 (1997), 27–39.

    MATH  Google Scholar 

  8. Grzymala-Busse J. W. and Hippe Z. S.: Postprocessing of rule sets induced from a melanoma data set. Accepted for the COMPSAC 2002, 26th Annual International Conference on Computer Software and Applications, Oxford, England, August 26–29, 2002.

    Google Scholar 

  9. Hippe, Z. S.: Computer database NEVI on endargement by melanoma. Task Quarterly 4, 1999, 483–488.

    Google Scholar 

  10. Holland, J. H., Holyoak, K. J., and Nisbett, R. E.: Induction. Processes of Inference, Learning, and Discovery. The MIT Press, Boston, MA, 1986.

    Google Scholar 

  11. Lorentzen, H. Weismann, K. Secher, L. Peterson, C. S. Larsen, F. G.: The dermatoscopic ABCD rule does not improve diagnostic accuracy of malignant melanoma. Acta Derm. Venereol. 79, 1999, 469–472.

    Article  Google Scholar 

  12. Pawlak, Z.: Rough Sets. International Journal of Computer and Information Sciences, 11, 1982, 341–356.

    Article  MATH  MathSciNet  Google Scholar 

  13. Pawlak, Z.: Rough Sets. Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, Dordrecht, Boston, London, 1991.

    Google Scholar 

  14. Peterson, N.: Discretization using divisive cluster analysis and selected postprocessing techniques. Department of Computer Science, University of Kansas, internal report, 1993.

    Google Scholar 

  15. Stolz, W., Braun-Falco, O., Bilek, P., Landthaler, A. B., Cogneta, A. B.: Color Atlas of Dermatology, Blackwell Science Inc., Cambridge, MA, 1993.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Grzymała-Busse, J.W., Hippe, Z.S. (2002). A Search for the Best Data Mining Method to Predict Melanoma. In: Alpigini, J.J., Peters, J.F., Skowron, A., Zhong, N. (eds) Rough Sets and Current Trends in Computing. RSCTC 2002. Lecture Notes in Computer Science(), vol 2475. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45813-1_71

Download citation

  • DOI: https://doi.org/10.1007/3-540-45813-1_71

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44274-5

  • Online ISBN: 978-3-540-45813-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics