Abstract
The paper describes an application of a new, non-linear dimensionality reduction method, named Isomap, for mining the structural knowledge from high-dimensional medical data. The algorithm was evaluated on two publicly available medical datasets: the pathological dataset of breast cancer (241 malignant samples) and the gene expression dataset from the lung (186 tumours). It was found by Isomap that the approximate intrinsic dimensionalities of these two datasets were as low as three. The spatial structures of both datasets were presented in low-dimensional space. Isomap, as a general tool for dimensionality reduction analysis, is helpful in revealing the nonlinear structural knowledge of high-dimensional medical data.
Similar content being viewed by others
References
Bhattacharjee, A., Richards, W. G., Staunton, J., Li, C., Monti, S., Vasa, P., Ladd, C., Beheshti, J., Bueno, R., Gillette, M., Loda, M., Weber, G., Mark, E. J., Lander, E. S., Wong, W., Johnson, B. E., Golub, T. R., Sugarbaker, D. J., andMeyerson, M. (2001): ‘Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses’,Proc. Nat. Acad. Sci. USA,98, pp. 13790–13795
Blake, C. L. andMerz, C. J. (1998): ‘UCI repository of machine learning databases [http://www.ics.uci.edu/≈mlearn/MLRepository. html]. Irvine, CA: University of California, Department of Information & Computer Science
Borg, I., andGroenen, P. (1997): ‘Modern multidimensional scaling: theory and application’ (Springer-Verlag, New York, Berlin, Heidelberg, 1997)
Hastie, T. J., andStuetzle, W. (1989): ‘Principal curves’,J. Am. Stat. Assoc.,84, pp. 502–516
Huber, P. J. (1985): ‘Projection pursuit’,Ann. Statist.,13, pp. 435–475
Jolliffe, I. T. (1986): ‘Principal component analysis’ (Springer-Verlag, New York, 1986)
Kohonen, T. (1997): ‘Self organizing maps’ (Springer, Berlin, 1997)
Tenenbaum, J. B., Silvam, V. D., andLangford, J. C. (2000): ‘A global geometric framework for nonlinear dimensionality reduction’,Science,290, pp. 2319–2323
Travis, W. D., Travis, L. B., andDevesa, S. S. (1995): ‘Lungcancer’,Cancer,75, pp. 191–202
Wolberg, W. H., andMangasarian, O. L. (1990): ‘Multisurface method of pattern separation for medical diagnosis applied to breast cytology’,Proc. Natl. Acad. Sci. USA,87, pp. 9193–9196
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Weng, S., Zhang, C., Lin, Z. et al. Mining the structural knowledge of high-dimensional medical data using isomap. Med. Biol. Eng. Comput. 43, 410–412 (2005). https://doi.org/10.1007/BF02345820
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF02345820