Optimal Gene Selection for Cancer Classification with Partial Correlation and k-Nearest Neighbor Classifier

Si-Ho Yoo²¹ &
Sung-Bae Cho²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3157))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

838 Accesses
1 Citations

Abstract

High density DNA microarrays are widely used in cancer research, monitoring thousands of genes at once. Due to small sample size and the large amount of genes in micrarray experiments, selection of significant genes via expression patterns is an important matter in cancer classification. Many gene selection methods have been investigated, but it is hard to find out the perfect one. In this paper we propose a new gene selection method based on partial correlation in regression analysis to find the informative genes to predict cancer. The genes selected by this method tend to have information about the cancer that is not overlapped by the genes selected previously. We have measured the sensitivity, specificity, and recognition rate of the selected genes with k-nearest neighbor classifier for colon cancer dataset. In most of the cases, the proposed method has produced better results than the gene selection methods based on correlation coefficients, showing high accuracy of 90.3% for colon cancer dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

An Optimize Gene Selection Approach for Cancer Classification Using Hybrid Feature Selection Methods

A Comparative Study of Gene Selection Methods for Microarray Cancer Classification

Gene Selection and Classification Rule Generation for Microarray Dataset

References

Harrington, C.A., Rosenow, C., Retief, J.: Monitoring gene expression using DNA microarrays. Curr. Opin. Microbiol. 3, 285–291 (2000)
Article Google Scholar
Cho, S.-B., Ryu, J.: Classifying gene expression data of cancer using classifier ensemble with mutually exclusive features. Proc. of the IEEE 90(11), 1744–1753 (2002)
Article Google Scholar
Shannon, W.D., Watson, M.A., Perry, A., Rich, K.: Mantel statistics to correlate gene expression levels from microarrays with clinical covariates. Genetic Epidemiology 23(1), 96–97 (2002)
Article Google Scholar
Furey, T.S., Cristianini, N., Duffy, N., Bednarski, D.W., Schummer, M., Haussler, D.: Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics 16(10), 906–914 (2000)
Article Google Scholar
Tamayo, P.: Interpreting patterns of gene expression with self-organizing map: Methods and application to hematopoietic differentiation. Proc. of the Natl. Acad. of Sci. USA 96, 2907–2912 (1999)
Article Google Scholar
Li, L., Weinberg, C.R., Darden, T.A., Pedersen, L.G.: Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method. Bioinformatics 17(12), 1131–1142 (2001)
Article Google Scholar
Lipshutz, R.J., Fodor, S.P.A., Gingeras, T.R., Lockhart, D.J.: High density synthetic oligonucleotide arrays. Nature Genetics 21, 20–24 (1999)
Article Google Scholar
Lee, K.E., Sha, N., Dougherty, E.R., Vannucci, M., Mallick, B.K.: Gene selection: a Bayesian variable selection approach. Bioinformatics 19(1), 90–97 (2002)
Article Google Scholar
West, M., Nevins, J.R., Marks, J.R., Spang, R., Blanchette, C., Zuzan, H.: DNA microarray data analysis and regression modeling for genetic expression profiling. In: ISDS Discussion, pp. 00–15 (2000)
Google Scholar
Bo, T.H., Jonassen, I.: New feature subset selection procedures for classification of expression profiles. Genome Biology 3(4), 17.1–17.11 (2002)
Google Scholar
Liu, J., Iba, H.: Selecting informative genes with parallel genetic algorithms in tissue classification. Genome Informatics 12, 14–23 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, Yonsei University, 134 Shinchon-dong, Sudaemoon-ku, Seoul, 120-749, Korea
Si-Ho Yoo & Sung-Bae Cho

Authors

Si-Ho Yoo
View author publications
You can also search for this author in PubMed Google Scholar
Sung-Bae Cho
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Engineering and Information Technology, Centre for Quantum Computation and Intelligent Systems, and Australian ACS National Committee for Artificial Intelligence, University of Technology, Sydney, Australia
Chengqi Zhang
Department of Computer Science, Auckland University of Technology, 1020, Auckland, New Zealand
Hans W. Guesgen
Artificial Intelligence Technology Centre, Auckland University of Technology, Auckland, New Zealand
Wai-Kiang Yeap

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yoo, SH., Cho, SB. (2004). Optimal Gene Selection for Cancer Classification with Partial Correlation and k-Nearest Neighbor Classifier. In: Zhang, C., W. Guesgen, H., Yeap, WK. (eds) PRICAI 2004: Trends in Artificial Intelligence. PRICAI 2004. Lecture Notes in Computer Science(), vol 3157. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28633-2_75

Download citation

DOI: https://doi.org/10.1007/978-3-540-28633-2_75
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22817-2
Online ISBN: 978-3-540-28633-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Optimal Gene Selection for Cancer Classification with Partial Correlation and k-Nearest Neighbor Classifier

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

An Optimize Gene Selection Approach for Cancer Classification Using Hybrid Feature Selection Methods

A Comparative Study of Gene Selection Methods for Microarray Cancer Classification

Gene Selection and Classification Rule Generation for Microarray Dataset

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Optimal Gene Selection for Cancer Classification with Partial Correlation and k-Nearest Neighbor Classifier

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

An Optimize Gene Selection Approach for Cancer Classification Using Hybrid Feature Selection Methods

A Comparative Study of Gene Selection Methods for Microarray Cancer Classification

Gene Selection and Classification Rule Generation for Microarray Dataset

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation