Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/BIBM.2011.9guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Stratified Random Forest for Genome-wide Association Study

Published: 12 November 2011 Publication History

Abstract

For high dimensional genome-wide association (GWA) case-control data of complex disease, there are usually a large portion of single-nucleotide polymorphisms (SNPs) that are irrelevant with the disease. A simple random sampling method in random forest using default mtry parameter to choose feature subspace, will select too many subspaces without informative SNPs. Exhaustive searching an optimal mtry is often required in order to include useful and relevant SNPs and get rid of vast of non-informative SNPs. However, it is very time-consuming and not favorable in GWA study for high dimensional data. This paper proposes a stratified sampling method for feature subspace selection to generate decision trees in a random forest for GWA high-dimensional data. We employ two genome-wide SNP data sets (Parkinson case control data comprised of 408,803 SNPs and Alzheimer case control data comprised of 380,157 SNPs) to demonstrate that the proposed stratified sampling method is effective, and it can generate better random forest with higher accuracy and lower error bound than those by Breiman's random forest generation method.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
BIBM '11: Proceedings of the 2011 IEEE International Conference on Bioinformatics and Biomedicine
November 2011
651 pages
ISBN:9780769545745

Publisher

IEEE Computer Society

United States

Publication History

Published: 12 November 2011

Author Tags

  1. Genome-wide association study
  2. random forest classifier
  3. significant SNP selection
  4. stratified sampling

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media