Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3121138.3121175acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicbbsConference Proceedingsconference-collections
research-article

The High Performance Computing Applications for Bioinformatics Research

Published: 22 June 2017 Publication History

Abstract

This study reviews the current high performance computing applications for bioinformatics. Firstly, we introduce two popular high performance computing architecture such as single instruction multiple data (SIMD) and multiple instruction multiple data (MIMD). For SIMD, we employ CUDA as the example to show its popular applications. Here, we detail three CUDA related high performance computing applications for bioinformatics research, such as GPU-BLAST, CloudAligner and SEAL. For MIMD, we employ Hadoop as the example to show its popular applications. Here we detail three Hadoop related high performance computing applications for bioinformatics research, such as Clouldburst, SOAP3 and CLAST. Finally, we summarize the aim of the research.

References

[1]
Greene, C. S., Tan, J., Ung, M., Moore, J. H., and Cheng, C. 2014. Big Data Bioinformatics. Journal of Cellular Physiology. 229(12), 1896--1900.
[2]
Shvachko, K., Kuang, H., Radia, S., and Chansler, R. 2010. The Hadoop Distributed File System. (2010) 1--10.
[3]
Sanders, J. and Kandrot, E. 2010. UDA by example: an introduction to general-purpose GPU programming. Addison-Wesley Professional. 2010.
[4]
Kai, H. and Xu, Z. 2001. Scalable Parallel Computing: Technology, Architecture, Programming, China Machine Press 2001.
[5]
Nickolls, J., Buck, I., Garland, M., and Skadron, K. 2008. Scalable parallel programming with CUDA. ACM SIGGRAPH. pp. 16.
[6]
Gottlieb, A., Grishman, R., Kruskal, C. P., Mcauliffe, K. P., Rudolph, L., and Snir, M. 1983. The NYU ultracomputer---designing an MIMD Shared Memory Parallel Computer. IEEE Transactions on Computers. 32(2), 175--189.
[7]
White, T. 2011. Hadoop: The Definitive Guide, Yahoo! Press 2011.
[8]
Dean, J. and Ghemawat, S. 2004. MapReduce: simplified data processing on large clusters, operating systems design and implementation. pp. 10--10.
[9]
Siegel, H. J., Siegel, L. J., Kemmerer, F. C., Mueller, P. T., Smalley, H. E., and Smith, S. D. 1981. PASM: A partitionable SIMD/MIMD system for image processing and pattern recognition. IEEE Transactions on Computers. 30(12), 934--947.
[10]
Patterson, D. A., Hennessy, J. L., Ashenden, P. J., Larus, J. R., Sorin, D. J. 1998. Computer organization and design: the hardware/software interface. Morgan Kaufmann. Publishers 1998.
[11]
Schatz, M. C. 2009. CloudBurst. Bioinformatics. 25(11), 1363--1369.
[12]
Yang, Y., Chen, S. Z., Li, X., and Wang, Y. 2011. RMap: an algorithm of virtual network resilience mapping. 2011 7th International Conference on Wireless Communications, Networking and Mobile Computing. IEEE. 6955(6), 1--4.
[13]
Li, Y. and Zhong, S. 2008. SeqMapReduce: software and web service for accelerating sequence mapping. Critical Assessment of Massive Data Anaysis.
[14]
Gurtowski, J., Schatz, M. C.,and Langmead, B. 2012. Genotyping in the Cloud with Crossbow. Current protocols in bioinformatics. Chapter 15: Unit15.3.
[15]
Langmead, B. 2010. Aligning short sequencing reads with Bowtie, John Wiley & Sons, Inc.2010.
[16]
Li, R., Li, Y., Kristiansen, K., and Wang, J. 2008. SOAP: short oligonucleotide alignment program, Bioinformatics 24(5), 713.
[17]
Nguyen, T., Shi, W., and Ruden, D. 2011. CloudAligner: A fast and full-featured MapReduce based tool for sequence mapping. BMC research notes. 4, 171.
[18]
Nguyen, T., Shi, W., and Ruden, D. 2011. CloudAligner: A fast and full-featured MapReduce based tool for sequence mapping. BMC Research Notes. 4(1), 1--7.
[19]
Pireddu, L., Leo, S., and Zanetti, G. 2011. SEAL: a distributed short read mapping and duplicate removal tool. Bioinformatics. 27(15), 2159.
[20]
Johnson, M., Zaretskaya, I., Raytselis, Y., Merezhuk, Y., Mcginnis, S., and Madden, T. L. 2008. NCBI BLAST: a better web interface. Nucleic Acids Research. 36.
[21]
Vouzis, P. D. and Sahinidis, N. V. 2011. GPU-BLAST: using graphics processors to accelerate protein sequence alignment. Bioinformatics. 27(2), 182.
[22]
Liu, C. M., Wong, T., Wu, E., Luo, R., et al. 2012. SOAP3: ultra-fast GPU-based parallel alignment tool for short reads. Bioinformatics. 28(6), 878--9.
[23]
Li, R., Yu, C., Li, Y., Lam, T. W., Yiu, S. M., Kristiansen, K., and Wang, J. 2009. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 25(15), 1966--1967.
[24]
Seward, J. 2000. On the performance of BWT sorting algorithms. Data Compression Conference, 2000. Proceedings. DCC, pp. 173--182.
[25]
Fung, W. W. and Aamodt, T. M. 2011. Thread block compaction for efficient SIMT control flow. High Performance Computer Architecture (HPCA). 2011 IEEE 17th International Symposium on, IEEE, 2011, pp. 25--36.
[26]
Liu, Y., Maskell, D. L., and Schmidt, B. 2009. CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units. BMC Research Notes. 2(1), 73.
[27]
Suzuki, S., Ishida, T., Kurokawa, K., and Akiyama, Y. 2012. GHOSTM: a GPU-accelerated homology search tool for metagenomics. Plos One. 7(5), e36060.
[28]
Zhao, K. and Chu, 2014. X. G-BLASTN: accelerating nucleotide alignment by graphics processors. Bioinformatics. 30(10), 1384--91.
[29]
Yano, M., Mori, H., Akiyama, Y., Yamada, T., and Kurokawa, K. 2014. CLAST: CUDA implemented large-scale alignment search tool. BMC Bioinformatics. 15(1), 406--406

Cited By

View all
  • (2024)An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov modelsBMC Bioinformatics10.1186/s12859-024-05879-325:1Online publication date: 29-Jul-2024
  • (2023)G-SAIP: Graphical Sequence Alignment Through Parallel Programming in the Post-Genomic EraEvolutionary Bioinformatics10.1177/1176934322115058519Online publication date: 20-Jan-2023

Index Terms

  1. The High Performance Computing Applications for Bioinformatics Research

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICBBS '17: Proceedings of the 6th International Conference on Bioinformatics and Biomedical Science
    June 2017
    184 pages
    ISBN:9781450352222
    DOI:10.1145/3121138
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • Natl University of Singapore: National University of Singapore

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. CUDA
    2. Hadoop
    3. bioinformatics
    4. high performance computing

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICBBS '17

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 19 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov modelsBMC Bioinformatics10.1186/s12859-024-05879-325:1Online publication date: 29-Jul-2024
    • (2023)G-SAIP: Graphical Sequence Alignment Through Parallel Programming in the Post-Genomic EraEvolutionary Bioinformatics10.1177/1176934322115058519Online publication date: 20-Jan-2023

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media