Abstract
NGS has become a popular research field in biologists because it was able to produce inexpensive and accuracy short biology sequences very fast. NGS technique has been improved to produce long length sequences, more than 100bp, recently with the same quality, accuracy and speed. Thus, tools for short sequences may be not suitable for long length sequences. We propose a new tool called UNION for re-sequencing applications by mapping long length sequences to a reference genome. UNION uses the UniMarker with a non-overlapping interval indexing strategy and a tool, CORAL, to do sequence alignments. For the experiments we randomly cut ten thousands sequences with a length of 512bp from the genome of Trichomonas and also produce mutations/sequence errors for these sequences to simulate different similarities. UNION has been compared with GMAP in terms of speed and accuracy and achieves better performance than that of GMAP.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)
Chao, K.M., Zhang, J., Ostell, J., Miller, W.: A tool for aligning very similar DNA sequences. Comput. Appl. Biosci. 13, 75–80 (1997)
Zhang, J., Madden, T.L.: PowerBLAST: A new network BLAST application for interactive or automated sequence analysis and annotation. Genome Res. 7, 649–656 (1997)
Florea, L., Hartzell, G., Zhang, Z., Rubin, G.M., Miller, W.: A computer program for aligning a cDNA sequence with a genomic DNA sequence. Genome Res. 8, 967–974 (1998)
Delcher, A.L., Kasif, S., Fleischmann, R.D., Peterson, J., White, O., Salzberg, S.L.: Alignment of whole genomes. Nucleic Acids Res. 27, 2369–2376 (1999)
Zhang, Z., Schwartz, S., Wagner, L., Miller, W.: A greedy algorithm for aligning DNA sequences. J. Comput. Biol. 7, 203–214 (2000)
Chen, L.Y.Y., Lu, S.H., Shih, E.S.C., Hwang, M.J.: Single nucleotide polymorphism mapping using genome-wide unique sequences. Genome Res. 12, 1106–1111 (2002)
Hsu, F.R., Chen, J.F.: Aligning ESTs to Genome Using Multi-Layer Unique Makers. In: Proceedings of the Computational Systems Bioinformatics, CSB 2003 (2003)
Wu, T.D., Watanabe, C.K.: GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 21(9), 1859–1875 (2005)
James Kent, W.: BLAT—The BLAST-Like Alignment Tool. Genome Research 12, 656–664 (2002)
Hung, C.L., Lin, C.Y., Chung, Y.C., Hsieh, S.J., Tang, C.Y.: Comparative Exon Prediction based on Heuristic Coding Region Alignment. In: Proceedings of the International Symposium on Parallel Architectures, Algorithms, and Networks, pp. 14–19 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hung, CL., Lin, CY., Hu, YC. (2011). UNION: An Efficient Mapping Tool Using UniMark with Non-overlapping Interval Indexing Strategy. In: Kim, Th., et al. Database Theory and Application, Bio-Science and Bio-Technology. BSBT DTA 2011 2011. Communications in Computer and Information Science, vol 258. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27157-1_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-27157-1_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27156-4
Online ISBN: 978-3-642-27157-1
eBook Packages: Computer ScienceComputer Science (R0)