Abstract
With recent advances in sequencing technologies, a huge amount of DNA sequences become available year after year. In order to obtain useful information on these sequences, we need to process them in search of biologically meaningful regions. The genes are amongst the most important regions of a genome and the task of locating them in a DNA of interest is called the gene prediction problem. This problem can be addressed in several ways, and one of the most promising methods relies on homology information between the genomic DNA and previous annotated sequences (proteins, cDNAs and ESTs). In this paper we generalize a traditional formulation of the gene prediction problem and use this new formulation in the development of three gene identification tools. All these programs were tested on a benchmark of 240 human genomic sequences and the results obtained compare favorably with those achieved by other gene prediction tools available in the literature.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Burge, C., Karlin, S.: Prediction of Complete Gene Structures in Human Genomic DNA. Journal of Molecular Biology 268(1), 78–94 (1997)
Burset, M., Guigó, R.: Evaluation of Gene Structure Prediction Programs. Genomics 34(3), 353–367 (1996)
Florea, L., Hartzell, G., Zhang, Z., Rubin, G., Miller, W.: A Computer Program for Aligning a cDNA Sequence with a Genomic Sequence. Genome Research 8(9), 967–974 (1998)
Gelfand, M.S., Mironov, A.A., Pevzner, P.A.: Gene Recognition via Spliced Sequence Alignment. Proc. Natl. Acad. Sci (USA) 93(17), 9061–9066 (1996)
Guigó, R.: Assembling Genes from Predicted Exons in Linear Time with Dynamic Programming. Journal of Computational Biology 5(4), 681–702 (1998)
Gusfield, D.: Efficient Methods for Multiple Sequence Alignment with Guaranteed Error Bounds. Bulletin of Mathematical Biology 5(1), 141–154 (1993)
Kishi, R.M.: Identificação de Genes e o Problema do Alinhamento Spliced Múltiplo. Master Thesis. Faculdade de Computaoção/UFMS (December 2010)
Knight, J., Myers, E.W.: Super-pattern and Matching. Algorithmica 13, 211–243 (1995)
Majoros, W.H.: Methods for Computational Gene Prediction. Cambridge University Press, Cambridge (2007)
Mathé, C., Sagot, M.-F., Schiex, T., Rouzé, P.: Current Methods of Gene Prediction, their Strengths and Weaknesses. Nucleic Acids Research 30(19), 4103–4117 (2002)
Mott, R.: EST_Genome: a Program to Align Spliced DNA Sequences to Unspliced Genomic DNA. Comput. Appl. Biosci. 13(4), 477–478 (1997)
Nicolas, F., Rivals, E.: Hardness Results for the Center and Median String Problems Under the Weighted and Unweighted Edit Distances. Journal of Discrete Algorithms 3(2-4), 390–415 (2004)
The ENCODE Project Consortium: The ENCODE (Encyclopedia of DNA Elements) Project. Science 306(5696), 636–640 (2004)
Sayers, E.W., et al.: Database Resources of the National Center for Biotechnology Information. Nucleic Acids Research 37(suppl. 1), D5–D15 (2008)
Stanke, M., Schöffmman, O., Morgenstern, B., Waack, S.: Gene Prediction with a Hidden Markov Model and a New Intron Submodel. Bioinformatics 19(suppl. 2), II215–II225 (2003)
Usuka, J., Zhu, W., Brendel, V.: Optimal Spliced Alignment of Homologous cDNA to a Genomic DNA Template. Bioinformatics 16(3), 203–211 (2000)
Wheelan, S.J., Church, D.M., Ostell, J.M.: Spidey: a Tool for mRNA-to-genomic Alignments. Genome Research 11(11), 1952–1957 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kishi, R.M., dos Santos, R.F., Adi, S.S. (2011). Gene Prediction by Multiple Spliced Alignment. In: Norberto de Souza, O., Telles, G.P., Palakal, M. (eds) Advances in Bioinformatics and Computational Biology. BSB 2011. Lecture Notes in Computer Science(), vol 6832. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22825-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-22825-4_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22824-7
Online ISBN: 978-3-642-22825-4
eBook Packages: Computer ScienceComputer Science (R0)