Abstract
The comparative methods for predicting RNA secondary structure can be facilitated by taking structural alignments of homologous sequences as input. However, it is very difficult to construct a well structural alignment of RNA sequences without knowing the secondary structures. In this paper, we present a stem-based method for constructing structural alignment of RNA sequences with unknown structures. The method can be summarized by: 1) we detect possible stems in the RNA sequence using the so-called position matrix with which some possibly paired positions are uncovered; 2) we detect conserved stems across multiple sequences by multiplying the position matrices; 3) we assess the conserved stems using the Signal-to-Noise and the new SCFG model; 4) we construct structural alignment of RNA sequences by incorporating conserved stems with Clustal W which is a popular program for multiple sequence alignment. We tested our method on data sets composed of known structural alignments which are downloaded from the Rfam database. The accuracy, measured as sensitivity and true positive rate, of our method is much greater than alignments by Clustal W.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Eddy, S.R.: Non-coding RNA genes and modern RNA world. Nat. Rev. Genet. 2, 919–929 (2001)
Huttenhofer, A., Schattner, P., Polacek, N.: Non-coding RNAs: hope or hype. TRENDS in Genetics 21, 289–297 (2005)
Pace, N.R., Thomas, B.C., Woese, C.R.: Probing RNA structure, function, and history by comparative analysis. In: Gesteland, R.F., Cech, T.R., Atkins, J.F. (eds.) The RNA World, 2nd edn., pp. 113–141. Cold Spring Harbor Laboratory Press, NY (1999)
Knudsen, B., Hein, J.: Pfold: RNA secondary structure prediction using stochastic context-free grammars. Nucleic Acids Research 31, 3423–3428 (2003)
Hofacker, I., Fekete, M., Stadler, P.: Secondary structure prediction for aligned RNA sequences. Journal of Molecular Biology 319, 1059–1066 (2002)
Ruan, J., Stormo, G., Zhang, W.: An iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots. Bioinformatics 20, 58–66 (2004)
Knight, R., et al.: BayesFold: rational 2 degrees folds that combine thermodynamic, covariation, and chemical data for aligned RNA sequences. RNA 10, 1323–1336 (2004)
Thompson, J., Higgins, D., Gibson, T.: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nucleic Acids Research 22, 4673–4680 (1994)
Notredame, C., Higgins, D., Heringa, J.: T-Coffee: A novel method for fast and accurate multiple sequence alignment. Journal of Molecular Biology 302, 205–217 (2000)
Sankoff, D.: Simultaneous solution of the RNA folding, alignment and protosequence problems. SIAM Journal on Applied Mathematics 45, 810–825 (1985)
Hofacker, I., Bernhart, S., Stadler, P.: Alignment of RNA base pairing probability matrices. Bioinformatics 20, 2222–2227 (2004)
Mathews, D., Turner, D.: Dynalign: An algorithm for finding the secondary structure common to two RNA sequences. Journal of Molecular Biology 317(2), 191–203 (2002)
Perriquet, O., Touzet, H., Dauchet, M.: Finding the common structure shared by two homologous RNAs. Bioinformatics 19, 108–116 (2003)
Ji, Y., Xu, X., Stormo, G.: A graph theoretical approach for predicting common RNA secondary structure motifs including pseudoknots in unaligned sequences. Bioinformatics 20, 591–602 (2004)
Bafna, V., Tang, H., Zhang, S.: Consensus folding of unaligned RNA sequences revisited. Journal of computational biology 13, 283–295 (2006)
Tabei, Y., et al.: SCARNA: fast and accurate structural alignment of RNA sequences by matching fixed-length stem fragments. Bioinformatics 22, 1723–1729 (2006)
Durbin, R., et al.: Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University press, Cambridge (1998)
Sam, G.J., Alex, B., et al.: Rfam: an RNA family database. Nucleic Acids Research 31, 439–441 (2003)
Fang, X., et al.: The detection and assessment of possible RNA secondary structure using multiple sequence alignment. In: The 22nd Annual ACM Symposium on Applied Computing, Seoul, Korea, March 11-15, pp. 11–15. ACM Press, New York (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fang, X., Luo, Z., Yuan, B., Wang, Z., Ding, F. (2007). Constructing Structural Alignment of RNA Sequences by Detecting and Assessing Conserved Stems. In: Li, K., Li, X., Irwin, G.W., He, G. (eds) Life System Modeling and Simulation. LSMS 2007. Lecture Notes in Computer Science(), vol 4689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74771-0_24
Download citation
DOI: https://doi.org/10.1007/978-3-540-74771-0_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74770-3
Online ISBN: 978-3-540-74771-0
eBook Packages: Computer ScienceComputer Science (R0)