Abstract
Maximum likelihood (ML) (Felsenstein, 1981) is an increasingly popular optimality criterion for selecting evolutionary trees. Finding optimal ML trees appears to be a very hard computational task – in particular, algorithms and heuristics for ML take longer to run than algorithms and heuristics for maximum parsimony (MP). However, while MP has been known to be NP-complete for over 20 years, no such hardness result has been obtained so far for ML.
In this work we make a first step in this direction by proving that ancestral maximum likelihood (AML) is NP-complete. The input to this problem is a set of aligned sequences of equal length and the goal is to find a tree and an assignment of ancestral sequences for all of that tree’s internal vertices such that the likelihood of generating both the ancestral and contemporary sequences is maximized. Our NP-hardness proof follows that for MP given in (Day, Johnson and Sankoff, 1986) in that we use the same reduction from Vertex Cover; however, the proof of correctness for this reduction relative to AML is different and substantially more involved.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Day, W.: Computational Complexity of Inferring Phylogenies from Dissimilarity Matrices. Bulletin of Mathematical Biology 49(4), 461–467 (1987)
Day, W., Johnson, D., Sankoff, D.: The Computational Complexity of Inferring Rooted Phylogenies by Parsimony. Mathematical Biosciences 81, 33–42 (1986)
Day, W., Sankoff, D.: Computational Complexity of Inferring Phylogenies by Compatability. Systematic Zoology 35(2), 224–299 (1986)
Felsenstein, J.: Evolutionary Trees from DNA Sequences: A Maximum Likelihood Approach. Journal of Molecular Evolution 17, 368–376 (1981)
Fitch, W.: Towards defining the course of evolution: minimum change for a specific tree topology. Systematic Zoology 20, 406–416 (1971)
Foulds, L., Graham, R.: The Steiner problem in phylogeny is NP-complete. Advances in Applied Mathematics 3, 43–49 (1982)
Goldman, N.: Maximum likelihood inference of phylogenetic trees, with special reference to a Poisson process model of DNA substitution and to parsimony analyses. Systematic Zoology 39(4), 345–361 (1990)
Graham, R., Foulds, L.: Unlikelihood that minimal phylogenies for a realistic biological study can be constructed in reasonable computational time. Mathematical Biosciences 60, 133–142 (1982)
Koshi, M., Goldstein, R.: Probabilistic reconstruction of ancestral protein sequences. Journal of Molecular Evolution 42, 313–320 (1996)
Neyman, J.: Molecular studies of evolution: A source of novel statistical problems. In: Gupta, S., Jackel, Y. (eds.) Statistical Decision Theory and Related Topics, pp. 1–27. Academic Press, New York (1971)
Pupko, T., Pe’er, I., Shamir, R., Graur, D.: A Fast Algorithm for Joint Reconstruction of Ancestral Amino Acid Sequences. Molecular Biology and Evolution 17(6), 890–896 (2000)
Pupko, T., Pe’er, I., Hasegawa, M., Graur, D., Friedman, N.: A branch-and-bound algorithm for the inference of ancestral amino-acid sequences when the replacement rate varies among sites: Application to the evolution of five gene families. Bioinformatics 18(8), 1116–1123 (2002)
Sankoff, D., Cedergren, R.: Simultaneous comparison of three or more sequences related by a tree. In: Sankoff, D., Kruskal, J. (eds.) Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison, pp. 253–263. Addison-Wesley Publishing Company, Reading (1983)
Steel, M.: The complexity of reconstructing trees from qualitative characters and subtrees. Journal of Classification 9, 71–90 (1992)
Swofford, D., Maddison, W.: Parsimony, Character-State Reconstructions, and Evolutionary Inferences. In: Mayden, R. (ed.) Systematics, Historical Ecology, and North American Freshwater Fishes, pp. 186–223. Stanford University Press, Stanford (1992)
Swofford, D., Olsen, G., Waddell, P., Hillis, D.: Phylogenetic Inference. In: Hillis, D., Moritz, C., Mable, B. (eds.) Molecular Systematics, 2nd edn., pp. 407–514. Sinauer Associates, Sunderland (1996)
Wareham, T.: On the Computational Complexity of Inferring Evolutionary Trees. Technical Report 93-01, Department of Computer Science, Memorial University of Newfoundland (1993)
Yang, Z., Kumar, S., Nei, M.: A new method of inference of ancestral nucleotide and amino acid sequences. Genetics 141, 1641–1650 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Addario-Berry, L., Chor, B., Hallett, M., Lagergren, J., Panconesi, A., Wareham, T. (2003). Ancestral Maximum Likelihood of Evolutionary Trees Is Hard. In: Benson, G., Page, R.D.M. (eds) Algorithms in Bioinformatics. WABI 2003. Lecture Notes in Computer Science(), vol 2812. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39763-2_16
Download citation
DOI: https://doi.org/10.1007/978-3-540-39763-2_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20076-5
Online ISBN: 978-3-540-39763-2
eBook Packages: Springer Book Archive