Abstract
Sequence comparisons suggest that all living green plants belong to one of two major phyla1,2,3: Streptophyta4 (land plants and their closest green algal relatives, the charophytes); and Chlorophyta5 (the rest of green algae). Because no green algae are known that pre-date the Streptophyta/Chlorophyta split, and also because the earliest diverging green algae show considerable morphological variation, the nature of the unicellular flagellate ancestor of the two green plant phyla is unknown1,6,7. Here we report that the flagellate Mesostigma viride belongs to the earliest diverging green plant lineage discovered to date. We have sequenced the entire chloroplast DNA (118,360 base pairs) of this green alga and have conducted phylogenetic analyses of sequences derived from this genome. Mesostigma represents a lineage that emerged before the divergence of the Streptophyta and Chlorophyta, a position that is supported by several features of its chloroplast DNA. The structure and gene organization of this genome indicate that chloroplast DNA architecture has been extremely well conserved in the line leading to land plants.
Similar content being viewed by others
Main
Mesostigma is a scaly, green biflagellate that belongs to the Prasinophyceae8, a morphologically heterogeneous class that includes descendants of the earliest diverging green algae2,7,8,9. No unique character unites the Prasinophyceae to the exclusion of other green algal classes. In phylogenetic trees inferred from nuclear small subunit (SSU) ribosomal DNA1,2,3,7,9 and actin-coding sequences10, all prasinophytes examined so far, except Mesostigma, form independent lineages at the base of Chlorophyta. Mesostigma represents the earliest divergence within Streptophyta: in SSU rRNA trees1, this position is supported by low bootstrap values (52%), whereas in actin-gene trees10, it is more strongly supported (>81%). A specific affinity between Mesostigma and charophytes was previously predicted on the basis of the identical orientation of multilayered structures relative to flagellar roots1,7,11.
The 135 genes in Mesostigma chloroplast DNA (cpDNA) ( Fig. 1) represent the largest gene repertoire ever reported among green algal and land plant cpDNAs. We analysed the concatenated protein sequences (10,629 amino-acid positions) from 53 genes that are common to the cpDNAs of Mesostigma, three land plants (Marchantia polymorpha, Pinus thunbergii and Nicotiana tabacum) and three chlorophyte green algae (Nephroselmis olivacea, Chlorella vulgaris and Pedinomonas minor). The homologous proteins from the glaucocystophyte Cyanophora paradoxa, the red alga Porphyra purpurea and the cyanobacterium Synechocystis sp. PCC6803 were used to root the green algal phylogeny in these analyses. It is generally accepted that all chloroplasts were derived from a single primary endosymbiotic event involving the capture of a cyanobacterium12. The chloroplasts of glaucocystophytes, red algae and green algae are thought to be direct products of this primary endosymbiotic event12. Phylogenetic evidence indicates that glaucocystophyte chloroplasts evolved before those of red and green algae13.
Trees constructed with distance, maximum parsimony and maximum likelihood inference methods using Cyanophora proteins as an outgroup were found to be congruent in showing a strongly supported topology, ‘T1’ (Fig. 2a), in which Mesostigma emerges before the divergence of Streptophyta and Chlorophyta. Only T1 was detected in quartet-puzzling and distance-based analyses under the amino-acid substitution models of Jones et al.14 (JTT-F) and Dayhoff et al.15, with either a uniform or a gamma-distributed rate of substitution across sites. In maximum-parsimony analysis, bootstrap support for T1 was 97%, and two alternative topologies, ‘T2’ and ‘T3’ (Fig. 2b ), differing with respect to the position of Mesostigma, were each recovered in 1.5% of the bootstrap samples. T1, T2, and T3 were also the only topologies recovered in maximum-likelihood analysis under the JTT-F model with a uniform rate of substitution; T1 was found in 98.9% of bootstrap samples, and both T2 and T3 were significantly worse than T1 (T2, P < 0.05; T3, P < 0.01) in the Kishino-Hasegawa test16. Removal of constant sites from the data set had negligible effect on bootstrap support for T1 in distance and maximum-likelihood analyses, and on the confidence limit of tree topologies under the Kishino-Hasegawa test (see Supplementary Information ). Considering that most constant sites (5,585 out of 5,628) were estimated to be invariable using SPLITSTREE, these observations eliminate the possibility that misleading effects of invariable sites17 contributed to the recovery of T1. Even when the Cyanophora sequences were excluded from the data set, or when other outgroup sequences (Porphyra proteins, or a combination of Cyanophora, Porphyra and Synechocystis proteins) were used, trees compatible with T1 remained the best topologies (see Supplementary Information).
Systematic biases in amino-acid composition can give rise to incorrect relationships in reconstructed phylogenetic trees18. Chi-squared analysis of amino-acid composition at variable sites reveals that the 53 concatenated proteins of Mesostigma do not significantly differ (P < 0.05) from those of Cyanophora, Marchantia, Pedinomonas and Pinus. Our failure to detect a systematic bias in amino-acid composition thus suggests that the basal position of Mesostigma is the result of a genuine phylogenetic signal. In support of this conclusion, neighbour-joining analysis of LogDet distances calculated after removal of all constant sites retrieved the T1 topology in 93% of bootstrap samples, with T2 being the only alternative topology detected. LogDet distances allow the recovery of the correct tree when sequences differ markedly in amino-acid frequencies for cases when substitution processes are otherwise uniform across the underlying tree19.
We also carried out maximum-likelihood analyses of concatenated chloroplast SSU and large subunit (LSU) rDNA sequences (4,016 positions, of which 1,026 are variable) from the taxa shown in Fig. 2, using the HKY model of nucleotide substitution16 (for analyses with other models, see Supplementary Information). The reconstructed trees were found to be congruent with those inferred from chloroplast proteins in providing unequivocal support for T1. T1 and T3 accounted for 99.0% and 1.0% of bootstrap samples, respectively (T2 was not recovered), and T3 was significantly worse than T1 (P < 0.05) in the Kishino-Hasegawa test. Removal of all constant sites from the data had little effect on support for T1.
In favouring the placement of Mesostigma within Streptophyta (a position corresponding to T3), actin-gene trees10 contrast with those that we inferred from chloroplast sequence data. We used the Kishino-Hasegawa test under the HKY model to assess the confidence limit of actin-tree topologies (see Supplementary Information) and found that the phylogenetic information in the actin data set (198 variable sites) is not sufficient to eliminate the hypothesis that Mesostigma branches at the base of Chlorophyta and Streptophyta (T1). The actin trees that support this hypothesis represented a meaningful proportion (12.7%) of bootstrap samples in maximum-likelihood analysis and were not significantly worse (P ⩾ 0.30 ) than those compatible with T3 (best maximum-likelihood trees). From this result, we conclude that the discrepancy between the chloroplast and the actin tree is only apparent.
The positioning of Mesostigma as the earliest divergence in the phylogeny of green plants is supported by our finding that this prasinophyte has retained more ancestral cpDNA features than previously examined green plants. The most relevant of these ancestral features are (1) a quadripartite structure in which common genes reside in corresponding genomic regions—a feature that is also shared with the cpDNAs of some non-green algae, most land plants and Nephroselmis20; (2) the presence of five genes (trnA(ggc), odpB, ycf20, ycf61 and ycf65 ) that were previously identified only in Porphyra cpDNA; (3) the presence of all genes that were lost specifically in Chlorophyta or Streptophyta20, with the exception of one (rpl21) of the eight genes specifically missing from Chlorophyta and three (rne, rnpB and rpl12) of the sixteen genes missing from Streptophyta; (4) the presence of nine gene clusters that are found in some non-green algal cpDNAs but not in the Synechocystis genome (Fig. 1)—only subsets of these clusters have been retained in other green plants ( Table 1); and (5) the absence of introns—a feature that is also shared with the cpDNAs of Nephroselmis20 and most non-green algae12.
Many genes in Mesostigma cpDNA form clusters that are shared exclusively with chlorophytes and/or streptophytes (Table 1), thus strengthening the notion that Mesostigma represents the most basal green plant lineage. These clusters provide clues into how the green plant chloroplast genome diverged from the ancestral pattern of gene organization during the evolution of Chlorophyta and Streptophyta.
The gene organization of Mesostigma cpDNA is highly similar to that of land plant cpDNAs, with 81% of its genes being found in clusters that are shared with land plant cpDNAs (Fig. 1). This observation indicates that the chloroplast genome from the common ancestor of chlorophytes and streptophytes has been highly preserved in structure and gene order during the long evolutionary period (∼800 Myr (ref. 6)) separating this ancestor from land plants. Such an exceptional conservation of gene order was unexpected, as Nephroselmis cpDNA, the green algal cpDNA previously known to display the most ancestral characters, shares a limited number of gene clusters with its land plant counterparts20. Our results predict that cpDNA rearrangements occurred relatively infrequently in Streptophyta and that analysis of these events will be useful not only to study how the chloroplast genome evolved during the transition from the most ancestral green flagellate to land plants, but also to clarify phylogenetic relationships among streptophytes.
The morphological characteristics8 of Mesostigma support the view that the most ancestral green flagellate was a biflagellated and asymmetric cell that had an underlayer of square scales (a character unique to green algae), an eye spot and a cruciate flagellar root system with multilayered structures1,7. The number of flagella is highly variable among prasinophytes and other green algae. Square-shaped scales are found in members of both Chlorophyta and Streptophyta; however, the presence of an eye spot has been noted only in chlorophytes. Multilayered structures are found in chlorophytes and streptophytes, and appear to be an ancestral character of algae, as similar structures have been described in other algal groups6.
Methods
DNA sequencing
Chloroplast DNA from Mesostigma viride (NIES-296) was isolated from total cellular DNA as an AT-rich fraction by CsCl-bisbenzimide isopycnic centrifugation20. This DNA preparation was sheared by nebulization, and 1,500–3,000-bp fragments were recovered by electroelution after agarose gel electrophoresis. These fragments were treated with Escherichia coli Klenow fragment and T7 DNA polymerase, and cloned into the SmaI site of Bluescript II KS+. After hybridization of the clones with the original DNA used for cloning, DNA templates from positive clones were prepared with the QIAprep 8 Miniprep kit (Quiagen). Nucleotide sequences were determined with the PRISM dye terminator cycle sequencing kit (Applied Biosystems) on a DNA sequencer (model 373; Applied Biosystems) using T3 and T7 primers. Sequences were assembled and analysed as described20. Short genomic regions not represented in the clones analysed were sequenced from PCR-amplified fragments.
Phylogenetic analysis
Genome sequences were retrieved from GenBank. Pedinomonas cpDNA sequences are from our unpublished data. Individual protein and rRNA gene sequences were aligned with CLUSTALW 1.74 (ref. 21), alignments were concatenated, and ambiguously aligned regions containing gaps were excluded. The alignments and data sets are available in Supplementary Information. The program packages MOLPHY 2.3b3 (ref. 16), PHYLIP 3.573c22, PUZZLE 4.0.223 and SPLITSTREE 2.424 were used for phylogenetic analyses. Symmetric distance matrices were computed with PUZZLE and PROTDIST22, whereas Logdet distances were calculated with SPLITSTREE. Distance trees were constructed with NEIGHBOR22 and/or FITCH22, maximum-parsimony trees were obtained with PROTPARS22, and quartet-puzzling trees were generated with PUZZLE. The robustness of distance and maximum-parsimony trees was assessed by bootstrap percentages after 100 replications. In the case of quartet-puzzling trees, reliability percentages of the occurrence of the nodes were estimated after 10,000 puzzling steps. Maximum-likelihood analyses of protein and DNA sequences were carried out with PROTML16 and NUCML16, respectively, and local bootstrap probability was estimated by resampling of the estimated log likelihood16.
References
Melkonian, M ., Marin, B. & Surek, B. in Biodiversity and Evolution (eds Arai, R., Kato, M. & Doi, Y.) 153–176 (The National Science Museum Foundation, Tokyo, 1995).
Friedl, T. The evolution of the green algae. Plant Syst. Evol. 11, (Suppl.) 87–101 (1997 ).
Chapman, R. L. et al. in Molecular Systematics of Plants II DNA Sequencing (eds Soltis, D. E., Soltis, P. S. & Doyle, J. J.) 508– 540 (Kluwer Academic, Norwell, 1998).
Bremer, K. Summary of green plant phylogeny and classification. Cladistics 1, 369–385 ( 1985).
Sluiman, H. G. A cladistic evaluation of the lower and higher green plants (Viridiplantae ). Plant Syst. Evol. 149, 217– 232 (1985).
O'Kelly, C. J. in The Cytoskeleton of the Algae (ed. Menzel, D.) 315– 345 (CRC, Boca Raton, 1992).
Melkonian, M. & Surek, B. Phylogeny of the Chlorophyta: congruence between ultrastructural and molecular evidence. Bull. Soc. Zool. Fr. 120, 191–208 ( 1995).
Melkonian, M. in Handbook of Protoctista. The Structure, Cultivation, Habitats and Life Histories of the Eukaryotic Microorganisms and their Descendants Exclusive of Animals, Plants and Fungi (eds Margulis, L., Corliss, J. O., Melkonian, M. & Chapman, D. J.) 600–607 (Jones and Bartlett, Boston, 1990).
Nakayama, T. et al. The basal position of scaly green flagellates among the green algae (Chlorophyta) is revealed by analyses of nuclear-encoded SSU rRNA sequences. Protist 149, 367–380 ( 1998).
Bhattacharya, D., Weber, K., An, S. S. & Berning-Koch, W. Actin phylogeny identifies Mesostigma viride as a flagellate ancestor of the land plants. J. Mol. Evol. 47, 544– 550 (1998).
Melkonian, M. Flagellar apparatus ultrastructure in Mesostigma viride (Prasinophyceae). Plant Syst. Evol. 164, 93– 122 (1989).
Palmer, J. D. & Delwiche, C. F. in Molecular Systematics of Plants II DNA Sequencing (eds Soltis, D. E., Soltis, P. S. & Doyle, J. J.) 375–409 (Kluwer Academic, Norwell, 1998).
Martin, W. et al. Gene transfer to the nucleus and evolution of chloroplast. Nature 393, 162–165 (1998).
Jones, D. T., Taylor, W. R. & Thornton, J. M. The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci. 8, 275–282 (1992).
Dayhoff, M. O., Schwartz, R. M. & Orcutt, B. C. in Atlas of Protein Sequence and Structure (ed. Dayhoff, M. O.) 345–352 (National Biomedical Research Foundation, Washington, D.C., 1978).
Adachi, J. & Hasegawa, M. MOLPHY version 2.3: programs for molecular phylogenetics based on maximum likelihood method. Comput. Sci. Monogr. 28, 1–150 ( 1996).
Lockhart, P. J., Larkum, A. W. D., Steel, M. A., Waddell, P. J. & Penny, D. Evolution of chlorophyll and bacteriochlorophyll: The problem of invariant sites in sequence analysis. Proc. Natl Acad. Sci. USA 93, 1930–1934 (1996).
Lockhart, P. J., Howe, C. J., Barbrook, A. C., Larkum, A. W. D. & Penny, D. Spectral analysis, systematic bias, and the evolution of chloroplast. Mol. Biol. Evol. 16, 573–576 (1999).
Lockhart, P. J., Steel, M. A., Hendy, M. D. & Penny, D. Recovering evolutionary trees under a more realistic model of sequence evolution. Mol. Biol. Evol. 11, 605– 612 (1994).
Turmel, M., Otis, C. & Lemieux, C. The complete chloroplast DNA sequence of the green alga Nephroselmis olivacea: insights into the architecture of ancestral chloroplast genomes. Proc. Natl Acad. Sci. USA 96, 10248 –10253 (1999).
Thompson, J. D., Higgins, D. G. & Gibson, T. J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680 (1994).
Felsenstein, J. PHYLIP (phylogeny inference package) Version 3.5 (Dept. Genet., Univ. Washington, Seattle, 1995).
Strimmer, K. & von Haeseler, A. Quartet puzzling: a quartet maximum-likelihood method for reconstructing tree topologies. Mol. Biol. Evol. 13, 964–969 (1996).
Huson, D. H. SplitsTree: a program for analyzing and visualizing evolutionary data. Bioinformatics 14, 68–73 (1998).
Stirewalt, V., Michalowski, C., Löffelhardt, W., Bohnert, H. & Bryant, D. Nucleotide sequence of the cyanelle genome from Cyanophora paradoxa. Plant Mol. Biol. Reporter. 13, 327–332 ( 1995).
Reith, M. & Munholland, J. Complete nucleotide sequence of the Porphyra purpurea chloroplast genome. Plant Mol. Biol. Reporter. 13, 333–335 (1995).
Douglas, S. E. & Penny, S. L. The plastid genome of the cryptophyte alga, Guillardia theta: complete sequence and conserved synteny groups confirm its common ancestry with red algae. J. Mol. Evol. 48, 236–244 ( 1999).
Kowallik, K. V., Stoebe, B., Schaffran, I., Kroth-Panic, P. & Freier, U. The chloroplast genome of a chlorophyll a+c-containing alga, Odontella sinensis. Plant Mol. Biol. Reporter 13, 336–342 (1995).
Ohyama, K. et al. Chloroplast gene organization deduced from complete sequence of liverwort Marchantia polymorpha chloroplast DNA. Nature 322, 572–574 ( 1986).
Wakasugi, T. et al. Complete nucleotide sequence of the chloroplast genome from the green alga Chlorella vulgaris: the existence of genes possibly involved in chloroplast division. Proc. Natl Acad. Sci. USA 94, 5967–5972 (1997).
Acknowledgements
We thank B. F. Lang and L. Forget for help with the preparation of the library, and M. W. Gray for editorial suggestions. This work was supported by grants from the Natural Sciences and Engineering Research Council of Canada (to M.T. and C.L.). M.T. and C.L. are associates in the Program in Evolutionary Biology at the Canadian Institute for Advanced Research.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
This article is distributed under the terms of the Creative Commons Attribution-Non-Commercial-Share Alike licence (http://creativecommons.org/licenses/by-nc-sa/3.0/), which permits distribution, and reproduction in any medium, provided the original author and source are credited. This licence does not permit commercial exploitation, and derivative works must be licensed under the same or similar licence.
About this article
Cite this article
Lemieux, C., Otis, C. & Turmel, M. Ancestral chloroplast genome in Mesostigma viride reveals an early branch of green plant evolution. Nature 403, 649–652 (2000). https://doi.org/10.1038/35001059
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1038/35001059
This article is cited by
-
Cytoplasmic genome contributions to domestication and improvement of modern maize
BMC Biology (2024)
-
Assembly and analysis of the complete mitochondrial genome of Forsythia suspensa (Thunb.) Vahl
BMC Genomics (2023)
-
A genome-wide analysis of the chloroplast NADH dehydrogenase-like genes in Zostera marina
Journal of Oceanology and Limnology (2022)
-
Chloroplast genomes of five Oedogonium species: genome structure, phylogenetic analysis and adaptive evolution
BMC Genomics (2021)
-
The complete plastid genome sequence of the enigmatic moss, Takakia lepidozioides (Takakiopsida, Bryophyta): evolutionary perspectives on the largest collection of genes in mosses and the intensive RNA editing
Plant Molecular Biology (2021)