- Main
Long-read sequence assembly of the gorilla genome
- Gordon, David;
- Huddleston, John;
- Chaisson, Mark JP;
- Hill, Christopher M;
- Kronenberg, Zev N;
- Munson, Katherine M;
- Malig, Maika;
- Raja, Archana;
- Fiddes, Ian;
- Hillier, LaDeana W;
- Dunn, Christopher;
- Baker, Carl;
- Armstrong, Joel;
- Diekhans, Mark;
- Paten, Benedict;
- Shendure, Jay;
- Wilson, Richard K;
- Haussler, David;
- Chin, Chen-Shan;
- Eichler, Evan E
- et al.
Published Web Location
https://doi.org/10.1126/science.aae0344Abstract
Accurate sequence and assembly of genomes is a critical first step for studies of genetic variation. We generated a high-quality assembly of the gorilla genome using single-molecule, real-time sequence technology and a string graph de novo assembly algorithm. The new assembly improves contiguity by two to three orders of magnitude with respect to previously released assemblies, recovering 87% of missing reference exons and incomplete gene models. Although regions of large, high-identity segmental duplications remain largely unresolved, this comprehensive assembly provides new biological insight into genetic diversity, structural variation, gene loss, and representation of repeat structures within the gorilla genome. The approach provides a path forward for the routine assembly of mammalian genomes at a level approaching that of the current quality of the human genome.
Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.
Main Content
Enter the password to open this PDF file:
-
-
-
-
-
-
-
-
-
-
-
-
-
-