Article
Published: 29 October 2014

The contribution of de novo coding mutations to autism spectrum disorder

Ivan Iossifov¹^na1,
Brian J. O’Roak^2,3^na1,
Stephan J. Sanders^4,5^na1,
Michael Ronemus¹^na1,
Niklas Krumm²,
Dan Levy¹,
Holly A. Stessman²,
Kali T. Witherspoon²,
Laura Vives²,
Karynne E. Patterson²,
Joshua D. Smith²,
Bryan Paeper²,
Deborah A. Nickerson²,
Jeanselle Dea⁴,
Shan Dong^5,6,
Luis E. Gonzalez⁷,
Jeffrey D. Mandell⁴,
Shrikant M. Mane⁸,
Michael T. Murtha⁷,
Catherine A. Sullivan⁷,
Michael F. Walker⁴,
Zainulabedin Waqar⁷,
Liping Wei^6,9,
A. Jeremy Willsey^4,5,
Boris Yamrom¹,
Yoon-ha Lee¹,
Ewa Grabowska^1,10,
Ertugrul Dalkic^1,11,
Zihua Wang¹,
Steven Marks¹,
Peter Andrews¹,
Anthony Leotta¹,
Jude Kendall¹,
Inessa Hakker¹,
Julie Rosenbaum¹,
Beicong Ma¹,
Linda Rodgers¹,
Jennifer Troge¹,
Giuseppe Narzisi^1,10,
Seungtai Yoon¹,
Michael C. Schatz¹,
Kenny Ye¹²,
W. Richard McCombie¹,
Jay Shendure²,
Evan E. Eichler^2,13,
Matthew W. State^4,5,7,14 &
…
Michael Wigler¹

Nature volume 515, pages 216–221 (2014)Cite this article

59k Accesses
1742 Citations
316 Altmetric
Metrics details

Subjects

Abstract

Whole exome sequencing has proven to be a powerful tool for understanding the genetic architecture of human disease. Here we apply it to more than 2,500 simplex families, each having a child with an autistic spectrum disorder. By comparing affected to unaffected siblings, we show that 13% of de novo missense mutations and 43% of de novo likely gene-disrupting (LGD) mutations contribute to 12% and 9% of diagnoses, respectively. Including copy number variants, coding de novo mutations contribute to about 30% of all simplex and 45% of female diagnoses. Almost all LGD mutations occur opposite wild-type alleles. LGD targets in affected females significantly overlap the targets in males of lower intelligence quotient (IQ), but neither overlaps significantly with targets in males of higher IQ. We estimate that LGD mutation in about 400 genes can contribute to the joint class of affected females and males of lower IQ, with an overlapping and similar number of genes vulnerable to contributory missense mutation. LGD targets in the joint class overlap with published targets for intellectual disability and schizophrenia, and are enriched for chromatin modifiers, FMRP-associated genes and embryonically expressed genes. Most of the significance for the latter comes from affected females.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Rates of *de novo* events by mutational type in the SSC.**

**Figure 2: Recurrently hit genes and non-verbal IQ.**

**Figure 3: Number of vulnerable genes and class vulnerability.**

**Figure 4: Estimated contributions of CNVs, LGDs and missense DN mutations to simplex ASD.**

Large-scale targeted sequencing identifies risk genes for neurodevelopmental disorders

Article Open access 01 October 2020

Rates of contributory de novo mutation in high and low-risk autism families

Article Open access 01 September 2021

Analysis of recent shared ancestry in a familial cohort identifies coding and noncoding autism spectrum disorder variants

Article Open access 21 February 2022

References

Jeste, S. S. & Geschwind, D. H. Disentangling the heterogeneity of autism spectrum disorder through genetic findings. Nature Rev. Neurol. 10, 74–81 (2014)
Article Google Scholar
Sanders, S. J. et al. Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism. Neuron 70, 863–885 (2011)
Article CAS Google Scholar
Levy, D. et al. Rare de novo and transmitted copy-number variation in autistic spectrum disorders. Neuron 70, 886–897 (2011)
Article CAS Google Scholar
Marshall, C. R. et al. Structural variation of chromosomes in autism spectrum disorder. Am. J. Hum. Genet. 82, 477–488 (2008)
Article CAS Google Scholar
Sebat, J. et al. Strong association of de novo copy number mutations with autism. Science 316, 445–449 (2007)
Article ADS CAS Google Scholar
Sanders, S. J. et al. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 485, 237–241 (2012)
Article ADS CAS Google Scholar
O’Roak, B. J. et al. Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature 485, 246–250 (2012)
Article ADS Google Scholar
Iossifov, I. et al. De novo gene disruptions in children on the autistic spectrum. Neuron 74, 285–299 (2012)
Article CAS Google Scholar
Ronemus, M., Iossifov, I., Levy, D. & Wigler, M. The role of de novo mutations in the genetics of autism spectrum disorders. Nature Rev. Genet. 15, 133–141 (2014)
Article CAS Google Scholar
Zhao, X. et al. A unified genetic theory for sporadic and inherited autism. Proc. Natl Acad. Sci. USA 104, 12831–12836 (2007)
Article ADS CAS Google Scholar
Fischbach, G. D. & Lord, C. The Simons Simplex Collection: a resource for identification of autism genetic risk factors. Neuron 68, 192–195 (2010)
Article CAS Google Scholar
Campbell, C. D. et al. Estimating the human mutation rate using autozygosity in a founder population. Nature Genet. 44, 1277–1281 (2012)
Article CAS Google Scholar
Michaelson, J. J. et al. Whole-genome sequencing in autism identifies hot spots for de novo germline mutation. Cell 151, 1431–1442 (2012)
Article CAS Google Scholar
Schrider, D. R., Hourmozdi, J. N. & Hahn, M. W. Pervasive multinucleotide mutational events in eukaryotes. Curr. Biol. 21, 1051–1054 (2011)
Article CAS Google Scholar
Kong, A. et al. Rate of de novo mutations and the importance of father’s age to disease risk. Nature 488, 471–475 (2012)
Article ADS CAS Google Scholar
Neale, B. M. et al. Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature 485, 242–245 (2012)
Article ADS CAS Google Scholar
Darnell, J. C. et al. FMRP stalls ribosomal translocation on mRNAs linked to synaptic function and autism. Cell 146, 247–261 (2011)
Article CAS Google Scholar
Kang, H. J. et al. Spatio-temporal transcriptome of the human brain. Nature 478, 483–489 (2011)
Article ADS CAS Google Scholar
Voineagu, I. et al. Transcriptomic analysis of autistic brain reveals convergent molecular pathology. Nature 474, 380–384 (2011)
Article CAS Google Scholar
Bayés, A. et al. Characterization of the proteome, diseases and evolution of the human postsynaptic density. Nature Neurosci. 14, 19–21 (2011)
Article Google Scholar
Blake, J. A., Bult, C. J., Kadin, J. A., Richardson, J. E. & Eppig, J. T. The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics. Nucleic Acids Res. 39, D842–D848 (2011)
Article CAS Google Scholar
Feldman, I., Rzhetsky, A. & Vitkup, D. Network properties of genes harboring inherited disease mutations. Proc. Natl Acad. Sci. USA 105, 4323–4328 (2008)
Article ADS CAS Google Scholar
Willsey, A. J. et al. Coexpression networks implicate human midfetal deep cortical projection neurons in the pathogenesis of autism. Cell 155, 997–1007 (2013)
Article CAS Google Scholar
Newschaffer, C. J. et al. The epidemiology of autism spectrum disorders. Annu. Rev. Public Health 28, 235–258 (2007)
Article Google Scholar
de Ligt, J. et al. Diagnostic exome sequencing in persons with severe intellectual disability. N. Engl. J. Med. 367, 1921–1929 (2012)
Article ADS CAS Google Scholar
Fromer, M. et al. De novo mutations in schizophrenia implicate synaptic networks. Nature 506, 179–184 (2014)
Article ADS CAS Google Scholar
Lee, S. H. et al. Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs. Nature Genet. 45, 984–994 (2013)
Article CAS Google Scholar
McCarthy, S. E. et al. De novo mutations in schizophrenia implicate chromatin remodeling and support a genetic overlap with autism and intellectual disability. Mol. Psychiatry 19, 652–658 (2014)
Article CAS Google Scholar
Rauch, A. et al. Range of genetic mutations associated with severe non-syndromic sporadic intellectual disability: an exome sequencing study. Lancet 380, 1674–1682 (2012)
Article CAS Google Scholar
O’Roak, B. J. et al. Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders. Science 338, 1619–1622 (2012)
Article ADS Google Scholar
Nishiyama, M., Skoultchi, A. I. & Nakayama, K. I. Histone H1 recruitment by CHD8 is essential for suppression of the Wnt-β-catenin signaling pathway. Mol. Cell. Biol. 32, 501–512 (2012)
Article CAS Google Scholar
Birchler, J. A. & Veitia, R. A. Gene balance hypothesis: connecting issues of dosage sensitivity across biological disciplines. Proc. Natl Acad. Sci. USA 109, 14746–14753 (2012)
Article ADS CAS Google Scholar
Cooper, D. N., Krawczak, M., Polychronakos, C., Tyler-Smith, C. & Kehrer-Sawatzki, H. Where genotype is not predictive of phenotype: towards an understanding of the molecular basis of reduced penetrance in human inherited disease. Hum. Genet. 132, 1077–1130 (2013)
Article CAS Google Scholar
Darnell, J. C. Defects in translational regulation contributing to human cognitive and behavioral disease. Curr. Opin. Genet. Dev. 21, 465–473 (2011)
Article CAS Google Scholar
Veitia, R. A., Bottani, S. & Birchler, J. A. Gene dosage effects: nonlinearities, genetic interactions, and dosage compensation. Trends Genet. 29, 385–393 (2013)
Article CAS Google Scholar
Weischenfeldt, J., Symmons, O., Spitz, F. & Korbel, J. O. Phenotypic impact of genomic structural variation: insights from and for human disease. Nature Rev. Genet. 14, 125–138 (2013)
Article CAS Google Scholar
Zhang, F., Gu, W., Hurles, M. E. & Lupski, J. R. Copy number variation in human health, disease, and evolution. Annu. Rev. Genomics Hum. Genet. 10, 451–481 (2009)
Article CAS Google Scholar
Eckersley-Maslin, M. A. & Spector, D. L. Random monoallelic expression: regulating gene expression one allele at a time. Trends Genet. 30, 237–244 (2014)
Article CAS Google Scholar
Jeffries, A. R. et al. Random or stochastic monoallelic expressed genes are enriched for neurodevelopmental disorder candidate genes. PLoS ONE 8, e85093 (2013)
Article ADS Google Scholar
O’Roak, B. J. et al. Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations. Nature Genet. 43, 585–589 (2011)
Article Google Scholar
Boyle, E. A., O’Roak, B. J., Martin, B. K., Kumar, A. & Shendure, J. MIPgen: optimized modeling and design of molecular inversion probes for targeted resequencing. Bioinformatics 30, 2670–2672 (2014)
Article CAS Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009)
Article CAS Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010)
Article CAS Google Scholar
Narzisi, G. et al. Accurate de novo and transmitted indel detection in exome-capture data using microassembly. Nature Methods 11, 1033–1036 (2014)
Article CAS Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009)
Article Google Scholar
Reichenberg, A. et al. Advancing paternal age and autism. Arch. Gen. Psychiatry 63, 1026–1032 (2006)
Article Google Scholar
Croen, L. A., Najjar, D. V., Fireman, B. & Grether, J. K. Maternal and paternal age and risk of autism spectrum disorders. Arch. Pediatr. Adolesc. Med. 161, 334–340 (2007)
Article Google Scholar
Gulsuner, S. et al. Spatial and temporal mapping of de novo mutations in schizophrenia to a fetal prefrontal cortical network. Cell 154, 518–529 (2013)
Article CAS Google Scholar
Xu, B. et al. Exome sequencing supports a de novo mutational paradigm for schizophrenia. Nature Genet. 43, 864–868 (2011)
Article CAS Google Scholar

Download references

Acknowledgements

Simons Foundation Autism Research Initiative grants to E.E.E. (SF191889), M.W.S. (M144095 R11154) and M.W. (SF235988) supported this work. Additional support was provided by the Howard Hughes Medical Institute (International Student Research Fellowship to S.J.S.) and the Canadian Institutes of Health Research (Doctoral Foreign Study Award to A.J.W.). E.E.E. is an Investigator of the Howard Hughes Medical Institute. We thank all the families at the participating SSC sites, as well as the principal investigators (A. L. Beaudet, R. Bernier, J. Constantino, E. H. Cook Jr, E. Fombonne, D. Geschwind, D. E. Grice, A. Klin, D. H. Ledbetter, C. Lord, C. L. Martin, D. M. Martin, R. Maxim, J. Miles, O. Ousley, B. Peterson, J. Piggot, C. Saulnier, M. W. State, W. Stone, J. S. Sutcliffe, C. A. Walsh and E. Wijsman) and the coordinators and staff at the SSC sites for the recruitment and comprehensive assessment of simplex families; the SFARI staff for facilitating access to the SSC; and the Rutgers University Cell and DNA Repository (RUCDR) for accessing biomaterials. We would also like to thank the CSHL Woodbury Sequencing Center, the Genome Institute at the Washington University School of Medicine, and Yale Center for Genomic Analysis (in particular J. Overton) for generating sequencing data; E. Antoniou and E. Ghiban for their assistance in data production at CSHL; and T. Brooks-Boone, N. Wright-Davis and M. Wojciechowski for their help in administering the project at Yale. The NHLBI GO Exome Sequencing Project and its ongoing studies produced and provided exome variant calls for comparison: the Lung GO Sequencing Project (HL-102923), the WHI Sequencing Project (HL-102924), the Broad GO Sequencing Project (HL-102925), the Seattle GO Sequencing Project (HL-102926) and the Heart GO Sequencing Project (HL-103010).

Author information

Ivan Iossifov, Brian J. O’Roak, Stephan J. Sanders and Michael Ronemus: These authors contributed equally to this work.

Authors and Affiliations

Cold Spring Harbor Laboratory, Cold Spring Harbor, 11724, New York, USA
Ivan Iossifov, Michael Ronemus, Dan Levy, Boris Yamrom, Yoon-ha Lee, Ewa Grabowska, Ertugrul Dalkic, Zihua Wang, Steven Marks, Peter Andrews, Anthony Leotta, Jude Kendall, Inessa Hakker, Julie Rosenbaum, Beicong Ma, Linda Rodgers, Jennifer Troge, Giuseppe Narzisi, Seungtai Yoon, Michael C. Schatz, W. Richard McCombie & Michael Wigler
Department of Genome Sciences, University of Washington School of Medicine, Seattle, 98195, Washington, USA
Brian J. O’Roak, Niklas Krumm, Holly A. Stessman, Kali T. Witherspoon, Laura Vives, Karynne E. Patterson, Joshua D. Smith, Bryan Paeper, Deborah A. Nickerson, Jay Shendure & Evan E. Eichler
Molecular & Medical Genetics, Oregon Health & Science University, Portland, 97208, Oregon, USA
Brian J. O’Roak
Department of Psychiatry, University of California, San Francisco, San Francisco, California 94158, USA,
Stephan J. Sanders, Jeanselle Dea, Jeffrey D. Mandell, Michael F. Walker, A. Jeremy Willsey & Matthew W. State
Department of Genetics, Yale University School of Medicine, New Haven, 06520, Connecticut, USA
Stephan J. Sanders, Shan Dong, A. Jeremy Willsey & Matthew W. State
Center for Bioinformatics, State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, 100871, China
Shan Dong & Liping Wei
Child Study Center, Yale University School of Medicine, New Haven, 06520, Connecticut, USA
Luis E. Gonzalez, Michael T. Murtha, Catherine A. Sullivan, Zainulabedin Waqar & Matthew W. State
Yale Center for Genomic Analysis, Yale University School of Medicine, New Haven, 06520, Connecticut, USA
Shrikant M. Mane
National Institute of Biological Sciences, Beijing, 102206, China
Liping Wei
New York Genome Center, New York, 10013, New York , USA
Ewa Grabowska & Giuseppe Narzisi
Department of Medical Biology, Bulent Ecevit University School of Medicine, 67600 Zonguldak, Turkey,
Ertugrul Dalkic
Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, 10461, New York, USA
Kenny Ye
Howard Hughes Medical Institute, Seattle, 98195, Washington, USA
Evan E. Eichler
Department of Psychiatry, Yale University School of Medicine, New Haven, 06520, Connecticut, USA
Matthew W. State

Authors

Ivan Iossifov
View author publications
You can also search for this author in PubMed Google Scholar
Brian J. O’Roak
View author publications
You can also search for this author in PubMed Google Scholar
Stephan J. Sanders
View author publications
You can also search for this author in PubMed Google Scholar
Michael Ronemus
View author publications
You can also search for this author in PubMed Google Scholar
Niklas Krumm
View author publications
You can also search for this author in PubMed Google Scholar
Dan Levy
View author publications
You can also search for this author in PubMed Google Scholar
Holly A. Stessman
View author publications
You can also search for this author in PubMed Google Scholar
Kali T. Witherspoon
View author publications
You can also search for this author in PubMed Google Scholar
Laura Vives
View author publications
You can also search for this author in PubMed Google Scholar
Karynne E. Patterson
View author publications
You can also search for this author in PubMed Google Scholar
Joshua D. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Bryan Paeper
View author publications
You can also search for this author in PubMed Google Scholar
Deborah A. Nickerson
View author publications
You can also search for this author in PubMed Google Scholar
Jeanselle Dea
View author publications
You can also search for this author in PubMed Google Scholar
Shan Dong
View author publications
You can also search for this author in PubMed Google Scholar
Luis E. Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey D. Mandell
View author publications
You can also search for this author in PubMed Google Scholar
Shrikant M. Mane
View author publications
You can also search for this author in PubMed Google Scholar
Michael T. Murtha
View author publications
You can also search for this author in PubMed Google Scholar
Catherine A. Sullivan
View author publications
You can also search for this author in PubMed Google Scholar
Michael F. Walker
View author publications
You can also search for this author in PubMed Google Scholar
Zainulabedin Waqar
View author publications
You can also search for this author in PubMed Google Scholar
Liping Wei
View author publications
You can also search for this author in PubMed Google Scholar
A. Jeremy Willsey
View author publications
You can also search for this author in PubMed Google Scholar
Boris Yamrom
View author publications
You can also search for this author in PubMed Google Scholar
Yoon-ha Lee
View author publications
You can also search for this author in PubMed Google Scholar
Ewa Grabowska
View author publications
You can also search for this author in PubMed Google Scholar
Ertugrul Dalkic
View author publications
You can also search for this author in PubMed Google Scholar
Zihua Wang
View author publications
You can also search for this author in PubMed Google Scholar
Steven Marks
View author publications
You can also search for this author in PubMed Google Scholar
Peter Andrews
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Leotta
View author publications
You can also search for this author in PubMed Google Scholar
Jude Kendall
View author publications
You can also search for this author in PubMed Google Scholar
Inessa Hakker
View author publications
You can also search for this author in PubMed Google Scholar
Julie Rosenbaum
View author publications
You can also search for this author in PubMed Google Scholar
Beicong Ma
View author publications
You can also search for this author in PubMed Google Scholar
Linda Rodgers
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Troge
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppe Narzisi
View author publications
You can also search for this author in PubMed Google Scholar
Seungtai Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Michael C. Schatz
View author publications
You can also search for this author in PubMed Google Scholar
Kenny Ye
View author publications
You can also search for this author in PubMed Google Scholar
W. Richard McCombie
View author publications
You can also search for this author in PubMed Google Scholar
Jay Shendure
View author publications
You can also search for this author in PubMed Google Scholar
Evan E. Eichler
View author publications
You can also search for this author in PubMed Google Scholar
Matthew W. State
View author publications
You can also search for this author in PubMed Google Scholar
Michael Wigler
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

CSHL: I.I., M.R. and M.W. designed the study; I.I., D.L., B.Y., Y.L., E.G., E.D., P.A., A.L., J.K., G.N., S.Y., M.C.S., K.Y. and M.W. analysed the data; M.R., I.H., J.R., B.M., L.R., J.T. and W.R.M. generated the exome data at Cold Spring Harbor Laboratory; I.I., Z.W., S.M. and J.T. confirmed the variants; I.I., M.R. and M.W. wrote the paper. UCSF/Yale: S.J.S. and M.W.S. designed the study; S.J.S., S.D., L.W. and A.J.W. analysed the data; S.J.S., J.D., L.E.G., J.D.M., C.A.S., M.F.W. and Z.W. confirmed the variants; S.M.M. and M.T.M. generated the exome data at Yale Medical Center. UW: B.J.O., J.S. and E.E.E. designed the study; B.J.O. and N.K. analysed the data; B.J.O., H.A.S., K.T.W. and L.V. confirmed the variants; E.E.E. and J.S. revised the manuscript; K.E.P, J.D.S., B.P. and D.A.N. generated the exome data at the University of Washington.

Corresponding authors

Correspondence to Jay Shendure, Evan E. Eichler, Matthew W. State or Michael Wigler.

Ethics declarations

Competing interests

E.E.E. is on the scientific advisory board of DNAnexus, Inc. and was a scientific advisory board member of Pacific Biosciences, Inc. (2009–2013) and SynapDx Corp. (2011–2013). J.S. is a member of the scientific advisory board or serves as a consultant for Adaptive Biotechnologies, Ariosa Diagnostics, Stratos Genomics, GenePeeks, Gen9, Good Start Genetics, Ingenuity Systems and Rubicon Genomics. B.J.O. is an inventor on patent PCT/US2009/30620: Mutations in contactin-associated protein 2 are associated with increased risk for idiopathic autism.

Additional information

Sequence data used in these work are available from the National Database for Autism Research (http://ndar.nih.gov/), under study DOI:10.15154/1149697.

Extended data figures and tables

Extended Data Figure 1 Number of families sequenced by centre.

The numbers of families sequenced at the three centres are plotted as a Venn diagram. Families sequenced at more than one centre are indicated by the overlapping regions between circles. CSHL, Cold Spring Harbor Laboratory; UW, University of Washington, Seattle; YALE, Yale Medical Center.

Extended Data Figure 2 SSC sequencing by pedigree type and non-verbal IQ.

A summary of all SSC families sequenced is indicated across the ‘all’ row. Numbers of SSC families with complete exome sequencing data are displayed by centre in the following rows (see Extended Data Fig. 1 legend for centre designations). The top number in entries under the ‘families’ column indicates the total number of families sequenced, and the number in parentheses below indicates the total number of individuals. Family pedigree structures are shown across the top row with gender indicated by shape (square for male, circle for female) and affected status indicated by colour (white for unaffected, grey for affected). Distributions of non-verbal IQ within each cohort are shown for male probands (blue) and female probands (red).

Extended Data Figure 3 Rates of de novo LGD and missense mutations in the SSC by child status.

On the left we show the LGD rate per child in six types of children, labelled on the x axis, defined by their affected status, gender, and non-verbal IQ. We test for equal rates for every pair of child types and we show the ones with P > 0.05 with thin lines on the top of the figure. Although not significant, the rates in affected females and in affected males of lower non-verbal IQ are larger than the rate in males of higher non-verbal IQ. On the right, we show the missense rates per child for the same six groups of children.

Extended Data Figure 4 Paternal age and DN mutation rate at child birth.

Distribution of paternal age at birth of children (top) and rates of DN mutation in offspring as a function of paternal age are shown (bottom). Children were ordered by paternal age at birth and split into 20 groups of similar size, as shown in the bottom panel. The red curve shows the mean observed rates of de novo exomic substitutions in each of the 20 groups, with the x coordinate equal to the mean each of the fathers’ ages within each group. The blue line shows a linear fit to the observed rates. The dotted green line represents DN mutation rates from whole genome sequencing data¹⁵ scaled to rates per exome based on representation in the SeqCap EZ Human Exome Library v2.0 (Roche NimbleGen).

Extended Data Figure 5 Coding region size distribution for query sets of genes.

Probability density function (PDF) and cumulative distribution functions (CDF) (right bottom) of the distributions of the coding region length in base pairs of five sets of genes: a set of 1,200 genes picked uniformly from the set of exome-targeted genes (blue); a separate set of 1,200 genes picked with probabilities proportional to length of the coding region (green); the set of gene targets of neutral mutations, including synonymous mutations in probands and siblings, and missense mutation in siblings (red); genes with de novo missense mutations in probands (cyan); and genes with de novo LGDs in probands (magenta). Black within the histograms shows the distribution of lengths of the recurrently hit genes from each class. Coding region length distribution under a uniform model does not fit the lengths of the genes with observed mutations, and genes with LGD mutations are longer than predicted by a simple length-based model (bottom right).

Extended Data Figure 6 Distributions of sequencing depth.

Distributions of sequencing depth (number of sequence reads covering a given genomic position) per person per position for the three sequencing centres are plotted. Centre designations are as in Extended Data Fig. 1.

Extended Data Figure 7 Yield of DN LGD and missense mutations.

We plot the yield of DN LGD and missense mutations per sequencing centre (designations as in Extended Data Fig. 1). In each case we show the number of mutations we expect to see based on the estimated rates per child, indicated by the numbers above the bars. We also show what percentage of the expected number we have observed. Black refers to strong calls in the 40× target, grey refers to strong calls outside of 40× target, and magenta refers to weak (but valid) calls. The white region represents the difference between the expected and observed numbers of variants.

Extended Data Figure 8 Categorization of embryonically expressed genes.

We downloaded expression data¹⁸ from http://www.brainspan.org/static/download.html. The data set provides normalized expression levels for ∼17,000 genes across brain regions from 36 individuals, 18 of which were from embryos. Each brain was further subdivided into 14 anatomical regions for a total of 508 regions. We computed correlation values for the 17,000 genes, and generated a graph by connecting genes that had correlations >0.85. We then identified connected components and averaged the expression of genes within these components as a function of the annotated age of the brain and by region. Each region is sorted first by age, then by type. The averaged normalized expression of the 1,912 genes in the first component decreases after birth, and hence we call this set embryonic. See Supplementary Table 7 for the list of embryonic genes.

Supplementary information

Supplementary Information

This file contains Supplementary Table 3 (Experimental validation in the 40X target), Supplementary Table 4 (Multiple de novo events), Supplementary Table 8 (Compound non-synonymous hits in targets), Supplementary Table 11 (Validation summary by centre) and Supplementary Table 13 (Median gene lengths) as well as legends for Supplementary Tables 1, 2, 5–7, 9, 10 and 12. (PDF 191 kb)

Supplementary Data

This zipped file contains Supplementary Tables 1-2, 5-7, 9, 10 and 12. (ZIP 1639 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Iossifov, I., O’Roak, B., Sanders, S. et al. The contribution of de novo coding mutations to autism spectrum disorder. Nature 515, 216–221 (2014). https://doi.org/10.1038/nature13908

Download citation

Received: 04 July 2014
Accepted: 03 October 2014
Published: 29 October 2014
Issue Date: 13 November 2014
DOI: https://doi.org/10.1038/nature13908

This article is cited by

Statistical methods for assessing the effects of de novo variants on birth defects
- Yuhan Xie
- Ruoxuan Wu
- Hongyu Zhao
Human Genomics (2024)
Meta-analysis of 46,000 germline de novo mutations linked to human inherited disease
- Mónica Lopes-Marques
- Matthew Mort
- Luísa Azevedo
Human Genomics (2024)
CRISPR-dCas13d-based deep screening of proximal and distal splicing-regulatory elements
- Yocelyn Recinos
- Dmytro Ustianenko
- Chaolin Zhang
Nature Communications (2024)
Severity of Autism Spectrum Disorder Symptoms Associated with de novo Variants and Pregnancy-Induced Hypertension
- Xiaomeng Wang
- Zhengbao Ling
- Jinchen Li
Journal of Autism and Developmental Disorders (2024)
Brief Report: Differences in Naturalistic Attention to Real-World Scenes in Adolescents with 16p.11.2 Deletion
- Amanda J. Haskins
- Jeff Mentch
- Caroline E. Robertson
Journal of Autism and Developmental Disorders (2024)