Abstract
Free full text
Aberrant splicing of HTT generates the pathogenic exon 1 protein in Huntington disease
Huntington disease (HD) is a devastating, late-onset, inherited neurodegenerative disorder that manifests with personality changes, movement disorders, and cognitive decline. It is caused by a CAG repeat expansion in exon 1 of the HTT gene that translates to a polyglutamine tract in the huntingtin protein (HTT). The formation of HTT fragments has been implicated as an essential step in the molecular pathogenesis of HD and several proteases that cleave HTT have been identified. However, the importance of smaller N-terminal fragments has been highlighted by their presence in HD postmortem brains and by the fact that nuclear inclusions are only detected by antibodies to the N terminus of HTT. Despite an intense research effort, the precise length of these fragments and the mechanism by which they are generated remains unknown. Here we show that CAG repeat length–dependent aberrant splicing of exon 1 HTT results in a short polyadenylated mRNA that is translated into an exon 1 HTT protein. Given that mutant exon 1 HTT proteins have consistently been shown to be highly pathogenic in HD mouse models, the aberrant splicing of HTT mRNA provides a mechanistic basis for the molecular pathogenesis of HD. RNA-targeted therapeutic strategies designed to lower the levels of HTT are under development. Many of these approaches would not prevent the production of exon 1 HTT and should be reviewed in light of our findings.
Huntington disease (HD) is a devastating, late-onset, inherited neurodegenerative disease caused by a CAG repeat expansion in exon 1 of the HTT gene that translates to a polyglutamine tract in huntingtin protein (HTT). Individuals with (CAG)40 and above will develop HD within a normal lifespan, whereas repeats above (CAG)70 will invariably cause childhood onset (1). The hypothesis that an N-terminal fragment of mutant HTT represents the “toxic” species is well-established (2), and the search for the proteases that generate HTT fragments has identified caspase-3 (3), caspase-6 (3), calpains (4, 5), and matrix metalloproteinase 10 (6). However, the potential importance of smaller N-terminal fragments is highlighted by their presence in HD postmortem brains (7), their release by formic acid solubilization from brain tissue (8), and that nuclear inclusions are only detected by antibodies to N-terminal HTT epitopes (7, 9). In cell models, HTT can be cleaved into small fragments, namely cp-A and cp-B (8) or cp-1 and cp-2 (10, 11), but the proteases responsible for this have not been identified. We have recently shown that the smallest fragment present in the brains of HdhQ150 knock-in mice is an exon 1 HTT protein (12). Analysis of the R6 transgenic mouse lines has previously shown an exon 1 HTT protein is produced by translation of an exon 1 HTT transcript (13). This led us to investigate whether the exon 1 HTT protein in the HdhQ150 knock-in mice might have been generated by aberrant splicing rather than proteolytic cleavage.
Aberrant Splicing of Mouse Htt Exon 1 to Exon 2 Results in a Small Polyadenylated mRNA.
The HdhQ150 knock-in mouse model was generated by the insertion of ~150 CAGs into exon 1 of the mouse Htt gene (14). To determine whether exon 1 of Htt mRNA had spliced to exon 2, we used a series of RT-PCR assays (Fig. 1A) on cDNA prepared from the brains of homozygous (HdhQ150/Q150), heterozygous (Hdh+/Q150) and WT animals at 2 mo of age (Fig. 1B). Nonquantitative end-point RT-PCR products were comparable between all three genotypes for assays that amplified exon 2 and spanned the exon 1–exon 2 junction. In contrast, the levels of the RT-PCR products obtained for the exon 1–intron 1 boundary and for intron 1 sequences upstream of ~1.2 kb were more intense in cDNA prepared from the brains of HdhQ150/Q150 compared with Hdh+/Q150 animals, which were in turn more intense than those obtained from their WT littermates (Fig. 1B). Examination of the genomic sequence for Htt intron 1 identified a cryptic polyadenylation signal at position 677–685 bp and 3′RACE showed the presence of a polyadenylated short mRNA in HdhQ150/Q150 and Hdh+/Q150 but not WT brains (Fig. 1C). This unspliced exon 1–intron 1 transcript was present throughout the brain and in a range of peripheral tissues (Figs. S1 and S2). To independently verify the presence of the exon 1–intron 1 transcript, we used total RNA sequencing (RNA-Seq) on RNA prepared from the cortex of HdhQ150/Q150 and WT mice. Considerable read densities mapping to the first 1.2 kb of intron 1 were observed in HdhQ150/Q150, but not in WT littermate samples (Fig. 1D and Fig. S3). To approximate the percentage of nonspliced transcripts, we compared the 3′UTR read densities by the Mixture-of-Isoforms (MISO) software and found a 20% reduction in the full-length mRNA produced in the HdhQ150/Q150 samples (Fig. S3). Finally, we failed to detect any evidence for higher intronic reads in homozygotes compared with WT at any other position along the mouse Htt gene (Fig. S3C).
Aberrant Splicing Occurs in All HD Knock-In Mouse Models and Is CAG Repeat Length–Dependent.
As stated previously, individuals with mutations of (CAG)40 and above will develop HD within a normal lifespan, whereas repeats above (CAG)70 will invariably cause childhood onset (1). Therefore, we used a series of knock-in mouse lines: HdhQ20, Q50, Q80, Q100, Q150, and zQ175 (Figs. 1A and and2A)2A) to determine whether aberrant splicing occurred in the context of a wide range of CAG repeats (Table S1 shows CAG repeats lengths). We analyzed RNA at 2 mo of age for each of the knock-in lines and were able to detect the same 3′RACE product in all cases except for HdhQ20 (Fig. 2B). Quantitative RT-PCR (indicated that the spliced full-length Htt mRNA levels, as determined by amplifying the exon 1–exon 2 junction and exon 2 sequences, were largely comparable between genotypes for Q20, Q50, Q80, and Q100. However, the full-length Htt transcripts were underrepresented in the Q150 and zQ175 mice (Fig. 2 C and D), consistent with the RNA-Seq data (Fig. S3B) and a previous report (15). It was not possible to normalize the levels of any intronic sequences to WT because in these mice, intron 1 sequences in processed mRNA have been removed by splicing and any signal represents noise (Fig. 2E). However, quantitative RT-PCR showed that transcripts containing early intron 1 sequences were highly increased in a gene–dose-dependent manner for all lines except HdhQ20 (Fig. 2E). The level of these transcripts also increased in a polyQ length–dependent manner when comparing the Q50, Q100, and Q150 lines (mouse Htt sequences only) or the Q80 and zQ175 lines (human-mouse chimeric Htt) (Fig. 2 A and E).
Aberrant Splicing Occurs in the Context of Human HTT.
To extrapolate these findings to human HTT, we performed in silico analysis and identified nine predicted polyadenylation signals in intron 1 (Fig. S4). 3′RACE on brain RNA from 2-mo-old YAC128 (16) and BACHD (17) mice, both of which are transgenic for human HTT, showed that the signal with the highest predictive score (~7.3 kb) results in a polyadenylated transcript (Fig. 2F). A second polyadenylated transcript, resulting from a signal at ~2.7 kb, was also found in the YAC128 mice (Fig. S5).
Next we sought to determine whether the aberrantly spliced mRNA was translated. We performed RT-PCR on RNA isolated from polysome gradients prepared from homozygous zQ175 and WT brains at 2 mo of age (Fig. 3A and Fig. S6). RT-PCR with primers to Hsf1 and Atp5b demonstrated that the polysomes isolated from both genotypes were intact (Fig. S6 A–C). RNA isolated from the polysome gradients from WT mice showed the expected profiles: Htt exonic sequences, but not early intron 1 sequences were associated with polysomes (fractions 12–18) (Fig. 3A and Fig. S6D). In contrast, early intron 1 sequences from zQ175 mice were present in the polysome fractions (Fig. 3A), which is consistent with the nonspliced RNA being translated.
Aberrantly Spliced Htt Transcript Is Translated and Produces an Exon 1 HTT Protein.
An unspliced exon 1 is followed by an evolutionarily conserved stop codon (Fig. S7), resulting in the production of an exon 1 protein that, in all vertebrates, terminates in a proline residue. We have previously shown that the smallest N-terminal fragment generated in the HdhQ150 lines corresponds to exon 1 HTT (12). To determine whether this is also present in the other HD models, we performed immunoprecipitation with 3B5H10, and Western blots were immunoprobed with S830, MW8 [exon 1 HTT C-terminal neo-epitope (12)], and 1H6 (Fig. 3B). Comparison of the S830 and MW8 blots (which only detect an exon 1 HTT protein on Western blots) reveals that an exon 1 HTT protein (dotted lines) is present in the zQ175, HdhQ150, HdhQ100, HdhQ80, and YAC128 brains but not in those from their WT littermates or in the IgG controls (Fig. 3 C–F and Fig. S8), as would be predicted from the presence of the nonspliced short transcript.
Splicing Factor SRSF6 Binds to Expanded CAG Repeats in Htt Transcripts.
To investigate the underlying mechanism, we used bioinformatics to predict regulatory motifs in exon 1 of Htt (Fig. 4A) and mapped the binding site of SRSF6 to a CAG or CAGCAA repeat (Fig. 4B). RNA coimmunoprecipitation with an antibody to SRSF6 captured Htt 5′UTR and early intron 1 sequences from homozygous zQ175 mice with a much higher efficiency than those from WT brain lysates (Fig. 4C). Exon 2 transcripts were not immunoprecipitated from either zQ175 or WT lysates (Fig. 4C), which is consistent with SRSF6 binding to the expanded CAG repeat in the zQ175 mice and inhibiting the splicing of exon 1 to exon 2.
The Aberrantly Spliced Transcript Is Present in HD Patient Tissues.
To determine whether the aberrant splicing also occurs in humans, we performed RT-PCR on RNA from two juvenile HD and two control fibroblast lines, peripheral blood mononuclear cells from HD and control subjects, postmortem sensory motor cortex from two HD and two control subjects, and cortical tissue from two individuals with juvenile HD (Table S2). In the fibroblast lines, the signals were relatively comparable for exon 1, the exon 1–exon 2 junction, and exon 2 transcripts. In contrast, the early intron 1 transcripts were elevated in the HD fibroblast compared with the control lines (Fig. 4D). However, the pattern was more difficult to interpret in the postmortem brain samples, probably reflecting differences in the extent to which the RNA had degraded in these tissues. We performed 3′RACE for the same cryptic polyadenylation signal as used in the YAC128 and BACHD mice. The polyadenylated transcript was clearly apparent in the two fibroblast lines as well as in the sensory motor cortex of an HD patient with 42 CAGs and the cortex of a juvenile HD individual with 72 CAGs (Fig. 4E). We were only able to detect a 3′RACE product in one of eight peripheral blood mononuclear cells samples, suggesting that the missplicing events are very rare in these cells.
We have identified a small exon 1–intron 1 polyadenylated mRNA transcript in the brains of all HD mouse models expressing mutant Htt (mouse) or HTT (human). Furthermore, we have shown that the same transcript is also present in fibroblast lines derived from HD patients and in postmortem HD brains. We have shown that the SRSF6 splicing factor binds to the 5′end of the Htt gene with an expanded CAG repeat, consistent with its known recognition motif (18). SRSF6 regulates splicing and facilitates translation of partially spliced transcripts (19, 20). SR proteins might also sequester U1 small nuclear ribonucleoprotein complexes (21), a phenomenon that promotes polyadenylation from cryptic polyA signals within introns (22). Therefore, an increased association of SRSF6 with expanded CAG repeats could account for the CAG repeat length–dependent production of the exon 1–intron 1 transcript. Translation of this transcript produces an exon 1 HTT protein.
The pathological consequences of the expression of this aberrantly spliced product in mice have been demonstrated in multiple experiments. The R6/ mouse lines are transgenic for a genomic fragment that spans the 5′ end of the HTT gene, exon 1, and a portion of intron 1 (13). Therefore, the R6 lines can be considered to be models of aberrant splicing in HD and demonstrate that an exon 1 HTT protein is highly pathogenic, a result that was recapitulated in a set of independent experiments (23). Surprisingly, the late-stage phenotypes of R6/2 mice closely resemble those present in the HdhQ150 knock-in mice (24–27), the main difference between these two models being the age of phenotype onset and rate of disease progression. This suggests that the pathogenic process in HdhQ150 mice could be driven by the same exon 1 HTT fragment differing only in its abundance and accumulation over time.
Our discovery that exon 1 HTT is generated by aberrant splicing will provide the opportunity to test the extent to which an exon 1 protein contributes to disease pathogenesis in the knock-in lines and the extent to which full-length HTT and/or other N-terminal HTT fragments, generated by proteolytic cleavage, are also pathogenic. RNA-targeted therapeutic approaches designed to lower the levels of HTT through the use of antisense oligonucleotides, RNAi, or small hairpin RNAs are under development (28). Many of these approaches would not prevent the production of exon 1 HTT and should be reviewed in the light of our findings.
Mouse Maintenance and Breeding.
The HdhQ50 and HdhQ100 lines were generated by selective breeding for alterations in germ-line repeat size starting with a C57BL/6 congenic of the HdhQ150 lines (14). The Hdh+/Q20, Hdh+/Q80 (29, 30), and zQ175 (31) knock-in mice were supplied from CHDI colonies maintained at The Jackson Laboratory. The HdhQ20, HdhQ50, HdhQ80, HdhQ100, and zQ175 lines were maintained by backcrossing to C57BL/6J (Charles River). All experimental procedures were approved by the King's College London Ethical Review Committee and performed in accordance with United Kingdom Home Office regulations.
RNA Sequencing.
Samples were prepared using a modified, strand-specific version of the Illumina Tru-Seq protocol. Final PCR amplification was performed with KAPA HiFi polymerase and GC buffer (Kapa Biosystems). The paired-end, strand-specific cDNA libraries were multiplexed onto the Illumina HiSEq (40-bp reads). Read data were mapped to the mm9 build with the Bowtie alignment program using the “best” setting. For quality control statistics of the RNA-Seq libraries, see Table S4. Differential expression of the Htt gene was analyzed with the R DESeq package. Gene expression was calculated as reads per kilobase of exon per million mapped reads (RPKMs). Splicing was analyzed using the Python/C version of MISO.
Polysome Gradients.
A 10–40% (wt/vol) sucrose gradient was prepared in 50 mM TrisCl (pH 6.6), 140 mM NaCl, 12 mM magnesium chloride, 200 µg/mL cycloheximide, and 1 mM DTT. Mouse brain tissue was lysed in freshly prepared polysome buffer [10 mM TrisCl (pH 7.4), 140 mM NaCl, 12 mM magnesium chloride, 1% (wt/vol) Triton X-100, 1 mM DTT, 200 µg/mL cycloheximide, 0.5 U/µL RNAsin, and 10 mM ribonucleoside vanadyl complex]. A volume corresponding to 250 µg absorbance at 260 nm was layered on the 10–40% sucrose gradients. The gradients were centrifuged at 39,000 rpm at 4 °C for 1 h 40 min in a SW41-Ti swing out rotor. Fractions were collected and RNA was extracted.
SRSF6 RNA Coimmunoprecipitation.
Mouse brain tissue was lysed in freshly prepared Triton buffer [50 mM Hepes/NaOH pH 7.6, 160 mM NaCl, 7 mM magnesium chloride, 3 mM calcium chloride, 5 mM potassium chloride, 1% (wt/vol) Triton X-100, 1 mM phenylmethylsulfonyl fluoride, 0.5 U/µL RNAsin and complete protease and phosphatase inhibitor mixture]. Two milligrams of total protein was immunoprecipitated for 5 h on a rotating wheel at 4 °C with 9 μL of Dynabeads protein G and 3 μg of anti-SRSF6 antibody (LS-B5712; LifeSpan BioSciences). The beads were washed four times with 0.5 mL Triton buffer followed by RNA extraction.
Detailed descriptions of mouse maintenance, breeding, genotyping, and CAG repeat sizing, quantitative RT-PCR, mouse/human RT-PCR (see also Table S5) and 3′RACE, RNA sequencing, polysome gradients, antibodies, immunoprecipitation, Western blotting, SRSF6 RNA-IP, bioinformatics, and statistical analysis are provided in SI Materials and Methods.
We thank Jenny Morton for providing YAC128 mice, Yvon Trottier for the 1H6 antibody, the Massachusetts Institute of Technology BioMicro Center for RNA-Seq preparation and sequencing, and the Neurological Foundation of New Zealand for Huntington disease postmortem brain tissue. This work was supported by Grant G0801314 from the Medical Research Council and a grant from the CHDI Foundation.
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/lookup/suppl/10.1073/pnas.1221891110/-/DCSupplemental.
Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences
Full text links
Read article at publisher's site: https://doi.org/10.1073/pnas.1221891110
Read article for free, from open access legal sources, via Unpaywall: https://www.pnas.org/content/pnas/110/6/2366.full.pdf
Citations & impact
Impact metrics
Citations of article over time
Alternative metrics
Smart citations by scite.ai
Explore citation contexts and check if this article has been
supported or disputed.
https://scite.ai/reports/10.1073/pnas.1221891110
Article citations
Age-dependent somatic expansion of the ATXN3 CAG repeat in the blood and buccal swab DNA of individuals with spinocerebellar ataxia type 3/Machado-Joseph disease.
Hum Genet, 143(11):1363-1378, 08 Oct 2024
Cited by: 0 articles | PMID: 39375222 | PMCID: PMC11522074
m<sup>6</sup>A modification of mutant huntingtin RNA promotes the biogenesis of pathogenic huntingtin transcripts.
EMBO Rep, 25(11):5026-5052, 11 Oct 2024
Cited by: 0 articles | PMID: 39394467 | PMCID: PMC11549361
Therapeutic targeting of RNA for neurological and neuromuscular disease.
Genes Dev, 38(15-16):698-717, 19 Sep 2024
Cited by: 0 articles | PMID: 39142832
Review
Huntingtin contains an ubiquitin-binding domain and regulates lysosomal targeting of mitochondrial and RNA-binding proteins.
Proc Natl Acad Sci U S A, 121(32):e2319091121, 29 Jul 2024
Cited by: 2 articles | PMID: 39074279 | PMCID: PMC11317567
Beyond CAG Repeats: The Multifaceted Role of Genetics in Huntington Disease.
Genes (Basel), 15(6):807, 19 Jun 2024
Cited by: 2 articles | PMID: 38927742
Review
Go to all (294) article citations
Data
Data behind the article
This data has been text mined from the article, or deposited into data resources.
BioStudies: supplemental material and supporting data
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Aberrantly spliced HTT, a new player in Huntington's disease pathogenesis.
RNA Biol, 10(11):1647-1652, 11 Oct 2013
Cited by: 41 articles | PMID: 24256709 | PMCID: PMC3907474
A series of N-terminal epitope tagged Hdh knock-in alleles expressing normal and mutant huntingtin: their application to understanding the effect of increasing the length of normal Huntingtin's polyglutamine stretch on CAG140 mouse model pathogenesis.
Mol Brain, 5:28, 14 Aug 2012
Cited by: 14 articles | PMID: 22892315 | PMCID: PMC3499431
Ciliogenesis is regulated by a huntingtin-HAP1-PCM1 pathway and is altered in Huntington disease.
J Clin Invest, 121(11):4372-4382, 10 Oct 2011
Cited by: 97 articles | PMID: 21985783 | PMCID: PMC3223861
Selective degeneration in YAC mouse models of Huntington disease.
Brain Res Bull, 72(2-3):124-131, 16 Nov 2006
Cited by: 28 articles | PMID: 17352936
Review
Funding
Funders who supported this work.
Medical Research Council (2)
Investigation of the mechanism by which huntingtin fragments are produced and their pathogenic relevance to Huntington's disease
Prof Gillian Bates, King's College London
Grant ID: MR/L003627/1
Investigation of Heat shock Factor 1 as a Therapeutic Target for Huntington s Disease
Prof Gillian Bates, King's College London
Grant ID: G0801314
NCI NIH HHS (1)
Grant ID: P30 CA014051