Abstract
X-chromosome inactivation and X-upregulation are the fundamental modes of chromosome-wide gene regulation that collectively achieve dosage compensation in mammals, but the regulatory link between the two remains elusive and the X-upregulation dynamics are unknown. Here, we use allele-resolved single-cell RNA-seq combined with chromatin accessibility profiling and finely dissect their separate effects on RNA levels during mouse development. Surprisingly, we uncover that X-upregulation elastically tunes expression dosage in a sex- and lineage-specific manner, and moreover along varying degrees of X-inactivation progression. Male blastomeres achieve X-upregulation upon zygotic genome activation while females experience two distinct waves of upregulation, upon imprinted and random X-inactivation; and ablation of Xist impedes female X-upregulation. Female cells carrying two active X chromosomes lack upregulation, yet their collective RNA output exceeds that of a single hyperactive allele. Importantly, this conflicts the conventional dosage compensation model in which naïve female cells are initially subject to biallelic X-upregulation followed by X-inactivation of one allele to correct the X dosage. Together, our study provides key insights to the chain of events of dosage compensation, explaining how transcript copy numbers can remain remarkably stable across developmental windows wherein severe dose imbalance would otherwise be experienced by the cell.
Similar content being viewed by others
Introduction
In most mammals, the X chromosome is present as two copies in females but only one in males, and two X-chromosome-wide mechanisms ensure balanced expression dosage in the cell1. X-chromosome upregulation (XCU) evolved to resolve X-to-autosomal imbalances in XY males by hyperactivation of the single X-chromosome whereas X-chromosome inactivation (XCI) followed as a mechanism to silence one hyperactive allele in XX females, equalizing expression between the sexes1. This stands as the prevailing evolutionary hypothesis of mammalian dosage compensation2,3,4. In addition, XCU followed by XCI is also believed to be the developmental sequence of events. Specifically, the conventional model of mammalian dosage compensation assumes an initial state of biallelically upregulated X chromosomes in female embryonic cells upon which XCI subsequently corrects the expression dosage by silencing one X allele. However, despite being central for the mechanistic understanding of how cells achieve dosage compensation, the developmental dynamics of XCU have so far not been thoroughly characterized in a mammal. Previous studies have approximated XCU primarily by the relative measurement between nonallelic total expression levels of X and autosomes in steady-state XCU5,6,7,8,9,10. Such comparisons are however not only indirect since autosomes and sex chromosomes differ in multiple aspects such as gene content but also confounded as opposite dosage effects of XCU and XCI on each allele would be masked at a nonallelic level. It is therefore unsurprising that reports on the establishment, magnitude, maintenance, and potential reversal of XCU have been conflicting5,11,12. Disentangling the isolated dosage effects of XCU and XCI requires quantitative gene expression measurements at a cellular and allelic resolution only recently enabled by allele-resolved single-cell RNA-seq (scRNA-seq), and computationally accounting for stochastic processes affecting allelic measurements at the cellular level13, such as effects of transcriptional bursting and random XCI (rXCI) progressing heterogeneously in cells.
Here, we reveal the dynamics of XCU in mouse at the cellular and allelic resolution throughout mouse embryonic stem cell (mESC) priming in vitro as well as pre- and peri-implantation development in vivo. By combining allele-resolved scRNA-seq and scATAC-seq (single-cell assay for transposase-accessible chromatin using sequencing) in the same cells during XCU establishment, we directly contrast epigenetic features of the hyperactive and non-hyperactive state for the first time. Remarkably, our data show that XCU is neither a constant state of hyper transcription of the X chromosome, nor established in a single developmental event in female embryos, but is a flexible process that tunes RNA synthesis proportionally to the output of the second X allele across developmental states. These new insights alter the model of dosage compensation in early mammalian embryogenesis, bridging the allele-specific dynamics of XCU and XCI.
Results
Naïve female embryonic cells lack X-upregulation
Exit from pluripotency is accompanied by rXCI in females14. We modeled this process by differentiating naïve F1 mESCs of mixed genetic background (C57BL6/J × CAST/EiJ) cultured under 2i condition (Gsk3 + MEK inhibition) towards primed epiblast stem cells (EpiSCs; Activin A + FGF2) for up to 7 days, during which cells were captured for scRNA-seq (Fig. 1a and Supplementary Data 1). By using Smart-seq315 we obtained gene-copy-specific expression measurements, as full-length read coverage provided the allelic origin of transcripts via parental polymorphisms and unique molecular identifiers (UMIs) yielded original RNA counts (Supplementary Fig. 1a). After quality filtering, we captured the allelic expression of up to 576 X-linked and 18,043 autosomal genes in 687 deep-sequenced cells, highlighting that our data provided near genome-wide allelic resolution and the sensitivity to study allelic regulation on the gene level in single cells.
The mESC-to-EpiSC transition triggered distinct expression changes accompanied by the loss of pluripotency factors (e.g., Sox2 and Nanog) and induction of lineage-specific factors (e.g., Fgf5 and Krt18) together with related pathways (Fig. 1b, c, Supplementary Fig. 1b, c, and Supplementary Data 2), signifying successful stem cell priming. As expected, female naïve mESCs, carrying two active X alleles (XaXa state), demonstrated an elevated total X-gene expression dosage that diminished upon exit from pluripotency (Fig. 1d). This dose-diminishing effect has previously been attributed to the silencing of one out of two hyperactive X alleles by XCI7,16. We confirmed the elevated X dosage in naïve female XX mESCs compared to both male XY and female XO (Turner syndrome) mESCs in bulk RNA-seq17,18 (Fig. 1e). At the same time, we recorded female XX mESC expression to be less than the twofold higher expected relative to XY and XO if comparing a biallelic hyperactive XaXa state to a single hyperactive X allele in XY/XO cells (median XY = 1.42 and XO = 1.54-fold, relative to XX) (Fig. 1f). To dissect X dosage to allelic resolution in our scRNA-seq data, we stratified female cells according to XCI status into biallelic and monoallelic X-chromosome states (XaXa, and XaXi/XiXa, respectively) (Fig. 1g and Supplementary Fig. 1d) calculated from X-chromosome allelic expression imbalance, indeed confirming decreased total X-linked expression upon XCI (Supplementary Fig. 1e). Surprisingly, when resolved onto the separate alleles, X-linked expression in XaXa cells was on par with autosomal levels for each allele whereas female X-inactive states (XaXi/XiXa) and male (XY) cells exhibited distinct upregulation of the single active X-chromosome copy (Fig. 1h). Remarkably, this implies that XCI does not initiate silencing upon one of two hyperactive X chromosomes, which is distinct from what is widely believed and modeled for the dosage compensation process3,4,7,16. Notably, upregulation was observed X-chromosome-wide (Supplementary Fig. 1f) and female XaXa cells consistently lacked XCU regardless of days of EpiSC priming (Supplementary Fig. 1g). This was validated by reanalyzing published 3′ UMI-tagged allele-resolved scRNA-seq data of 2i withdrawal in mESCs19, under which XCI initiates at a lower rate compared to EpiSC priming14, allowing us to observe the effect of biallelic versus monoallelic X expression over a wide timespan (Supplementary Fig. 1h–j). Notably, our finding that two moderately expressed X alleles (XaXa state) achieve higher total expression dose than a single hyperactive X allele implies that XCU does not attain full twofold upregulation at the transcript-count level. Indeed, calculating relative gene-wise expression of the same active allele transitioning from XaXa into XaXi state (Fig. 1i), we observed a pronounced shift in X-linked expression yet with fold changes below two (median XC57 = 1.62, PC57 = 1.41 × 10−40 and median XCAST = 1.67, PCAST = 5.72 × 10−48, Wilcoxon signed-rank test). Indeed, even as the Xa allele became hyperactive after XCI, the combined output of both basally expressed X alleles in the XaXa state remained higher (Fig. 1j). While these findings conceptually conflict the model that XCI acts on a biallelically hyperactive XaXa state3,4, the scenario remains compatible with the notion that increased dosage of X-linked pluripotency factors in naïve female XaXa cells delays differentiation of female mESCs20.
Two waves of X-upregulation
Naïve mESCs are derived from, and mimic, the inner cell mass (ICM) of preimplantation embryos. XCU was previously proposed to be present prior to ICM formation5,11,12 but the lack of hyperactivation in naïve female mESCs that we observed suggested that XCU might be reversed during embryonic development. To map the X-upregulation dynamics in vivo, we leveraged the large allele-level Smart-seq datasets we recently generated across early murine embryogenesis14,21,22, covering key developmental time points from the oocyte/zygote up until gastrulation and spanning the known major events of embryonic X-chromosome regulation;1 i.e., imprinted XCI (iXCI), X-chromosome reactivation (XCR), and rXCI (Fig. 2a). Allelic expression was balanced for X and autosomes in mature (MII) oocytes up until two-cell stages where mRNAs originate only from the maternal genome. However, around the completion of zygotic genome activation (ZGA, ~4-cell stage), where biallelic autosomal transcription is achieved, XCU was specifically observed in male cells (Fig. 2b). Conversely, female four-cell embryos exhibited biallelic XaXa expression that lacked XCU, recapitulating the observations in naïve XaXa mESCs. Maternal-specific XCU was first detected around the 8–16-cell stage and was maintained throughout preimplantation blastocyst development (Fig. 2c), notably coinciding with the iXCI dynamics on the paternal allele. Because lineage specification commences during blastocyst development, we classified blastocyst cells into epiblast (EPI), trophectoderm (TE), and primitive endoderm (PrE) lineages (Supplementary Fig. 2a, b), revealing XCU to be present in all late-blastocyst lineages, including EPI cells prior to XCR (Supplementary Fig. 2c). Importantly, the sex-specific temporal dynamics of XCU closely followed events that would otherwise result in imbalanced chromosomal dosage, suggesting that XCU follows external X-chromosome-dosage imbalances. To test this hypothesis, we analyzed XCU in allele-resolved scRNA-seq data from XistpatΔ knockout embryos genetically designed to lack iXCI11. Indeed, female XistpatΔ knockout embryos did not initiate XCU (Fig. 2c), indicating that hyperactivation is initiated as a response to imbalanced dosage.
As cell lineages are transcriptionally distinct in postimplantation embryos22 (E5.5-6.5), we continued the analyses in a lineage-specific manner. Whereas extraembryonic lineages (visceral endoderm, VE; extraembryonic ectoderm, ExE) retain iXCI in female cells, EPI cells undergo XCR followed by rXCI (Fig. 2a). Strikingly, we found that cells of the extraembryonic lineages maintained XCU along with iXCI (Fig. 2d) whereas female EPI cells residing in reactivated XaXa state (XCR state) lacked XCU regardless of the embryonic age or inferred developmental pseudotime of the cells (Fig. 2d and Supplementary Fig. 2d–g), importantly demonstrating erasure of XCU in vivo. This was followed by a second wave of XCU in cells where rXCI was either underway or completed (Fig. 2d and Supplementary Fig. 2d, f), confirming that XCU regulation is highly dynamic also in vivo.
A quantitative relationship between X-upregulation and X-inactivation
Next, we investigated whether XCU responded quantitatively to the lowering of expression from the inactivating allele during XCI progression. If trans-acting factors gradually shift towards the Xa allele upon XCI, as hypothesized from our previous work23,24, XCU would not show an on/off pattern but would “tune” expression according to the cellular degree of XCI. To ensure that such tuning-like dynamics could be correctly measured on the allele-level across different levels of allelic imbalance, we constructed and sequenced mock libraries of equimolar concentration containing various spiked ratios of purified C57 and CAST RNA (Methods), indeed demonstrating that allelic ratios could be faithfully captured down to around 100,000 reads per sample or with as few as 100 genes using UMIs as well as read counts (Supplementary Fig. 3a, b).
As rXCI is an asynchronous process14,22, it represents an ideal system to evaluate the XCU modus. Indeed, as rXCI was established in EPI cells, the other allele displayed corresponding chromosome-wide compensation in accordance with a tuning-like mode (adjusted R2maternal = 0.80, Pmaternal = 2.92 × 10−69, adjusted R2paternal = 0.91, Ppaternal = 1.19 × 10−103, linear regression; Fig. 3a). This tuning-like behavior was further confirmed in independent UMI-count data of 2i withdrawal in mESCs19 (adjusted R2C57 = 0.84, adjusted R2CAST = 0.86, P < 2.2 × 10−16, linear regression; Supplementary Fig. 3c). Strikingly, variability of iXCI completeness in extraembryonic lineages and preimplantation stages also followed the trend projected from postimplantation EPI cells (Fig. 3a and Supplementary Fig. 3d), suggesting that the first iXCI-associated wave of XCU is achieved by the same mechanism as the second rXCI-associated wave. Together, these findings provide evidence for continuous feedback of XCU. Surprisingly, genes known to escape XCI followed similar tuning-like trends as other X-linked genes in female cells (Pmaternal = 0.62, Ppaternal = 0.71, Kolmogorov–Smirnov test; Fig. 3b, c). Furthermore, we observed that the subset of escapee genes having ancestral homologs remaining on the Y chromosome25 lacked dosage compensation in males whereas the corresponding X-linked homologs displayed XCU in female cells of all lineages (Males: P > 0.05, Females: PEPI = 6.02 × 10−30, PVE = 3.91 × 10−15, PExE = 4.84 × 10−2, one-sample Wilcoxon test, µ = 0.5; Fig. 3d and Supplementary Fig. 3e). Therefore, the combined effect of biallelic expression and gene-specific XCU may explain why escapee genes tend to be expressed at higher levels in females1 but at the lower output from the inactive X allele26,27,28. How could the tuning-like action of XCU have been overlooked in previous studies? We speculated that this was due to the lack of allele-resolution measurements of dosage compensation5,6,7,8,9,11,19,29. Indeed, all approaches we tested on our data utilizing total expression levels (total X expression, X:Autosomal ratios, Female:Male ratios) failed to identify tuning-like XCU as all non-XaXa cell states produced similarly balanced expression (Supplementary Fig. 3f–h), explicitly exposing the risk of inferring allelic processes from nonallelic measurements.
Steady-state mRNA levels are determined by synthesis and degradation. We have previously shown that expression-matched X-linked- and autosomal transcripts have similar decay rates24 and others have associated steady-state XCU with increased transcriptional initiation8,9,30, suggesting that XCU is primarily controlled at the level of transcription. Indeed, when dissecting expression levels into kinetic parameters of transcriptional bursting (two-state model [on/off]; burst frequency [kon] and burst size [ksyn/koff]; Supplementary Fig. 3i), we previously found XCU to be driven by increased transcriptional burst frequency24,31. To test whether bursting patterns underlie the tuning-like mode of XCU, we inferred allele-level kinetic parameters from our mESCs using molecule- (UMI) and read-count (TPM) Smart-seq3 data (Methods). The two XaXa alleles displayed moderate and balanced burst frequency (kon) whereas all states of monoallelic X-chromosome expression (i.e., XCU states) displayed markedly increased burst frequency (FDRTPM ≤3.97 × 10−10, FDRUMI ≤5.68 × 10−03, FDR-corrected Mann–Whitney U-test; Fig. 3e) whereas burst size (ksyn/koff) remained largely unchanged (FDR ≥0.08, Supplementary Fig. 3j). Next, we inferred transcriptional kinetics during rXCI establishment in vivo and indeed found that burst frequency on the Xa allele was progressively increased in a tuning-like manner in epiblasts (Fig. 3f and Supplementary Fig. 3k). Interestingly, the Xi allele lost burst frequency at a higher rate than gained at the Xa allele (Supplementary Fig. 3l), suggesting that residual mRNA molecules help buffer allelic imbalance. Male cells and extraembryonic cells subject to iXCI similarly displayed elevated burst frequency on the hyperactive X allele (Fig. 3f), pointing towards allelic tuning by transcriptional bursting as a general mechanism of XCU.
The epigenetic state of X-upregulation
With our new insights into allele-specific XCU dynamics, we sought to directly contrast the epigenetic state of the hyperactive and non-hyperactive X allele for the first time. To do so at allelic and cellular resolution, we utilized a single-cell multi-modal profiling assay combining Smart-seq3 and scATAC-seq in parallel for the same cell (Supplementary Data 1). We applied this method to our mESC priming model throughout rXCI and XCU establishment (Fig. 4a and Supplementary Fig. 4a), providing the first combined scRNA/ATAC-seq profiling with an allelic resolution to our knowledge. As expected, the mESC-to-EpiSC transition was accompanied by distinct changes in chromatin accessibility of pluripotency and differentiation markers (Fig. 4b, c), matching differential expression in our initial Smart-seq3-only experiment (Odds ratio = 2.66, P = 8.34 × 10−15, Fisher’s exact test). Next, we integrated single-cell expression and chromatin accessibility measurements, which further confirmed the agreement between the RNA and DNA modalities (Fig. 4d and Supplementary Fig. 4b). To assess whether the combined scRNA/ATAC-seq assay could accurately detect gene- and allele-specific accessibility, we investigated imprinted autosomal genes32 which showed skewing to the expected parental alleles in both expression and accessibility (Fig. 4e, f). Next, we grouped cells into different XCI states based on the allelic expression of the RNA modality, which demonstrated concurrent loss of chromatin accessibility on the Xi allele (Fig. 4g) whereas autosomes remained biallelically accessible (Supplementary Fig. 4c). Surprisingly, unlike the distinct shift in X-linked RNA levels, accessibility of the active allele was not notably increased upon XCU neither in male nor female cells (Supplementary Fig. 4d). This was also the case when matching expression and accessibility for the same gene where only RNA-level upregulation was observed relative to the biallelic XaXa state, regardless of the degree of XCU (Supplementary Fig. 4e, f), indicating that the transcriptional action of XCU transpires on a basal state of chromatin openness. To address this further, we reanalyzed allele-resolved native histone ChIP-seq (H2AK119Ub, H3K27me3, H3K4me1, H3K4me3, H3K27ac, H3K9ac, and H4ac) timeseries data for mESCs with DOX-inducible XCI33. In agreement with our combined scRNA/ATAC-seq data, the number of enriched regions for all histone modifications remained constant on the Xa allele throughout XCI/XCU progression (Supplementary Fig. 4g), as did modification density at both promoters and enhancers (Supplementary Fig. 4h). To explore potential spatial patterns of XCU accessibility, we separated alleles based on the degree of XCI completion (per modality) which indeed revealed unchanged accessibility across the Xa allele whereas Xi accessibility was preferentially lost at regions gaining the heterochromatin histone modification H3K27me3 (Fig. 4h). This corroborates our initial finding that the chromatin accessibility landscape remains nearly unchanged upon XCU. Intrigued by these observations at the chromatin level, we hypothesized that the burst-frequency-driven XCU may not be modulated by enhancer state per se but through enhancer-to-gene contacts34,35. To explore this, we analyzed allele-resolved timeseries in situ Hi-C data for mESCs undergoing differentiation for up to 10 days36 (Methods). Unlike the Xi allele, that assumes a distinct bipartite mega-domain structure upon XCI36,37,38, Xa retained its global long-range chromosome conformation (Supplementary Fig. 5a). However, the shorter-range chromatin domains on Xa became increasingly distinct as the cells underwent XCU (Supplementary Fig. 5b, c), suggesting that XCU is associated with increased chromatin contacts, consistent with the observed increase in transcriptional burst frequency. While this observation is preliminary and warrants further study, we do note that the Xa allele has also been found to be more structured in cells with XCI in two recent studies39,40, in agreement with our findings on XCU.
Discussion
In this study, we uncovered unanticipated flexibility of XCU in controlling X-linked expression, with fundamental implications to our understanding of dosage compensation. By tracing expression levels at allelic resolution in single cells during early murine embryonic development, we identified key sex- and lineage-specific events and timing for initiation, maintenance, erasure, and reestablishment of allelic X-upregulation to compensate otherwise imbalanced RNA levels. Notably, we showed that XCU is achieved by transcriptional burst frequency increase as a universal kinetic drive, co-occurring with increased chromatin compartmentalization. In contrast to the tightly controlled initiation of XCI20, we found that XCU acts in a tuning-like manner as a direct response to imbalanced X dosage, i.e., requiring ZGA or XCI for establishment in XY males and XX females, respectively (Fig. 2b, c). This is distinct from previous reports suggesting XCU be active already in the zygote5 or progressively after the four-cell stage in both sexes11,12. These discrepancies may be explained by the inability of previous nonallelic gene expression measurements to correctly distinguish XCU in the presence of parallel confounding allelic processes such, as ZGA and XCI, whereas our present study directly attributes XCU to the active X allele. Our surprising finding that female cells with biallelic expression (naïve female mESCs, four-cell embryos, and uncommitted epiblasts; XaXa state) lack previously assumed XCU7,8,41 is in line with observations of balanced X:A dosage in haploid cells (including MII oocytes; Fig. 2b) or other cellular states harboring two active X chromosomes, such as primary oocytes and primordial germ cells5,29,42,43. Furthermore, the gene expression dosage of two Xa alleles is known to be incompatible with sustained embryonic development20,44 which fits with the observation of incomplete compensation by XCU at the RNA level (~1.6-fold) reported by us and others5,6,7,8,9,24,30 (Fig. 1). Using allele-resolved chromatin analyses, including joint scRNA/ATAC-seq, we characterized epigenetic features of XCU. In contrast to readily observable differentiation- and XCI-related changes, we found the broad chromatin landscape of XCU to be unaltered which, in combination with the increased transcriptional burst frequency upon XCU, may explain why active histone modifications do not scale proportionally with RNA levels on the hyperactive X allele9,30.
These revelations prompt a revised model of the sequence of events by which mammalian X-chromosome dosage compensation is achieved in the presence of other allelic processes during development (i.e., ZGA, iXCI, XCR, and rXCI; Fig. 5a). This conflicts with the widely-held belief that XCI acts on hyperactive Xa alleles in females7,8,41 (“Default XCU” model; Fig. 5b top). Instead, our data fit a model where the two Xa alleles are moderately expressed and that XCU gradually tunes Xa expression levels throughout XCI proportional to dosage imbalances (“Elastic XCU” model; Fig. 5b bottom). The transcription-driven XCU we observe may act on top of other layers of regulation at post-transcriptional- and translational levels1, that could be now be investigated in detail as the XCU timing and dynamics are mapped. Although our current data do not pinpoint the upstream regulatory events and elements of the burst-frequency-driven XCU, it is worthwhile to hypothesize on their nature. Our findings hint that increased transcription factor concentrations at the hyperactive Xa allele could play a role in its regulation. Not only would the single X allele in XY males and XO females be subjected to higher doses of factors transcribed from diploid autosomes, but increased chromatin contacts might also increase local transcription factor concentrations through loop-mediated trapping45. As transcriptional burst frequency is controlled by both local factor concentrations46 and enhancer-promoter contacts31,34,35, this could result in the increased transcriptional burst frequency in XCU we observe (Fig. 3e, f). Furthermore, the same model can operate for XCI in XX females as transcription factors are rapidly excluded from the Xi allele when repressive compartments are established47. As XCI is a gradual process17, factor complexes could progressively shift to the Xa allele in line with the tuning-like mode of XCU. Collectively, this would result in balanced RNA levels, explaining how the total dosage of X-encoded transcripts are maintained so surprisingly stable throughout development and XCI progression in mammalian species5,7,10,11,48 and why XCI escapees are expressed at higher levels from the Xa allele26,27,28 whereas X-Y homologs are not dosage compensated in males as both the X- and Y-chromosome is active (Fig. 3d and Supplementary Fig. 3e). As two active X chromosomes are considered a hallmark of the naïve female stem cell state and a gold standard in reprogramming studies49, our finding that the biallelic XaXa state lacks XCU alters the interpretation of X-chromosome expression level measurements for assessment of reprogramming success and naïveness.
In conclusion, our study exposes X-upregulation as a remarkably elastic process that tunes RNA dosage throughout development, leading to a refined model of mammalian dosage compensation by unifying the allele-specific dynamics of X-inactivation and X-upregulation in achieving balanced transcript-copy numbers in the cell.
Methods
Animal housing and ethics statement
Mice were housed in specific pathogen-free at Comparative Medicine Biomedicum (KM-B), Karolinska Institutet, according to Swedish national regulations for laboratory animal work food and water ad libitum, cage enrichment, and 12 h light and dark cycles. All animal experimental procedures were performed in accordance with Karolinska Institutet’s guidelines and approved by the Swedish Board of Agriculture (permits 17956-2018 and 18729-2019 Jordbruksverket).
Derivation and culturing of cell lines
Male and female cell lines were established as previously described in ref. 14. In brief, mESCs were derived from E4 blastocysts of F1 embryos (female C57BL/6 J × male CAST/EiJ) and adapted to 2i condition by growing them in gelatin-coated flasks in N2B27 medium (50% neurobasal medium [Gibco], 50% DMEM/F12 [Gibco], 2 mM l-glutamine [Gibco], 0.1 mM β-mercaptoethanol, NDiff Neuro-2 supplement [Millipore], B27 serum-free supplement [Gibco]) supplemented with 1000 units/mL LIF, 3 µM Gsk3 inhibitor CT-99021, 1 µM MEK inhibitor PD0325901, and passaged with accutase [Gibco]. To induce differentiation toward EpiSCs, mESCs grown in serum/LIF were plated on FBS coated tissue culture plates (coated overnight at 37 °C) in N2B27 medium supplemented with 8 ng/mL Fgf2 (R&D) and 20 ng/mL Activin A (R&D) at a cell density of 1 × 104 cells/cm2 and cultured for up to 7 days. Cells were split and replated in the same condition after 1, 2, 4, and 7 days of differentiation. At each split, an aliquot of cells was collected for single-cell sorting into 96-well plates containing Smart-seq3 lysis buffer.
Single-cell RNA sequencing (Smart-seq3)
scRNA-seq libraries were constructed as previously described in ref. 15 with slight modification. Briefly, cells were single-cell sorted into 96-well low-bind PCR-plates [Eppendorf] containing 3 µl of lysis buffer (0.5 units/µl RNase inhibitor [Takara], 0.15% Triton X-100 [Sigma], 0.5 mM (each) dNTP [Thermo Scientific], 1 µM oligo-dT primer [5′-biotin-ACGAGCATCAGCAGCATACGAT30VN-3′; IDT], 5% PEG [Sigma]). Sorting was performed using an SH800 [Sony]. Plates were briefly centrifuged immediately after sorting, sealed, and stored at −80 °C. For cell lysis and RNA denaturation, plates were incubated at 72 °C for 10 min and immediately placed on ice. Next, 5 µl of reverse transcription mix (50 mM Tris-HCl, pH 8.3 [Sigma], 75 mM NaCl [Ambion], 1 mM GTP [Thermo Scientific], 3 mM MgCl2 [Ambion], 10 mM DTT [Thermo Scientific], 1 units/µl RNase inhibitor [Takara], 2 µM of template-switch oligo [5′-biotin-AGAGACAGATTGCGCAATGNNNNNNNNrGrGrG-3′; IDT], and 2 U/µl of Maxima H-minus reverse transcriptase [Thermo Scientific]) was added to each sample. Reverse transcription was carried out at 42 °C for 90 min followed by ten cycles of 50 °C for 2 min and 42 °C for 2 min and the reaction was terminated at 85 °C for 5 min. PCR pre-amplification was performed directly after reverse transcription by adding 17 µl of PCR mix (containing DNA polymerase, forward and reverse primer) bringing the final concentration in the 25 µl reaction to 1x KAPA HiFi ReadyMix [Roche] 0.1 µM forward primer [5′-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGATTGCGCAATG-3′; IDT] and 0.1 µM reverse primer [5′-ACGAGCATCAGCAGCATACGA-3′; IDT]). Thermocycling was performed as follows: 3 min at 98 °C, 22 cycles of 20 s at 98 °C, 30 s at 65 °C and 6 min at 72 °C, and final elongation at 6 min at 72 °C. After PCR pre-amplification, samples were purified with AMpure XP beads [Beckman Coulter] at a volume ratio of 0.8:1. Library size distributions were monitored using high-sensitivity DNA chips (Agilent Bioanalyzer 2100) and cDNA concentrations were quantified using the Quant-iT PicoGreen dsDNA Assay Kit [Thermo Scientific]. cDNA was subsequently diluted to 100–200 pg/µl.
Tagmentation was performed using in-house produced Tn550. Two nanograms of cDNA in 5 µl water was mixed with 15 µl tagmentation mix (0.2 µl Tn5, 2 µl 10x TAPS MgCl2 Tagmentation buffer; 5 µl 40% PEG-8000; 7.8 µl water, per reaction) and incubated 8 min at 55 °C in a thermal cycler. Tn5 was inactivated and released from the DNA by the addition of 4 µl 0.2% SDS and 5 min incubation at room temperature. Library amplification was performed by adding 5 µl mix of 1 µM of forward and reverse custom-designed Nextera index primers [forward: 5′-CAAGCAGAAGACGGCATACGAGATNNNNNNNNNNGTCTCGTGGGCTCGG-3′, reverse: 5′-AATGATACGGCGACCACCGAGATCTACACNNNNNNNNNNTCGTCGGCAGCGTCIDT-3′, where N represents the 10-bp index bases; IDT] and 15 µl PCR mix (1 µl KAPA HiFi DNA polymerase [Roche]; 10 µl 5× KAPA HiFi buffer; 1.5 µl 10 mM dNTPs; 3.5 µl H2O, per reaction), and thermal cycling: 3 min 72 °C, 30 s 95 °C, 13 cycles of 10 s 95 °C; 30 s 55 °C; 30 s 72 °C, followed by final elongation at 5 min 72 °C; 4 °C hold. DNA sequencing libraries were purified using 0.8:1 volume of AMPure XP beads [Beckman Coulter]. Libraries were sequenced using NextSeq 550 and High Output kits [Illumina].
Single-cell joint accessibility and RNA expression (scATAC + Smart-seq3)
The joint single-cell ATAC and Smart-Seq3 analysis was performed as in an early version of Smart3-ATAC51. Briefly, single cells were FACS sorted into 384 well plates containing 3 μl lysis buffer (0.03 ul 1 M Tris-pH7.4, 0.0078 μl 5 M NaCl, 0.075 μl 10% IGEPAL, 0.075 μl RNase Inhibitor, 0.075 μl 1:1.2 M ERCC, 2.7372 μl H2O). Immediately after sorting, plates were centrifuged at 1800 × g 4 °C 5 min, placed on ice 5 min, vortexed 3000 RPM 3 min, and centrifuged again at 1800 × g 4 °C 5 min to lyse the cells and spin down the nucleus. Two microliters of the supernatant was then carefully moved to a new 384 well plate for Smart-seq3 mRNA library preparation and the nucleus remained in the original well for scATAC library preparation. The scATAC in situ tagmentation was performed with 2 μl of Tn5 tagmentation mix (0.06 μl 1 M Tris-pH 8.0, 0.0405 μl 1 M MgCl2, 2 μl Tn5) and incubating at 37 °C for 30 min. After the tagmentation, 2 μl of the supernatant was aspirated and the nuclei were then washed once with 10 μl ice-cold washing buffer (0.1 μl 1 M Tris-pH7.4, 0.02 μl 5 M NaCl, 0.03 μl 1 M MgCl2, 9.85 μl H2O). The remaining Tn5 was inactivated by the addition of 2 μl 0.2% SDS-Triton X-100 followed by incubation at room temperature for 15 min and 55 °C for 10 min. Barcoding PCR was done by KAPA HiFi PCR Kit [Roche] in a final 25 μl reaction (11.5 μl H2O, 5 µl 5X reaction buffer, 0.75 μl dNTP, 0.5 μl KAPA HiFi DNA Polymerase, 2 μl barcoding primers). The thermal cycling program was 15 min 72 °C, 45 s 95 °C, 22 cycles of 15 s 98 °C; 30 s 67 °C; 1 min 72 °C; followed by final elongation at 5 min 72 °C; 4 °C hold. After PCR, 2 μl of each well was pooled and cleaned-up twice using AMPure XP beads (at 1.3X volume).
Smart-seq3 was performed as described above but with 28 PCR cycles to amplify cDNA. Libraries were sequenced as described above on a NextSeq 550.
Allelic RNA dilution series
Liver tissue was isolated from 12-week-old male C57BL6/J and CAST/EiJ mice, dissected into 1–2 mm2-wide samples, and transferred to 1 ml TRIzol isolation reagent (Thermo Scientific). The samples were thoroughly homogenized in TRIzol using a metallic tissue grinder before proceeding to RNA extraction. RNA extraction was performed according to the manufacturer’s instructions. In brief, 0.2 ml chloroform was added to 1 ml of TRIzol and the samples were vigorously shaken. The samples were incubated at room temperature and the RNA-containing upper aqueous phase was isolated and precipitated with 0.5 ml isopropyl alcohol. The samples were incubated at room temperature for 10 min and centrifuged at 12,000 × g at 4 °C for 10 min. The pellet was washed once with 75% ethanol and the RNA pellet was air-dried for 10 min. The RNA pellets were resuspended in 50 µl of RNase-free water and incubated in a heat block at 55–60 °C for 15 min before measuring the concentration using a Nanodrop 2000 spectrophotometer. RNA from pure C57BL6/J and CAST/EiJ strains was combined at varying ratios (0, 12.5, 25, 37.5, 50, 62.5, 75, 87.5, and 100% C57) for a total of 200 pg RNA which was subjected to Smart-seq3 with slight modification from above. Briefly, tagmentation was done using 0.1 µl Nextera XT ATM Tn5 [Illumina] for 10 min, index primers were used as 0.2 µM each and a 0.6:1 AMPure XP bead ratio was used for cleanup.
Smart-seq3 data analysis
A hybrid mouse genome index was constructed by N-masking the reference genome (GRCm38_68) for CAST/EiJ SNPs from the Mouse Genomes Project (mgp.v5.merged.snps_all.dbSNP142)52 using SNPsplit (0.3.2)53. Raw Smart-seq3 data was processed using zUMIs (2.8.0)54. Briefly, sample barcodes were filtered and data was aligned with STAR (2.7.2a)55 [options: --clip3pAdapterSeq CTGTCTCTTATACACATCT] and reads were assigned to both intron and exon features (Mus_musculus.GRCm38.97.chr.gtf) using FeatureCounts56. Next, barcodes were collapsed using 1 hamming distance and gene expression was calculated for both reads and UMIs. Finally, allele-level expression was calculated from the zUMIs output as previously described (github.com/sandberg-lab/Smart-seq3/tree/master/allele_level_expression)15. Seventy-seven cells were excluded due to low read depth (>3 MADs) in the original experiments and 486 cells were excluded due to low read depth in the multi-omics experiments.
Processing of published C57BL/6J × CAST/EiJ scRNA-seq data
Preprocessed Smart-seq data for MII oocyte—16-cell stages were obtained21 and two cells were excluded (8cell_8-3 and 16cell_4-2) due to potential sample problems (high paternal X ratio in male and non-ZGA with clustering together with Zygote samples, respectively). Preprocessed blastocyst Smart-seq2 data were obtained21 and 18 late blastocysts were excluded due to low read depth (>3 MADs). Preprocessed Smart-seq2 data for postimplantation embryos was obtained14,22.
Genes escaping X-inactivation and X-Y homolog gene lists
A list of known mouse escapee genes (1810030O07Rik, 2010000l03Rik, 2010308F09Rik, 2610029G23Rik, 5530601H04Rik, 6720401G13Rik, Abcd1, Araf, Atp6ap2, BC022960, Bgn, Car5b, D330035K16Rik, D930009K15Rik, Ddx3x, Eif1ax, Eif2s3x, Fam50a, Flna, Ftsj1, Fundc1, Gdi1, Gemin8, Gpkow, Huwe1, Idh3g, Igbp1, Ikbkg, Kdm5c, Kdm6a, Lamp2, Maged1, Mbtps2, Med14, Mid1, Mmgt1, Mpp1, Msl3, Ndufb11, Nkap, Ogt, Pbdc1, Pdha1, Prdx4, Rbm3, Renbp, Sh3bgrl, Shroom4, Sms, Suv39h1, Syap1, Tbc1d25, Timp1, Trap1a, Uba1, Usp9x, Utp14a, Uxt, Xist, and Yipf6) was compiled from previous work11,27,32,57 and excluded from analyses where specified.
A list of ancestral X-Y homologs was obtained25 and sequence similarity of expressed X-Y homolog transcripts was calculated using the nucleotide BLAST webservice58 using default settings (performed 2020-03-24). The ancestral homologs had an average sequence similarity of 83% and were detected in a maximum of 2.5% female cells whereas maximum male-specific detection was 98.3%, indicating that X-Y homologs show correct sex-specific mapping.
Expression calculations
Allelic reference ratios were calculated as CountsC57/CountsTotal after exclusion of the top 10% expressed X-linked genes to avoid bias from highly expressed X genes. XCI status was determined as active (Xa/Xa), semi-inactivated (Xa/Xs) or fully inactivated (Xa/Xi) for allelic ratios in the intervals (0.4, 0.6), [0.6, 0.9), and [0.9, ∞) and the inverse, respectively. Due to differences in X-controlling elements (Xce) between C57 and CAST strains, the C57 allele is preferentially inactivated during random XCI32,59,60,61 which is observable in our data as the number of XaC57XiCAST vs. XiC57XaCAST cells. TPM was calculated for gene i in cell j as TPMij = (FPKMi / ∑j FPKMj) × 106 and allelic TPM was calculated by scaling TPM by reference ratios per gene and cell. Relative UMI counts were calculated relative to total UMIs per sample (percentage of total counts). A gene was considered expressed in a dataset/lineage if the average TPM expression was >0. To obtain a robust estimate for cell-level expression, 20% trimmed means was calculated per cell.
Expression ratios
Chromosome:Autosomal expression ratios were calculated for expressed genes (>1 TPM)7,8,30 as relative to median of autosomes after excluding genes that escape XCI. Additionally, a bootstrapping method11 was used to account for a different number of genes between chromosomes. For bootstrapped ratios, random autosomal gene sets of the same size as the test set were selected as a background, repeated n = 103 times.
Female:male ratios were calculated after exclusion of X escapees as TPM relative to gene average per embryonic day and lineage, as well as for the active X allele for allelic data.
Differential expression of scRNA-seq data
For Smart-seq3 data, global count data was size-factor normalized using scater (1.12.2)/scran (1.12.1)62 and genes expressed in >10% of cells were kept. Differential expression was calculated along days of differentiation as a continuous variable using likelihood ratio tests as implemented in MAST (1.10.0)63. Gene set enrichment for mouse GO biological process gene sets obtained from MGI (http://www.informatics.jax.org/downloads/reports/index.html#go; accessed 2020-06-23) was performed on the differential expression model against a bootstrapped (n = 100) control model as implemented in MAST.
Dimensionality reduction, clustering, and trajectory inference
For Smart-seq3 data, highly variable genes (HVGs) were identified from size-factor-normalized counts using scater/scran and ordered by biological variance and FDR. Data was visualized for the top 1000 HVGs using diffusion maps64.
For in vivo blastocyst data, HVGs were obtained as explained above and dimensionality reduction was performed for top 1000 HVGs using UMAP65 and cells were Louvain clustered based on top 1000 HVG ranks using scran.
For in vivo postimplantation data, pseudotime trajectories were inferred using Slingshot (1.2.0)66 from normalized counts following Mclust (5.4.5)67 clustering on diffusion map coordinates.
Kinetics inference
Missing allelic data points were set to 0 if the gene was detected on the other allele. Kinetic parameters were calculated per lineage/genotype/XCI status/growth condition (depending on dataset) using txburst (github.com/sandberg-lab/txburst)31. Genes not passing filtering steps were excluded and relative burst frequency (kon) and burst size (ksyn/koff) was calculated relative to the median of expressed autosomal genes (per lineage). Only groups with >20 cells were kept to increase the reliability of the statistical inference.
scATAC data analysis
Raw fastq files were tagged with cell names using GNU sed (4.4), quality and adapter trimmed using fastp (0.20.0)68, and aligned to the N-masked reference genome using bowtie2 (2.4.1)69 [options: --very-sensitive -N 1 -X 2000 -k 10]. Duplicates were marked using biobambam2 (2.0.87; gitlab.com/german.tischler/biobambam2) and data were merged and split into respective alleles using SNPsplit (0.3.2)53 [options: --paired --no_sort]. Downstream processing and analysis was performed using ArchR (1.0.1)70. Briefly, non-duplicate properly paired and mapped primary alignments were loaded and mitochondrial and Y chromosomes were excluded. Nonallelic data were filtered for cells with at least 1000 fragments and a minimum TSS enrichment of four and predicted doublets were excluded for a total of 437 excluded cells. Enriched peaks were called grouped by differentiation day using the ArchR implementation of Macs271. Expression data were integrated using Smart-seq3 UMI counts, and dimensionality reduction was performed using the ArchR LSI implementation (default parameters for scATAC and top 1000 variable features based on the variance-to-mean ratio for scRNA) and visualized using UMAP with 15 nearest neighbors. Accessibility X:A ratios was calculated as meanchrX/meanAutosome on ArchR gene scores to account for the binary nature of scATAC-seq. Allelic data was filtered to match the cells in the nonallelic data and allelic count matrices were recalculated. Allelic ratios per cell was calculated using the paired expression data, as described above, and X-inactivation completion was calculated per modality as |0.5–(CountsC57/CountsTotal)|/0.5. A list of known imprinted mouse genes (Sgce, Peg3, Peg12, Plagl1, Zrsr1, Peg13, Airn, Impact, Nckap5, Dlx5, Gm5422, Grb10, Meg3, Rian, Mirg, Igf2r, Igf2, and H19) was obtained32 and only genes detected by both scRNA-seq and scATAC-seq was kept for analysis.
Re-analysis of published data
Raw bulk RNA-seq data including both male and female mESCs was obtained18,72 and quantified to protein-coding transcripts from GENCODE vM2273 using pseudoalignment with Salmon (0.14.1)74. Transcript abundance estimates were summarized to gene-level using tximport (1.12.3)75 and differential expression was calculated using likelihood ratio tests in DESeq2 (1.24.0)76. A full model including cellular genotype (XX, XY, or XO), cell culture condition (2i or serum) and study accession was tested against a reduced model without the genotype term. Lowly expressed genes (<100 average normalized counts) were excluded from the final plots. Serum growth conditions only showed a minor effect on global X expression compared to 2i (not shown), consistent with a low degree of cells exhibiting complete XCI14.
Preprocessed scRNA-seq data for wild-type and XistpatΔ knockout embryos was obtained;11 processed data from GSE80810. Briefly, allelic reference ratios were used to split normalized expression values (RPRT reads per retro-transcribed length per million mapped reads) into alleles and average expression per cell was calculated using trimmed means for expressed genes (average total RPRT >0) as described above.
Preprocessed scRNA-seq data for mESCs adapting to serum/LIF conditions were obtained;19 processed data from GSE151009. Briefly, Total UMI count matrices were size-factor normalized as described above and allelic UMI count matrices were used as is. Reference ratios and XCI status was calculated as described above.
Raw native ChIP-seq data for hybrid mESCs was obtained33, quality and adapter trimmed using fastp (0.20.0)68, aligned to a CAST/EiJ N-masked genome using Bowtie2 (2.4.1)69 [options: -N 1] and aligned data was split to respective alleles using SNPsplit (0.3.2)53. Peaks were called against Input samples as controls using MACS2 (2.1.2)71 [options: --broad --broad-cutof 0.01 -f BAMPE -g 2652783500], peaks overlapping ENCODE v2 problematic regions77 were excluded and consensus peaks were defined as peaks with a reciprocal overlap of at least 25% of peak width between replicates. Normalized coverage was calculated using deepTools (3.3.0)78 [options bamCoverage –normalizeUsing RPGC –effectiveGenomeSize 2652783500 --skipNAs --ignoreDuplicates --centerReads --blackListFileName mm10-blacklist.v2.ENSEMBL.bed.gz] and profiles were calculated using deepTools for chrX TSSs (Mus_musculus.GRCm38.97.chr.gtf) [options: computeMatrix reference-point --upstream 5000 --downstream 5000 --skipZeros --nanAfterEnd] or chrX enhancers (intersection between H3K4me1 and H3K27ac consensus peaks excluding peaks withing 1 kb of a TSS) [options: computeMatrix reference-point --referencePoint center --upstream 5000 --downstream 5000 --skipZeros] and resulting coverage profiles were normalized against Input.
Raw in situ Hi-C data for hybrid mESCs was obtained36. An N-masked dual hybrid mouse genome index was constructed for 129S1/svImJ × CAST/EiJ (mgp.v5.merged.snps_all.dbSNP142) using SNPsplit (0.3.2)53 and further in silico MboI digested for downstream tools. Data were aligned using HiCUP (0.8.0) [options: --bowtie2 --shortest 50 --longest 700 --digest –zip]79 then split into alleles using SNPsplit, converted to juicer format using HiCUP (hicup2juicer), and reads uniquely corresponding to either allele (requiring at least one mate to map to the genome) were merged. Hi-C contact maps were generated using juicer tools pre (1.22.01) [options: -d -f -q 10 -r 1000000]80 and KR-normalized 1 Mb matrices were extracted using juicer tools (dump observed/Pearsons/eigenvector). Observed contact counts were further normalized for sequencing depth (per million) per chromosome and timepoint.
Statistics and data visualization
All statistical tests were performed in R (3.6.1) as two-tailed unless otherwise stated. Heatmaps were visualized using ComplexHeatmap (2.0.0)81 and all other plots were made using ggplot2 (3.2.1)82. Box plots are presented as median, first and third quartiles, and 1.5x inter-quartile range (IQR). For median ± confidence interval plots, bootstrapped 95% confidence intervals (n = 1000) were calculated using the percentile method83 as implemented in the R boot package (1.3-23).
Reporting Summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
Raw and preprocessed data generated in this study have been deposited in ArrayExpress database under accession codes E-MTAB-9324 (Smart-seq3), E-MTAB-10709 (Allelic dilution series), and E-MTAB-10714 (Combined Smart-seq3+scATAC). Previously published raw data is available at Gene Expression Omnibus under accessions GSE45719, GSE74155, GSE109071, GSE116480, GSE23943, GSE80810, GSE90516, GSE116649, and GSE151009. Data generated during this study are available at github: github.com/reiniuslab/Lentini_XCU_in_vivo.
Code availability
Code generated during this study are available at github: github.com/reiniuslab/Lentini_XCU_in_vivo.
References
Deng, X., Berletch, J. B., Nguyen, D. K. & Disteche, C. M. X chromosome regulation: diverse patterns in development, tissues and disease. Nat. Rev. Genet. 15, 367–378 (2014).
Ohno, S. Monographs on Endocrinology (Springer, 1967).
Graves, J. A. M. Sex chromosome specialization and degeneration in mammals. Cell 124, 901–914 (2006).
Gribnau, J. & Grootegoed, J. A. Origin and evolution of X chromosome inactivation. Curr. Opin. Cell Biol. 24, 397–404 (2012).
Nguyen, D. K. & Disteche, C. M. Dosage compensation of the active X chromosome in mammals. Nat. Genet. 38, 47–53 (2006).
Gupta, V. et al. Global analysis of X-chromosome dosage compensation. J. Biol. 5, 3 (2006).
Lin, H. et al. Dosage compensation in the mouse balances up-regulation and silencing of X-linked genes. PLoS Biol. 5, 2809–2820 (2007).
Deng, X. et al. Evidence for compensatory upregulation of expressed X-linked genes in mammals, Caenorhabditis elegans and Drosophila melanogaster. Nat. Genet. 43, 1179–1185 (2011).
Deng, X. et al. Mammalian X upregulation is associated with enhanced transcription initiation, RNA half-life, and MOF-mediated H4K16 acetylation. Dev. Cell 25, 55–68 (2013).
Mahadevaiah, S. K., Sangrithi, M. N., Hirota, T. & Turner, J. M. A. A single-cell transcriptome atlas of marsupial embryogenesis and X inactivation. Nature https://doi.org/10.1038/s41586-020-2629-6 (2020).
Borensztein, M. et al. Xist-dependent imprinted X inactivation and the early developmental consequences of its failure. Nat. Struct. Mol. Biol. 24, 226–233 (2017).
Wang, F. et al. Regulation of X-linked gene expression during early mouse development by Rlim. eLife 5, e19127 (2016).
Reinius, B. & Sandberg, R. Random monoallelic expression of autosomal genes: stochastic transcription and allele-level regulation. Nat. Rev. Genet. 16, 653–664 (2015).
Chen, G. et al. Single-cell analyses of X chromosome inactivation dynamics and pluripotency during differentiation. Genome Res. 26, 1342–1354 (2016).
Hagemann-Jensen, M. et al. Single-cell RNA counting at allele and isoform resolution using Smart-seq3. Nat. Biotechnol. 38, 708–714 (2020).
Lin, H. et al. Relative overexpression of X-linked genes in mouse embryonic stem cells is consistent with Ohno’s hypothesis. Nat. Genet. 43, 1169–1170 (2011).
Marks, H. et al. Dynamics of gene silencing during X inactivation using allele-specific RNA-seq. Genome Biol. 16, 9–10 (2015).
Werner, R. J. et al. Sex chromosomes drive gene expression and regulatory dimorphisms in mouse embryonic stem cells. Biol. Sex. Differ. 8, 28 (2017).
Pacini, G. et al. Integrated analysis of Xist upregulation and X-chromosome inactivation with single-cell and single-allele resolution. Nat. Commun. 12, 3638 (2021).
Schulz, E. G. et al. The two active X chromosomes in female ESCs block exit from the pluripotent state by modulating the ESC signaling network. Cell Stem Cell 14, 203–216 (2014).
Deng, Q., Ramsköld, D., Reinius, B. & Sandberg, R. Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells. Science 343, 193–196 (2014).
Cheng, S. et al. Single-cell RNA-seq reveals cellular heterogeneity of pluripotency transition and X chromosome dynamics during early mouse development. Cell Rep. 26, 2593–2607.e3 (2019).
Deng, X. & Disteche, C. M. Rapid transcriptional bursts upregulate the X chromosome. Nat. Struct. Mol. Biol. 26, 851–853 (2019).
Larsson, A. J. M., Coucoravas, C., Sandberg, R. & Reinius, B. X-chromosome upregulation is driven by increased burst frequency. Nat. Struct. Mol. Biol. 26, 963–969 (2019).
Soh, Y. Q. S. et al. Sequencing the mouse y chromosome reveals convergent gene acquisition and amplification on both sex chromosomes. Cell 159, 800–813 (2014).
Carrel, L. & Willard, H. F. X-inactivation profile reveals extensive variability in X-linked gene expression in females. Nature 434, 400–404 (2005).
Yang, F., Babak, T., Shendure, J. & Disteche, C. M. Global survey of escape from X inactivation by RNA-sequencing in mouse. Genome Res. 20, 614–622 (2010).
Tukiainen, T. et al. Landscape of X chromosome inactivation across human tissues. Nature 550, 244–248 (2017).
Sangrithi, M. N. et al. Non-canonical and sexually dimorphic X dosage compensation states in the mouse and human germline. Dev. Cell 40, 289–301.e3 (2017).
Yildirim, E., Sadreyev, R. I., Pinter, S. F. & Lee, J. T. X-chromosome hyperactivation in mammals via nonlinear relationships between chromatin states and transcription. Nat. Struct. Mol. Biol. 19, 56–62 (2012).
Larsson, A. J. M. et al. Genomic encoding of transcriptional burst kinetics. Nature 565, 251–254 (2019).
Reinius, B. et al. Analysis of allelic expression patterns in clonal somatic cells by single-cell RNA-seq. Nat. Genet. 48, 1430–1435 (2016).
Zylicz, J. J. et al. The implication of early chromatin changes in X chromosome inactivation. Cell 176, 182–197.e23 (2019).
Bartman, C. R., Hsu, S. C., Hsiung, C. C.-S., Raj, A. & Blobel, G. A. Enhancer regulation of transcriptional bursting parameters revealed by forced chromatin looping. Mol. Cell 62, 237–247 (2016).
Fukaya, T., Lim, B. & Levine, M. Enhancer control of transcriptional bursting. Cell 166, 358–368 (2016).
Froberg, J. E., Pinter, S. F., Kriz, A. J., Jégu, T. & Lee, J. T. Megadomains and superloops form dynamically but are dispensable for X-chromosome inactivation and gene escape. Nat. Commun. 9, 5004 (2018).
Deng, X. et al. Bipartite structure of the inactive mouse X chromosome. Genome Biol. 16, 152 (2015).
Giorgetti, L. et al. Structural organization of the inactive X chromosome in the mouse. Nature 535, 575–579 (2016).
Bauer, M. et al. Chromosome compartments on the inactive X guide TAD formation independently of transcription during X-reactivation. Nat. Commun. 12, 3499 (2021).
Takei, Y. et al. Single-cell nuclear architecture across cell types in the mouse brain. Science 374, 586–594 (2021).
Wang, M., Lin, F., Xing, K. & Liu, L. Random X-chromosome inactivation dynamics in vivo by single-cell RNA sequencing. BMC Genomics 18, 9–10 (2017).
Fukuda, A., Tanino, M., Matoba, R., Umezawa, A. & Akutsu, H. Imbalance between the expression dosages of X-chromosome and autosomal genes in mammalian oocytes. Sci. Rep. 5, 9–10 (2015).
Li, X. et al. Dosage compensation in the process of inactivation/reactivation during both germ cell development and early embryogenesis in mouse. Sci. Rep. 7, 3729 (2017).
Takagi, N. & Abe, K. Detrimental effects of two active X chromosomes on early mouse development. Dev. Camb. Engl. 109, 189–201 (1990).
Cortini, R. & Filion, G. J. Theoretical principles of transcription factor traffic on folded chromatin. Nat. Commun. 9, 9–10 (2018).
Brouwer, I. & Lenstra, T. L. Visualizing transcription: key to understanding gene expression dynamics. Curr. Opin. Chem. Biol. 51, 122–129 (2019).
Chaumeil, J. A novel role for Xist RNA in the formation of a repressive nuclear compartment into which genes are recruited when silenced. Genes Dev. 20, 2223–2237 (2006).
Fan, G. et al. X-chromosome dosage compensation dynamics in human early embryos. Preprint at bioRxiv https://doi.org/10.1101/2020.03.08.982694. (2020).
Augui, S., Nora, E. P. & Heard, E. Regulation of X-chromosome inactivation by the X-inactivation centre. Nat. Rev. Genet. 12, 429–442 (2011).
Picelli, S. et al. Tn5 transposase and tagmentation procedures for massively scaled sequencing projects. Genome Res 24, 2033–2040 (2014).
Cheng, H. et al. Smart3-ATAC: a highly sensitive method for joint accessibility and full-length transcriptome analysis in single cells. Preprint at bioRxiv https://doi.org/10.1101/2021.12.02.470912 (2021).
Keane, T. M. et al. Mouse genomic variation and its effect on phenotypes and gene regulation. Nature 477, 289–294 (2011).
Krueger, F. & Andrews, S. R. SNPsplit: allele-specific splitting of alignments between genomes with known SNP genotypes. F1000Res. 5, 1479 (2016).
Parekh, S., Ziegenhain, C., Vieth, B., Enard, W. & Hellmann, I. zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs. GigaScience 7, giy059 (2018).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinforma. Oxf. Engl. 29, 15–21 (2013).
Liao, Y., Smyth, G. K. & Shi, W. The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads. Nucleic Acids Res. 47, e47–e47 (2019).
Reinius, B. et al. Female-biased expression of long non-coding RNAs in domains that escape X-inactivation in mouse. BMC Genomics 11, 614 (2010).
Johnson, M. et al. NCBI BLAST: a better web interface. Nucleic Acids Res. 36, W5–W9 (2008).
Chadwick, L. H., Pertz, L. M., Broman, K. W., Bartolomei, M. S. & Willard, H. F. Genetic control of X chromosome inactivation in mice: definition of the Xce candidate interval. Genetics 173, 2103–2110 (2006).
Thorvaldsen, J. L., Krapp, C., Willard, H. F. & Bartolomei, M. S. Nonrandom X chromosome inactivation is influenced by multiple regions on the murine X chromosome. Genetics 192, 1095–1107 (2012).
Calaway, J. D. et al. Genetic architecture of skewed X inactivation in the laboratory mouse. PLoS Genet. 9, e1003853 (2013).
Lun, A. T. L., McCarthy, D. J. & Marioni, J. C. A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor. F1000Res. 5, 2122 (2016).
Finak, G. et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol. 16, 9–10 (2015).
Haghverdi, L., Buettner, F. & Theis, F. J. Diffusion maps for high-dimensional single-cell analysis of differentiation data. Bioinformatics 31, 2989–2998 (2015).
McInnes, L., Healy, J. & Melville, J. UMAP: uniform manifold approximation and projection for dimension reduction. Preprint at arvix:1802.03426v3 (2018).
Street, K. et al. Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics. BMC Genomics 19, 477 (2018).
Fraley, C. & Raftery, A. E. Bayesian regularization for normal mixture estimation and model-based clustering. J. Classif. 24, 155–181 (2007).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Granja, J. M. et al. ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis. Nat. Genet. 53, 403–411 (2021).
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Marks, H. et al. The transcriptional and epigenomic foundations of ground state pluripotency. Cell 149, 590–604 (2012).
Frankish, A. et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 47, D766–D773 (2019).
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 14, 417–419 (2017).
Soneson, C., Love, M. I. & Robinson, M. D. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res. 4, 1521 (2015).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Amemiya, H. M., Kundaje, A. & Boyle, A. P. The ENCODE blacklist: identification of problematic regions of the genome. Sci. Rep. 9, 9354 (2019).
Ramírez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165 (2016).
Wingett, S. W. et al. HiCUP: pipeline for mapping and processing Hi-C data. F1000Res. 4, 1310 (2015).
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
Gu, Z., Eils, R. & Schlesner, M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32, 2847–2849 (2016).
Wickham, H. Ggplot2: Elegant Graphics for Data Analysis Vol. 174 (Springer, 2016).
Davison, A. C. & Hinkley, D. V. Bootstrap Methods and their Application (Cambridge Univ. Press, 1997)..
Acknowledgements
This study was made possible by grants from the Swedish Research Council (2017-01723), the Ragnar Söderberg Foundation (M16/17), the Knut and Alice Wallenberg Foundation (2021.0142), and SRA Stem Cells and Regenerative Medicine (Karolinska Institutet) to B.R. M.E. is supported by The Swedish Cancer Society, The Swedish Childhood Cancer Fund, Radiumhemmets Forskningsfonder, SFO StratRegen, The Swedish Research Council (2020-02940) and Cancer Research KI. A.L. is supported by a postdoctoral fellowship from the Swedish Society for Medical Research. We thank members of the Reinius lab and Colm Nestor for comments on the manuscript, Christoph Ziegenhain for support related zUMIs, and Björn Högberg and Ana Teixeira for use of an Illumina sequencer.
Funding
Open access funding provided by Karolinska Institute.
Author information
Authors and Affiliations
Contributions
A.L.: Investigation, methodology, formal analysis, conceptualization, visualization, data curation, writing —original draft, Writing—review and editing. H.C.: Investigation. J.C.N., N.P., C.C., and N.A.: Investigation. M.E. and Q.D.: Methodology and resources. B.R.: Investigation, methodology, conceptualization, supervision, resources, project administration, funding acquisition, writing—original draft, writing—review and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lentini, A., Cheng, H., Noble, J.C. et al. Elastic dosage compensation by X-chromosome upregulation. Nat Commun 13, 1854 (2022). https://doi.org/10.1038/s41467-022-29414-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-29414-1
This article is cited by
-
Detection of allele-specific expression in spatial transcriptomics with spASE
Genome Biology (2024)
-
Appropriate glycemic management protects the germline but not the uterine environment in hyperglycemia
EMBO Reports (2024)
-
RNA stability controlled by m6A methylation contributes to X-to-autosome dosage compensation in mammals
Nature Structural & Molecular Biology (2023)
-
A fine balancing act: how epitranscriptome regulates dosage compensation in mammals
Nature Structural & Molecular Biology (2023)