Abstract
Free full text
Architectural Proteins, Transcription, and the Three-dimensional Organization of the Genome
Abstract
Architectural proteins mediate interactions between distant sequences in the genome. Two well-characterized functions of architectural protein interactions include the tethering of enhancers to promoters and bringing together Polycomb-containing sites to facilitate silencing. The nature of which sequences interact genome-wide appears to be determined by the orientation of the architectural protein binding sites as well as the number and identity of architectural proteins present. Ultimately, long range chromatin interactions result in the formation of loops within the chromatin fiber. In this review, we discuss data suggesting that architectural proteins mediate long range chromatin interactions that both facilitate and hinder neighboring interactions, compartmentalizing the genome into regions of highly interacting chromatin domains.
Introduction
Multiple lines of evidence indicate that chromosome organization is a contributor to gene expression regulation [1, 2]. The use of 3C-derived approaches to detect intra- and inter-chromosome interactions has led to the observation that individual chromosomes are highly organized structures. Chromatin interactions decrease with increasing linear genomic distance and occur non-randomly across the chromosome length [3-6]. Based on the frequency of these interactions, chromosomes can be divided into distinct regions of highly interacting chromatin, named topologically associating domains (TADs), which engage in few long-range interactions with loci in other TADs [7]. Architectural proteins, also known as insulator proteins, appear to play a critical role in the three-dimensional organization of the genome. Here we discuss known architectural proteins in Drosophila and mammals, and describe evidence suggesting that architectural proteins regulate long range chromatin contacts and ultimately, gene expression. Current results suggest that architectural proteins have two inter-related functions, genome compartmentalization and the facilitation of interactions between regulatory elements. Finally, we end with a discussion of the molecular mechanisms regulating interactions between distant architectural protein binding sites.
Architectural Proteins
The roles of architectural proteins in genome organization and function can be explained by their ability to facilitate the formation of long-range contacts between DNA sequences. In Drosophila, 11 different DNA binding architectural proteins have been identified, each recognizing a unique DNA motif: CCCTC-binding factor (CTCF), Suppressor of Hairy-wing (Su(Hw)), Boundary Element Associated Factor 32 (BEAF-32), DNA Replication Related Element binding Factor (DREF), Transcription Factor IIIC (TFIIIC), Z4 (also called Putzig), Early Boundary Activity DNA-binding Factor (Elba), Pita (also called Spotted dick), Zinc Finger Interacting with CP190 (ZIPIC), Insulator binding factor 1 (Ibf1), and Insulator binding factor 2 (Ibf2) [8-15]. ChIP-seq experiments demonstrating co-occupancy in the genome as well as coimmunoprecipitation studies have demonstrated that DNA binding architectural proteins interact with accessory proteins, which do not recognize specific DNA motifs [9, 10, 16]. The accessory proteins identified in Drosophila include Centrosomal Protein 190 (CP190), Modifier of mdg4 (Mod(mdg4)), Rad21 (a component of the cohesin complex), Cap-H2 (a component of the condensin II complex), the long isoform of Female sterile homeotic on chromosome 1 (Fs(1)h-L), Lethal (3) malignant brain tumor (L3mbt), and Chromator (also called Chriz) [10]. Notably, depletion of either CTCF or CP190 reduced the chromatin interactions in the Abd-B locus by 3C analysis, suggesting that both DNA-binding and accessory architectural proteins can functionally contribute to chromatin looping interactions in cells [17]. Of particular interest, a recent in vitro analysis has provided a model for how DNA-binding and accessory architectural proteins function together to mediate long range chromatin interactions. Purified BEAF-32 protein was capable of binding its DNA motif, but only formed intermolecular interactions between two BEAF-32 motifs in the presence of the accessory proteins Chromator or CP190 [18]. However, how DNA-binding and accessory architectural proteins interact to mediate intrachromosomal interactions within a cell remains to be determined.
The number of architectural proteins characterized in mammals is not as extensive as in Drosophila. Multiple lines of evidence indicate that CTCF and cohesin are mediators of chromatin interactions and thus, architectural proteins. Functional studies evaluating the function of CTCF and cohesin in chromatin looping within individual genomic loci were the first studies to indicate these proteins are regulators of chromatin interactions [19-22]. More recently, CTCF and cohesin depletion studies have shown that loss of these architectural proteins reduces genome-wide chromatin interactions [23-25]. Furthermore, ChIA-PET analysis, a technique that maps the chromatin interactions occurring between loci occupied by a specific protein, characterized a subset of the chromatin interactions that occur between cohesin and CTCF occupied sites, indicative that cohesin and CTCF may play a role mediating chromatin interactions [26, 27]. In addition to CTCF and cohesin, the cohesin interacting proteins Nipbl (the protein responsible for loading cohesin rings onto DNA), and Mediator have also been implicated as mammalian architectural proteins due to their interactions with cohesin and enrichment at enhancer-promoter contact sites [28, 29]. Similar to Drosophila, a series of proteins that co-localize or directly interact with CTCF have been identified in mammals, including Yin Yang 1 (YY1), Kaiso, Chromodomain Helicase-DNA-binding protein 8 (CHD8), Poly ADP-Ribose Polymerase 1 (PARP1), MYC-associated zing-finger protein (MAZ), jun-D proto-oncogene (JUND), ZNF143, nucleophosmin, the PR domain zinc-finger protein 5 (PRDM5), and TFII-I [30, 31]. Through their association with CTCF, it is possible that CTCF interacting proteins also function as architectural proteins but additional experiments are required to address this hypothesis. Notably, there is a growing body of evidence that the CTCF-interacting protein ZNF143 is a mammalian architectural protein. The genomic occupancy of ZNF143 was shown to highly correlate with CTCF sites forming chromatin loops and ZNF143 depletion studies demonstrated a functional role for this protein in mediating long range chromatin interactions [32, 33]. Beyond the cohesin and CTCF interacting proteins, a number of other potential architectural proteins have been characterized as proteins required for chromatin loop formation within specific genomic loci like CHD6 in the CFTR locus or Ldb1 in the - globin locus [34, 35]. However, the significance of CHD6 and Ldb1 as genome-wide regulators of chromatin interactions is not currently known. In summary, these data suggest that, similar to Drosophila, mammals express a wide array of potential architectural proteins important for regulating long range interactions that should be subject to additional characterization.
In mammalian cells, the architectural protein cohesin is an important contributor to the regulation of nuclear size and organization. For example, depletion of cohesin in astrocytes caused an approximately 25% increase in the volume of the entire nucleus, possibly due to a loss of chromatin interactions and interphase chromatin organization [23]. In a reciprocal analysis, deletion of the cohesin regulator Wapl caused excessive cohesin and CTCF occupancy on chromatin and a striking hypercondensation of interphase DNA was observed [36]. It is interesting to speculate that the accumulation of cohesin and CTCF on chromatin resulted in excessive chromatin interactions responsible for the condensation phenotype. However, additional characterization is required to determine if these changes in nuclear morphology were direct effects of altered chromatin interactions.
General Mechanisms of Architectural Protein Function
Many functions of architectural proteins can be explained by their ability to mediate interactions between distant loci and form chromatin loops. Architectural proteins directly binding enhancers and promoters increase the contact frequency between regulatory elements by forming stable protein-protein interactions between them, consistent with the classical model of enhancer-promoter interactions (Figure 1A) [32, 37-40]. In addition, polymer simulations have suggested that an architectural protein interaction can facilitate neighboring interactions by two additional mechanisms [41, 42]. First, regulatory elements that are looped out by two interacting architectural protein-bound loci have higher contact frequencies, indicative that a chromatin loop highly interacts within itself (Figure 1B) [41, 42]. Secondly, the genomic elements flanking a chromatin loop are brought into closer proximity by architectural proteins, reducing the linear genomic distance between them and increasing their contact frequency (Figure 1C) [41]. However, additional evidence is required to determine if the simulation studies are representative of mechanisms of architectural protein-mediated facilitation in cells. In addition to facilitating some interactions, the establishment of a chromatin loop by architectural proteins also precludes other interactions. The original function ascribed to architectural proteins was their ability to insulate promoters from the effect of regulatory sequences such as enhancers, consistent with a function in hindering chromatin interactions [19, 43-51]. Furthermore, a reduction of contact frequency for interactions between sequences located within a chromatin loop and sequences present outside the loop was observed in polymer simulation studies, supporting the notion that an architectural protein-mediated interaction will hinder a subset of neighboring chromatin interactions (Figure 1D) [41]. Together, this data suggests that for a given architectural protein interaction, neighboring chromatin interactions will be either facilitated or inhibited depending on their location with respect to the chromatin loop [41].
The widespread distribution of architectural proteins in the genome raises the question of what regulates which architectural protein binding sites interact to form chromatin loops (Figure 2). Of the nearly 10,000 long range chromatin interactions mapped in human cells, only 68% of the loci anchoring these contacts interact with only one other locus [32]. Thus, nearly one third of the loci involved in long range interactions interact with multiple distant loci [32]. In Drosophila, one factor that determines which loci interact in 3D space is the number of architectural proteins present at specific sites in the genome and the distance between those sites. More specifically, highly occupied sites containing more than 8 architectural proteins appear to mediate interactions at a longer distance and form more stable loops that interfere with interactions between sequences inside and outside of the loops [3, 5, 6, 10, 52, 53] (Figure 3A). In mammals the regulation appears to be more complex, and recent evidence suggests that the orientation of the CTCF protein in the genome could be one critical mechanism regulating chromatin interactions. Two independent reports have shown a striking correlation between the orientation of CTCF motifs in the genome and the ability of these sites to interact. The anchors of the nearly 10,000 mapped chromatin loops in humans strongly correlate with binding of architectural proteins, including CTCF, Smc1, Rad21 and ZNF143[32]. A subset of these chromatin loops (4,322) contain CTCF at both loop anchors, with 92% of loop anchors containing the CTCF motifs in a convergent head-to-head orientation [32]. The distinct orientation of CTCF motifs was also observed at TAD borders (Figure 3A). The 5′ end of TAD domains were shown to contain a CTCF motif on the minus strand, while the 3′ border contains the CTCF motif on the plus strand in mice, monkeys, and dogs, supporting the hypothesis that CTCF orientation is a critical regulator of long range chromatin interactions in mammals [54]. Hadjur and colleagues speculate that the specific orientation of CTCF motifs also uniquely orients cohesin binding because Rad21 specifically interacts with the C-terminus of CTCF [54, 55]. Interestingly, the distinct orientation and spacing of cohesin/CTCF peaks has been characterized previously in primary mouse liver cells [56]. Together, these data suggest that the orientation of the CTCF protein in the genome as well as the number of architectural proteins bound are some of the mechanisms regulating which loci form long range chromatin interactions.
Architectural proteins mark the boundaries between TADs
Individual chromosomes are compartmentalized into TADs (880 kb and 107 kb median size in mouse and Drosophila, respectively) separated by regions called TAD borders or sites on contact insulation, which are depleted of interactions between adjacent sequences on either side of the border (Figure 3A) [3, 6, 23]. FISH analysis demonstrated that loci spanning the same linear genomic distance are more likely to interact in 3D space if they are located in the same TAD rather than in different TADs, indicative that TADs represent regions of highly interacting chromatin [7]. TAD borders are enriched in actively transcribed genes, especially housekeeping genes, and architectural proteins (Figure 3A) [3-7]. In general, TAD borders do not appear to separate regions of the genome with different chromatin environments, as defined by the presence of specific histone modifications (Figure 3B). A single TAD often contains several chromatin types and TAD borders do not change after complete loss of H3K9me2 or H3K27me3 chromatin environments [6, 7]. Instead, there is evidence suggesting that TADs represent a compartmentalization of regulatory elements (Figure 3B). For example deletion of the Xist-Tsix or Epha4 TAD borders resulted in novel chromatin interactions and thus, a partial merging of the two neighboring TADs and altered gene expression [2, 7]. Furthermore, ectopic interactions between the enhancers and genes outside their normal TAD were detected by 4C analysis, consistent with the aberrant expression profiles [2]. Of particular interest, the ectopic interactions did not extend past the next proximal TAD border, suggesting that TAD structure partially contributes to gene regulation [2]. [7]Thus, genes within a TAD appear to have a partially shared regulatory environment by interacting with similar regulatory elements. Consistent with this hypothesis, analysis of gene expression profiles during cellular differentiation demonstrated that coordinated gene expression was most correlative when analyzing genes within a single TAD [7, 57].
The strength of TAD borders, defined as the ratio between intra- and inter-TAD interactions around border sequences, correlates with the number of architectural proteins bound: sites with few architectural proteins are weak borders, while sites with many architectural proteins are strong borders (Figure 3A). Border strength also correlates with the ability of these sequences to function as classical insulators in transgene reporter experiments [10]. A potential role for architectural proteins in the formation of TAD borders is supported by a variety of experimental results. Depletion of Rad21 or CTCF results in increased inter-TAD interactions, consistent with reduced TAD border strength [23-25, 53]. Similarly, the redistribution of architectural proteins from TAD borders to loci within TADs is accompanied by a concomitant decrease in TAD border strength [53]. A recent study evaluating inversions and deletions affecting TAD borders found in patients with limb malformations provided additional evidence that architectural proteins at least partially regulate chromatin interactions at TAD borders. Only deletions of TAD borders containing multiple occupied CTCF sites resulted in aberrant gene expression, indicating that the altered chromatin interactions upon deletion were not merely caused by increased proximity between regulatory elements but being regulated by architectural protein occupancy [2]. Together, these observations suggest a strong association between architectural protein occupancy and TAD borders, implicating that architectural proteins may regulate TAD structure.
Architectural proteins regulate chromatin interactions and gene expression within TADs
Additional compartmentalized regulatory regions have been characterized within TADs. Intra-TAD regions of enriched chromatin interactions encompassing ~100-200 kb in mammals have been named subTADs or contact domains [29, 32]. The presence of subTADs is consistent with the observation that genes within the same TAD can exhibit different chromatin states and expression levels. Further evidence that intra-TAD contacts represent interactions regulating gene expression comes from cell differentiation studies. During cellular differentiation, chromatin interactions within TADs change significantly (Figure 4A) [7, 29, 58]. More precisely, 893/1062 of mouse cortex TAD borders were conserved in mouse embryonic stem cells, while only 260/425 of chromatin interactions were conserved between neural progenitor cells and embryonic stem cells in mice [3, 29]. Furthermore, a subTAD at the Evx2-HoxD locus was strongly reduced after activation of the HoxD genes by retinoic acid, suggesting that dynamic subTAD structure and intra-TAD interactions are representative of unique gene expression profiles [58].
A number of depletion studies have provided evidence suggesting that architectural proteins facilitate intra-TAD interactions. Depletion of the architectural proteins Rad21 or CTCF results in a striking loss of intra-TAD contacts, implicating architectural proteins as positive regulators of chromatin interactions (Figure 4A) [23-25, 27, 53]. Furthermore, these architectural protein-mediated interactions are functionally significant in gene expression and likely represent cell-type specific enhancer-promoter interactions responsible for differential gene expression. The loss of CTCF and Rad21-dependent chromatin interactions results in the misregulation of hundreds of genes, with a reduction in the expression of highly active genes and activation of poorly expressed genes [24, 25, 59]. Intriguingly, analysis of thymocytes after Rad21 depletion shows that nearly 50% of the misregulated genes overlap or are in close proximity to conventional enhancers or super-enhancers [59]. Misregulation of genes in close proximity to enhancers could be the result of reduced facilitating activity of architectural proteins. For example, in the absence of Rad21, enhancers may not be properly tethered to their normal gene targets, resulting in promiscuous interactions with neighboring genes. Together, these studies provide strong correlative evidence that architectural proteins are critical regulators of chromatin interactions within TADs to fine-tune gene expression and generate a robust range of transcriptional outputs (Figure 4B).
Despite the apparent redundant function of architectural proteins in the establishment of TADs, individual architectural proteins appear to have distinct roles in regulating interactions within TADs. For example, depletion of CTCF or Rad21 exhibit many similarities but result in the misregulation of unique subsets of genes [24]. Additional evidence for distinct functions of architectural proteins comes from studies in mouse embryonic and neural progenitor cells. The unique combinations of CTCF, Smc1 and/or mediator bound at the anchors of chromatin interactions influence the genomic distance spanned by these interactions [29, 60]. Furthermore, the architectural protein CAP-H2 has been shown to antagonize, rather than facilitate, enhancer-promoter interactions. Comparative analyses of Cap-H2 and Rad21 depletion in Drosophila show that CAP-H2 antagonizes while Rad21 promotes long range interactions, consistent with architectural proteins having distinct functions [53]. Overall, these results suggest that the identity of the architectural proteins bound to an intra-TAD locus may influence the type of long range interactions formed.
Architectural proteins regulate enhancer-promoter interactions
The molecular mechanisms governing which promoter-enhancer pairs interact in 3D space have been somewhat elusive. Only 7% enhancer-promoter loops occur between a specific regulatory element and the closest transcriptional start site (TSS) [61]. Furthermore, each gene promoter has been estimated to contact an average of 4.75 enhancers, while 25% of enhancers were assigned to 2 or more promoters, highlighting the complexity of long range 3D interactions [60]. Recent studies have demonstrated a strong correlation between cohesin and CTCF occupancy and long range enhancer-promoter interactions [62]. ChIP-seq analysis has shown that the two cohesin subunits, Rad21 and Smc1A, as well as CTCF, exhibit higher occupancy within conventional and super-enhancers than in the neighboring genomic regions [59]. Furthermore, ChIA-PET analysis has shown that many long range chromatin interactions are anchored by enhancer-promoter pairs bound by both CTCF and cohesin, indicating a possible role for CTCF and cohesin in mediating these interactions [26, 27].
Architectural proteins are associated with enhancer-promoter interactions that result in full transcription activation. Transcription factor clusters are found throughout the genome at promoter-proximal sites and distal regulatory regions and are highly correlated with cohesin occupancy [56, 63]. Consistent with the different functions of architectural proteins observed in subTAD interactions, it is clear that CTCF is preferentially absent from a subset of active enhancer-promoter interactions. ChIP-seq analysis demonstrated that transcription factor hotspots contain the architectural proteins cohesin, mediator and nipbl but lack CTCF [28]. Indicative of cell-type specific gene expression profiles, only actively transcribed genes exhibit enhancer-promoter occupancy of cohesin, mediator and nipbl and long range chromatin interactions as measured by 3C [28]. In contrast, sites bound by cohesin, mediator and CTCF were more conserved between cells types, indicative of housekeeping genes or poised long range interactions [28, 64]. Whether CTCF or other uncharacterized architectural proteins are truly excluded from transcription factor hotspots or are selectively released after full transcriptional activation has yet to be explored further.
Correlations between architectural protein occupancy and transcriptionally unproductive interactions between enhancers and promoters at paused genes have also been observed. For example, BEAF-32 and CP190 are enriched at sites of paused RNA polymerase II, indicative that architectural proteins can form long range interactions prior to transcription elongation [65]. It is well established that interactions between enhancers and promoters form prior to gene activation and can be maintained even after blocking transcription with pharmacological agents [35, 66-70]. Architectural proteins have been implicated in regulating poised enhancer-promoter interactions during Drosophila embryonic development. Most interactions observed in 3-4 and 6-8 hour embryos represent poised enhancer-promoter contacts because only 6% of interactions change between these two time points despite distinct changes in transcription [71]. Strikingly, nearly 50% of the poised interactions present in 6-8 hour embryos in Drosophila were bound by the architectural proteins BEAF-32, CP190, CTCF, GAF, Mod(mdg4) and Su(Hw) [71]. Notably, the 50% correlation is likely an underestimate of occupancy because many architectural proteins were not investigated, including cohesin. Overall, this data suggests that architectural proteins can facilitate enhancer-promoter interactions in the process of transcription initiation, although additional events must be required to release RNA Polymerase II into productive elongation.
Of particular interest, architectural proteins are also associated with genomic elements other than enhancers and promoters. In fact, thirty three percent of the long range interactions mediated by cohesin and CTCF were bound to genomic loci without enhancers or promoters [26]. This observation can be explained by one of three possibilities. First, the occupancy of CTCF or cohesin at these sites could be unrelated to gene expression such as sites involved in V(D)J or sister chromatid cohesion, respectively [72, 73]. Secondly, the co-occupied cohesin-CTCF sites are bound to enhancers that have not been properly annotated yet. Due to the cell-type specificity of enhancer activity, the global mapping of regulatory regions is still likely far from complete. Finally, it is also possible that CTCF and cohesin form interactions between other regulatory elements such as Polycomb (Pc) response elements (PREs). Mutational analysis has demonstrated that long range Mcp PRE-mediated interactions do not require the PRE sequence but rather nearby architectural protein binding sites occupied by CTCF and CP190 [74]. Furthermore, depletion of the architectural proteins CTCF, CP190 or cohesin reduced or ablated long range Mcp-interactions, supporting a role for architectural proteins in mediating Pc interactions [75]. In addition, interacting architectural protein-bound loci have been shown to flank super-enhancers and Pc-repressed genes, forming super-enhancer and polycomb domains [26]. Clustered regularly interspaced short palindromic repeat (CRISPR) genome-editing was utilized to delete a CTCF motif at one domain border, resulting in the misregulation of genes within and outside of the domains [26]. Thus, these CTCF sites are still important regulators of gene expression, despite not directly involving enhancers and promoters.
Regulation of Architectural Proteins
The mechanisms that regulate the genomic occupancy of architectural proteins are still poorly defined. Even though most architectural proteins are ubiquitously expressed, recent results suggest that the NIPB promoter may interact with enhancers typically found associated with housekeeping genes as well as cell type specific enhancers. Zabidi and colleagues utilized STARR-seq to identify all of the enhancers that interact with a housekeeping gene promoter compared to a developmentally regulated promoter, demonstrating a clear distinction in the nature of enhancers activating these two classes of genes [76]. Analysis of four housekeeping gene promoters and three developmentally regulated promoters suggests a clear pattern of enhancer interactions with the two distinct core promoter classes. Intriguingly, the NIPB promoter exhibited contacts with housekeeping and developmentally regulated enhancers, suggesting that Nipbl expression may be partially modulated in a cell type specific manner [76]. Thus, it is interesting to speculate that other architectural protein genes may contain a hybrid class of core promoter like NIPB and thus, exhibit a partially cell-type specific expression pattern which could contribute to cell-type specific chromatin interactions.
Another potential mechanism to regulate architectural protein function involves the modulation of DNA binding. In mammals, CTCF binding is negatively regulated by methylation of its DNA recognition sequence [77]. Thus, cell-type specific methylation can result in cell-type specific CTCF occupancy [78]. In addition, different classes of regulatory elements may recruit distinct members of the architectural protein family based on their intrinsic sequences. For example, Drosophila enhancers shown to interact with housekeeping gene core promoters are enriched for the DRE motif, which can act as a binding site for the architectural proteins DREF or BEAF-32 [9, 76]. Thus, DREF and BEAF-32 may be important regulators of enhancer-promoter interactions between housekeeping genes but not developmentally regulated genes.
Because functional chromatin interactions depend on DNA-binding proteins and the recruitment of accessory architectural proteins, altering the composition of architectural protein complexes can affect their function. This process may be controlled by posttranslational modifications that regulate the interaction of architectural proteins with DNA or with each other. For example, CP190, Mod(mdg4), and CTCF can be sumoylated, which regulates the formation of insulator bodies in Drosophila [79, 80]. CTCF sumoylation has also been detected in mammals, but how the modification affects its ability to interact with DNA or other proteins is still unclear [81]. In addition, CTCF has been shown to be modified by polyADP ribosylation, which regulates DNA binding of CTCF in mammals and interactions between CTCF and CP190 in Drosophila [82-84]. Thus, differential covalent modifications of architectural proteins may generate unique protein subcomplexes important in regulating 3D interactions.
The presence of lncRNAs in architectural protein complexes could represent another mechanism to regulate the composition of protein complexes formed at various genomic loci. The protein-protein interactions of CTCF as well as CP190 have been shown to be dependent on the presence of ncRNAs [85, 86]. In fact, interactions between Tsix and Xite RNAs are critical for targeting CTCF to the X chromosome, suggesting a role for RNA in recruiting architectural proteins to the proper genomic loci and protein complexes [87]. Expanding the identification of RNAs present in architectural protein complexes will be important to gain insight into the role of RNA in regulating long range architectural protein interactions.
Conclusions and Perspectives
The data discussed in this review supports the hypothesis that architectural proteins play an important role in the establishment of interphase chromatin organization. These proteins mediate long range chromatin interactions to form loops that facilitate communication between some regulatory elements while reducing contacts between others. We propose a model that architectural proteins contribute to the formation of TAD borders and mediate chromatin interactions between enhancer, promoters, and PREs within TADs [53].
The mechanisms regulating the specificity of long range chromatin interactions between architectural protein-bound genomic loci are just beginning to be characterized. CTCF orientation is likely one important mechanism regulating long range chromatin interactions in mammals but is observed at less than half of the chromatin loops mapped in humans [32]. Thus, it will be interesting to determine if other architectural proteins also exhibit distinct orientations in the genome with respect to the long range chromatin interactions they mediate. Furthermore, the number of the architectural proteins bound to a genomic locus promotes specific types of long range chromatin interactions, raising the question what are high occupancy architectural protein sites interacting with to hinder chromatin interactions across TAD borders? One possibility is that TAD borders interact with each other in 3D space, looping out DNA in a rosette structure to prevent inter-TAD contacts. The 10-30 insulator bodies observed in Drosophila cells would then be expected to represent clusters of TAD borders from the four chromosomes [6, 9, 45]. Finally, there is correlative evidence that the identity of the architectural proteins bound to a genomic locus regulate the long range chromatin interactions. Thus, future studies better characterizing distinct multimeric architectural protein complexes including the PTMs and ncRNAs present could enhance our understanding about architectural proteins promote distinct long range chromatin interactions. By refining our understanding of how architectural proteins interact in 3D space, we will gain great insight into the mechanisms that allow architectural proteins to both facilitate and hinder interactions between regulatory elements genome-wide.
Abbreviations
(ChIP-seq) | Chromatin immunoprecipitation-sequencing |
(3C) | Chromosome conformation capture |
(ChIA-PET) | Chromatin interaction analysis by paired-end tag sequencing |
(STARR-seq) | Self-transcribing active regulatory region sequencing |
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
REFERENCES
Full text links
Read article at publisher's site: https://doi.org/10.1016/j.febslet.2015.05.025
Read article for free, from open access legal sources, via Unpaywall: https://europepmc.org/articles/pmc4598269?pdf=render
Citations & impact
Impact metrics
Citations of article over time
Alternative metrics
Article citations
YY1-controlled regulatory connectivity and transcription are influenced by the cell cycle.
Nat Genet, 56(9):1938-1952, 29 Aug 2024
Cited by: 2 articles | PMID: 39210046
Spatial proximity and gene function: a new dimension in prokaryotic gene association network analysis with 3D-GeneNet.
Brief Bioinform, 25(4):bbae320, 01 May 2024
Cited by: 0 articles | PMID: 38975892 | PMCID: PMC11229033
Enhancer-promoter interactions can form independently of genomic distance and be functional across TAD boundaries.
Nucleic Acids Res, 52(4):1702-1719, 01 Feb 2024
Cited by: 6 articles | PMID: 38084924 | PMCID: PMC10899756
3D chromatin interactions involving Drosophila insulators are infrequent but preferential and arise before TADs and transcription.
Nat Commun, 14(1):6678, 21 Oct 2023
Cited by: 6 articles | PMID: 37865700 | PMCID: PMC10590426
The Conserved Chromatin Remodeler SMARCAD1 Interacts with TFIIIC and Architectural Proteins in Human and Mouse.
Genes (Basel), 14(9):1793, 13 Sep 2023
Cited by: 2 articles | PMID: 37761933 | PMCID: PMC10530723
Go to all (48) article citations
Data
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Insulator function and topological domain border strength scale with architectural protein occupancy.
Genome Biol, 15(6):R82, 30 Jun 2014
Cited by: 187 articles | PMID: 24981874 | PMCID: PMC4226948
CTCF: an architectural protein bridging genome topology and function.
Nat Rev Genet, 15(4):234-246, 11 Mar 2014
Cited by: 620 articles | PMID: 24614316 | PMCID: PMC4610363
Review Free full text in Europe PMC
Towards a predictive model of chromatin 3D organization.
Semin Cell Dev Biol, 57:24-30, 03 Dec 2015
Cited by: 8 articles | PMID: 26658098 | PMCID: PMC4892986
Review Free full text in Europe PMC
Topologically Associating Domains: An invariant framework or a dynamic scaffold?
Nucleus, 6(6):430-434, 29 Sep 2015
Cited by: 17 articles | PMID: 26418477 | PMCID: PMC4915497