Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3584371.3613033acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
abstract

Deciphering the contribution of 3D interactions between cis-regulatory elements and promoters to regulate gene expression using graph neural networks

Published: 04 October 2023 Publication History

Abstract

Gene expression is regulated by various factors including nearby histone modifications (HMs), binding of transcription factors (TFs), and interplay of diverse cis-regulatory elements (CREs) in a cooperative manner. Two previous methods (GC-MERGE and Chromoformer) have been developed to predict gene expression by constructing a promoter-centered network based on 3D physical contacts between the promoter and CREs. However, these published models lack transferability to various size of promoter-centered graphs and quantify the influences of 3D physical contacts in gene regulation. In our approach, by jointly modeling multi-omics data, we can determine the main factors that regulate gene expression and can identify important CREs for each gene.
We present a novel deep learning architecture CreGNN based on a graph neural network (GNN) to decipher the contribution of 3D interactions among cis-regulatory elements (CREs) in gene expression. The transformer attention mechanism was applied to each pair of nodes in a graph. We used ~800 ChIP-seq datasets of HMs and TFs, Hi-C/micro-C, ATAC-seq and RNA-seq data from human, mouse and fly genomes. Promoter-centered graphs were built for each gene based on the open accessible or enhancer regions across the genome that contact the promoter in 3D space.
First, we trained a model based on the HMs and calculated the Pearson correlation, AUC, and AUPRC between raw gene expression and predicted gene expression in four cell lines. CreGNN showed better prediction performance relative to other existing methods. Second, we trained another model based on TFs, which showed slightly lower performance relative to HMs. By ranking node importance, we found that TFs and HMs coregulate gene expression by binding to different CREs. Third, we test the effect of 3D contacts on gene expression with different input. The prediction performance of other CREs were significantly lower than the promoter and the model with promoter only can achieve similar performance with promoter-centered graphs. 3D contact frequency showed low correlation with attention scores. Our results suggest that gene expression is dictated by promoter region activation or repression by CREs and that 3D contacts are necessary in specific cases but provide a lower relative contribution to overall gene expression.
Compared to previously developed methods, CreGNN exhibited higher accuracy and good transferability. Our method provides new insights into the function of specific CREs on respective gene promoters based on multi-omics data, and we systematically estimated the feature importance and node importance of 3D genome contacts, CREs and epigenomic features. We aspire to use our method to identify novel specific 3d contacts among promoters and CREs that contribute to the regulation of specific genes. This predictive power should not only contribute to our knowledge of specific gene regulatory mechanisms on the 3D level but also allow us to design targeted strategies to modify gene expression within individual cells.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
BCB '23: Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
September 2023
626 pages
ISBN:9798400701269
DOI:10.1145/3584371
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 October 2023

Check for updates

Author Tags

  1. graph neural networks
  2. gene expression
  3. histone modifications
  4. transcription factors

Qualifiers

  • Abstract

Funding Sources

  • the Intramural Program of the National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health

Conference

BCB '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 885 submissions, 29%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 37
    Total Downloads
  • Downloads (Last 12 months)37
  • Downloads (Last 6 weeks)2
Reflects downloads up to 24 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media