Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3574198.3574211acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicbbeConference Proceedingsconference-collections
research-article

Prediction of protein interactions based on CT-DNN

Published: 15 March 2023 Publication History

Abstract

Protein-protein interactions (PPIs) play critical roles in many cellular biological processes, underlie the entire molecular machinery of living cells, and can be used to aid in drug target detection and therapeutic design. Traditional methods such as biochemical analysis and chromatography have low efficiency and low coverage for PPI identification. In recent years, the amino acid sequence information of proteins has developed rapidly, and machine learning methods based on amino acid sequences have been widely developed. It is difficult to improve the PPI prediction accuracy of traditional machine learning, and it is difficult to deal with the increasing protein sequence data, which hinders the application of traditional machine learning in protein interaction prediction. Deep learning methods simulate the learning mechanism of the human brain and have been successfully applied in speech recognition and image recognition, natural language understanding and other fields. Compared with traditional machine learning methods, deep learning algorithms can handle large-scale raw and complex data and can automatically learn useful abstract features. Deep neural networks (DNNs) have brought a turnaround in protein interaction prediction, which can process large-scale protein sequence data and automatically extract low-level or high-level features required for classification. In this paper, we propose a CT-DNN method to predict PPI, which takes the protein sequence as the research basis, considers the properties of an amino acid and its neighboring amino acids, and divides the amino acids into 7 amino acids according to the dipolar polarity and volume of the amino side chain category, take three consecutive amino acids as a unit to obtain the projection vector space of the protein sequence, and normalize it. By constructing protein interaction relationship pairs, a deep neural network (DNN) method is introduced for learning. By adjusting the learning rate, Different parameters such as activation function are used to improve the prediction performance of the model. The best model is trained through 10-fold cross-validation, on the test set, auc is 0.983 and aupr is 0.982. Compared with other commonly used models, it is proved that CT-DNN has has good learning and generalization ability, and can be effectively used for PPI prediction.

References

[1]
Howl J, Jones S. Insights into the molecular mechanisms of action of bioportides: a strategy to target protein-protein interactions[J]. Expert Reviews in Molecular Medicine, 2015, 17.
[2]
Bludau I, Aebersold R. Proteomic and interactomic insights into the molecular basis of cell functional diversity[J]. Nature Reviews Molecular Cell Biology, 2020, 21(6): 327-340.
[3]
Shlomai J. Redox control of protein–DNA interactions: from molecular mechanisms to significance in signal transduction, gene expression, and DNA replication[J]. Antioxidants & redox signaling, 2010, 13(9): 1429-1476.
[4]
Sieracki N A, Komarova Y A. Studying cell signal transduction with biomimetic point mutations[J]. Genetic Manipulation of DNA and Protein-Examples from Current Research, 2013: 381-392.
[5]
Rao V S, Srinivas K, Sujini G N, Protein-protein interaction detection: methods and analysis[J]. International journal of proteomics, 2014, 2014.
[6]
Keskin O, Tuncbag N, Gursoy A. Predicting protein–protein interactions from the molecular to the proteome level[J]. Chemical reviews, 2016, 116(8): 4884-4909.
[7]
Xiao H, Huang D, Pan Y, Fault diagnosis and prognosis of wastewater processes with incomplete data by the auto-associative neural networks and ARMA model[J]. Chemometrics and Intelligent Laboratory Systems, 2017, 161: 96-107.
[8]
Khan S H, He X, Porikli F, Forest change detection in incomplete satellite images with deep neural networks[J]. IEEE Transactions on Geoscience and Remote Sensing, 2017, 55(9): 5407-5423.
[9]
Shen J, Zhang J, Luo X, Predicting protein–protein interactions based only on sequences information[J]. Proceedings of the National Academy of Sciences, 2007, 104(11): 4337-4341.
[10]
Singhal M, Resat H. A domain-based approach to predict protein-protein interactions[J]. Bmc Bioinformatics, 2007, 8(1): 1-19.
[11]
Chen C, Zhang Q, Ma Q, LightGBM-PPI: Predicting protein-protein interactions through LightGBM with multi-information fusion[J]. Chemometrics and Intelligent Laboratory Systems, 2019, 191: 54-64.
[12]
Romero‐Molina S, Ruiz‐Blanco Y B, Harms M, PPI‐detect: A support vector machine model for sequence‐based prediction of protein–protein interactions[J]. Journal of computational chemistry, 2019, 40(11): 1233-1242.
[13]
Wang S, Sun S, Li Z, Accurate de novo prediction of protein contact map by ultra-deep learning model[J]. PLoS computational biology, 2017, 13(1): e1005324.
[14]
Luo P, Tian LP, Ruan J, Wu FX. Disease Gene Prediction by Integrating PPI Networks, Clinical RNA-Seq Data and OMIM Data. IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):222-232. Epub 2017 Nov 7. 29990218.
[15]
He T, Liu Y, Ko T H, Contextual correlation preserving multiview featured graph clustering[J]. IEEE transactions on cybernetics, 2019, 50(10): 4318-4331.
[16]
Velickovic P, Cucurull G, Casanova A, Graph attention networks[J]. stat, 2017, 1050: 20.
[17]
Kipf T N, Welling M. Semi-supervised classification with graph convolutional networks[J]. arXiv preprint arXiv:1609.02907, 2016.
[18]
Qiu W, Li S, Cui X, Predicting protein submitochondrial locations by incorporating the pseudo-position specific scoring matrix into the general Chou's pseudo-amino acid composition[J]. Journal of theoretical biology, 2018, 450: 86-103.
[19]
Dehzangi A, Heffernan R, Sharma A, Gram-positive and Gram-negative protein subcellular localization by incorporating evolutionary-based descriptors into Chou s general PseAAC[J]. Journal of theoretical biology, 2015, 364: 284-294.
[20]
Sastri M, Darshi M, Mackey M, Sub-mitochondrial localization of the genetic-tagged mitochondrial intermembrane space-bridging components Mic19, Mic60 and Sam50[J]. Journal of cell science, 2017, 130(19): 3248-3260.
[21]
Chen Y L, Li Q Z. Prediction of the subcellular location of apoptosis proteins[J]. Journal of Theoretical Biology, 2007, 245(4): 775-783.
[22]
Liu L, Tian J, Nan H, Porcine reproductive and respiratory syndrome virus nucleocapsid protein interacts with Nsp9 and cellular DHX9 to regulate viral RNA synthesis[J]. Journal of virology, 2016, 90(11): 5384-5398.
[23]
Du X, Sun S, Hu C, DeepPPI: boosting prediction of protein–protein interactions with deep neural networks[J]. Journal of chemical information and modeling, 2017, 57(6): 1499-1510.
[24]
Wang Y B, You Z H, Li X, Predicting protein–protein interactions from protein sequences by a stacked sparse autoencoder deep neural network[J]. Molecular BioSystems, 2017, 13(7): 1336-1344.
[25]
Pan X Y, Zhang Y N, Shen H B. Large-Scale prediction of human protein− protein interactions from amino acid sequence based on latent topic features[J]. Journal of proteome research, 2010, 9(10): 4992-5001.
[26]
Cutler A, Cutler D R, Stevens J R. Random forests[M]//Ensemble machine learning. Springer, Boston, MA, 2012: 157-175.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICBBE '22: Proceedings of the 2022 9th International Conference on Biomedical and Bioinformatics Engineering
November 2022
306 pages
ISBN:9781450397223
DOI:10.1145/3574198
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 March 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CT
  2. DNN
  3. PPI
  4. protein-protein interaction prediction

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • 2022 Guangdong Education Science Planning Project (Higher Education Project)

Conference

ICBBE 2022

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 31
    Total Downloads
  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)3
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media