Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3589437.3589439acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccbbConference Proceedingsconference-collections
research-article

Prediction of Protein Interactions Based on Cnn-Lstm

Published: 20 July 2023 Publication History

Abstract

Protein is the material basis and the only form of all life activities, and it is also the material basis or drug for diagnosing and treating diseases. The number of human proteins not only far exceeds the number of genes, but also due to the variability and diversity of proteins, protein research techniques are far more complex and difficult than nucleic acid techniques. Protein-protein interactions (PPIs) play key roles in many cellular biological processes and underlie the entire molecular machinery of living cells, which can be used to aid in drug target detection and therapeutic design. Deep learning methods have produced many research results in the field of bioinformatics. Convolutional neural network (CNN) methods and LSTM methods have strong spatial and sequence feature representation learning capabilities, and have achieved outstanding results in the fields of images and text. In-depth research can be done in the field of PPI. In this paper, we propose a CNN-LSTM method to predict PPI. Taking the protein sequence as the research basis, the protein sequence is encoded in hexadecimal, and the protein interaction relationship pair is constructed, and the CNN method and the LSTM method are introduced for fusion learning. A 3-layer convolutional network is used for representation learning, and then connected to the LSTM layer. The prediction performance of the model is improved by adjusting different parameters such as learning rate and activation function. On the test set, Auc is 0.9212 and F1 is 0.9206, and compared with other commonly used models, it proves that CNN-LSTM has good learning and generalization capabilities, and can be effectively used for PPI prediction.

References

[1]
Richards A L, Eckhardt M, Krogan N J. Mass spectrometry‐based protein–protein interaction networks for the study of human diseases[J]. Molecular systems biology, 2021, 17(1): e8792.
[2]
Casadio R, Martelli P L, Savojardo C. Machine learning solutions for predicting protein–protein interactions[J]. Wiley Interdisciplinary Reviews: Computational Molecular Science, 2022: e1618.
[3]
Mahood E H, Kruse L H, Moghe G D. Machine learning: A powerful tool for gene function prediction in plants[J]. Applications in Plant Sciences, 2020, 8(7): e11376.
[4]
Zhang B, Li J, Quan L, Sequence-based prediction of protein-protein interaction sites by simplified long short-term memory network[J]. Neurocomputing, 2019, 357: 86-100.
[5]
Sun T, Zhou B, Lai L, Sequence-based prediction of protein protein interaction using a deep-learning algorithm[J]. BMC bioinformatics, 2017, 18(1): 1-8.
[6]
Debnath S, Mollah A F. A Supervised Machine Learning Approach for Sequence Based Protein-protein Interaction (PPI) Prediction[J]. arXiv preprint arXiv:2203.12659, 2022.
[7]
Hashemifar S, Neyshabur B, Khan A A, Predicting protein–protein interactions through sequence-based deep learning[J]. Bioinformatics, 2018, 34(17): i802-i810.
[8]
Li H, Gong X J, Yu H, Deep neural network based predictions of protein interactions using primary sequences[J]. Molecules, 2018, 23(8): 1923.
[9]
Yao Y, Du X, Diao Y, An integration of deep learning with feature embedding for protein–protein interaction prediction[J]. PeerJ, 2019, 7: e7126.
[10]
Chen M, Ju C J T, Zhou G, Multifaceted protein–protein interaction prediction based on Siamese residual RCNN[J]. Bioinformatics, 2019, 35(14): i305-i314.
[11]
Guo Y, Yu L, Wen Z, Using support vector machine combined with auto covariance to predict protein–protein interactions from protein sequences[J]. Nucleic acids research, 2008, 36(9): 3025-3030.
[12]
Altschul S F, Koonin E V. Iterated profile searches with PSI-BLAST—a tool for discovery in protein databases[J]. Trends in biochemical sciences, 1998, 23(11): 444-447.
[13]
O'Shea K, Nash R. An introduction to convolutional neural networks[J]. arXiv preprint arXiv:1511.08458, 2015.
[14]
Longwell S, Shimko T. Res2Vec: Amino acid vector embeddings from 3d-protein structure[J]. THRESHOLD, 30(22,344): 22,344.
[15]
Gui Y, Wang R, Wei Y, DNN-PPI: a large-scale prediction of protein–protein interactions based on deep neural networks[J]. Journal of Biological Systems, 2019, 27(01): 1-18.
[16]
Chen M, Ju C J T, Zhou G, Multifaceted protein–protein interaction prediction based on Siamese residual RCNN[J]. Bioinformatics, 2019, 35(14): i305-i314.
[17]
Jha K, Saha S. Amalgamation of 3D structure and sequence information for protein–protein interaction prediction[J]. Scientific Reports, 2020, 10(1): 1-14.
[18]
Alakus T B, Turkoglu I. Prediction of protein-protein interactions with LSTM deep learning model[C]//2019 3rd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT). IEEE, 2019: 1-5.
[19]
Wang L, Wang H F, Liu S R, Predicting protein-protein interactions from matrix-based protein sequence using convolution neural network and feature-selective rotation forest[J]. Scientific reports, 2019, 9(1): 1-12.
[20]
Zhang H, Guan R, Zhou F, Deep residual convolutional neural network for protein-protein interaction extraction[J]. Ieee Access, 2019, 7: 89354-89365.
[21]
Liu J, Gong X. Attention mechanism enhanced LSTM with residual architecture and its application for protein-protein interaction residue pairs prediction[J]. BMC bioinformatics, 2019, 20(1): 1-11.
[22]
Alakus T B, Turkoglu I. Prediction of protein-protein interactions with LSTM deep learning model[C]//2019 3rd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT). IEEE, 2019: 1-5.
[23]
Pan X Y, Zhang Y N, Shen H B. Large-Scale prediction of human protein− protein interactions from amino acid sequence based on latent topic features[J]. Journal of proteome research, 2010, 9(10): 4992-5001.
[24]
Keshava Prasad T S, Goel R, Kandasamy K, Human protein reference database—2009 update[J]. Nucleic acids research, 2009, 37(suppl_1): D767-D772.
[25]
Yanofsky C. Establishing the triplet nature of the genetic code[J]. Cell, 2007, 128(5): 815-818.
[26]
Zhou Z H, Feng J. Deep Forest: Towards An Alternative to Deep Neural Networks[C]//IJCAI. 2017: 3553-3559.

Index Terms

  1. Prediction of Protein Interactions Based on Cnn-Lstm

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCBB '22: Proceedings of the 2022 6th International Conference on Computational Biology and Bioinformatics
    December 2022
    87 pages
    ISBN:9781450397636
    DOI:10.1145/3589437
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 July 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. CNN
    2. CNN-LSTM
    3. LSTM
    4. PPI
    5. protein-protein interaction prediction

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    • Higher Education Science Special Fund of the Guangdong Provincial Department of Education and HJAI-LAB

    Conference

    ICCBB 2022

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 67
      Total Downloads
    • Downloads (Last 12 months)47
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 20 Nov 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media