Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3584371.3612950acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

A Sequence-Based Prediction Model of Vesicular Transport Proteins Using Ensemble Deep Learning

Published: 04 October 2023 Publication History

Abstract

This study aims to employ computational methods for the accurate identification of vesicular transport proteins. The identification of these proteins holds great significance in enhancing our understanding of their protein family structure, thereby enabling the design of more effective drug targets for individuals afflicted with endocrine disorders. In recent times, researchers in the field of biology have increasingly sought to leverage deep learning techniques to address this challenge. In order to further enhance the classification performance, we investigated the following models incorporating distinct features: (1) We devised a novel protein feature called AAC_PSSM by amalgamating amino acid composition (AAC) and position-specific scoring matrix (PSSM) features. Subsequently, a gated recurrent unit (GRU) model was employed to learn such features; (2) An ensemble model was constructed by combining the existing GRU model with the model of a neural network featuring the AAC feature; (3) Random forest analysis was conducted using the pseudo-amino acid composition (PseAAC) feature; (4) Furthermore, we explored a natural language processing (NLP) approach by considering the protein sequence as a natural language and applying various neural network architectures. Upon analyzing the results obtained from the different models, it was observed that the ensemble model incorporating PSSM and AAC features exhibited the highest sensitivity of 81.03% and accuracy of 82.43%. Notably, our proposed model surpassed the performance of state-of-the-art models addressing the same problem and datasets, thus establishing its superiority.

References

[1]
Yongjian Liu and Robert H Edwards. The role of vesicular transport proteins in synaptic transmission and neural degeneration. Annual Review of Neuroscience, 20(1):125--156, 1997.
[2]
William E Balch. Small gtp-binding proteins in vesicular transport. Trends in Biochemical Sciences, 15(12):473--477, 1990.
[3]
Nguyen Quoc Khanh Le and Van-Nui Nguyen. Snare-cnn: a 2d convolutional neural network architecture to identify snare proteins from high-throughput sequencing data. PeerJ Computer Science, 5:e177, 2019.
[4]
Juan S Bonifacino and Jennifer Lippincott-Schwartz. Coat proteins: shaping membrane transport. Nature Reviews Molecular Cell Biology, 4(5):409--414, 2003.
[5]
Nobuyuki Tanaka, Masanao Kyuuma, and Kazuo Sugamura. Endosomal sorting complex required for transport proteins in cancer pathogenesis, vesicular transport, and non-endosomal functions. Cancer Science, 99(7):1293--1303, 2008.
[6]
Paul Gissen and Eamonn R Maher. Cargos and genes: insights into vesicular transport from inherited human disease. Journal of Medical Genetics, 44(9):545--555, 2007.
[7]
Nguyen Quoc Khanh Le, Quang Thai Ho, and Yu Yen Ou. Classifying the molecular functions of rab gtpases in membrane trafficking using deep convolutional neural networks. Analytical Biochemistry, 555:33--41, 2018.
[8]
Nguyen Quoc Khanh Le, Tuan-Tu Huynh, Edward Kien Yee Yapp, and Hui-Yuan Yeh. Identification of clathrin proteins by incorporating hyperparameter optimization in deep learning and pssm profiles. Computer Methods and Programs in Biomedicine, 177:81--88, 2019.
[9]
Nguyen Quoc Khanh Le, Edward Kien Yee Yapp, Nagarajan Nagasundaram, Matthew Chin Heng Chua, and Hui-Yuan Yeh. Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture. Computational and Structural Biotechnology Journal, 17:1245--1254, 2019.
[10]
Zhiyu Tao, Yanjuan Li, Zhixia Teng, and Yuming Zhao. A method for identifying vesicle transport proteins based on libsvm and mrmd. Computational and Mathematical Methods in Medicine, 2020:1--9, 2020.
[11]
Yue Gong, Benzhi Dong, Zixiao Zhang, Yixiao Zhai, Bo Gao, Tianjiao Zhang, and Jingyu Zhang. Vtp-identifier: Vesicular transport proteins identification based on pssm profiles and xgboost. Frontiers in Genetics, 12:808856, 2022.
[12]
Rui Fan, Bing Suo, and Yijie Ding. Identification of vesicle transport proteins via hypergraph regularized k-local hyperplane distance nearest neighbour model. Frontiers in Genetics, 13, 2022.
[13]
Rui Fan, Yijie Ding, Quan Zou, and Lei Yuan. Multi-view local hyperplane nearest neighbor model based on independence criterion for identifying vesicular transport proteins. International Journal of Biological Macromolecules, 247:125774, 2023.
[14]
Yu Chen, Lixin Gao, and Tianjiao Zhang. Stack-vtp: prediction of vesicle transport proteins based on stacked ensemble classifier and evolutionary information. BMC Bioinformatics, 24(1):1--18, 2023.
[15]
UniProt Consortium. Uniprot: a worldwide hub of protein knowledge. Nucleic Acids Research, 47(D1):D506--D515, 2019.
[16]
Gene Ontology Consortium. The gene ontology resource: 20 years and still going strong. Nucleic Acids Research, 47(D1):D330--D338, 2019.
[17]
Stephen F Altschul, Thomas L Madden, Alejandro A Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J Lipman. Gapped blast and psi-blast: a new generation of protein database search programs. Nucleic Acids Research, 25(17):3389--3402, 1997.
[18]
David T Jones. Protein secondary structure prediction based on position-specific scoring matrices. Journal of Molecular Biology, 292(2):195--202, 1999.
[19]
Pufeng Du, Shuwang Gu, and Yasen Jiao. Pseaac-general: fast building various modes of general form of chou's pseudo-amino acid composition for large-scale protein datasets. International Journal of Molecular Sciences, 15(3):3495--3506, 2014.
[20]
Nan Xiao, Dong-Sheng Cao, Min-Feng Zhu, and Qing-Song Xu. protr/protrweb: R package and web server for generating various numerical representation schemes of protein sequences. Bioinformatics, 31(11):1857--1859, 2015.
[21]
Zhen Shen, Su-Ping Deng, and De-Shuang Huang. Rna-protein binding sites prediction via multi scale convolutional gated recurrent unit networks. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 17(5):1741--1750, 2019.
[22]
Shuhui Liu, Shunfang Wang, and Haiyan Ding. Protein sub-nuclear location by fusing aac and pssm features based on sequence information. In 2015 IEEE 5th International Conference on Electronics Information and Emergency Communication, pages 236--239. IEEE, 2015.
[23]
Zhen Chen, Pei Zhao, Fuyi Li, André Leier, Tatiana T Marquez-Lago, Yanan Wang, Geoffrey I Webb, A Ian Smith, Roger J Daly, Kuo-Chen Chou, et al. ifeature: a python package and web server for features extraction and selection from protein and peptide sequences. Bioinformatics, 34(14):2499--2502, 2018.
[24]
Nguyen Quoc Khanh Le. Potential of deep representative learning features to interpret the sequence information in proteomics. Proteomics, pages e2100232--e2100232, 2021.
[25]
Ashish Ranjan, Md Shah Fahad, David Fernández-Baca, Akshay Deepak, and Sudhakar Tripathi. Deep robust framework for protein function prediction using variable-length protein sequences. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 17(5):1648--1659, 2019.
[26]
Wen Torng and Russ B Altman. High precision protein functional site detection using 3d convolutional neural networks. Bioinformatics, 35(9):1503--1512, 2019.
[27]
John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, et al. Highly accurate protein structure prediction with alphafold. Nature, 596(7873):583--589, 2021.
[28]
Minkyung Baek, Frank DiMaio, Ivan Anishchenko, Justas Dauparas, Sergey Ovchinnikov, Gyu Rie Lee, Jue Wang, Qian Cong, Lisa N Kinch, R Dustin Schaeffer, et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science, 373(6557):871--876, 2021.
[29]
Roman A Laskowski, James D Watson, and Janet M Thornton. Protein function prediction using local 3d templates. Journal of Molecular Biology, 351(3):614--626, 2005.
[30]
Thanh Hoa Vo, Ngan Thi Kim Nguyen, Quang Hien Kha, and Nguyen Quoc Khanh Le. On the road to explainable ai in drug-drug interactions prediction: A systematic review. Computational and Structural Biotechnology Journal, 2022.
[31]
Xiao Liu, Fanjin Zhang, Zhenyu Hou, Li Mian, Zhaoyu Wang, Jing Zhang, and Jie Tang. Self-supervised learning: Generative or contrastive. IEEE Transactions on Knowledge and Data Engineering, 35(1):857--876, 2021.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
BCB '23: Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
September 2023
626 pages
ISBN:9798400701269
DOI:10.1145/3584371
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 October 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. deep learning
  2. nesemble learning
  3. gate recurrent unit
  4. position-specific scoring matrix
  5. protein sequence
  6. vesicular transport

Qualifiers

  • Research-article

Funding Sources

Conference

BCB '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 885 submissions, 29%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 51
    Total Downloads
  • Downloads (Last 12 months)51
  • Downloads (Last 6 weeks)2
Reflects downloads up to 24 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media