Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Classification of Protein Structure Classes on Flexible Neutral Tree

Published: 01 September 2017 Publication History

Abstract

Accurate classification on protein structural is playing an important role in Bioinformatics. An increase in evidence demonstrates that a variety of classification methods have been employed in such a field. In this research, the features of amino acids composition, secondary structure's feature, and correlation coefficient of amino acid dimers and amino acid triplets have been used. Flexible neutral tree FNT, a particular tree structure neutral network, has been employed as the classification model in the protein structures’ classification framework. Considering different feature groups owing diverse roles in the model, impact factors of different groups have been put forward in this research. In order to evaluate different impact factors, Impact Factors Scaling IFS algorithm, which aim at reducing redundant information of the selected features in some degree, have been put forward. To examine the performance of such framework, the 640, 1189, and ASTRAL datasets are employed as the low-homology protein structure benchmark datasets. Experimental results demonstrate that the performance of the proposed method is better than the other methods in the low-homology protein tertiary structures.

References

[1]
K. D. Kedarisetti, L. A. Kurgan, and S. Dick, "Classifier ensembles for protein structural class prediction with varying homology," Biochem. Biophys. Res. Commun., vol. 348, pp. 981-988, 2006.
[2]
R. B. Corey and L. Pauling, "Fundamental dimensions of polypeptide chains," in Proc. Roy. Soc. London, 1953, vol. 141, no. 902, pp. 10-20.
[3]
C. Levinthal, "Are there pathways for protein folding," Chimie Physique, vol. 65, pp. 44-45, 1968.
[4]
C. B. Anfinsen, "Principles that govern the folding of protein chains," Science, vol. 181, pp. 223-230, 1973.
[5]
J. Groppe et al., "Structural basis of BMP signalling inhibition by the cystine knot protein Noggin," Nature, vol. 420, pp. 636-642, 2002.
[6]
S. F. Altschul and E. V. Koonin, "Iterated profile searches with PSIBLAST--A tool for discovery in protein databases," Trends Biochem. Sci., vol. 23, pp. 444-447, 1998.
[7]
K. Drew, et al., "The proteome folding project: Proteome-scale prediction of structure and function," Genome Res., vol. 21, pp. 1981-1994, 2011.
[8]
D. L. Halligan, F. Oliver, A. Eyre-Walker, B. Harr, and P. D. Keightley, "Evidence for pervasive adaptive protein evolution in wild mice," PLoS Genet, vol. 6, 2010, Art. no. e1000825.
[9]
W. Li, et al., "Phylogenetic analysis, structural evolution and functional divergence of the 12-oxo-phytodienoate acid reductase gene family in plants," BMC Evol. Biol., vol. 9, 2009, Art. no. 90.
[10]
X. Deng, J. Morris, C. Chaton, G. F. Schroder, W. S. Davidson, and T. B. Thompson, "Small-angle X-ray scattering of apolipoprotein A-IV reveals the importance of its termini for structural stability," J. Biol. Chemistry, vol. 288, pp. 4854-4866, 2013.
[11]
T. B. Thompson, T. F. Lerch, R. W. Cook, and T. K. Woodruff, "The structure of the follistatin: Activin complex reveals antagonism of both type I and type II receptor binding," Devel. Cell, vol. 9, pp. 535-543, 2005.
[12]
Z. Ji, D. Wu, W. Zhao, et al., "Systemic modeling myelomaosteoclast interactions under normoxic/hypoxic condition using a novel computational approach," Sci. Rep., vol. 5, 2015, Art. no. 13291.
[13]
A. D. Ullah and K. Steinhöfel, "A hybrid approach to protein folding problem integrating constraint programming with local search," BMC Bioinf., vol. 11, 2010, Art. no. S39.
[14]
S. Shatabda, M. H. Newton, and A. Sattar, "Mixed heuristic local search for protein structure prediction," in Proc. 27th AAAI Conf. Artif. Intell., 2013, pp. 876-882.
[15]
M. A. Rashid, M. T. Hoque, M. H. Newton, D. N. Pham, and A. Sattar, "A new genetic algorithm for simplified protein structure prediction," in Advances in Artificial Intelligence. Berlin, Germany: Springer, pp. 107-119.
[16]
S. Shatabda, M. A. H. Newton, M. A. Rashid, D. N. Pham, and A. Sattar, "How good are simplified models for protein structure prediction," Adv. Bioinf., vol. 2014, 1-9, 2014.
[17]
T. A. Cross, M. Sharma, M. Yi, and H. X. Zhou, "Influence of solubilizing environments on membrane protein structures," Trends Biochem Sci., vol. 36, pp. 117-125, 2011.
[18]
H. W. Choe, et al., "Crystal structure of metarhodopsin II," Nature, vol. 471, pp. 651-655, 2011.
[19]
T. H. Bayburt and S. G. Sligar, "Membrane protein assembly into Nanodiscs," FEBS Lett., vol. 584, pp. 1721-1727, 2010.
[20]
D. P. Claxton, et al., "Ion/substrate dependent conformational dynamics of a bacterial homolog of neurotransmitter: Sodium symporters," Nat. Struct. Mol. Biol., vol. 17, pp. 822-829, 2010.
[21]
L. Zhu, D-S. Ping, and D. S. Huang, "A two stage geometric method for pruning unreliable links in protein-protein networks," IEEE Trans. Nano-Biosci., vol. 14, no. 5, pp. 528-534, Jul. 2015.
[22]
S.-P. Deng, L. Zhu, and D. S. Huang, "Mining the bladder cancer-associated genes by an integrated strategy for the construction and analysis of differential co-expression networks," BMC Genomics, 2015, vol. 16, no. Suppl 3, Art. no. S4.
[23]
C. H. Q. Ding and I. Dubchak, "Multi-class protein fold recognition using support vector machines and neural networks," Bioinf., vol. 17, no. 4, pp. 349-358, 2001.
[24]
H. Ding, et al., "iCTX-Type: A sequence-based predictor for identifying the types of conotoxins in targeting ion channels," BioMed Res. Int., vol. 2014, 2014, Art. no. 286419.
[25]
X. Guo and X. Gao, "A novel hierarchical ensemble classifier for protein fold recognition," Protein Eng. Des., vol. 21, no. 11, pp. 659-664, 2008.
[26]
K. Kavousi, et al., "Evidence theoretic protein fold classification based on the concept of hyperfold," Math. Biosci., vol. 240, no. 2, pp. 148-160, 2012.
[27]
R. Aram and N. M. Charkari, "A two-layer classification framework for protein fold recognition," J. Theoretical Biol., vol. 365, pp. 32-39, 2015.
[28]
M. Q. Yang, Y. Chen, F. Chen, and J. K. Yang, "Ensemble voting system for multiclass protein fold recognition," Int. J. Pattern Recognit. Artif. Intell., vol. 22, no. 4, pp. 747-763, 2008.
[29]
W. Bao, Y. Chen, and D. Wang, "Prediction of protein structure classes with flexible neural tree," Bio-Med. Mater. Eng., vol. 24, pp. 3797-3806, 2014.
[30]
L. Kurgan and L. Homaeian, "Prediction of structural classes for protein sequences and domains--Impact of prediction algorithms, sequence representation and homology, and test procedures on accuracy," Pattern Recognit., vol. 39, pp. 2323-2343, 2006.
[31]
T. L. Zhang, Y. S. Ding, and K. C. Chou, "Prediction protein structural classes with pseudo amino acid composition: Approximate entropy and hydrophobicity pattern," J. Theoretical. Biol., vol. 250, pp. 186-193, 2008.
[32]
H.-J. Yu and D. S. Huang, "Graphical representation for DNA sequences via joint diagonalization of matrix pencil," IEEE J. Biomed. Health Inf., vol. 17, no. 3, pp. 503-511, May 2013.
[33]
X.-M. Zhao, Y.-M. Cheung, and D. S. Huang, "Analysis of gene expression data using RPEM algorithm in normal mixture model with dynamic adjustment of learning rate," Int. J. Pattern Recognit. Artif. Intell., vol. 24, no. 4, pp. 651-666, 2010.
[34]
A. Andreeva, et al., "Data growth and its impact on the SCOP database: New development," Nucleic Acids Res., vol. 36, pp. D419-D425, 2007.
[35]
D. S. Huang, L. Zhang, K. Han, S. Deng, K. Yang, and H. Zhang, "Prediction of protein-protein interactions based on protein-protein correlation using least squares regression," Current Protein Peptide Sci.," vol. 15, no. 6, pp. 553-560, 2014.
[36]
D. S. Huang and H.-J. Yu, "Normalized feature vectors: A novel alignment-free sequence comparison method based on the numbers of adjacent amino acids," IEEE/ACM Trans. Comput. Biol. Bioinf., vol. 10, no. 2, pp. 457-467, Mar. 2013.
[37]
H. Poor, "A hypertext history of multiuser dimensions," MUD History, 1986. [Online]. Available: http://www.ccs.neu.edu/home/pb/mud-history.html
[38]
C. H. Q. Ding and I. Dubchak, "Multi-class protein fold recognition using support vector machines and neural networks," Bioinf., vol. 17, no. 4, pp. 349-358, 2001.
[39]
K. Chen, L. A. Kurgan, and J. S. Ruan, "Prediction of protein structural class using novel evolutionary collocation-based sequence representation," J. Comput. Chemistry, vol. 29, no. 2008, pp. 1596- 1604.
[40]
S.-L. Wang, Y. Zhu, W. Jia, and D. S. Huang, "Robust classification method of tumor subtype by using correlation filters," IEEE/ACM Trans. Comput. Biol. Bioinf., vol. 9, no. 2, pp. 580-591, Mar.-Apr. 2012.
[41]
S. F. Altschul, et al., "Gapped BLAST and PSI-BLAST: A new generation of protein database search programs," Nucleic Acids Res., vol. 25, pp. 3389-3402, 1997.
[42]
L. A. Kurgan, T. Zhang, H. Zhang, S. Shen, and J. Ruan, "Secondary structure-based assignment of the protein structural classes," Amino Acids, vol. 35, pp. 551-564, 2008.
[43]
L. Kurgan, K. Cios, and K. Chen, "SCPRED: Accurate prediction of protein structural class for sequences of twilight-zone similarity with predicting sequences," BMC Bioinf., vol. 9, 2008, Art. no. 226.
[44]
T. Liu and C. Jia, "A high-accuracy protein structural class prediction algorithm using predicted secondary structural information," J. Theoretical Biol., vol. 267, pp. 272-275, 2010.
[45]
L. Zhu, Z.-H. You, D. S. Huang, and B. Wang, "t-LSE: A novel robust geometric approach for modeling protein-protein interaction networks," PLoS One, vol. 8, no. 4, Art. no. e58368.
[46]
S. Ding, S. Zhang, Y. Li, and T. Wang, "A novel protein structural classes prediction method based on predicted secondary structure," Biochimie, vol. 94, pp. 1166-1171, 2012.
[47]
Z. R. Li, H. H. Lin, L. Y. Han, L. Jiang, X. Chen, and Y. Z. Chen, "PROFEAT: A web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence," Nucleic Acids Res., vol. 34, pp. W32-W37, 2006.
[48]
H. B. Rao, F. Zhu, G. B. Yang, Z. R. Li, and Y. Z. Chen, "Update of PROFEAT: A web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence," Nucleic Acids Res., vol. 39, pp. W385-W39, 2011.
[49]
P. Chatterjee, S. Basu, and M. Nasipuri, "Improving prediction of protein secondary structure using physicochemical properties of amino acids," in Proc. Int. Symp. Biocomputing, 2010, Art. no. 10.
[50]
B. Yang, Y. H. Chen, and M. Y. Jiang, "Reverse engineering of gene regulatory networks using flexible neural tree models," Neurocomputing, vol. 99, pp. 458-466, 2013.
[51]
Y. Chen, B. Yang, J. Dong, and A. Abraham, "Time-series forecasting using flexible neural tree model," Inf. Sci., vol. 174, pp. 219-235, 2005.
[52]
K. C. Chou and C. T. Zhang, "Predicting of protein structural class," Critical Rev. Biochemistry Mol. Biol., vol. 30, pp. 275-349, 1995.
[53]
K. C. Chou and H. B. Shen, "Recent progress in protein subcellular location prediction," Anal. Biochemistry, vol. 370, pp. 1-16, 2007.
[54]
H. B. Shen and K. C. Chou, "Predicting protein subnuclear location with optimized evidence-theoretic K-nearest classifier and pseudo amino acid composition," Biochemical Biophys. Res. Commun., vol. 337, pp. 752-756, 2005.
[55]
Y. S. Ding, T. L. Zhang, and K. C. Chou, "Prediction of protein structure classes with pseudo amino acid composition and fuzzy support vector machine network," Protein Peptide Lett., vol. 14, pp. 811-815, 2007.
[56]
D. S. Huang, Systematic Theory of Neural Networks for Pattern Recognition (in Chinese), Publishing House of Electronic Industry of China, May 1996.
[57]
D. S. Huang, "Radial basis probabilistic neural networks: Model and application," Int. J. Pattern Recog. Artif. Intell., vol. 13, no. 7, pp. 1083-1101, 1999.
[58]
D. S. Huang and C.-H. Zheng, "Independent component analysis based penalized discriminant method for tumor classification using gene expression data," Bioinf., vol. 22, no. 15, pp. 1855-1862, 2006.
[59]
D. S. Huang and J.-X. Du, "A constructive hybrid structure optimization methodology for radial basis probabilistic neural networks," IEEE Trans. Neural Netw., vol. 19, no. 12, pp. 2099- 2115, 2008.

Cited By

View all
  • (2024)Protein acetylation sites with complex-valued polynomial modelFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-023-2640-918:3Online publication date: 1-Jun-2024
  • (2021)The Value and Clinical Significance of Tumor Marker Detection in Cervical CancerScientific Programming10.1155/2021/66437822021Online publication date: 1-Jan-2021
  • (2021)LncRNA NKILA Promotes Cardiomyocytes Apoptosis by Targeting miR22-3p-TXNIP Signal Axis to Inhibit Proliferation, Migration, and Invasion of Cardiomyocytes under High Glucose-Induced ConditionScientific Programming10.1155/2021/66268452021Online publication date: 1-Jan-2021
  • Show More Cited By
  1. Classification of Protein Structure Classes on Flexible Neutral Tree

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image IEEE/ACM Transactions on Computational Biology and Bioinformatics
      IEEE/ACM Transactions on Computational Biology and Bioinformatics  Volume 14, Issue 5
      September 2017
      202 pages

      Publisher

      IEEE Computer Society Press

      Washington, DC, United States

      Publication History

      Published: 01 September 2017
      Published in TCBB Volume 14, Issue 5

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 12 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Protein acetylation sites with complex-valued polynomial modelFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-023-2640-918:3Online publication date: 1-Jun-2024
      • (2021)The Value and Clinical Significance of Tumor Marker Detection in Cervical CancerScientific Programming10.1155/2021/66437822021Online publication date: 1-Jan-2021
      • (2021)LncRNA NKILA Promotes Cardiomyocytes Apoptosis by Targeting miR22-3p-TXNIP Signal Axis to Inhibit Proliferation, Migration, and Invasion of Cardiomyocytes under High Glucose-Induced ConditionScientific Programming10.1155/2021/66268452021Online publication date: 1-Jan-2021
      • (2021)Automatic 3D Pollen Recognition Based on Convolutional Neural NetworkScientific Programming10.1155/2021/55773072021Online publication date: 1-Jan-2021
      • (2021)Deep Neural Network for Somatic Mutation ClassificationScientific Programming10.1155/2021/55292022021Online publication date: 1-Jan-2021
      • (2021)The Role of Endothelial Mesenchymal Transformation on Infantile HemangiomaScientific Programming10.1155/2021/55110952021Online publication date: 1-Jan-2021
      • (2020)CirBiTreeScientific Programming10.1155/2020/88476942020Online publication date: 28-Nov-2020
      • (2020)A Robust Authentication Algorithm for Medical Images Based on Fractal Brownian Model and Visual CryptographyScientific Programming10.1155/2020/66425862020Online publication date: 4-Dec-2020
      • (2020)Classification of Protein Modification Sites with Machine LearningIntelligent Computing Theories and Application10.1007/978-3-030-60802-6_38(432-445)Online publication date: 2-Oct-2020
      • (2019)Predicting subcellular localization of multisite proteins using differently weighted multi-label k-nearest neighbors setsTechnology and Health Care10.3233/THC-19901827:S1(185-193)Online publication date: 1-Jan-2019
      • Show More Cited By

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media