Protein Sequence Classification Using Probabilistic Motifs and Neural Networks

Konstantinos Blekas^7,8,
Dimitrios I. Fotiadis^7,8 &
Aristidis Likas^7,8

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2714))

Included in the following conference series:

1279 Accesses
2 Citations

Abstract

The basic issue concerning the construction of neural network systems for protein classification is the sequence encoding scheme that must be used in order to feed the network. To deal with this problem we propose a method that maps a protein sequence into a numerical feature space using the matching local scores of the sequence to groups of conserved patterns (called motifs). We consider two alternative schemes for discovering a group of D motifs within a set of K-class sequences. We also evaluate the impact of the background features (2-grams) to the performance of the neural system. Experimental results on real datasets indicate that the proposed method is superior to other known protein classification approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Protein Sequence Classification Based on N-Gram and K-Nearest Neighbor Algorithm

Protein Classification Workflow

Binary Classification of Proteins by a Machine Learning Approach

References

Hughey R. and Krogh A. Hidden Markov models for sequence analysis: Extension and analysis of the basic method. CABIOS, 12(2):95–107, 1996.
Google Scholar
Wang J.T.L., Ma Q., Shasha D., and Wu C.H. New techniques for extracting features from protein sequences. IBM: Systems Journal, 40(2):426–441, 2001.
Article Google Scholar
Bréjova B., DiMarco C., Vinař T., Hidalgo S.R., Holguin G., and Patten C. Finding patterns in biological sequences. Project Report for CS798g, University of Waterloo, 2000.
Google Scholar
Ma Q. and Wang J.T.L. Application of Bayesian neural networks to protein sequence classification. In ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pages 305–309, Boston, MA, USA, Aug 2000.
Google Scholar
Bailey T.L. and Gribskov M. Combining evidence using p-values: application to sequence homology searches. Bioinformatics, 14:48–54, 1998.
Article Google Scholar
Bailey T.L. and Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Second International Conference on Intelligent Systems for Molecular Biology, pages 28–36, Menlo Park, California, 1994. AAAI Press.
Google Scholar
MacKay D.J.C. Bayesian interpolation. Neural Computation, 4:415–447, 1992.
Article Google Scholar
Foresse F.D. and Hagan M.T. Gauss-Newton approximation to Bayesian regularization. In Proceedings of the 1997 International Joint Conference on Neural Network, pages 1930–1935, 1997.
Google Scholar
Bishop C.M. Neural Networks for Pattern Recognition. Oxford Univ. Press Inc., New York, 1995.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Ioannina, 45110, Ioannina, Greece
Konstantinos Blekas, Dimitrios I. Fotiadis & Aristidis Likas
Biomedical Research Institute, FORTH — Hellas, 45110, Ioannina, Greece
Konstantinos Blekas, Dimitrios I. Fotiadis & Aristidis Likas

Authors

Konstantinos Blekas
View author publications
You can also search for this author in PubMed Google Scholar
Dimitrios I. Fotiadis
View author publications
You can also search for this author in PubMed Google Scholar
Aristidis Likas
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Bogazici University, Bebek, 34342, Istanbul, Turkey
Okyay Kaynak & Ethem Alpaydin &
Laboratory of Computer and Information Science, Helsinki University of Technology, P.O.B. 5400, 02015, Finland
Erkki Oja
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong
Lei Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Blekas, K., Fotiadis, D.I., Likas, A. (2003). Protein Sequence Classification Using Probabilistic Motifs and Neural Networks. In: Kaynak, O., Alpaydin, E., Oja, E., Xu, L. (eds) Artificial Neural Networks and Neural Information Processing — ICANN/ICONIP 2003. ICANN ICONIP 2003 2003. Lecture Notes in Computer Science, vol 2714. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44989-2_84

Download citation

DOI: https://doi.org/10.1007/3-540-44989-2_84
Published: 18 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40408-8
Online ISBN: 978-3-540-44989-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Protein Sequence Classification Using Probabilistic Motifs and Neural Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Protein Sequence Classification Based on N-Gram and K-Nearest Neighbor Algorithm

Protein Classification Workflow

Binary Classification of Proteins by a Machine Learning Approach

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Protein Sequence Classification Using Probabilistic Motifs and Neural Networks

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Protein Sequence Classification Based on N-Gram and K-Nearest Neighbor Algorithm

Protein Classification Workflow

Binary Classification of Proteins by a Machine Learning Approach

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation