Classifier’s Complexity Control while Training Multilayer Perceptrons

Šarūnas Raudys⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1876))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

1082 Accesses

Abstract

We consider an integrated approach to design the classification rule. Here qualities of statistical and neural net approaches are merged together. Instead of using the multivariate models and statistical methods directly to design the classifier, we use them in order to whiten the data and then to train the perceptron. A special attention is paid to magnitudes of the weights and to optimization of the training procedure. We study an influence of all characteristics of the cost function (target values, conventional regularization parameters), parameters of the optimization method (learning step, starting weights, a noise injection to original training vectors, to targets, and to the weights) on a result. Some of the discussed methods to control complexity are almost not discussed in the literature yet.

Download to read the full chapter text

Chapter PDF

Second Order Training and Sizing for the Multilayer Perceptron

Article 08 October 2019

Comparing Regularization Techniques Applied to a Perceptron

Robust Multilayer Perceptrons: Robust Loss Functions and Their Derivatives

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Amari, S.: A theory of adaptive pattern classifiers. — IEEE Trans. Electron. Computers, EC-16 (1967) 623–625
Google Scholar
An, G.: The effects of adding noise during backpropagation training on generalization performance, Neural Computation, 8 (1996) 643–674
Article Google Scholar
Bishop, C..M.: Neural Networks for Pattern Recognition. Oxford Univ. Press (1995)
Google Scholar
Devijver, P.A., Kittler, J. Pattern Recognition. A statistical approach, Precentice-Hall International, Inc., London (1982).
Google Scholar
Duin, R.P.W.: Small sample size generalization. Proc. 9th Scandinavian Conference on Image Analysis, June 6–9, 1995, Uppsala, Sweden (1995)
Google Scholar
Kanal L., Chandrasekaran, B.: On dimensionality and sample size in statistical pattern recognition. Pattern Recognition, 3 (1971) 238–255
Article Google Scholar
Mao, J. Jain, A.: Regularization techniques in artificial neural networks. In: Proc. World Congress on Neural Networks, Portland, July 1993.
Google Scholar
Raudys, S. On the problems of sample size in pattern recognition. In: Proc., 2nd All-Union Conf. Statist. Methods in Control Theory; Moscow, Nauka, 64–67 (1970)
Google Scholar
Raudys S.: Evolution and generalization of a single neurone. I. SLP as seven statistical classifiers. Neural Networks, 11 (1998) 283–296.
Article Google Scholar
Raudys S.: Statistical and Neural Classification Algorithms. An Integrated Approach. Springer, London (2001)
Google Scholar
Raudys S, Skurikhina, M., Cibas, T., Gallinari, P.: Ridge estimates of the covariance matrix and regularization of artificial neural network classifier. Pattern Recognition and Image Processing, Int. J. of Russian Academy of Sciences, Moscow, 1995, N4 633–650
Google Scholar
Reed R.: Pruning Algorithms-A Survey. IEEE Transactions on Neural Networks, 4, (1993) 740–747
Article Google Scholar
Reed R., Marks II, R.J., Oh, S.: Similarities of error regularization, sigmoid gain scaling, target smoothing, and training with jitter, IEEE Transactions on Neural Networks, 6 (1995) 529–538.
Article Google Scholar
Skurichina M., Raudys, S.. Duin, R.P.W.: K-nearest neighbors directed noise injection in multilayer perceptron training, IEEE Trans. on Neural Networks, 11 (2000) 504–511
Article Google Scholar
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Berlin (1995)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Mathematics and Informatics, Akademijos 4, Vilnius, 2600, Lithuania
Šarūnas Raudys

Authors

Šarūnas Raudys
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of València, 46100, Burjassot (València), Spain
Francesc J. Ferri
Department of Computer Languages and Systems, University of Alicante, 03071, Alicante, Spain
José M. Iñesta
School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, 2052, Australia
Adnan Amin
Institute of Information Theory and Automation, Academy of Sciences of the Czech Republic, 182 08, Prague 8, Czech Republic
Pavel Pudil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Raudys, Š. (2000). Classifier’s Complexity Control while Training Multilayer Perceptrons. In: Ferri, F.J., Iñesta, J.M., Amin, A., Pudil, P. (eds) Advances in Pattern Recognition. SSPR /SPR 2000. Lecture Notes in Computer Science, vol 1876. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44522-6_4

Download citation

DOI: https://doi.org/10.1007/3-540-44522-6_4
Published: 21 December 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67946-2
Online ISBN: 978-3-540-44522-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)