On Cross-Validation for MLP Model Evaluation

Tommi Kärkkäinen²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8621))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

2843 Accesses

Abstract

Cross-validation is a popular technique for model selection and evaluation. The purpose is to provide an estimate of generalization error using mean error over test folds. Typical recommendation is to use ten-fold stratified cross-validation in classification problems. In this paper, we perform a set of experiments to explore the characteristics of cross-validation, when dealing with model evaluation of Multilayer Perceptron neural network. We test two variants of stratification, where the nonstandard one takes into account classwise data density in addition to pure class frequency. Based on computational experiments, many common beliefs are challenged and some interesting conclusions drawn.

Download to read the full chapter text

Chapter PDF

Machine Learning—Evaluation (Cross-validation, Metrics, Importance Scores...)

An Evaluation of Classification Algorithms Using Mc Nemar’s Test

Feature importance measure of a multilayer perceptron based on the presingle-connection layer

Article 04 September 2023

Keywords

References

Elisseeff, A., Pontil, M.: Leave-one-out error and stability of learning algorithms with applications. NATO Science Series, Sub Series III: Computer and Systems Sciences 190, 111–130 (2003)
Google Scholar
Pinkus, A.: Approximation theory of the MLP model in neural networks. Acta Numerica, 143–195 (1999)
Google Scholar
Kohavi, R.: Study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI 1995), pp. 1137–1143 (1995)
Google Scholar
Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2011)
Google Scholar
Borra, S., Ciaccio, A.D.: Measuring the prediction error. a comparison of cross-validation, bootstrap and covariance penalty methods. Computational Statistics and Data Analysis 54, 2976–2989 (2010)
Article MATH MathSciNet Google Scholar
Davison, A.C., Hinkley, D.V.: Bootstrap Methods and their Applications. Cambridge Series on Statistical and Probabilistic Mathematics. Cambridge University Press (1997)
Google Scholar
Breiman, L.: Heuristics of instability and stabilization in model selection. The Annals of Statistics 24(6), 2350–2383 (1996)
Article MATH MathSciNet Google Scholar
Andersen, T., Martinez, T.: Cross validation and MLP architecture selection. In: Proceedings of the International Joint Conference on Neural Networks (IJCNN 1999), pp. 1614–1619 (1999)
Google Scholar
Last, M.: The uncertainty principle of cross-validation. In: Proceedings of the IEEE International Conference on Granular Computing (GrC 2006), pp. 275–280 (2006)
Google Scholar
Arlot, S., Celisse, A.: A survey of cross-validation procedures for model selection. Statistics Surveys 4, 40–79 (2010)
Article MATH MathSciNet Google Scholar
Moreno-Torres, J.G., Sáez, J.A., Herrera, F.: Study on the impact of partition-induced dataset shift on k-fold cross-validation. IEEE Transactions on Neural Networks and Learning Systems 23(8), 1304–1312 (2012)
Article Google Scholar
Kärkkäinen, T.: MLP in layer-wise form with applications to weight decay. Neural Computation 14, 1451–1480 (2002)
Article MATH Google Scholar
López, V., Fernández, A., Herrera, F.: On the importance of the validation technique for classification with imbalanced datasets: Addressing covariate shift when data is skewed. Information Sciences 257, 1–13 (2014)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematical Information Technology, University of Jyväskylä, Finland
Tommi Kärkkäinen

Authors

Tommi Kärkkäinen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, University of Eastern Finland, 80101, Joensuu, Finland
Pasi Fränti
School of Computer Science, The University of Manchester, Manchester, UK
Gavin Brown
Delft University of Technology, Delft, The Netherlands
Marco Loog
Universidad de Alicante, Spain
Francisco Escolano
Università Ca’ Foscari Venezia, Venezia Mestre, Italy
Marcello Pelillo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kärkkäinen, T. (2014). On Cross-Validation for MLP Model Evaluation. In: Fränti, P., Brown, G., Loog, M., Escolano, F., Pelillo, M. (eds) Structural, Syntactic, and Statistical Pattern Recognition. S+SSPR 2014. Lecture Notes in Computer Science, vol 8621. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44415-3_30

Download citation

DOI: https://doi.org/10.1007/978-3-662-44415-3_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44414-6
Online ISBN: 978-3-662-44415-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

On Cross-Validation for MLP Model Evaluation

Abstract

Chapter PDF

Similar content being viewed by others

Machine Learning—Evaluation (Cross-validation, Metrics, Importance Scores...)

An Evaluation of Classification Algorithms Using Mc Nemar’s Test

Feature importance measure of a multilayer perceptron based on the presingle-connection layer

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

On Cross-Validation for MLP Model Evaluation

Abstract

Chapter PDF

Similar content being viewed by others

Machine Learning—Evaluation (Cross-validation, Metrics, Importance Scores...)

An Evaluation of Classification Algorithms Using Mc Nemar’s Test

Feature importance measure of a multilayer perceptron based on the presingle-connection layer

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation