Model Selection and Weight Sharing of Multi-layer Perceptrons

Yusuke Tanahashi²¹,
Kazumi Saito²² &
Ryohei Nakano²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3684))

Included in the following conference series:

International Conference on Knowledge-Based and Intelligent Information and Engineering Systems

1468 Accesses
2 Citations

Abstract

We present a method to learn and select a succinct multi-layer perceptron having shared weights. Weight sharing means a weight is allowed to have one of common weights. A near-zero common weight can be eliminated, called weight pruning. Our method iteratively merges and splits common weights based on 2nd-order criteria, escaping local optima through bidirectional clustering. Moreover, our method selects the optimal number of hidden units based on cross-validation. Our experiments showed that the proposed method can perfectly restore the original sharing structure for an artificial data set, and finds a small number of common weights for a real data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Determining Optimal Multi-layer Perceptron Structure Using Linear Regression

An Empirically-Sourced Heuristic for Predetermining the Size of the Hidden Layer of a Multi-layer Perceptron for Large Datasets

Hyperparameter Optimization with Factorized Multilayer Perceptrons

References

Bishop, C.M.: Neural networks for pattern recognition. Clarendon Press (1995)
Google Scholar
Haykin, S.: Neural networks, 2nd edn. Prentice-Hall, Englewood Cliffs (1999)
MATH Google Scholar
Saito, K., Nakano, R.: Structuring neural networks through bidirectional clustering of weights. In: Proc. 5th Int. Conf. on Discovery Science, pp. 206–219 (2002)
Google Scholar
Stone, M.: Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society B 64, 111–147 (1974)
Google Scholar

Download references

Author information

Authors and Affiliations

Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya, 466-8555, Japan
Yusuke Tanahashi & Ryohei Nakano
NTT Communication Science Laboratories, NTT Corporation, 2-4 Hikaridai, Seika, Soraku, Kyoto, 619-0237, Japan
Kazumi Saito

Authors

Yusuke Tanahashi
View author publications
You can also search for this author in PubMed Google Scholar
Kazumi Saito
View author publications
You can also search for this author in PubMed Google Scholar
Ryohei Nakano
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Business, La Trobe University, 3086, Melbourne, Victoria, Australia
Rajiv Khosla
Centre for SMART systems Engineering Research Centre, University of Brighton, Moulsecoomb, BN2 4GJ, Brighton, UK
Robert J. Howlett
School of Electrical and Information Engineering, Knowledge Based Intelligent Engineering Systems Centre, University of South Australia, 5095, Mawson Lakes, SA, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tanahashi, Y., Saito, K., Nakano, R. (2005). Model Selection and Weight Sharing of Multi-layer Perceptrons. In: Khosla, R., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2005. Lecture Notes in Computer Science(), vol 3684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11554028_100

Download citation

DOI: https://doi.org/10.1007/11554028_100
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28897-8
Online ISBN: 978-3-540-31997-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics