Nothing Special   »   [go: up one dir, main page]

Information and Media Technologies
Online ISSN : 1881-0896
ISSN-L : 1881-0896
Computing
Statistical Mechanics of On-line Node-perturbation Learning
Kazuyuki Hara Brain Science Institute, RIKEN
Graduate School of Frontier Science, The University of Tokyo">Kentaro Katahira
Brain Science Institute, RIKEN">Kazuo Okanoya Brain Science Institute, RIKEN
Graduate School of Frontier Science, The University of Tokyo">Masato Okada
Author information
JOURNAL FREE ACCESS

2011 Volume 6 Issue 2 Pages 352-361

Details
Abstract

Node-perturbation learning (NP-learning) is a kind of statistical gradient descent algorithm that estimates the gradient of an objective function through application of a small perturbation to the outputs of the network. It can be applied to problems where the objective function is not explicitly formulated, including reinforcement learning. In this paper, we show that node-perturbation learning can be formulated as on-line learning in a linear perceptron with noise, and we can derive the differential equations of order parameters and the generalization error in the same way as for the analysis of learning in a linear perceptron through statistical mechanical methods. From analytical results, we show that cross-talk noise, which originates in the error of the other outputs, increases the generalization error as the output number increases.

Content from these authors
© 2011 Information Processing Society of Japan
Previous article Next article
feedback
Top