Convergence of a Batch Gradient Algorithm with Adaptive Momentum for Neural Networks

Hongmei Shao¹,
Dongpo Xu² &
Gaofeng Zheng³

163 Accesses
4 Citations
Explore all metrics

Abstract

In this paper, a batch gradient algorithm with adaptive momentum is considered and a convergence theorem is presented when it is used for two-layer feedforward neural networks training. Simple but necessary sufficient conditions are offered to guarantee both weak and strong convergence. Compared with existing general requirements, we do not restrict the error function to be quadratic or uniformly convex. A numerical example is supplied to illustrate the performance of the algorithm and support our theoretical finding.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Convergence of batch gradient learning with smoothing regularization and adaptive momentum for neural networks

Article Open access 08 March 2016

Convergence of Batch Gradient Method Based on the Entropy Error Function for Feedforward Neural Networks

Article 23 October 2020

Adaptive Momentum Coefficient for Neural Network Optimization

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Rumelhart DE, McClelland JL, The PDP Research Group: (1986) Parallel distributed processing-explorations in the microstructure of cognition. MIT Press, Cambridge
Google Scholar
Meybodi MR, Beigy H (2002) A note on learning automata-based schemes for adaptation of BP parameters. Neurocomputing 48: 957–974
Article MATH Google Scholar
Qiu G, Varley MR, Terrell TJ (1992) Accelerated training of backpropagation networks by using adaptive momentum step. IEE Electron Lett 28(4): 377–379
Article Google Scholar
Istook E, Martinez T (2002) Improved backpropagation learning in nerural networks with windowed momentum. Int J Neural Syst 12(3–4): 303–318
Article Google Scholar
Chan LW, Fallside F (1987) An adaptive training algorithm for backpropagation networks. Comput Speech Lang 2: 205–218
Article Google Scholar
Yu X, Loh NK, Miller WC (1993) A new acceleration technique for the backpropagation algorithm. IEEE Int Conf Neural Netw 3: 1157–1161
Article Google Scholar
Yu C, Liu B (2002) A backpropagation algorithm with adaptive learning rate and momentum coefficient. IEEE Int Conf Neural Netw 2: 1218–1223
Google Scholar
Shao HM, Zheng GF (2009) A new BP algorithm with adaptive momentum for FNNs training. WRI Glob Congr Intell Syst 4: 16–20
Article Google Scholar
Bhaya A, Kaszkurewicz E (2004) Steepest descent with momentum for quadratic functions is a version of the conjugate gradient method. Neural Netw 17: 65–71
Article MATH Google Scholar
Torii M, Hagan MT (2002) Stability of steepest descent with momentum for quadratic functions. IEEE Trans Neural Netw 13(3): 752–756
Article Google Scholar
Zhang NM, Wu W, Zheng GF (2006) Convergence of gradient method with momentum for two-layer feedforward neural networks. IEEE Trans Neural Netw 17(2): 522–525
Article Google Scholar
Wu W, Xu YS (2002) Deterministic convergence of an online gradient method for neural networks. J Comput Appl Math 144(1–2): 335–347
Article MathSciNet MATH Google Scholar
Zhang C, Wu W, Xiong Y (2007) Convergence analysis of batch gradient algorithm for three classes of sigma-pi neural networks. Neural Process Lett 26: 177–189
Article MATH Google Scholar
Xu DP, Zhang HS, Liu LJ (2010) Convergence analysis of three classes of split-complex gradient algorithms for complex-valued recurrent neural networks. Neural Comput 22(10): 2655–2677
Article MathSciNet MATH Google Scholar
Yuan YX, Sun WY (2001) Optimization theory and methods. Science Press, Beijing
Google Scholar

Download references

Author information

Authors and Affiliations

School of Math. and Comput. Science, China University of Petroleum, Dongying, 257061, China
Hongmei Shao
College of Science, Harbin Engineering University, Harbin, 150001, China
Dongpo Xu
Platform Search Group, Rakuten Inc., Shinagawa, Tokyo, 140-0002, Japan
Gaofeng Zheng

Authors

Hongmei Shao
View author publications
You can also search for this author in PubMed Google Scholar
Dongpo Xu
View author publications
You can also search for this author in PubMed Google Scholar
Gaofeng Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongmei Shao.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shao, H., Xu, D. & Zheng, G. Convergence of a Batch Gradient Algorithm with Adaptive Momentum for Neural Networks. Neural Process Lett 34, 221–228 (2011). https://doi.org/10.1007/s11063-011-9193-x

Download citation

Published: 22 July 2011
Issue Date: December 2011
DOI: https://doi.org/10.1007/s11063-011-9193-x

Convergence of a Batch Gradient Algorithm with Adaptive Momentum for Neural Networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Convergence of batch gradient learning with smoothing regularization and adaptive momentum for neural networks

Convergence of Batch Gradient Method Based on the Entropy Error Function for Feedforward Neural Networks

Adaptive Momentum Coefficient for Neural Network Optimization

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Convergence of a Batch Gradient Algorithm with Adaptive Momentum for Neural Networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Convergence of batch gradient learning with smoothing regularization and adaptive momentum for neural networks

Convergence of Batch Gradient Method Based on the Entropy Error Function for Feedforward Neural Networks

Adaptive Momentum Coefficient for Neural Network Optimization

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation