Wang et al., 2018 - Google Patents

A boosting-based deep neural networks algorithm for reinforcement learning

Wang et al., 2018

Document ID: 8477076843640674919
Author: Wang Y; Jin H
Publication year: 2018
Publication venue: 2018 Annual American Control Conference (ACC)

External Links

Cited by

Snippet

In this paper, a new boosting-based deep neural networks algorithm is designed for improving the performance of model-free reinforcement learning structures. Based on theoretical proof and performance analysis, it is going to demonstrate that the new approach …

Continue reading at campuspress.yale.edu (PDF) (other versions)

230000001537 neural 0 title abstract description 79

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/02—Computer systems based on specific mathematical models using fuzzy logic
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design

Similar Documents

Publication	Publication Date	Title
Wang et al.	2018	A boosting-based deep neural networks algorithm for reinforcement learning
Duan et al.	2021	Distributional soft actor-critic: Off-policy reinforcement learning for addressing value estimation errors
Wang	2017	A new concept using LSTM Neural Networks for dynamic system identification
Pérez-Sánchez et al.	2018	A review of adaptive online learning for artificial neural networks
Jiang et al.	2017	A brief review of neural networks based learning and control and their applications for robots
Er et al.	2004	Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning
Juang et al.	2010	A locally recurrent fuzzy neural network with support vector regression for dynamic-system modeling
Tavoosi et al.	2016	Nonlinear system identification based on a self-organizing type-2 fuzzy RBFN
Yang et al.	2013	A novel self-constructing radial basis function neural-fuzzy system
Li et al.	2016	A new approach for chaotic time series prediction using recurrent neural network
Tavoosi et al.	2016	Stable ANFIS2 for nonlinear system identification
Qiao et al.	2019	A self-organizing RBF neural network based on distance concentration immune algorithm
Pillai et al.	2014	Extreme learning ANFIS for control applications
Millidge et al.	2022	A theoretical framework for inference and learning in predictive coding networks
Abiyev et al.	2008	Identification and control of dynamic plants using fuzzy wavelet neural networks
Wu et al.	2011	A functional neural fuzzy network for classification applications
Abiyev et al.	2012	Differential evaluation learning of fuzzy wavelet neural networks for stock price prediction
Ikemoto et al.	2021	Continuous deep Q-learning with a simulator for stabilization of uncertain discrete-time systems
Nikovski et al.	2022	Comparison of two learning networks for time series prediction
Du et al.	2012	A novel locally regularized automatic construction method for RBF neural models
Fischer	2015	Neural networks: a class of flexible non-linear models for regression and classification
Lindsay et al.	2020	A novel way of training a neural network with reinforcement learning and without back propagation
Wang et al.	1999	Fuzzy system modeling using linear distance rules
Lin et al.	2011	Design of a recurrent functional neural fuzzy network using modified differential evolution
Otadi	2019	Simulation and evaluation of second-order fuzzy boundary value problems