Nothing Special   »   [go: up one dir, main page]

Wang et al., 2018 - Google Patents

A boosting-based deep neural networks algorithm for reinforcement learning

Wang et al., 2018

View PDF
Document ID
8477076843640674919
Author
Wang Y
Jin H
Publication year
Publication venue
2018 Annual American Control Conference (ACC)

External Links

Snippet

In this paper, a new boosting-based deep neural networks algorithm is designed for improving the performance of model-free reinforcement learning structures. Based on theoretical proof and performance analysis, it is going to demonstrate that the new approach …
Continue reading at campuspress.yale.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models
    • G06N7/02Computer systems based on specific mathematical models using fuzzy logic
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models
    • G06N7/005Probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design

Similar Documents

Publication Publication Date Title
Wang et al. A boosting-based deep neural networks algorithm for reinforcement learning
Duan et al. Distributional soft actor-critic: Off-policy reinforcement learning for addressing value estimation errors
Wang A new concept using LSTM Neural Networks for dynamic system identification
Pérez-Sánchez et al. A review of adaptive online learning for artificial neural networks
Jiang et al. A brief review of neural networks based learning and control and their applications for robots
Er et al. Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning
Juang et al. A locally recurrent fuzzy neural network with support vector regression for dynamic-system modeling
Tavoosi et al. Nonlinear system identification based on a self-organizing type-2 fuzzy RBFN
Yang et al. A novel self-constructing radial basis function neural-fuzzy system
Li et al. A new approach for chaotic time series prediction using recurrent neural network
Tavoosi et al. Stable ANFIS2 for nonlinear system identification
Qiao et al. A self-organizing RBF neural network based on distance concentration immune algorithm
Pillai et al. Extreme learning ANFIS for control applications
Millidge et al. A theoretical framework for inference and learning in predictive coding networks
Abiyev et al. Identification and control of dynamic plants using fuzzy wavelet neural networks
Wu et al. A functional neural fuzzy network for classification applications
Abiyev et al. Differential evaluation learning of fuzzy wavelet neural networks for stock price prediction
Ikemoto et al. Continuous deep Q-learning with a simulator for stabilization of uncertain discrete-time systems
Nikovski et al. Comparison of two learning networks for time series prediction
Du et al. A novel locally regularized automatic construction method for RBF neural models
Fischer Neural networks: a class of flexible non-linear models for regression and classification
Lindsay et al. A novel way of training a neural network with reinforcement learning and without back propagation
Wang et al. Fuzzy system modeling using linear distance rules
Lin et al. Design of a recurrent functional neural fuzzy network using modified differential evolution
Otadi Simulation and evaluation of second-order fuzzy boundary value problems