De Lellis et al., 2023 - Google Patents

CT-DQN: Control-tutored deep reinforcement learning

De Lellis et al., 2023

Document ID: 14781752081995401367
Author: De Lellis F; Coraggio M; Russo G; Musolesi M; dI Bernardo M
Publication year: 2023
Publication venue: Learning for Dynamics and Control Conference

External Links

Cited by

Snippet

One of the major challenges in Deep Reinforcement Learning for control is the need for extensive training to learn the policy. Motivated by this, we present the design of the Control- Tutored Deep Q-Networks (CT-DQN) algorithm, a Deep Reinforcement Learning algorithm …

Continue reading at proceedings.mlr.press (PDF) (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators

Similar Documents

Publication	Publication Date	Title
Yu et al.	2020	Mopo: Model-based offline policy optimization
Ni et al.	2013	Adaptive learning in tracking control based on the dual critic network design
Cao et al.	2014	Matrix measure strategies for stability and synchronization of inertial BAM neural network with time delays
Xu et al.	2020	Knowledge transfer in multi-task deep reinforcement learning for continuous control
Pan et al.	2016	Model reference composite learning control without persistency of excitation
Joshi et al.	2020	Design and flight evaluation of deep model reference adaptive controller
Ure et al.	2013	Health aware planning under uncertainty for uav missions with heterogeneous teams
Behzadan et al.	2018	The faults in our pi stars: Security issues and open challenges in deep reinforcement learning
De Lellis et al.	2023	CT-DQN: Control-tutored deep reinforcement learning
Zhang et al.	2020	Model‐Free Attitude Control of Spacecraft Based on PID‐Guide TD3 Algorithm
Valluru et al.	2017	Stabilization of nonlinear inverted pendulum system using MOGA and APSO tuned nonlinear PID controller
Krishnakumar et al.	1995	Hybrid fuzzy logic flight controller synthesis via pilot modeling
Xian et al.	2024	Control of quadrotor robot via optimized nonlinear type-2 fuzzy fractional PID with fractional filter: Theory and experiment
Alsalti et al.	2023	Data-driven nonlinear predictive control for feedback linearizable systems
Gu et al.	2023	A human-centered safe robot reinforcement learning framework with interactive behaviors
Liu et al.	2023	How to guide your learner: Imitation learning with active adaptive expert involvement
Long et al.	2021	Online Optimal Control of Robotic Systems with Single Critic NN‐Based Reinforcement Learning
Zhang et al.	2022	Research on adaptive non-singular fast terminal sliding mode control based on variable exponential power reaching law in manipulators
Uchibe	2018	Cooperative and competitive reinforcement and imitation learning for a mixture of heterogeneous learning modules
Cheng et al.	2023	Offline quantum reinforcement learning in a conservative manner
Lin et al.	2012	Adaptive fuzzy total sliding-mode control of unknown nonlinear systems
Yang et al.	2020	Variational adversarial kernel learned imitation learning
Li et al.	2022	Curricular robust reinforcement learning via GAN-based perturbation through continuously scheduled task sequence
Chawla et al.	2019	System identification of an inverted pendulum using adaptive neural fuzzy inference system
Deka et al.	2023	Arc-actor residual critic for adversarial imitation learning