von Rohr et al., 2021 - Google Patents

Probabilistic robust linear quadratic regulators with Gaussian processes

von Rohr et al., 2021

Document ID: 3466008142719384134
Author: von Rohr A; Neumann-Brosig M; Trimpe S
Publication year: 2021
Publication venue: Learning for Dynamics and Control

External Links

Cited by

Snippet

Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design. While learning-based control has the potential to yield superior performance in demanding applications …

Continue reading at proceedings.mlr.press (PDF) (other versions)

238000000034 method 0 title abstract description 24

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0205—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric not using a model or a simulator of the controlled system
- G05B13/024—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric not using a model or a simulator of the controlled system in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B23/00—Testing or monitoring of control systems or parts thereof
- G05B23/02—Electric testing or monitoring
- G05B23/0205—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults
- G05B23/0218—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes

Similar Documents

Publication	Publication Date	Title
Buisson-Fenet et al.	2020	Actively learning gaussian process dynamics
Wabersich et al.	2018	Linear model predictive safety certification for learning-based control
von Rohr et al.	2021	Probabilistic robust linear quadratic regulators with Gaussian processes
Fujimoto et al.	2019	Off-policy deep reinforcement learning without exploration
Matni et al.	2019	From self-tuning regulators to reinforcement learning and back again
Curi et al.	2020	Efficient model-based reinforcement learning through optimistic policy search and planning
Lale et al.	2020	Logarithmic regret bound in partially observable linear dynamical systems
Chen et al.	2021	Generalization bounds for meta-learning: An information-theoretic analysis
Kallus et al.	2020	Statistically efficient off-policy policy gradients
Mern et al.	2021	Bayesian optimized monte carlo planning
Sinha et al.	2022	Adaptive robust model predictive control with matched and unmatched uncertainty
Lale et al.	2021	Model learning predictive control in nonlinear dynamical systems
Bonzanini et al.	2020	Safe learning-based model predictive control under state-and input-dependent uncertainty using scenario trees
Paulson et al.	2023	A tutorial on derivative-free policy learning methods for interpretable controller representations
Guzman et al.	2020	Heteroscedastic bayesian optimisation for stochastic model predictive control
Yang et al.	2022	A behavior regularized implicit policy for offline reinforcement learning
Jiang et al.	2022	Safe learning for uncertainty-aware planning via interval MDP abstraction
Xin et al.	2023	Learning dynamical systems by leveraging data from similar systems
Kudva et al.	2022	Efficient robust global optimization for simulation-based problems using decomposed Gaussian processes: Application to MPC calibration
Nguyen et al.	2021	Robust control theory based stability certificates for neural network approximated nonlinear model predictive control
Keivan et al.	2022	Model-free μ synthesis via adversarial reinforcement learning
Usmanova et al.	2020	Safe non-smooth black-box optimization with application to policy search
Baggio et al.	2022	Finite-sample guarantees for state-space system identification under full state measurements
Karg et al.	2022	Guaranteed safe control of systems with parametric uncertainties via neural network controllers
Gros et al.	2019	Towards safe reinforcement learning using nmpc and policy gradients: Part i-stochastic case