von Rohr et al., 2021 - Google Patents
Probabilistic robust linear quadratic regulators with Gaussian processesvon Rohr et al., 2021
View PDF- Document ID
- 3466008142719384134
- Author
- von Rohr A
- Neumann-Brosig M
- Trimpe S
- Publication year
- Publication venue
- Learning for Dynamics and Control
External Links
Snippet
Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design. While learning-based control has the potential to yield superior performance in demanding applications …
- 238000000034 method 0 title abstract description 24
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0205—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric not using a model or a simulator of the controlled system
- G05B13/024—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric not using a model or a simulator of the controlled system in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B23/00—Testing or monitoring of control systems or parts thereof
- G05B23/02—Electric testing or monitoring
- G05B23/0205—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults
- G05B23/0218—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Buisson-Fenet et al. | Actively learning gaussian process dynamics | |
Wabersich et al. | Linear model predictive safety certification for learning-based control | |
von Rohr et al. | Probabilistic robust linear quadratic regulators with Gaussian processes | |
Fujimoto et al. | Off-policy deep reinforcement learning without exploration | |
Matni et al. | From self-tuning regulators to reinforcement learning and back again | |
Curi et al. | Efficient model-based reinforcement learning through optimistic policy search and planning | |
Lale et al. | Logarithmic regret bound in partially observable linear dynamical systems | |
Chen et al. | Generalization bounds for meta-learning: An information-theoretic analysis | |
Kallus et al. | Statistically efficient off-policy policy gradients | |
Mern et al. | Bayesian optimized monte carlo planning | |
Sinha et al. | Adaptive robust model predictive control with matched and unmatched uncertainty | |
Lale et al. | Model learning predictive control in nonlinear dynamical systems | |
Bonzanini et al. | Safe learning-based model predictive control under state-and input-dependent uncertainty using scenario trees | |
Paulson et al. | A tutorial on derivative-free policy learning methods for interpretable controller representations | |
Guzman et al. | Heteroscedastic bayesian optimisation for stochastic model predictive control | |
Yang et al. | A behavior regularized implicit policy for offline reinforcement learning | |
Jiang et al. | Safe learning for uncertainty-aware planning via interval MDP abstraction | |
Xin et al. | Learning dynamical systems by leveraging data from similar systems | |
Kudva et al. | Efficient robust global optimization for simulation-based problems using decomposed Gaussian processes: Application to MPC calibration | |
Nguyen et al. | Robust control theory based stability certificates for neural network approximated nonlinear model predictive control | |
Keivan et al. | Model-free μ synthesis via adversarial reinforcement learning | |
Usmanova et al. | Safe non-smooth black-box optimization with application to policy search | |
Baggio et al. | Finite-sample guarantees for state-space system identification under full state measurements | |
Karg et al. | Guaranteed safe control of systems with parametric uncertainties via neural network controllers | |
Gros et al. | Towards safe reinforcement learning using nmpc and policy gradients: Part i-stochastic case |