Levine et al., 2016 - Google Patents

End-to-end training of deep visuomotor policies

Levine et al., 2016

Document ID: 4721631804326015979
Author: Levine S; Finn C; Darrell T; Abbeel P
Publication year: 2016
Publication venue: Journal of Machine Learning Research

External Links

Cited by

Snippet

For spline regressions, it is well known that the choice of knots is crucial for the performance of the estimator. As a general learning framework covering the smoothing splines, learning in a Reproducing Kernel Hilbert Space (RKHS) has a similar issue. However, the selection …

Continue reading at www.jmlr.org (PDF) (other versions)

230000001537 neural 0 abstract description 32

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6288—Fusion techniques, i.e. combining data from various sources, e.g. sensor fusion
- G06K9/629—Fusion techniques, i.e. combining data from various sources, e.g. sensor fusion of extracted features
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships

Similar Documents

Publication	Publication Date	Title
Levine et al.	2016	End-to-end training of deep visuomotor policies
Finn et al.	2017	Deep visual foresight for planning robot motion
Smith et al.	2019	Avid: Learning multi-stage tasks via pixel-level translation of human videos
Das et al.	2021	Model-based inverse reinforcement learning from visual demonstrations
Sivakumar et al.	2022	Robotic telekinesis: Learning a robotic hand imitator by watching humans on youtube
Finn et al.	2016	Deep spatial autoencoders for visuomotor learning
Arduengo et al.	2021	Robust and adaptive door operation with a mobile robot
Chebotar et al.	2017	Path integral guided policy search
Finn et al.	2015	Learning visual feature spaces for robotic manipulation with deep spatial autoencoders
Kumar et al.	2016	Learning dexterous manipulation policies from experience and imitation
Neumann et al.	2013	Neural learning of stable dynamical systems based on data-driven Lyapunov candidates
Akgun et al.	2016	Simultaneously learning actions and goals from demonstration
Rahmatizadeh et al.	2016	Learning real manipulation tasks from virtual demonstrations using LSTM
Bischoff et al.	2014	Policy search for learning robot control using sparse data
Droniou et al.	2014	Learning a repertoire of actions with deep neural networks
Nematollahi et al.	2022	Robot skill adaptation via soft actor-critic gaussian mixture models
Chen et al.	2018	A probabilistic framework for uncertainty-aware high-accuracy precision grasping of unknown objects
CN117798919A (en)	2024-04-02	A dexterous manipulator grasping method based on dynamic interaction representation
Hu et al.	2023	Reboot: Reuse data for bootstrapping efficient real-world dexterous manipulation
Tee et al.	2022	A framework for tool cognition in robots without prior tool learning or observation
Oliva et al.	2022	Graph neural networks for relational inductive bias in vision-based deep reinforcement learning of robot control
Liu et al.	2019	Learning articulated constraints from a one-shot demonstration for robot manipulation planning
Tobin	2019	Real-world robotic perception and control using synthetic data
Mosbach et al.	2022	Efficient representations of object geometry for reinforcement learning of interactive grasping policies
Paul et al.	2017	Deterministic policy gradient based robotic path planning with continuous action spaces