Cruz et al., 2017 - Google Patents

Path planning of multi-agent systems in unknown environment with neural kernel smoothing and reinforcement learning

Cruz et al., 2017

Document ID: 1095973740907034885
Author: Cruz D; Yu W
Publication year: 2017
Publication venue: Neurocomputing

External Links

Cited by

Snippet

Path planning is a basic task of robot navigation, especially for autonomous robots. It is more complex and difficult for multi-agent systems. The popular reinforcement learning method cannot solve the path planning problem directly in unknown environment. In this paper, the …

Continue reading at www.sciencedirect.com (other versions)

230000001537 neural 0 title abstract description 36

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G06N3/006—Artificial life, i.e. computers simulating life based on simulated virtual individual or collective life forms, e.g. single "avatar", social simulations, virtual worlds
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/002—Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models

Similar Documents

Publication	Publication Date	Title
Cruz et al.	2017	Path planning of multi-agent systems in unknown environment with neural kernel smoothing and reinforcement learning
Zhu et al.	2021	A survey of deep RL and IL for autonomous driving policy learning
Yao et al.	2020	Path planning method with improved artificial potential field—a reinforcement learning perspective
Hüttenrauch et al.	2019	Deep reinforcement learning for swarm systems
Jiang et al.	2017	A brief review of neural networks based learning and control and their applications for robots
Couceiro et al.	2012	A fuzzified systematic adjustment of the robotic Darwinian PSO
Luviano et al.	2017	Continuous-time path planning for multi-agents with fuzzy reinforcement learning
Guo et al.	2021	A fusion method of local path planning for mobile robots based on LSTM neural network and reinforcement learning
Azimirad et al.	2022	A consecutive hybrid spiking-convolutional (CHSC) neural controller for sequential decision making in robots
Yu et al.	2023	Hybrid attention-oriented experience replay for deep reinforcement learning and its application to a multi-robot cooperative hunting problem
Feng et al.	2011	Towards human-like social multi-agents with memetic automaton
Vatankhah et al.	2012	Adaptive critic-based neuro-fuzzy controller in multi-agents: Distributed behavioral control and path tracking
Park et al.	2023	Quantum multi-agent reinforcement learning for autonomous mobility cooperation
Chen et al.	2014	Autonomous intelligent decision-making system based on Bayesian SOM neural network for robot soccer
Iwasa et al.	2021	Multi-scale batch-learning growing neural gas for topological feature extraction in navigation of mobility support robots
Akbari et al.	2023	Role Engine Implementation for a Continuous and Collaborative Multirobot System
Vatankhah et al.	2012	Active leading through obstacles using ant-colony algorithm
Etemadi et al.	2012	Leader connectivity management and flocking velocity optimization using the particle swarm optimization method
Fernandez-Gauna et al.	2013	Undesired state-action prediction in multi-agent reinforcement learning for linked multi-component robotic system control
Daglarli et al.	2009	Behavioral task processing for cognitive robots using artificial emotions
Yang et al.	2022	Automatic synthesizing multi-robot cooperation strategies based on Brain Storm Robotics
Baxter et al.	2009	Shared Potential Fields and their place in a multi-robot co-ordination taxonomy
Song et al.	2023	Learning disentangled skills for hierarchical reinforcement learning through trajectory autoencoder with weak labels
Cruz et al.	2014	Multi-agent path planning in unknown environment with reinforcement learning and neural network
Kim et al.	2015	Inference of other’s internal neural models from active observation