Guo et al., 2023 - Google Patents

Optimal navigation for AGVs: A soft actor–critic-based reinforcement learning approach with composite auxiliary rewards

Guo et al., 2023

Document ID: 9579366893999816471
Author: Guo H; Ren Z; Lai J; Wu Z; Xie S
Publication year: 2023
Publication venue: Engineering Applications of Artificial Intelligence

External Links

Cited by

Snippet

In this paper, we address the problem of real-time navigation and obstacle avoidance for automated guided vehicles (AGVs) in dynamic environments, which is a primary research area in collaborative control systems for AGVs. To overcome the computational inefficiency …

Continue reading at www.sciencedirect.com (other versions)

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0268—Control of position or course in two dimensions specially adapted to land vehicles using internal positioning means
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0287—Control of position or course in two dimensions specially adapted to land vehicles involving a plurality of land vehicles, e.g. fleet or convoy travelling
- G05D1/0291—Fleet control
- G05D1/0295—Fleet control by at least one leading vehicle of the fleet
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0255—Control of position or course in two dimensions specially adapted to land vehicles using acoustic signals, e.g. ultra-sonic singals
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0212—Control of position or course in two dimensions specially adapted to land vehicles with means for defining a desired trajectory
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0231—Control of position or course in two dimensions specially adapted to land vehicles using optical position detecting means
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric

Similar Documents

Publication	Publication Date	Title
Chai et al.	2022	Design and experimental validation of deep reinforcement learning-based fast trajectory planning and control for mobile robot in unknown environment
Zhou et al.	2022	A review of motion planning algorithms for intelligent robots
Chiang et al.	2019	RL-RRT: Kinodynamic motion planning via learning reachability estimators from RL policies
Bansal et al.	2021	Deepreach: A deep learning approach to high-dimensional reachability
Mohanan et al.	2018	A survey of robotic motion planning in dynamic environments
Zhao et al.	2022	A novel direct trajectory planning approach based on generative adversarial networks and rapidly-exploring random tree
Zhuang et al.	2016	Efficient collision-free path planning for autonomous underwater vehicles in dynamic environments with a hybrid optimization algorithm
Grigorescu et al.	2019	Neurotrajectory: A neuroevolutionary approach to local state trajectory learning for autonomous vehicles
Guo et al.	2023	Optimal navigation for AGVs: A soft actor–critic-based reinforcement learning approach with composite auxiliary rewards
CN109597425B (en)	2021-10-26	Unmanned aerial vehicle navigation and obstacle avoidance method based on reinforcement learning
Sonny et al.	2023	Q-learning-based unmanned aerial vehicle path planning with dynamic obstacle avoidance
Ramezani et al.	2023	UAV path planning employing MPC-reinforcement learning method considering collision avoidance
Ma et al.	2022	Path planning of UUV based on HQPSO algorithm with considering the navigation error
Vallon et al.	2020	Data-driven hierarchical predictive learning in unknown environments
CN113485323A (en)	2021-10-08	Flexible formation method for cascaded multiple mobile robots
Xing et al.	2022	Robot path planner based on deep reinforcement learning and the seeker optimization algorithm
Cheng et al.	2024	A cross-platform deep reinforcement learning model for autonomous navigation without global information in different scenes
Claviere et al.	2019	Trajectory tracking control for robotic vehicles using counterexample guided training of neural networks
Lakhal et al.	2022	Safe and adaptive autonomous navigation under uncertainty based on sequential waypoints and reachability analysis
Löppenberg et al.	2024	Dynamic robot routing optimization: State–space decomposition for operations research-informed reinforcement learning
Qiu	2020	Multi-agent navigation based on deep reinforcement learning and traditional pathfinding algorithm
Kollar et al.	2006	Using reinforcement learning to improve exploration trajectories for error minimization
Xin et al.	2024	Long Short‐Term Memory‐Based Multi‐Robot Trajectory Planning: Learn from MPCC and Make It Better
Belker et al.	2002	Learning action models for the improved execution of navigation plans
Kashyap et al.	2023	Modified type-2 fuzzy controller for intercollision avoidance of single and multi-humanoid robots in complex terrains