Park et al., 2023 - Google Patents

Quantum multi-agent reinforcement learning for autonomous mobility cooperation

Park et al., 2023

Document ID: 7466057673876270897
Author: Park S; Kim J; Park C; Jung S; Kim J
Publication year: 2023
Publication venue: IEEE Communications Magazine

External Links

Cited by

Snippet

For Industry 4.0 Revolution, cooperative autonomous mobility systems are widely used based on multi-agent reinforcement learning (MARL). However, the MARL-based algorithms suffer from huge parameter utilization and convergence difficulties with many agents. To …

Continue reading at arxiv.org (PDF) (other versions)

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/002—Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric

Similar Documents

Publication	Publication Date	Title
Park et al.	2023	Quantum multi-agent reinforcement learning for autonomous mobility cooperation
Cruz et al.	2017	Path planning of multi-agent systems in unknown environment with neural kernel smoothing and reinforcement learning
Kiumarsi et al.	2017	Optimal and autonomous control using reinforcement learning: A survey
Yan et al.	2017	Cloud robotics in smart manufacturing environments: Challenges and countermeasures
Yan et al.	2023	Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
Luviano et al.	2017	Continuous-time path planning for multi-agents with fuzzy reinforcement learning
Guo et al.	2021	A fusion method of local path planning for mobile robots based on LSTM neural network and reinforcement learning
He et al.	2022	Multiagent soft actor-critic based hybrid motion planner for mobile robots
Liu et al.	2023	Roboec2: A novel cloud robotic system with dynamic network offloading assisted by amazon ec2
Aguzzi et al.	2023	Field-informed reinforcement learning of collective tasks with graph neural networks
Atchade-Adelomou	2022	Quantum algorithms for solving hard constrained optimisation problems
Skulimowski	2016	Anticipatory control of vehicle swarms with virtual supervision
Chen et al.	2021	When shall i be empathetic? the utility of empathetic parameter estimation in multi-agent interactions
CN116009542A (en)	2023-04-25	Dynamic multi-agent coverage path planning method, device, equipment and storage medium
Li et al.	2019	Improving fast adaptation for newcomers in multi-robot reinforcement learning system
Meng et al.	2023	Learning-Based Risk-Bounded Path Planning Under Environmental Uncertainty
Nasir et al.	2016	Multi‐level decision making in hierarchical multi‐agent robotic search teams
Han et al.	2019	Robot path planning in dynamic environments based on deep reinforcement learning
Ferguson et al.	2023	Collaborative Decision-Making and the k-Strong Price of Anarchy in Common Interest Games
Zhang et al.	2021	[Retracted] A Novel Model‐Based Reinforcement Learning Attitude Control Method for Virtual Reality Satellite
Zhao et al.	2022	Path Planning Algorithm Based on A_star Algorithm and Q-Learning Algorithm
Dutta et al.	2024	Design and Simulation of Time-energy Optimal Anti-swing Trajectory Planner for Autonomous Tower Cranes
Lu et al.	2023	Disturbance-aware reinforcement learning for rejecting excessive disturbances
Cuenca Macas et al.	2022	Collision Avoidance Simulation Using Voronoi Diagrams in a Centralized System of Holonomic Multi-agents
Ding et al.	2022	Multiagent reinforcement learning for strictly constrained tasks based on Reward Recorder