Park et al., 2023 - Google Patents
Quantum multi-agent reinforcement learning for autonomous mobility cooperationPark et al., 2023
View PDF- Document ID
- 7466057673876270897
- Author
- Park S
- Kim J
- Park C
- Jung S
- Kim J
- Publication year
- Publication venue
- IEEE Communications Magazine
External Links
Snippet
For Industry 4.0 Revolution, cooperative autonomous mobility systems are widely used based on multi-agent reinforcement learning (MARL). However, the MARL-based algorithms suffer from huge parameter utilization and convergence difficulties with many agents. To …
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/002—Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Park et al. | Quantum multi-agent reinforcement learning for autonomous mobility cooperation | |
Cruz et al. | Path planning of multi-agent systems in unknown environment with neural kernel smoothing and reinforcement learning | |
Kiumarsi et al. | Optimal and autonomous control using reinforcement learning: A survey | |
Yan et al. | Cloud robotics in smart manufacturing environments: Challenges and countermeasures | |
Yan et al. | Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach | |
Luviano et al. | Continuous-time path planning for multi-agents with fuzzy reinforcement learning | |
Guo et al. | A fusion method of local path planning for mobile robots based on LSTM neural network and reinforcement learning | |
He et al. | Multiagent soft actor-critic based hybrid motion planner for mobile robots | |
Liu et al. | Roboec2: A novel cloud robotic system with dynamic network offloading assisted by amazon ec2 | |
Aguzzi et al. | Field-informed reinforcement learning of collective tasks with graph neural networks | |
Atchade-Adelomou | Quantum algorithms for solving hard constrained optimisation problems | |
Skulimowski | Anticipatory control of vehicle swarms with virtual supervision | |
Chen et al. | When shall i be empathetic? the utility of empathetic parameter estimation in multi-agent interactions | |
CN116009542A (en) | Dynamic multi-agent coverage path planning method, device, equipment and storage medium | |
Li et al. | Improving fast adaptation for newcomers in multi-robot reinforcement learning system | |
Meng et al. | Learning-Based Risk-Bounded Path Planning Under Environmental Uncertainty | |
Nasir et al. | Multi‐level decision making in hierarchical multi‐agent robotic search teams | |
Han et al. | Robot path planning in dynamic environments based on deep reinforcement learning | |
Ferguson et al. | Collaborative Decision-Making and the k-Strong Price of Anarchy in Common Interest Games | |
Zhang et al. | [Retracted] A Novel Model‐Based Reinforcement Learning Attitude Control Method for Virtual Reality Satellite | |
Zhao et al. | Path Planning Algorithm Based on A_star Algorithm and Q-Learning Algorithm | |
Dutta et al. | Design and Simulation of Time-energy Optimal Anti-swing Trajectory Planner for Autonomous Tower Cranes | |
Lu et al. | Disturbance-aware reinforcement learning for rejecting excessive disturbances | |
Cuenca Macas et al. | Collision Avoidance Simulation Using Voronoi Diagrams in a Centralized System of Holonomic Multi-agents | |
Ding et al. | Multiagent reinforcement learning for strictly constrained tasks based on Reward Recorder |