Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
Can Cooperative Multi-Agent Reinforcement Learning Boost Automatic Web Testing? An Exploratory Study
ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software EngineeringPages 14–26https://doi.org/10.1145/3691620.3694983Reinforcement learning (RL)-based web GUI testing techniques have attracted significant attention in both academia and industry due to their ability to facilitate automatic and intelligent exploration of websites under test. Yet, the existing approaches ...
- ArticleSeptember 2024
Demand-Responsive Transport Dynamic Scheduling Optimization Based on Multi-agent Reinforcement Learning Under Mixed Demand
Artificial Neural Networks and Machine Learning – ICANN 2024Pages 356–368https://doi.org/10.1007/978-3-031-72341-4_24AbstractDemand-Responsive Transport (DRT) is an innovative mode of public transportation that focuses on individual passenger needs by offering customized transportation solutions. Most prior researches rely on historical passenger flow to generate static ...
- ArticleSeptember 2024
Reinforcement Learning-Based Cooperative Traffic Control System
AbstractUrban traffic congestion is an increasingly pressing issue and advanced solutions like intelligent traffic control systems are becoming unavoidable. This paper explores the application of reinforcement learning to enhance traffic flow and reduce ...
- research-articleAugust 2024
DyPS: Dynamic Parameter Sharing in Multi-Agent Reinforcement Learning for Spatio-Temporal Resource Allocation
KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 3128–3139https://doi.org/10.1145/3637528.3672052In large-scale metropolis, it is critical to efficiently allocate various resources such as electricity, medical care, and transportation to meet the living demands of citizens, according to the spatio-temporal distributions of resources and demands. ...
- research-articleAugust 2024
Rethinking Order Dispatching in Online Ride-Hailing Platforms
- Zhaoxing Yang,
- Haiming Jin,
- Guiyun Fan,
- Min Lu,
- Yiran Liu,
- Xinlang Yue,
- Hao Pan,
- Zhe Xu,
- Guobin Wu,
- Qun Li,
- Xiaotong Wang,
- Jiecheng Guo
KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 3863–3873https://doi.org/10.1145/3637528.3672028Achieving optimal order dispatching has been a long-standing challenge for online ride-hailing platforms. Early methods would make shortsighted matchings as they only consider order prices alone as the edge weights in the driver-order bipartite graph, ...
-
- research-articleAugust 2024
Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks
KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 3598–3609https://doi.org/10.1145/3637528.3671790Active Voltage Control (AVC) on the Power Distribution Networks (PDNs) aims to stabilize the voltage levels to ensure efficient and reliable operation of power systems. With the increasing integration of distributed energy resources, recent efforts have ...
- short-paperAugust 2024
Learning to Communicate Strategically for Efficient Collective Intelligence
NAIC '24: Proceedings of the 2024 SIGCOMM Workshop on Networks for AI ComputingPages 4–6https://doi.org/10.1145/3672198.3673795Learning to communicate (L2C) involves learning how, when, and with whom to communicate to enhance cooperation among agents under limited bandwidth. However, introducing L2C impedes the original learning tasks, resulting in slower learning and poor ...
- research-articleNovember 2024
EVDMARL: Efficient Value Decomposition-based Multi-Agent Reinforcement Learning with Domain-Randomization for Complex Analog Circuit Design Migration
- Handa Sun,
- Zhaori Bi,
- Wenning Jiang,
- Ye Lu,
- Changhao Yan,
- Fan Yang,
- Wenchuang Hu,
- Sheng-Guo Wang,
- Dian Zhou,
- Xuan Zeng
DAC '24: Proceedings of the 61st ACM/IEEE Design Automation ConferenceArticle No.: 284, Pages 1–6https://doi.org/10.1145/3649329.3656523Automated analog circuit design migration significantly alleviates the burden on designers in circuit sizing under various operating conditions. Conventional methods model the migration problem as black-box optimization, requiring excessive iterations of ...
- research-articleJune 2024
Optimizing Profitability of E-Scooter Sharing System via Battery-aware Recommendation
MOBISYS '24: Proceedings of the 22nd Annual International Conference on Mobile Systems, Applications and ServicesPages 575–587https://doi.org/10.1145/3643832.3661859In e-scooter sharing systems, users randomly select and use e-scooters based on inaccurate battery information. This simple rental policy leads to low profitability on two fronts. First, inaccurate battery information causes unexpected device shutdowns, ...
- research-articleJune 2024
A simulation and experimentation architecture for resilient cooperative multiagent reinforcement learning models operating in contested and dynamic environments
Cooperative multiagent reinforcement learning approaches are increasingly being used to make decisions in contested and dynamic environments, which tend to be wildly different from the environments used to train them. As such, there is a need for a more ...
- research-articleMay 2024
A Survey of Multi-Agent Deep Reinforcement Learning with Communication
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2845–2847Communication is an effective mechanism for coordinating the behaviors of multiple agents, broadening their views of the environment, and to support their collaborations. In the field of multi-agent deep reinforcement learning (MADRL), agents can improve ...
- research-articleMay 2024
Naphtha Cracking Center Scheduling Optimization using Multi-Agent Reinforcement Learning
- Sunghoon Hong,
- Deunsol Yoon,
- Whiyoung Jung,
- Jinsang Lee,
- Hyundam Yoo,
- Jiwon Ham,
- Suhyun Jung,
- Chanwoo Moon,
- Yeontae Jung,
- Kanghoon Lee,
- Woohyung Lim,
- Somin Jeon,
- Myounggu Lee,
- Sohui Hong,
- Jaesang Lee,
- Hangyoul Jang,
- Changhyun Kwak,
- Jeonghyeon Park,
- Changhoon Kang,
- Jungki Kim
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2806–2808The Naphtha Cracking Center (NCC) is central to petrochemical feedstock production through the intricate process. It consists of receipt stage for unloading naphtha, blending stage for mixing naphtha, and furnace stage for producing marketable products. ...
- research-articleMay 2024
Advancing Sample Efficiency and Explainability in Multi-Agent Reinforcement Learning
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2791–2793Multi-Agent Reinforcement Learning (MARL) holds promise for complex real-world applications but faces challenges in sample efficiency and policy explainability. My dissertation aims to address these critical barriers, advancing MARL towards more ...
- research-articleMay 2024
Cooperative Multi-Agent Reinforcement Learning in Convention Reliant Environments
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2773–2775There has been a substantial increase in interest in the field of Reinforcement Learning (RL), particularly that of using it to solve problems involving cooperation between many different agents, examples include self driving cars, robot assistants and ...
- research-articleMay 2024
Scaling up Cooperative Multi-agent Reinforcement Learning Systems
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2737–2739Cooperative multi-agent reinforcement learning methods aim to learn effective collaborative behaviours of multiple agents performing complex tasks. However, existing MARL methods are commonly proposed for fairly small-scale multi-agent benchmark problems,...
- extended-abstractMay 2024
Decentralized Competing Bandits in Many-to-One Matching Markets
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2603–2605Two-sided matching is a classic and well-studied problem. As the participants are usually not aware of the accurate preferences towards the other side, the model of competing bandits characterizes the process of learning uncertainty through interactions ...
- extended-abstractMay 2024
MATLight: Traffic Signal Coordinated Control Algorithm based on Heterogeneous-Agent Mirror Learning with Transformer
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2582–2584In order to better handle the issue of real-time multi-intersection traffic signal coordinated control, we expect that multi-agent decision-making can benefit from the advantages of large sequence models. In this paper, we propose a method for multi-...
- extended-abstractMay 2024
Fairness and Cooperation between Independent Reinforcement Learners through Indirect Reciprocity
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2468–2470In a multi-agent setting, altruistic cooperation is costly yet socially desirable. As such, agents adapting through independent reinforcement learning struggle to converge to efficient, cooperative policies. Indirect reciprocity (IR) constitutes a ...
- extended-abstractMay 2024
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
- Pedro P. Santos,
- Diogo S. Carvalho,
- Miguel Vasco,
- Alberto Sardinha,
- Pedro A. Santos,
- Ana Paiva,
- Francisco S. Melo
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2453–2455We introduce hybrid execution in multi-agent reinforcement learning (MARL), a new paradigm in which agents aim to successfully complete cooperative tasks with arbitrary communication levels at execution time by taking advantage of information-sharing ...
- extended-abstractMay 2024
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
- Alexander Rutherford,
- Benjamin Ellis,
- Matteo Gallici,
- Jonathan Cook,
- Andrei Lupu,
- Garðar Ingvarsson,
- Timon Willi,
- Akbir Khan,
- Christian Schroeder de Witt,
- Alexandra Souly,
- Saptarashmi Bandyopadhyay,
- Mikayel Samvelyan,
- Minqi Jiang,
- Robert Lange,
- Shimon Whiteson,
- Bruno Lacerda,
- Nick Hawes,
- Tim Rocktäschel,
- Chris Lu,
- Jakob Foerster
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2444–2446Benchmarks play an important role in the development of machine learning algorithms, with reinforcement learning (RL) research having been heavily influenced by the available environments. However, RL environments are traditionally run on the CPU, ...