Keyword: multi-agent reinforcement learning : Search

research-article

Free

Can Cooperative Multi-Agent Reinforcement Learning Boost Automatic Web Testing? An Exploratory Study

ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software EngineeringPages 14–26https://doi.org/10.1145/3691620.3694983

Reinforcement learning (RL)-based web GUI testing techniques have attracted significant attention in both academia and industry due to their ability to facilitate automatic and intelligent exploration of websites under test. Yet, the existing approaches ...

Article

Demand-Responsive Transport Dynamic Scheduling Optimization Based on Multi-agent Reinforcement Learning Under Mixed Demand

Artificial Neural Networks and Machine Learning – ICANN 2024Pages 356–368https://doi.org/10.1007/978-3-031-72341-4_24

Abstract

Demand-Responsive Transport (DRT) is an innovative mode of public transportation that focuses on individual passenger needs by offering customized transportation solutions. Most prior researches rely on historical passenger flow to generate static ...

Article

Reinforcement Learning-Based Cooperative Traffic Control System

Computational Collective IntelligencePages 176–188https://doi.org/10.1007/978-3-031-70819-0_14

Abstract

Urban traffic congestion is an increasingly pressing issue and advanced solutions like intelligent traffic control systems are becoming unavoidable. This paper explores the application of reinforcement learning to enhance traffic flow and reduce ...

research-article

Open Access

DyPS: Dynamic Parameter Sharing in Multi-Agent Reinforcement Learning for Spatio-Temporal Resource Allocation

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 3128–3139https://doi.org/10.1145/3637528.3672052

In large-scale metropolis, it is critical to efficiently allocate various resources such as electricity, medical care, and transportation to meet the living demands of citizens, according to the spatio-temporal distributions of resources and demands. ...

research-article

Rethinking Order Dispatching in Online Ride-Hailing Platforms

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 3863–3873https://doi.org/10.1145/3637528.3672028

Achieving optimal order dispatching has been a long-standing challenge for online ride-hailing platforms. Early methods would make shortsighted matchings as they only consider order prices alone as the edge weights in the driver-order bipartite graph, ...

research-article

Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 3598–3609https://doi.org/10.1145/3637528.3671790

Active Voltage Control (AVC) on the Power Distribution Networks (PDNs) aims to stabilize the voltage levels to ensure efficient and reliable operation of power systems. With the increasing integration of distributed energy resources, recent efforts have ...

short-paper

Open Access

Learning to Communicate Strategically for Efficient Collective Intelligence

NAIC '24: Proceedings of the 2024 SIGCOMM Workshop on Networks for AI ComputingPages 4–6https://doi.org/10.1145/3672198.3673795

Learning to communicate (L2C) involves learning how, when, and with whom to communicate to enhance cooperation among agents under limited bandwidth. However, introducing L2C impedes the original learning tasks, resulting in slower learning and poor ...

research-article

Free

EVDMARL: Efficient Value Decomposition-based Multi-Agent Reinforcement Learning with Domain-Randomization for Complex Analog Circuit Design Migration

DAC '24: Proceedings of the 61st ACM/IEEE Design Automation ConferenceArticle No.: 284, Pages 1–6https://doi.org/10.1145/3649329.3656523

Automated analog circuit design migration significantly alleviates the burden on designers in circuit sizing under various operating conditions. Conventional methods model the migration problem as black-box optimization, requiring excessive iterations of ...

research-article

Open Access

Optimizing Profitability of E-Scooter Sharing System via Battery-aware Recommendation

MOBISYS '24: Proceedings of the 22nd Annual International Conference on Mobile Systems, Applications and ServicesPages 575–587https://doi.org/10.1145/3643832.3661859

In e-scooter sharing systems, users randomly select and use e-scooters based on inaccurate battery information. This simple rental policy leads to low profitability on two fronts. First, inaccurate battery information causes unexpected device shutdowns, ...

research-article

Open Access

A simulation and experimentation architecture for resilient cooperative multiagent reinforcement learning models operating in contested and dynamic environments

Simulation (SIMU), Volume 100, Issue 6Pages 563–579https://doi.org/10.1177/00375497241232432

Cooperative multiagent reinforcement learning approaches are increasingly being used to make decisions in contested and dynamic environments, which tend to be wildly different from the environments used to train them. As such, there is a need for a more ...

research-article

A Survey of Multi-Agent Deep Reinforcement Learning with Communication

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2845–2847

Communication is an effective mechanism for coordinating the behaviors of multiple agents, broadening their views of the environment, and to support their collaborations. In the field of multi-agent deep reinforcement learning (MADRL), agents can improve ...

research-article

Naphtha Cracking Center Scheduling Optimization using Multi-Agent Reinforcement Learning

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2806–2808

The Naphtha Cracking Center (NCC) is central to petrochemical feedstock production through the intricate process. It consists of receipt stage for unloading naphtha, blending stage for mixing naphtha, and furnace stage for producing marketable products. ...

research-article

Advancing Sample Efficiency and Explainability in Multi-Agent Reinforcement Learning

Zhicheng Zhang

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2791–2793

Multi-Agent Reinforcement Learning (MARL) holds promise for complex real-world applications but faces challenges in sample efficiency and policy explainability. My dissertation aims to address these critical barriers, advancing MARL towards more ...

research-article

Cooperative Multi-Agent Reinforcement Learning in Convention Reliant Environments

Jarrod Shipton

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2773–2775

There has been a substantial increase in interest in the field of Reinforcement Learning (RL), particularly that of using it to solve problems involving cooperation between many different agents, examples include self driving cars, robot assistants and ...

research-article

Scaling up Cooperative Multi-agent Reinforcement Learning Systems

Minghong Geng

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2737–2739

Cooperative multi-agent reinforcement learning methods aim to learn effective collaborative behaviours of multiple agents performing complex tasks. However, existing MARL methods are commonly proposed for fairly small-scale multi-agent benchmark problems,...

extended-abstract

Decentralized Competing Bandits in Many-to-One Matching Markets

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2603–2605

Two-sided matching is a classic and well-studied problem. As the participants are usually not aware of the accurate preferences towards the other side, the model of competing bandits characterizes the process of learning uncertainty through interactions ...

extended-abstract

MATLight: Traffic Signal Coordinated Control Algorithm based on Heterogeneous-Agent Mirror Learning with Transformer

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2582–2584

In order to better handle the issue of real-time multi-intersection traffic signal coordinated control, we expect that multi-agent decision-making can benefit from the advantages of large sequence models. In this paper, we propose a method for multi-...

extended-abstract

Fairness and Cooperation between Independent Reinforcement Learners through Indirect Reciprocity

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2468–2470

In a multi-agent setting, altruistic cooperation is costly yet socially desirable. As such, agents adapting through independent reinforcement learning struggle to converge to efficient, cooperative policies. Indirect reciprocity (IR) constitutes a ...

extended-abstract

Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2453–2455

We introduce hybrid execution in multi-agent reinforcement learning (MARL), a new paradigm in which agents aim to successfully complete cooperative tasks with arbitrary communication levels at execution time by taking advantage of information-sharing ...

extended-abstract

JaxMARL: Multi-Agent RL Environments and Algorithms in JAX

AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2444–2446

Benchmarks play an important role in the development of machine learning algorithms, with reinforcement learning (RL) research having been heavily influenced by the available environments. However, RL environments are traditionally run on the CPU, ...

Applied Filters

People

Names

Institutions

Authors

Editors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Reproducibility Badges

Publication Date

Save to Binder

Upcoming Conferences