research-article

Multi-agent deep reinforcement learning based real-time planning approach for responsive customized bus routes

Authors:

Xing WanAuthors Info & Claims

Volume 188, Issue C

https://doi.org/10.1016/j.cie.2023.109840

Published: 01 February 2024 Publication History

Abstract

Customized bus can meet many passengers’ personalized travel demand in a public transportation system by providing an innovative shared travel service. Customized bus offers multiple bus routes that jointly form a route network to serve its passengers. It must frequently adjust the station sequences of each bus route in response to trip cancellations and new trip bookings during its operation. Different from relevant research works in the literature, this paper proposes a multi-agent deep reinforcement learning based real-time planning approach for tackling the multiple customized bus routes planning problem. We model the problem as a multi-agent Markov decision process for the first time in literature where a separate agent is assigned to each route to plan its station sequence. We then develop a new multi-agent system. Each agent in the system is powered by an encoder–decoder neural network that consolidates the station sequence decision policy for each bus route. We employ a policy gradient-based reinforcement learning algorithm to train the network parameters of the multi-agent system so as to maximize the number of passengers served while ensuring the customized bus service quality and minimizing the operating cost of all customized bus routes. On three (six) problem instances in offline (online) scenarios, the trained multi-agent system can significantly outperform several existing algorithms in terms of the total cost, adaptiveness and computation time.

Highlights

•

This paper studies an emerging multiple routes planning problem for customized bus.

•

An innovative deep reinforcement learning based approach (MRL-RP) is proposed.

•

A new multi-agent system powered by encoder–decoder neural network is developed.

•

Empirical results show that MRL-RP outperforms some state-of-the-art algorithms.

References

[1]

Ai G., Zuo X., Chen G., Wu B., Deep reinforcement learning based dynamic optimization of bus timetable, Applied Soft Computing 131 (2022),.

Digital Library

[2]

Asghari M., Alehashem S.M.J.M., Rekik Y., Environmental and social implications of incorporating carpooling service on a customized bus system, Computers & Operations Research 142 (2022),.

Digital Library

[3]

Bono G., Dibangoye J.S., Simonin O., Matignon L., Pereyron F., Solving multi-agent routing problems using deep attention mechanisms, IEEE Transactions on Intelligent Transportation Systems 22 (12) (2021) 7804–7813,.

[4]

Chen X., Wang Y., Ma X., Integrated optimization for commuting customized bus stop planning, routing design, and timetable development with passenger spatial-temporal accessibility, IEEE Transactions on Intelligent Transportation Systems 22 (4) (2021) 2060–2075,.

[5]

Chen X., Wang Y., Wang Y., Qu X., Ma X., Customized bus route design with pickup and delivery and time windows: Model, case study and comparative analysis, Expert Systems with Applications 168 (2021),.

[6]

Darwish, A., Khalil, M., & Badawi, K. (2020). Optimising Public Bus Transit Networks Using Deep Reinforcement Learning. In 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC) (pp. 1–7). Rhodes, Greece: https://doi.org/10.1109/ITSC45102.2020.9294710.

[7]

Du W., Ding S., A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications, Artificial Intelligence Review 54 (5) (2021) 3215–3238,.

Digital Library

[8]

Foerster, J., Farquhar, G., Afouras, T., Nardelli, N., & Whiteson, S. (2018). Counterfactual Multi-Agent Policy Gradients. In Proceedings of the AAAI conference on artificial intelligence (pp. 2974–2982). New Orleans, Lousiana, USA: https://doi.org/10.1609/aaai.v32i1.11794.

[9]

Gronauer S., Diepold K., Multi-agent deep reinforcement learning: a survey, Artificial Intelligence Review 55 (2) (2022) 895–943,.

Digital Library

[10]

Guo R., Guan W., Zhang W., Meng F., Zhang Z., Customized bus routing problem with time window restrictions: model and case study, Transportmetrica A: Transport Science 15 (2) (2019) 1804–1824,.

[11]

Han S., Fu H., Zhao J., Lin J., Zeng W., Modelling and simulation of hierarchical scheduling of real-time responsive customised bus, IET Intelligent Transport Systems 14 (12) (2020) 1615–1625,.

[12]

Haydari A., Yılmaz Y., Deep reinforcement learning for intelligent transportation systems: A survey, IEEE Transactions on Intelligent Transportation Systems 23 (1) (2022) 11–32,.

Digital Library

[13]

Helbing D., Brockmann D., Chadefaux T., Donnay K., Blanke U., Woolley-Meza O., Moussaid M., Johansson A., Krause J., Schutte S., Perc M., Saving human lives: What complexity science and information systems can contribute, Journal of Statistical Physics 158 (2015) 735–781,.

[14]

Huang D., Gu Y., Wang S., Liu Z., Zhang W., A two-phase optimization model for the demand-responsive customized bus network design, Transportation Research Part C (Emerging Technologies) 111 (2020) 1–21,.

[15]

Jia D., Guo H., Song Z., Shi L., Deng X., Perc M., Wang Z., Local and global stimuli in reinforcement learning, New Journal of Physics 23 (8) (2021),.

[16]

Jusup M., Holme P., Kanazawa K., Takayasu M., Romić I., Wang Z., Geček S., Lipić T., Podobnik B., Wang L., Luo W., Klanjšček T., Fan J., Boccaletti S., Perc M., Social physics, Physics Reports 948 (2022) 1–148,.

[17]

Karimi-Mamaghan M., Mohammadi M., Meyer P., Karimi-Mamaghan A.M., Talbi E.-G., Machine learning at the service of meta-heuristics for solving combinatorial optimization problems: A state-of-the-art, European Journal of Operational Research 296 (2) (2022) 393–422,.

[18]

Ke J., Xiao F., Yang H., Ye J., Learning to delay in ride-sourcing systems: A multi-agent deep reinforcement learning framework, IEEE Transactions on Knowledge and Data Engineering 34 (5) (2022) 2280–2292,.

[19]

Li, M., Qin, Z., Jiao, Y., Yang, Y., Wang, J., Wang, C., Wu, G., & Ye, J. (2019). Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning. In The world wide web conference (pp. 983–994). San Francisco, CA, USA: https://doi.org/10.1145/3308558.3313433.

[20]

Liang E., Wen K., Lam W.H.K., Sumalee A., Zhong R., An integrated reinforcement learning and centralized programming approach for online taxi dispatching, IEEE Transactions on Neural Networks and Learning Systems 33 (9) (2022) 4742–4756,.

[21]

Liu T., Ceder A.A., Analysis of a new public-transport-service concept: Customized bus in China, Transport Policy 39 (2015) 63–76,.

[22]

Lyu Y., Chow C.-Y., Lee V.C., Ng J.K., Li Y., Zeng J., CB-planner: A bus line planning framework for customized bus systems, Transportation Research Part C (Emerging Technologies) 101 (2019) 233–253,.

[23]

Ma H., Yang M., Li X., Integrated optimization of customized bus routes and timetables with consideration of holding control, Computers & Industrial Engineering 175 (2023),.

Digital Library

[24]

Mazyavkina N., Sviridov S., Ivanov S., Burnaev E., Reinforcement learning for combinatorial optimization: A survey, Computers & Operations Research 134 (2021),.

[25]

Menda K., Chen Y.-C., Grana J., Bono J.W., Tracey B.D., Kochenderfer M.J., Wolpert D., Deep reinforcement learning for event-driven multi-agent decision processes, IEEE Transactions on Intelligent Transportation Systems 20 (4) (2019) 1259–1268,.

[26]

Mnih V., Kavukcuoglu K., Silver D., Rusu A.A., Veness J., Bellemare M.G., Graves A., Riedmiller M., Fidjeland A.K., Ostrovski G., Petersen S., Beattie C., Sadik A., Antonoglou I., King H., Kumaran D., Wierstra D., Legg S., Hassabis D., Human-level control through deep reinforcement learning, Nature 518 (2021) 529–533,.

[27]

Nazari, M., Oroojlooy, A., Snyder, L., & Takac, M. (2018). Reinforcement Learning for Solving the Vehicle Routing Problem. In Advances in neural information processing systems (pp. 9839–9849). Montréal, Canada.

[28]

Nguyen T.T., Nguyen N.D., Nahavandi S., Deep reinforcement learning for multiagent systems: A review of challenges, solutions, and applications, IEEE Transactions on Cybernetics 50 (9) (2020) 3826–3839,.

[29]

Ren L., Fan X., Cui J., Shen Z., Lv Y., Xiong G., A multi-agent reinforcement learning method with route recorders for vehicle routing in supply chain management, IEEE Transactions on Intelligent Transportation Systems 23 (9) (2022) 16410–16420,.

Digital Library

[30]

Rubenstein M., Cornejo A., Nagpal R., Programmable self-assembly in a thousand-robot swarm, Science 345 (6198) (2014) 795–799,.

[31]

Shen C., Sun Y., Bai Z., Cui H., Real-time customized bus routes design with optimal passenger and vehicle matching based on column generation algorithm, Physica A. Statistical Mechanics and its Applications 571 (2021),.

[32]

Solomon M.M., Algorithms for the vehicle routing and scheduling problems with time window constraints, Operations Research 35 (2) (1987) 254–265,.

[33]

Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to Sequence Learning with Neural Networks. In Advances in neural information processing systems (pp. 3104–3112). Montreal, Quebec, Canada.

[34]

Tong L.C., Zhou L., Liu J., Zhou X., Customized bus service design for jointly optimizing passenger to vehicle assignment and vehicle routing, Transportation Research Part C (Emerging Technologies) 85 (2017) 451–475,.

[35]

Vansteenwegen P., Melis L., Aktaş D., Montenegro B.D.G., Vieira F.S., Sörensen K., A survey on demand-responsive public bus systems, Transportation Research Part C (Emerging Technologies) 137 (2022),.

[36]

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is All you Need. In Advances in neural information processing systems (pp. 5998–6008). Long Beach, CA, USA.

[37]

Veres M., Moussa M., Deep learning for intelligent transportation systems: A survey of emerging trends, IEEE Transactions on Intelligent Transportation Systems 21 (8) (2020) 3152–3168,.

[38]

Vinyals, O., Fortunato, M., & Jaitly, N. (2015). Pointer Networks. In Advances in neural information processing systems (pp. 2692–2700). Montreal, Quebec, Canada.

[39]

Wang Y., He H., Sun C., Learning to navigate through complex dynamic environment with modular deep reinforcement learning, IEEE Transactions on Games 10 (4) (2018) 400–412,.

[40]

Wang C., Ma C., Xu X., Multi-objective optimization of real-time customized bus routes based on two-stage method, Physica A. Statistical Mechanics and its Applications 537 (2020),.

[41]

Wang Q., Tang C., Deep reinforcement learning for transportation network combinatorial optimization: A survey, Knowledge-Based Systems 233 (2021),.

Digital Library

[42]

Wang J., Yamamoto T., Liu K., Key determinants and heterogeneous frailties in passenger loyalty toward customized buses: An empirical investigation of the subscription termination hazard of users, Transportation Research Part C (Emerging Technologies) 115 (2020),.

[43]

Wang Z., Yu J., Hao W., Xiang J., Joint optimization of running route and scheduling for the mixed demand responsive feeder transit with time-dependent travel times, IEEE Transactions on Intelligent Transportation Systems 22 (4) (2021) 2498–2509,.

[44]

Yu J.J.Q., Yu W., Gu J., Online vehicle routing with neural combinatorial optimization and deep reinforcement learning, IEEE Transactions on Intelligent Transportation Systems 20 (10) (2019) 3806–3817,.

[45]

Zhang K., He F., Zhang Z., Lin X., Li M., Multi-vehicle routing problems with soft time windows: A multi-agent reinforcement learning approach, Transportation Research Part C (Emerging Technologies) 121 (2020),.

[46]

Zhang Z., Liu H., Zhou M., Wang J., Solving dynamic traveling salesman problems with deep reinforcement learning, IEEE Transactions on Neural Networks and Learning Systems early access (2021) 1–14,.

[47]

Zhang J., Wang D.Z.W., Meng M., Analyzing customized bus service on a multimodal travel corridor: An analytical modeling approach, Journal of Transportation Engineering, Part A: Systems 143 (11) (2017),.

[48]

Zhang J., Wang D.Z., Meng M., Which service is better on a linear travel corridor: Park and ride or on-demand public bus?, Transportation Research Part A: Policy and Practice 118 (2018) 803–818,.

[49]

Zhao, Y., Chen, G., Ma, H., Zuo, X., & Ai, G. (2022). Dynamic Bus Holding Control Using Spatial-Temporal Data – A Deep Reinforcement Learning Approach. In AI 2022: Advances in artificial intelligence (pp. 661–674). Cham: https://doi.org/10.1007/978-3-031-22695-3_46.

[50]

Zhao J., Mao M., Zhao X., Zou J., A hybrid of deep reinforcement learning and local search for the vehicle routing problems, IEEE Transactions on Intelligent Transportation Systems 22 (11) (2021) 7208–7218,.

Digital Library

[51]

Zhao R., Wang D., Yan R., Mao K., Shen F., Wang J., Machine health monitoring using local feature-based gated recurrent unit networks, IEEE Transactions on Industrial Electronics 65 (2) (2018) 1539–1548,.

Recommendations

Deep reinforcement learning for multi-agent interaction
Multi-agent systems research in the United Kingdom

The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning. Towards this goal, the Autonomous Agents Research Group develops novel ...
Mediated Multi-Agent Reinforcement Learning
AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems

The majority of Multi-Agent Reinforcement Learning (MARL) literature equates the cooperation of self-interested agents in mixed environments to the problem of social welfare maximization, allowing agents to arbitrarily share rewards and private ...
Assured Deep Multi-Agent Reinforcement Learning for Safe Robotic Systems
Agents and Artificial Intelligence
Abstract
Using multi-agent reinforcement learning to find solutions to complex decision-making problems in shared environments has become standard practice in many scenarios. However, this is not the case in safety-critical scenarios, where the ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Computers and Industrial Engineering

Computers and Industrial Engineering Volume 188, Issue C

Feb 2024

1029 pages

Issue’s Table of Contents

Elsevier Ltd.

Publisher

Pergamon Press, Inc.

United States

Publication History

Published: 01 February 2024

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

Figures

Tables

Media

View Issue’s Table of Contents