Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3534678.3542679acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Open access

Reinforcement Learning Enhances the Experts: Large-scale COVID-19 Vaccine Allocation with Multi-factor Contact Network

Published: 14 August 2022 Publication History

Abstract

In the fight against the COVID-19 pandemic, vaccines are the most critical resource but are still in short supply around the world. Therefore, efficient vaccine allocation strategies are urgently called for, especially in large-scale metropolis where uneven health risk is manifested in nearby neighborhoods. However, there exist several key challenges in solving this problem: (1) great complexity in the large scale scenario adds to the difficulty in experts' vaccine allocation decision making; (2) heterogeneous information from all aspects in the metropolis' contact network makes information utilization difficult in decision making; (3) when utilizing the strong decision-making ability of reinforcement learning (RL) to solve the problem, poor explainability limits the credibility of the RL strategies. In this paper, we propose a reinforcement learning enhanced experts method. We deal with the great complexity via a specially designed algorithm aggregating blocks in the metropolis into communities and we hierarchically integrate RL among the communities and experts solution within each community. We design a self-supervised contact network representation algorithm to fuse the heterogeneous information for efficient vaccine allocation decision making. We conduct extensive experiments in three metropolis with real-world data and prove that our method outperforms the best baseline, reducing 9.01% infections and 12.27% deaths.We further demonstrate the explainability of the RL model, adding to its credibility and also enlightening the experts in turn.

Supplemental Material

MP4 File
Presentation video for paper 'Reinforcement Learning Enhances the Experts: Large Scale COVID-19 Vaccines Allocation with Multi-factor Contact Network'

References

[1]
Hamsa Bastani, Kimon Drakopoulos, Vishal Gupta, Ioannis Vlachogiannis, Christos Hadjicristodoulou, Pagona Lagiou, Gkikas Magiorkinis, Dimitrios Paraskevis, and Sotirios Tsiodras. 2021. Efficient and targeted COVID-19 border testing via reinforcement learning. Nature, Vol. 599, 7883 (2021), 108--113.
[2]
Bryan P Bednarski, Akash Deep Singh, and William M Jones. 2021. On collaborative reinforcement learning to optimize the redistribution of critical medical supplies throughout the COVID-19 pandemic. Journal of the American Medical Informatics Association, Vol. 28, 4 (2021), 874--878.
[3]
Kate M Bubar, Kyle Reinholt, Stephen M Kissler, Marc Lipsitch, Sarah Cobey, Yonatan H Grad, and Daniel B Larremore. 2021. Model-informed COVID-19 vaccine prioritization strategies by age and serostatus. Science, Vol. 371, 6532 (2021), 916--921.
[4]
Hui Cao and Simin Huang. 2012. Principles of scarce medical resource allocation in natural disaster relief: a simulation approach. Medical Decision Making, Vol. 32, 3 (2012), 470--476.
[5]
Serina Chang, Emma Pierson, Pang Wei Koh, Jaline Gerardin, Beth Redbird, David Grusky, and Jure Leskovec. 2021. Mobility network models of COVID-19 explain inequities and inform reopening. Nature, Vol. 589, 7840 (2021), 82--87.
[6]
Serina Chang, Mandy L Wilson, Bryan Lewis, Zakaria Mehrab, Komal K Dudakiya, Emma Pierson, Pang Wei Koh, Jaline Gerardin, Beth Redbird, David Grusky, et almbox. [n.d.]. Supporting covid-19 policy response with large-scale mobility-based modeling. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining.
[7]
Lin Chen, Fengli Xu, Zhenyu Han, Kun Tang, Pan Hui, James Evans, and Yong Li. 2021. Strategic COVID-19 vaccine distribution can simultaneously elevate social utility and equity. arXiv preprint arXiv:2111.06689 (2021).
[8]
Carlos Del Rio, Saad B Omer, and Preeti N Malani. 2021. Winter of omicron-the evolving COVID-19 pandemic. JAMA (2021).
[9]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
[10]
Ezekiel J Emanuel, Govind Persad, Adam Kern, Allen Buchanan, Cécile Fabre, Daniel Halliday, Joseph Heath, Lisa Herzog, RJ Leland, Ephrem T Lemango, et al. 2020. An ethical framework for global vaccine allocation. Science, Vol. 369, 6509 (2020), 1309--1312.
[11]
Jim AC Everett, Clara Colombatto, Edmond Awad, Paulo Boggio, Björn Bos, William J Brady, Megha Chawla, Vladimir Chituc, Dongil Chung, Moritz A Drupp, et al. 2021. Moral dilemmas and trust in leaders during a global health crisis. Nature human behaviour, Vol. 5, 8 (2021), 1074--1088.
[12]
Samuel Greydanus, Anurag Koul, Jonathan Dodge, and Alan Fern. 2018. Visualizing and understanding atari agents. In International conference on machine learning. PMLR, 1792--1801.
[13]
Qianyue Hao, Lin Chen, Fengli Xu, and Yong Li. 2020. Understanding the urban pandemic spreading of COVID-19 with real world mobility data. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 3485--3492.
[14]
Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition.
[15]
Yuanshuang Jiang, Linfang Hou, Yuxiang Liu, Zhuoye Ding, Yong Zhang, and Shengzhong Feng. 2020. Epidemic Control Based on Reinforcement Learning Approaches. (2020).
[16]
Soheyl Khalilpourazari and Hossein Hashemi Doulabi. 2021. Designing a hybrid reinforcement learning based algorithm with application in prediction of the COVID-19 pandemic in Quebec. Annals of Operations Research (2021), 1--45.
[17]
Gloria Hyunjung Kwak, Lowell Ling, and Pan Hui. 2021. Deep reinforcement learning approaches for global public health strategies for COVID-19 pandemic. Plos one, Vol. 16, 5 (2021), e0251550.
[18]
Laura Matrajt, Julia Eaton, Tiffany Leung, and Elizabeth R Brown. 2021. Vaccine optimization for COVID-19: Who to vaccinate first? Science Advances, Vol. 7, 6 (2021), eabf1374.
[19]
John N Nkengasong, Nicaise Ndembi, Akhona Tshangela, and Tajudeen Raji. 2020. COVID-19 vaccines: how to ensure Africa has access.
[20]
Abu Quwsar Ohi, MF Mridha, Muhammad Mostafa Monowar, Md Hamid, et al. 2020. Exploring optimal control of epidemic spread using reinforcement learning. Scientific reports, Vol. 10, 1 (2020), 1--19.
[21]
World Health Organization et al. 2020. WHO SAGE values framework for the allocation and prioritization of COVID-19 vaccination, 14 September 2020. Technical Report. World Health Organization.
[22]
Ferran Parés, Dario Garcia Gasulla, Armand Vilalta, Jonatan Moreno, Eduard Ayguadé, Jesús Labarta, Ulises Cortés, and Toyotaro Suzumura. 2017. Fluid communities: a competitive, scalable and diverse community detection algorithm. In International conference on complex networks and their applications. Springer, 229--240.
[23]
Govind Persad, Ezekiel J Emanuel, Samantha Sangenito, Aaron Glickman, Steven Phillips, and Emily A Largent. 2021. Public perspectives on COVID-19 vaccine prioritization. JAMA network open, Vol. 4, 4 (2021), e217943--e217943.
[24]
Govind Persad, Monica E Peek, and Ezekiel J Emanuel. 2020. Fairly prioritizing groups for access to COVID-19 vaccines. Jama, Vol. 324, 16 (2020), 1601--1602.
[25]
Stefano Giovanni Rizzo. 2020. Balancing precision and recall for costeffective epidemic containment. (2020).
[26]
Harald Schmidt, Rebecca Weintraub, Michelle A Williams, Kate Miller, Alison Buttenheim, Emily Sadecki, Helen Wu, Aditi Doiphode, Neha Nagpal, Lawrence O Gostin, et al. 2021. Equitable allocation of COVID-19 vaccines in the United States. Nature Medicine, Vol. 27, 7 (2021), 1298--1307.
[27]
John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, and Pieter Abbeel. 2015. High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438 (2015).
[28]
John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017).
[29]
Max Schwarzer, Ankesh Anand, Rishab Goel, R Devon Hjelm, Aaron Courville, and Philip Bachman. 2020. Data-efficient reinforcement learning with self-predictive representations. arXiv preprint arXiv:2007.05929 (2020).
[30]
Aravind Srinivas, Michael Laskin, and Pieter Abbeel. 2020. Curl: Contrastive unsupervised representations for reinforcement learning. arXiv preprint arXiv:2004.04136 (2020).
[31]
Andrew J Stier, Marc G Berman, and Luis Bettencourt. 2020. COVID-19 attack rate increases with city size. arXiv preprint arXiv:2003.10376 (2020).
[32]
Adam Stooke, Kimin Lee, Pieter Abbeel, and Michael Laskin. 2021. Decoupling representation learning from reinforcement learning. In International Conference on Machine Learning. PMLR, 9870--9879.
[33]
Bing Xu, Naiyan Wang, Tianqi Chen, and Mu Li. 2015. Empirical evaluation of rectified activations in convolutional network. arXiv preprint arXiv:1505.00853 (2015).
[34]
Denis Yarats, Amy Zhang, Ilya Kostrikov, Brandon Amos, Joelle Pineau, and Rob Fergus. 2019. Improving sample efficiency in model-free reinforcement learning from images. arXiv preprint arXiv:1910.01741 (2019).
[35]
Tao Yu, Cuiling Lan, Wenjun Zeng, Mingxiao Feng, Zhizheng Zhang, and Zhibo Chen. 2021. Playvirtual: Augmenting cycle-consistent virtual trajectories for reinforcement learning. Advances in Neural Information Processing Systems, Vol. 34 (2021).

Cited By

View all
  • (2024)DyPS: Dynamic Parameter Sharing in Multi-Agent Reinforcement Learning for Spatio-Temporal Resource AllocationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3672052(3128-3139)Online publication date: 25-Aug-2024
  • (2024)A guided twin delayed deep deterministic reinforcement learning for vaccine allocation in human contact networksApplied Soft Computing10.1016/j.asoc.2024.112322(112322)Online publication date: Oct-2024
  • (2023)GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement LearningProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599359(685-697)Online publication date: 6-Aug-2023
  • Show More Cited By

Index Terms

  1. Reinforcement Learning Enhances the Experts: Large-scale COVID-19 Vaccine Allocation with Multi-factor Contact Network

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
      August 2022
      5033 pages
      ISBN:9781450393850
      DOI:10.1145/3534678
      This work is licensed under a Creative Commons Attribution International 4.0 License.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 14 August 2022

      Check for updates

      Author Tags

      1. covid-19 pandemic
      2. model explainability
      3. reinforcement learning
      4. self-supervised representation learning
      5. vaccine allocation.

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      KDD '22
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)747
      • Downloads (Last 6 weeks)111
      Reflects downloads up to 14 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)DyPS: Dynamic Parameter Sharing in Multi-Agent Reinforcement Learning for Spatio-Temporal Resource AllocationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3672052(3128-3139)Online publication date: 25-Aug-2024
      • (2024)A guided twin delayed deep deterministic reinforcement learning for vaccine allocation in human contact networksApplied Soft Computing10.1016/j.asoc.2024.112322(112322)Online publication date: Oct-2024
      • (2023)GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement LearningProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599359(685-697)Online publication date: 6-Aug-2023
      • (2023)GCRL: Efficient Delivery Area Assignment for Last-mile Logistics with Group-based Cooperative Reinforcement Learning2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00269(3522-3534)Online publication date: Apr-2023
      • (2023)Using Reinforcement Learning for Optimizing COVID-19 Vaccine Distribution StrategiesMathematical Modeling and Intelligent Control for Combating Pandemics10.1007/978-3-031-33183-1_10(169-196)Online publication date: 12-May-2023

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media