Integrating Sentiment Analysis and Reinforcement Learning for Equitable Disaster Response: A Novel Approach
<p>Conceptual diagram of the proposed framework for disaster resource allocation.</p> "> Figure 2
<p>Temporal distribution of disaster-related tweets. Peaks align with major events (e.g., hurricanes and floods).</p> "> Figure 3
<p>Keyword distribution in raw dataset, indicating relative frequency of resource mentions.</p> "> Figure 4
<p>Screenshot of the gamified simulation environment in dash reflecting interactive resource allocation and visualization tools.</p> "> Figure 5
<p>Temporal trends for resource demands: (<b>a</b>) electricity, (<b>b</b>) food, (<b>c</b>) shelter, (<b>d</b>) medical, and (<b>e</b>) water. Each subfigure reflects sentiment-driven insights from tweets over the time period under study.</p> "> Figure 6
<p>Equity distribution: histogram of equity scores showing how resource allocation is spread across regions. Higher values indicate more uniform distribution, while lower values suggest resource concentration.</p> "> Figure 7
<p>Trade-offs between equity and satisfaction in resource allocation. Clusters in the upper right region imply that resource distribution meets a large proportion of demands while preserving fairness.</p> ">
Abstract
:1. Introduction
- We demonstrate how sentiment analysis of disaster-related social media posts can reveal immediate shifts in community needs, thereby facilitating more accurate and timely resource prioritization [16].
- We develop and analyze an RL agent that dynamically allocates resources under varying levels of uncertainty, achieving higher equity and satisfaction metrics compared to static baseline approaches [17].
- We introduce an interactive simulation platform designed to engage diverse stakeholders, bridging the gap between theoretical modeling and practical policymaking.
2. Related Work
2.1. Sentiment Analysis in Disaster Management
2.2. Reinforcement Learning in Dynamic Environments
2.3. Gamification in Policy Decision-Making
2.4. Current Gaps and Research Opportunities
- While sentiment analysis offers valuable clues regarding immediate needs, few studies harness these signals to actively shape resource allocation in a continuous feedback loop.
- Existing RL models for disaster response often ignore fluctuating public sentiment and lack the capacity to update policies as these sentiments change.
- Most gamification efforts in disaster management focus on theoretical or educational outcomes, leaving a gap in practical, interactive decision-support tools that can handle real-time data streams.
3. Methodology
- Social Media Data: Disaster-related tweets and posts serve as the real-time data source, capturing public sentiment and resource needs.
- Data Preprocessing and Sentiment Analysis: Text data undergo cleaning, tokenization, and sentiment scoring (e.g., via VADER). This stage translates raw user-generated text into actionable demand metrics for specific resources, such as water, food, shelter, medical supplies, and electricity.
- Reinforcement Learning Framework: The sentiment-driven resource demands feed into an RL model that learns to balance satisfaction (meeting demand) and equity (fair distribution across regions).
- Gamified Simulation Environment: A user-friendly simulation platform enabling interactive testing of resource allocation strategies, visualization of real-time outcomes, and stakeholder engagement.
- Output: The system outputs an adaptive, context-aware resource allocation plan that continually updates as disaster conditions evolve.
3.1. Data Collection
3.1.1. Initial Dataset Analysis
- Time Span: Tweets from 2011 to 2019, capturing a diverse array of disasters and peak event periods.
- Geographic Spread: Global coverage with a concentration in disaster-prone regions (e.g., areas with frequent hurricanes).
- Topical Breadth: Various resource demands, emotional responses, and general commentary.
3.1.2. Data Filtering and Reduction
- Keyword Filtering: Only tweets referencing critical resources (water, food, shelter, medicine, electricity) were retained, focusing the dataset on actionable demands.
- Data Quality Checks: Tweets lacking valid timestamps, clear geolocation cues, or complete metadata were removed to maintain consistency in temporal and spatial analyses.
- Deduplication: Duplicate or near-duplicate content was eliminated to avoid overweighting specific messages.
- Total Tweets: 34,122
- Resource-Specific Tweets: 17,960 (approx. 52.6%)
- Temporal Coverage: 2011 to 2019
- Keyword Counts: Food is the most frequently cited resource, followed by shelter and water.
3.2. Data Preprocessing
3.2.1. Data Cleaning
- Removal of URLs, Mentions, and Hashtags: Regular expressions strip out these elements to reduce noise.
- Lowercasing and Tokenization: Ensures uniform text representation across tweets.
- Language Filtering: Only English tweets were retained to avoid ambiguity in sentiment scoring.
3.2.2. Resource Keyword Extraction
3.2.3. Sentiment Scoring
3.2.4. Data Aggregation and Representation
3.3. Reinforcement Learning Framework
3.3.1. State Space
3.3.2. Action Space
3.3.3. Reward Function
- Demand Satisfaction: Proportion of total resource needs met, incentivizing coverage of the most urgent demands.
- Equity: Uniformity of allocations across regions, promoting fairness and preventing under-served areas.
- High Satisfaction, Low Equity: allocating resources primarily to high-demand regions maximizes satisfaction but may leave other regions under-served, reducing equity.
- High Equity, Moderate Satisfaction: ensuring fair distribution across regions can lower overall satisfaction as resources may not fully meet high-demand areas.
3.3.4. Training Procedure
- Learning Rate: Balances stability and adaptability in updating Q-values.
- Discount Factor: Emphasizes cumulative benefits over extended disaster timelines.
- Exploration Rate: Decreases from to over training, ensuring sufficient exploration before exploiting the learned policy.
3.4. Gamified Simulation Environment
- Real-Time Visualization: Time-series graphs illustrate resource demands, allocation levels, and the resulting satisfaction/equity metrics across regions.
- Parameter Adjustment: Users can modify , the equity–satisfaction trade-off parameter, or manually allocate resources to compare performance against the RL-driven policy.
- Scenario Simulation: Synthetic disaster scenarios, where resource demands evolve dynamically, enable stress-testing of the learned RL policy under various crisis conditions.
3.5. Experimental Setup
- Baselines:
- –
- Uniform Allocation, where resources are distributed equally across all regions.
- –
- Rule-Based Allocation, where allocations rely on historical demand averages.
- Evaluation Metrics:
- –
- Satisfaction: Fraction of total resource demand fulfilled.
- –
- Equity: Fairness of allocation, measured via the inverse coefficient of variation.
- –
- Adaptability: System responsiveness to real-time sentiment fluctuations.
- Implementation Details: Python libraries (NumPy, Pandas, and Dash) were used for data management and visualization. The Q-learning algorithm was implemented in a modular manner, allowing easy integration with the simulation platform.
4. Results and Discussion
4.1. Resource Demand Analysis
- Electricity Demand: As illustrated in Figure 5a, electricity demand exhibits a noticeable spike in 2013, underscoring a potential system-wide power shortage or infrastructure failure. The sporadic appearance of electricity-related tweets suggests that such demands, while critical, can be episodic and closely tied to specific high-impact events such as large-scale storm outages.
- Food and Shelter Demand:Figure 5b,c display recurring peaks in food and shelter demands, notably concentrated in 2013 and 2017. These results confirm the fundamental importance of these resources for immediate survival and longer-term stability in post-disaster conditions. The temporal patterns observed also highlight seasonal and event-driven spikes, which decision-makers can use to anticipate urgent relief measures.
- Medical Demand: Compared to other resources, medical demand appears relatively subdued until 2019, where a pronounced surge is evident (Figure 5d). This anomaly likely corresponds to a health-related disaster or a scenario producing significant casualties. While medical aid may not be as universally demanded as food or shelter, this spike underscores the essential nature of rapidly scaling medical services under certain disaster conditions.
- Water Demand:Figure 5e underscores water’s pivotal role across multiple disaster events. Elevated demand peaks in 2017, for example, potentially coincide with heatwaves or urban flooding that compromise safe drinking water. The consistency of these spikes across different events confirms water’s criticality for public health and underscores the need for prioritizing potable water distribution during emergencies.
4.2. Equity and Satisfaction Analysis
4.2.1. Equity Distribution
4.2.2. Equity vs. Satisfaction
4.3. Reinforcement Learning Performance
- Dynamically Adapt to real-time sentiment signals, recalibrating allocations as demands shift.
- Balance Equity and Satisfaction by penalizing skewed distributions through the reward function, thus avoiding extreme allocations to a single region.
- Learn Priority Resource Patterns, as high-value resources (e.g., water, shelter) receive more weight in environments where they are pivotal to life-saving measures.
4.4. Gamified Simulation Environment
- Interactive Dashboard: The environment renders real-time plots of satisfaction vs. equity and allows manual adjustment of allocation parameters. This feature demystifies RL outputs and engages users in iterative resource distribution planning.
- Dynamic Scenario Updates: The simulation utilizes incoming sentiment data to mimic evolving disaster scenarios, ensuring that users witness how changes in public sentiment directly influence RL-driven strategies.
- Performance Metrics: Key metrics, including resource utilization rates, equity scores, and unmet demands, are displayed. These indicators help in diagnosing potential inefficiencies and highlight trade-offs crucial for policymaking.
4.5. Implications for Policy and Practice
- Real-Time Responsiveness: By incorporating social media sentiment, the framework ensures a rapid detection of emerging resource shortages, thus empowering agencies to respond more effectively to sudden surges in demand.
- Equitable Allocation: Although equity scores vary across scenarios, the system provides a data-driven means to minimize resource disparities among affected regions, a factor essential for ethical and inclusive disaster response.
- Sustainability and Resilience: Efficient and fair distribution of resources helps reduce waste, thereby aligning with broader sustainability goals while bolstering the resilience of impacted communities.
- Stakeholder Engagement: The gamified simulation platform offers policymakers a tangible and interactive tool for exploring the trade-offs between resource availability and demand patterns, thus encouraging evidence-based decision-making.
4.6. Limitations and Possible Directions
4.7. Summary of Key Findings
- Substantial Gains in Resource Satisfaction: up to 30% increases over static methods.
- Moderate to High Equity Scores: approximately 0.5 equity, showcasing balanced allocation across regions.
- Robustness to Dynamic Scenarios: real-time data integration allows the system to respond promptly to shifting demands.
- Enhanced Policy Engagement: the gamified platform simplifies complex trade-offs and promotes iterative policy experimentation.
5. Conclusions and Future Work
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
RL | Reinforcement Learning |
NLP | Natural Language Processing |
VADER | Valence Aware Dictionary and Sentiment Reasoner |
SDG | Sustainable Development Goal |
References
- Haigh, R.; Amaratunga, D. An integrative review of the built environment discipline’s role in the development of society’s resilience to disasters. Int. J. Disaster Resil. Built Environ. 2010, 1, 11–24. [Google Scholar] [CrossRef]
- For Disaster Risk Reduction (UNDRR), U.N.O. Terminology: Disaster. Available online: https://www.undrr.org/terminology/disaster (accessed on 19 January 2025).
- Turoff, M.; Chumer, M.; Van De Walle, B.; Yao, X. The design of a dynamic emergency response management information system (DERMIS). J. Inf. Technol. Theory Appl. (JITTA) 2004, 5, 1–35. [Google Scholar]
- United Nations. Sustainable Development Goals. 2015. Available online: http://sdgs.un.org (accessed on 20 January 2025).
- Cutter, S.L.; Ash, K.D.; Emrich, C.T. The geographies of community disaster resilience. Glob. Environ. Change 2014, 18, 598–606. [Google Scholar] [CrossRef]
- Imran, M.; Castillo, C.; Lucas, J.; Meier, P.; Vieweg, S. AIDR: Artificial intelligence for disaster response. In Proceedings of the 23rd International Conference on World Wide Web, Seoul, Republic of Korea, 7–11 April 2014; pp. 159–162. [Google Scholar] [CrossRef]
- Minaee, S.; Kalchbrenner, N.; Cambria, E.; Nikzad, N.; Chenaghlu, M.; Gao, J. Deep learning–based text classification: A comprehensive review. Acm Comput. Surv. (Csur) 2021, 54, 1–40. [Google Scholar] [CrossRef]
- Pang, B.; Lee, L. Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2008, 2, 1–135. [Google Scholar] [CrossRef]
- Barbosa, L.; Feng, J. Robust sentiment detection on Twitter from biased and noisy data. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters, Association for Computational Linguistics, Beijing, China, 23–27 August 2010; pp. 36–44. [Google Scholar]
- Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction; MIT Press: Cambridge, MA, USA, 2018. [Google Scholar]
- Li, Y. Deep reinforcement learning: An overview. arXiv 2017, arXiv:1701.07274. [Google Scholar]
- Kamilaris, A.; Prenafeta-Boldú, F.X. Deep learning in agriculture: A survey. Comput. Electron. Agric. 2018, 147, 70–90. [Google Scholar] [CrossRef]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G.; et al. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef]
- Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal policy optimization algorithms. arXiv 2017, arXiv:1707.06347. [Google Scholar]
- Hamari, J.; Koivisto, J.; Sarsa, H. Does gamification work? A literature review of empirical studies on gamification. In Proceedings of the Annual Hawaii International Conference on System Sciences (HICSS), Waikoloa, HI, USA, 6–9 January 2014; pp. 3025–3034. [Google Scholar] [CrossRef]
- Burel, G.; Alani, H. Crisis Event Extraction Service (CREES)—Automatic Detection and Classification of Crisis-related Content on Social Media. In Proceedings of the 15th International Conference on Information Systems for Crisis Response and Management, Rochester, NY, USA, 20–23 May 2018. [Google Scholar]
- Li, Y.; Yu, C.; Shahidehpour, M.; Yang, T.; Zeng, Z.; Chai, T. Deep Reinforcement Learning for Smart Grid Operations: Algorithms, Applications, and Prospects; IEEE: Piscataway, NJ, USA, 2023. [Google Scholar]
- Imran, M.; Castillo, C.; Diaz, F.; Vieweg, S. Processing social media messages in mass emergency: A survey. Acm Comput. Surv. (CSUR) 2015, 47, 1–38. [Google Scholar] [CrossRef]
- Ragini, J.R.; Anand, P.R.; Bhaskar, V. Big data analytics for disaster response and recovery through sentiment analysis. Int. J. Inf. Manag. 2018, 42, 13–24. [Google Scholar] [CrossRef]
- Castillo, C. Big Crisis Data: Social Media in Disasters and Time-Critical Situations; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
- Alam, F.; Ofli, F.; Imran, M. Crisismmd: Multimodal twitter datasets from natural disasters. In Proceedings of the International AAAI Conference on Web and Social Media, Palo Alto, CA, USA, 25–28 June 2018; Volume 12, pp. 395–402. [Google Scholar]
- Yum, S. Social media-based demographic and sentiment analysis for disaster responses. J. Emerg. Manag. 2024, 22, 89–99. [Google Scholar] [CrossRef] [PubMed]
- Hutto, C.; Gilbert, E. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Proceedings of the International AAAI Conference on Web and Social Media, Ann Arbor, MI, USA, 1–4 June 2014; Volume 8, pp. 216–225. [Google Scholar]
- Li, X.; Ding, H.; Yuan, H.; Zhang, W.; Pang, J.; Cheng, G.; Chen, K.; Liu, Z.; Loy, C.C. Transformer-based visual segmentation: A survey. In IEEE Transactions on Pattern Analysis and Machine Intelligence; IEEE: Piscataway, NJ, USA, 2024. [Google Scholar]
- Holler, J.; Vuorio, R.; Qin, Z.; Tang, X.; Jiao, Y.; Jin, T.; Singh, S.; Wang, C.; Ye, J. Deep reinforcement learning for multi-driver vehicle dispatching and repositioning problem. In Proceedings of the 2019 IEEE International Conference on Data Mining (ICDM), Beijing, China, 8–11 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1090–1095. [Google Scholar]
- Cui, J.; Liu, Y.; Nallanathan, A. Multi-agent reinforcement learning-based resource allocation for UAV networks. IEEE Trans. Wirel. Commun. 2019, 19, 729–743. [Google Scholar] [CrossRef]
- Rahman, R.; Hasan, S. A deep learning approach for network-wide dynamic traffic prediction during hurricane evacuation. Transp. Res. Part Emerg. Technol. 2023, 152, 104126. [Google Scholar] [CrossRef]
- Li, S.; Wu, T. Deep reinforcement learning-based decision support system for transportation infrastructure management under hurricane events. Struct. Saf. 2022, 99, 102254. [Google Scholar] [CrossRef]
- Harteveld, C. Triadic Game Design: Balancing Reality, Meaning and Play; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
- Kankanamge, N.; Yigitcanlar, T.; Goonetilleke, A.; Kamruzzaman, L. How can gamification be incorporated into disaster emergency planning? A systematic review of the literature. Int. J. Disaster Resil. Built Environ. 2020, 11, 158–175. [Google Scholar] [CrossRef]
- Tsai, M.H.; Chang, Y.L.; Shiau, J.S.; Wang, S.M. Exploring the effects of a serious game-based learning package for disaster prevention education: The case of Battle of Flooding Protection. Int. J. Disaster Risk Reduct. 2020, 43, 101393. [Google Scholar] [CrossRef]
- Weyrich, P.; Ruin, I.; Terti, G.; Scolobig, A. Using serious games to evaluate the potential of social media information in early warning disaster management. Int. J. Disaster Risk Reduct. 2021, 56, 102053. [Google Scholar] [CrossRef]
- Meng, L.; Dong, Z.S. Natural hazards Twitter dataset. arXiv 2020, arXiv:2004.14456. [Google Scholar]
- Bednarski, B.P.; Singh, A.D.; Jones, W.M. On collaborative reinforcement learning to optimize the redistribution of critical medical supplies throughout the COVID-19 pandemic. J. Am. Med. Inform. Assoc. 2021, 28, 874–878. [Google Scholar] [CrossRef]
- Bertsimas, D.; Farias, V.F.; Trichakis, N. On the efficiency-fairness trade-off. Manag. Sci. 2012, 58, 2234–2250. [Google Scholar] [CrossRef]
- Watkins, C.J.; Dayan, P. Q-learning. Mach. Learn. 1992, 8, 279–292. [Google Scholar] [CrossRef]
- Fernández Galeote, D.; Rajanen, M.; Rajanen, D.; Legaki, N.Z.; Langley, D.J.; Hamari, J. Gamification for climate change engagement: Review of corpus and future agenda. Environ. Res. Lett. 2021, 16, 063004. [Google Scholar] [CrossRef]
- Palen, L.; Vieweg, S.; Sutton, J.; Liu, S.B.; Hughes, A. Crisis informatics: Studying crisis in a networked world. In Proceedings of the Third International Conference on E-Social Science, Ann Arbor, MC, USA, 7–9 October 2007; pp. 7–9. [Google Scholar]
- Neppalli, K.V.; Caragea, C.; Squicciarini, A.C.; Tapia, A.; Stehle, S. Sentiment analysis during Hurricane Sandy in emergency response. Int. J. Disaster Risk Reduct. 2017, 21, 213–222. [Google Scholar] [CrossRef]
- Liu, H.; Zhang, J.; Zhou, Z.; Dai, Y.; Qin, L. A Deep Reinforcement Learning-Based Algorithm for Multi-Objective Agricultural Site Selection and Logistics Optimization Problem. Appl. Sci. 2024, 14, 8479. [Google Scholar] [CrossRef]
- Department, S.R. Penetration Rate of the Internet by Region 2025. Available online: https://www.statista.com/statistics/269329/penetration-rate-of-the-internet-by-region/ (accessed on 19 January 2025).
- Gizzi, F.T.; Kam, J.; Porrini, D. Time windows of opportunities to fight earthquake under-insurance: Evidence from Google Trends. Humanit. Soc. Sci. Commun. 2020, 7, 61. [Google Scholar] [CrossRef]
- Gizzi, F.T.; Potenza, M.R. Earthquake Insurance in California, USA: What does community-generated big data reveal to us? Big Data Cogn. Comput. 2022, 6, 60. [Google Scholar] [CrossRef]
Feature | Value |
---|---|
Total Tweets | 34,122 |
Tweets Mentioning Resources | 17,960 |
Temporal Range | 20 April 2011 to 9 September 2019 |
Resource Keyword Examples | Water: 2543 Food: 8907 Shelter: 9042 Medical: 237 Electricity: 99 |
Parameter | Symbol | Value | Description |
---|---|---|---|
Learning Rate | 0.1 | Weight of Q-value updates | |
Discount Factor | 0.9 | Priority of future rewards | |
Exploration Rate | Balance exploration/exploitation |
Actions | Water | Food | Shelter | Medical | Electricity |
---|---|---|---|---|---|
1 | 2.07 | 10.39 | 9.59 | 0.11 | 0.14 |
2 | 2.46 | 10.42 | 9.64 | 0.18 | 0.14 |
3 | 2.79 | 17.21 | 11.81 | 0.25 | 0.14 |
4 | 2.44 | 10.44 | 9.63 | 0.11 | 0.13 |
5 | 6.60 | 11.69 | 21.35 | 0.11 | 0.14 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alqithami, S. Integrating Sentiment Analysis and Reinforcement Learning for Equitable Disaster Response: A Novel Approach. Sustainability 2025, 17, 1072. https://doi.org/10.3390/su17031072
Alqithami S. Integrating Sentiment Analysis and Reinforcement Learning for Equitable Disaster Response: A Novel Approach. Sustainability. 2025; 17(3):1072. https://doi.org/10.3390/su17031072
Chicago/Turabian StyleAlqithami, Saad. 2025. "Integrating Sentiment Analysis and Reinforcement Learning for Equitable Disaster Response: A Novel Approach" Sustainability 17, no. 3: 1072. https://doi.org/10.3390/su17031072
APA StyleAlqithami, S. (2025). Integrating Sentiment Analysis and Reinforcement Learning for Equitable Disaster Response: A Novel Approach. Sustainability, 17(3), 1072. https://doi.org/10.3390/su17031072