Social Media Big Data Mining and Spatio-Temporal Analysis on Public Emotions for Disaster Mitigation
<p>Framework of the automatic emotion classification and disaster analysis.</p> "> Figure 2
<p>The study area of the 2013 Ya’an earthquake that was used in this paper.</p> "> Figure 3
<p>Structure of the text feature vector.</p> "> Figure 4
<p>Structure of the convolutional neural network (CNN) model used in the paper.</p> "> Figure 5
<p>Emotional distribution characteristics of the affected population. The figure (<b>a</b>), (<b>b</b>), (<b>c</b>), (<b>d</b>) and (<b>e</b>) describe the distribution of emotions in different time periods within 72 hours after the disaster. The figure (<b>f</b>) shows the distribution of emotions over 72 hours. Among them, each of red circle 1, red circle 2, and red circle 3 in the figures represent the same area. The blue circle 1 in (<b>b</b>) shows that compared with (<b>a</b>), new negative emotions emerged in same area.</p> "> Figure 6
<p>Changes in different emotion categories in data volume for different periods of time.</p> "> Figure 7
<p>Changes of the crowd amount in each small time period.</p> "> Figure 8
<p>The change characteristics of public spatio-temporal trajectory. This sequence diagram describes how the crowd moved in different small time periods after the earthquake. Among them, the figure (<b>a</b>) shows the trajectories of public change in 10 minutes after the earthquake. Three clusters were formed in this period. The figure (<b>b</b>) shows the location relationship between each cluster and shelters in the second ten minutes. The figure (<b>c</b>), (<b>d</b>) and (<b>e</b>) shows that all small clusters formed a large cluster over time and it had the largest population between 08:40 and 09:00 as in figure (<b>d</b>). The figure (<b>f</b>) and (<b>g</b>) shows crowd was gradually dissipating and leaving the shelter. From the whole process of analysis, we determined that: (1) When the earthquake happened, people rushed to the shelters in a very short time period. However, were these shelters reasonably laid out? We saw that some shelters did not contain many people, or even had no people. Therefore, the analysis results could be used as a reference for the rational layout of shelters. (2) The characteristics of crowd gathering and evacuation could be used as an effective reference to aid disaster reduction departments in dealing with future emergencies.</p> "> Figure 9
<p>Sequence diagram of positive emotion (the words in the text box are the hot words related to this emotion in the corresponding time period).</p> "> Figure 10
<p>Sequence diagram of anxiety.</p> "> Figure 11
<p>Sequence diagram of anger.</p> "> Figure 12
<p>Sequence diagram of sadness.</p> "> Figure 13
<p>Sequence diagram of fear.</p> "> Figure 14
<p>Classification accuracy of different emotions.</p> "> Figure 15
<p>Comparison of the number of pieces of address information with different accuracy in each city.</p> ">
Abstract
:1. Introduction
2. Framework to Analyze Public Emotion from Social Media Big Data
2.1. Social Media Data Acquisition and Parsing
2.2. Social Media Data Processing
2.3. Constructing the Word Vector List
2.3.1. Word Segmentation and the Removal of Stop Words
2.3.2. Construction of the Word Vector List
2.4. Model Training
2.4.1. Word Segmentation, Removal of Stop Words, and Construction of the Feature Matrix
2.4.2. Training Convolutional Neural Network Model
2.5. Emotion Classification
2.6. Spatio-Temporal Analysis of the Public Emotions
3. Spatio-Temporal Analysis of Public Emotional Information
3.1. Spatio-Temporal Assessment of the Affected Population
3.2. Emotional Spatio-Temporal Trajectory Mining
3.3. Post-Disaster Emotional Change Analysis
4. Evaluation of the Experimental Indicators
4.1. Accuracy Evaluation of the Emotion Classification
4.1.1. Experimental Corpus
4.1.2. Experimental Environment
4.1.3. Experimental Results and Accuracy Comparison
4.2. Evaluation of Spatio-Temporal Analysis Experiments
4.2.1. The Description of Data with Address Information
4.2.2. Evaluation of the Experimental Process and Results
5. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Goodchild, M.F. Citizens as sensors: Web 2.0 and the volunteering of geographic information. GeoFocus. Rev. Int. Ciencia Tecnol. Inf. Geogr. 2007, 7, 8–10. [Google Scholar]
- Zhou, Y.; Yang, L.; Van de Walle, B.; Han, C. Classification of microblogs for support emergency responses: Case study Yushu earthquake in China. In Proceedings of the 2013 46th Hawaii International Conference on System Sciences, Wailea, Maui, HI, USA, 7–10 January 2013; pp. 1553–1562. [Google Scholar]
- Qu, Y.; Huang, C.; Zhang, P.; Zhang, J. Microblogging after a major disaster in China: A case study of the 2010 Yushu earthquake. In Proceedings of the ACM 2011 Conference on Computer Supported Cooperative Work, Hangzhou, China, 19–23 March 2010; pp. 25–34. [Google Scholar]
- Chae, J.; Thom, D.; Jang, Y.; Kim, S.; Ertl, T.; Ebert, D.S. Special Section on Visual Analytics: Public behavior response analysis in disaster events utilizing visual analytics of microblog data. Comput. Graph. 2014, 38, 51–60. [Google Scholar] [CrossRef]
- Li, Z.; Wang, C.; Emrich, C.T.; Guo, D. A novel approach to leveraging social media for rapid flood mapping: A case study of the 2015 South Carolina floods. Cartogr. Geogr. Inf. Sci. 2017, 1–14. [Google Scholar] [CrossRef]
- Social Media in Disasters and Emergencies; The Drum: Washington, DC, USA, 3 March 2013.
- Ekman, P. Basic Emotions. In Handbook of Cognition & Emotion; Dalgleish, T., Power, M.J., Eds.; John Wiley & Sons: Hoboken, NY, USA, 1999. [Google Scholar]
- Gruebner, O.; Lowe, S.R.; Sykora, M.; Shankardass, K.; Subramanian, S.V.; Galea, S. A novel surveillance approach for disaster mental health. PLoS ONE 2017, 12, e0181233. [Google Scholar] [CrossRef] [PubMed]
- Lindell, M.K.; Prater, C.S.; Perry, R.W.; Nicholson, W.C. Fundamentals of Emergency Management; Emond Montgomery Publications: Toronto, ON, Canada, 2006. [Google Scholar]
- Camras, L.A. An Event—Emotion or Event—Expression Hypothesis? A Comment on the Commentaries on Bennett, Bendersky, and Lewis (2002). Infancy 2010, 6, 431–433. [Google Scholar] [CrossRef]
- Goltz, J.D.; Russell, L.A.; Bourque, L.B. Initial behavioural response to a rapid onset disaster. Int. J. Mass Emerg. Disasters 1992, 10, 43–69. [Google Scholar]
- Yi, J.; Nasukawa, T.; Bunescu, R.; Niblack, W. Sentiment Analyzer: Extracting Sentiments about a Given Topic using Natural Language Processing Techniques. In Proceedings of the IEEE International Conference on Data Mining, Melbourne, FL, USA, 22 November 2003; pp. 427–434. [Google Scholar]
- Xu, R.; Wong, K.F.; Xia, Y. Coarse-Fine Opinion Mining—WIA in NTCIR-7 MOAT Task. In Proceedings of the NTCIR 2008, Tokyo, Japan, 16–19 December 2008. [Google Scholar]
- Turney, P.D. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, Philadelphia, Pennsylvania, 6 July 2012; pp. 417–424. [Google Scholar]
- Xiao, Z.; Li, X.; Wang, L.; Yang, Q.; Du, J.; Sangaiah, A.K. Using convolution control block for Chinese sentiment analysis. J. Parallel Distrib. Comput. 2017, 116, 18–26. [Google Scholar] [CrossRef]
- Narayanan, V.; Arora, I.; Bhatia, A. Fast and accurate sentiment classification using an enhanced Naive Bayes model. In Proceedings of the International Conference on Intelligent Data Engineering and Automated Learning; Springer: Berlin/Heidelberg, Germany, 2013; pp. 194–201. [Google Scholar]
- Pang, T.B.; Pang, B.; Lee, L. Thumbs up? Sentiment Classification using Machine Learning. Empir. Methods Nat. Lang. Process. 2002, 10, 79–86. [Google Scholar]
- Jozefowicz, R.; Vinyals, O.; Schuster, M.; Shazeer, N.; Wu, Y. Exploring the Limits of Language Modeling. arXiv, 2016; arXiv:1602.02410. [Google Scholar]
- Bengio, Y.; Ducharme, R.; Vincent, P.; Jauvin, C. A neural probabilistic language model. J. Mach. Learn. Res. 2003, 3, 1137–1155. [Google Scholar]
- Norris, F.H.; Friedman, M.J.; Watson, P.J.; Byrne, C.M.; Diaz, E.; Kaniasty, K. 60,000 Disaster Victims Speak: Part I. An Empirical Review of the Empirical Literature, 1981–2001. Psychiatry-Interpers. Biol. Process. 2002, 65, 207–239. [Google Scholar] [CrossRef]
- Goldmann, E.; Galea, S. Mental health consequences of disasters. Ann. Rev. Public Health 2014, 35, 169. [Google Scholar] [CrossRef] [PubMed]
- Neria, Y.; Shultz, J.M. Mental health effects of Hurricane Sandy: Characteristics, potential aftermath, and response. J. Am. Med. Assoc. 2012, 308, 2571–2572. [Google Scholar] [CrossRef] [PubMed]
- Coyle, D.; Meier, P. New technologies in emergencies and conflicts: The role of information and social networks. Washington D 2009. Available online: https://www.popline.org/node/209135 (accessed on 12 January 2019).
- Tausczik, Y.R.; Pennebaker, J.W. The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods. J. Lang. Soc. Psychol. 2009, 29, 24–54. [Google Scholar] [CrossRef] [Green Version]
- Oh, O.; Kwon, K.H.; Rao, H.R. An Exploration of Social Media in Extreme Events: Rumor Theory and Twitter during the Haiti Earthquake 2010. In Proceedings of the International Conference on Information Systems, Icis 2010, Saint Louis, MO, USA, 12–15 December 2010; p. 231. [Google Scholar]
- Mikolov, T.; Sutskever, I.; Chen, K.; Corrado, G.; Dean, J. Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 2013, 26, 3111–3119. [Google Scholar]
- Mikolov, T.; Chen, K.; Corrado, G.; Dean, J. Efficient estimation of word representations in vector space. arXiv, 2013; arXiv:1301.3781. [Google Scholar]
- Mikolov, T.; Kopecky, J.; Burget, L.; Glembek, O.; Cernocky, J. Neural network based language models for highly inflective languages. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, 19–24 April 2009; pp. 4725–4728. [Google Scholar]
- Xiong, F.; Deng, Y.; Tang, X. The Architecture of Word2vec and Its Applications. J. Nanjing Norm. Univ. 2015, 1, 43–48. [Google Scholar]
- Kim, Y. Convolutional Neural Networks for Sentence Classification; Association for Computational Linguistics: Doha, Qatar, 2014. [Google Scholar]
- Zhang, D.; Wang, D. Relation Classification: CNN or RNN? 2016. Available online: https://link.springer.com/chapter/10.1007/978-3-319-50496-4_60 (accessed on 12 January 2019).
- Pappas, N.; Popescu-Belis, A. Multilingual Hierarchical Attention Networks for Document Classification. arXiv, 2017; arXiv:1707.00896. [Google Scholar]
- Yin, W.; Kann, K.; Yu, M.; Schtze, H. Comparative Study of CNN and RNN for Natural Language Processing. arXiv, 2017; arXiv:1702.01923. [Google Scholar]
- Tang, D.; Qin, B.; Liu, T. Document modeling with gated recurrent neural network for sentiment classification. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, 17–21 September 2015; pp. 1422–1432. [Google Scholar]
- Report on Text Classification Using CNN, RNN & HAN. Available online: https://medium.com/jatana/report-on-text-classification-using-cnn-rnn-han-f0e887214d5f (accessed on 12 January 2019).
- Collobert, R.; Weston, J. A unified architecture for natural language processing:deep neural networks with multitask learning. In Proceedings of the International Conference on Machine Learning, Helsinki, Finland, 5–9 July 2008; pp. 160–167. [Google Scholar]
- Wang, Y.; Feng, S.; Wang, D.; Yu, G.; Zhang, Y. Multi-label Chinese Microblog Emotion Classification via Convolutional Neural Network. In Proceedings of the Web Technologies and Applications: 18th Asia-Pacific Web Conference, APWeb 2016, Suzhou, China, 23–25 September 2016. [Google Scholar]
- Silverman, B.W. Density Estimation for Statistics and Data Analysis; Chapman and Hall: San Francisco, CA, USA, 1986; pp. 296–297. [Google Scholar]
- Word2Vec. Available online: https://code.google.com/archive/p/word2vec/ (accessed on 12 January 2019).
- Abadi, M.; Agarwal, A.; Barham, P.; Brevdo, E.; Chen, Z.; Citro, C.; Corrado, G.S.; Davis, A.; Dean, J.; Devin, M. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. arXiv, 2016; arXiv:1603.04467. [Google Scholar]
Period of Time | Cluster | Emotion Categories | Major Emotion Category |
---|---|---|---|
08:02 h to 08:12 h (Figure 8a) | Cluster 1 | Anxious | Anxious |
Cluster 2 | Fearful | Fearful | |
Cluster 3 | Anxious, fearful, angry | Fearful | |
08:12 h to 08:22 h (Figure 8b) | Cluster 1 | Anxious, fearful, neutral | Fearful |
Cluster 2 | Anxious, angry, fearful | Fearful | |
Cluster 3 | Angry, fearful, positive | Fearful | |
08:22 h to 08:40 h (Figure 8c) | Cluster 1 | Anxious, angry, Fearful, sad, neutral, positive | Fearful |
8:40 to 9:30 (Figure 8d) | Cluster 1 | Anxious, angry, Fearful, neutral, positive | Fearful |
09:30 h to 10:30 h (Figure 8e) | Cluster 1 | Anxious, fearful, Neutral, positive anxious | Anxious, fearful anxious |
10:30 h to 11:30 h (Figure 8f) | Cluster 1 | Fearful, positive, neutral | Fearful |
Cluster 2 | Neutral, sad | Neutral, sad | |
11:30 h to 12:30 h (Figure 8g) | Cluster 1 | Fearful, neutral positive, anxious, sad | Positive |
Cluster 2 | Neutral, angry | Angry | |
Cluster 3 | Angry, anxious, neutral | Anxious | |
Cluster 4 | Neutral, positive | Neutral |
Emotional Category | Precision (P) | Recall (R) | Comprehensive Evaluation Index (F-1) |
---|---|---|---|
Positive | 82.25% | 80.00% | 82.54% |
Neutral | 84.21% | 87.91% | 86.02% |
Angry | 91.57% | 86.36% | 88.89% |
Sad | 88.54% | 85.00% | 86.73% |
Anxious | 78.47% | 85.27% | 81.77% |
Fearful | 84.69% | 85.57% | 85.13% |
City | The Proportion of Data with Accurate Address Information | The Proportion of Data with Rough Address Information | Total |
---|---|---|---|
Chengdu | 8.08% | 2.16% | 10.24% |
Ya’an | 12.18% | 13.91% | 26.09% |
Mianyang | 9.45% | 2.60% | 12.05% |
Leshan | 9.81% | 4.85% | 14.66% |
Meishan | 13.30 % | 7.44% | 20.74% |
Deyang | 8.71% | 5.79% | 14.50% |
Aba | 11.29% | 23.51% | 34.80% |
Ziyang | 5.10% | 3.10% | 8.20% |
Neijiang | 4.79% | 2.33% | 7.12% |
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yang, T.; Xie, J.; Li, G.; Mou, N.; Li, Z.; Tian, C.; Zhao, J. Social Media Big Data Mining and Spatio-Temporal Analysis on Public Emotions for Disaster Mitigation. ISPRS Int. J. Geo-Inf. 2019, 8, 29. https://doi.org/10.3390/ijgi8010029
Yang T, Xie J, Li G, Mou N, Li Z, Tian C, Zhao J. Social Media Big Data Mining and Spatio-Temporal Analysis on Public Emotions for Disaster Mitigation. ISPRS International Journal of Geo-Information. 2019; 8(1):29. https://doi.org/10.3390/ijgi8010029
Chicago/Turabian StyleYang, Tengfei, Jibo Xie, Guoqing Li, Naixia Mou, Zhenyu Li, Chuanzhao Tian, and Jing Zhao. 2019. "Social Media Big Data Mining and Spatio-Temporal Analysis on Public Emotions for Disaster Mitigation" ISPRS International Journal of Geo-Information 8, no. 1: 29. https://doi.org/10.3390/ijgi8010029
APA StyleYang, T., Xie, J., Li, G., Mou, N., Li, Z., Tian, C., & Zhao, J. (2019). Social Media Big Data Mining and Spatio-Temporal Analysis on Public Emotions for Disaster Mitigation. ISPRS International Journal of Geo-Information, 8(1), 29. https://doi.org/10.3390/ijgi8010029