Abstract
Social media has become a substitute for social interaction, thus the amount of medical and clinical-related information on the web is increasing. Monitoring of Personal Health Mentioning (PHM) on social media is an active area of research that predicts whether a given piece of text contains a health condition or not. To this end, the main idea is to consider the usage of disease or symptom words in the text. However, due to their usage in a figurative sense, disease or symptom words may not always indicate the presence of the health condition. Prior work attempts to address this by considering contextual word representations along with the utilization of the sentiment information. However, these methods are unable to capture the complete context in which symptom word is used. In this work, we incorporate permutation-based contextual word representation for the task of health mention detection which captures the context of disease words efficiently, in the given piece of text, and hence improves the performance of the classifier. To evaluate the integrity of the proposed method, we perform experimentation on the public benchmark dataset that shows an improvement of 5.5% in F-score in comparison to the state of the art health mention detection classifier. (Code is available at https://github.com/pervaizniazi/Figurative-Mention).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
This is an original tweet taken from Twitter.
References
WHO. Epidemic intelligence - systematic event detection (2017)
Biddle, R., Joshi, A., Liu, S., Paris, C., Guandong, X.: Leveraging sentiment distributions to distinguish figurative from literal health reports on Twitter. In: Proceedings of The Web Conference 2020, pp. 1217–1227 (2020)
Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R.R., Le, Q.V.: XLNet: generalized autoregressive pretraining for language understanding. In: Advances in Neural Information Processing Systems, pp. 5754–5764 (2019)
Dai, A.M., Le, Q.V.: Semi-supervised sequence learning. In: Advances in Neural Information Processing Systems, pp. 3079–3087 (2015)
Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Saeed, Z., Ayaz Abbasi, R., Razzak, I.: EveSense: what can you sense from Twitter? In: Jose, J.M., et al. (eds.) ECIR 2020. LNCS, vol. 12036, pp. 491–495. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-45442-5_64
McCann, B., Bradbury, J., Xiong, C., Socher, R.: Learned in translation: contextualized word vectors. In: Advances in Neural Information Processing Systems, pp. 6294–6305 (2017)
Peters, M.E., et al.: Deep contextualized word representations. arXiv preprint arXiv:1802.05365 (2018)
Saeed, Z., et al.: What’s happening around the world? A survey and framework on event detection techniques on twitter. J. Grid Comput. 17(2), 279–312 (2019)
Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language understanding by generative pre-training (2018)
Saeed, Z., Abbasi, R.A., Razzak, I., Maqbool, O., Sadaf, A., Xu, G.: Enhanced heartbeat graph for emerging event detection on twitter using time series networks. Expert Syst. Appl. 136, 115–132 (2019)
Jiang, K., Feng, S., Song, Q., Calix, R.A., Gupta, M., Bernard, G.R.: Identifying tweets of personal health experience through word embedding and LSTM neural network. BMC Bioinf. 19(8), 210 (2018)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Karisani, P., Agichtein, E.: Did you really just have a heart attack? Towards robust detection of personal health mentions in social media. In: Proceedings of the 2018 World Wide Web Conference, pp. 137–146 (2018)
Iyer, A., Joshi, A., Karimi, S., Sparks, R., Paris, C.: Figurative usage detection of symptom words to improve personal health mention detection. arXiv preprint arXiv:1906.05466 (2019)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, vol. 10, pp. 2200–2204 (2010)
Mohammad, S.: Obtaining reliable human ratings of valence, arousal, and dominance for 20,000 English words. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 174–184 (2018)
Howard, J., Ruder, S.: Universal language model fine-tuning for text classification. arXiv preprint arXiv:1801.06146 (2018)
Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6645–6649. IEEE (2013)
Zhu, Y., et al.: Aligning books and movies: towards story-like visual explanations by watching movies and reading books. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 19–27 (2015)
Parker, R., Graff, D., Kong, J., Chen, K., Maeda, K.: English gigaword fifth edition LDC2011T07 (technical report). Technical report. Linguistic Data Consortium, Philadelphia (2011)
Callan, J.: The lemur project and its ClueWeb12 dataset. In: Invited Talk at the SIGIR 2012 Workshop on Open-Source Information Retrieval (2012)
Common Crawl. Common crawl corpus (2019). http://commoncrawl.org
Acknowledgement
The authors would like to thank Shoaib Ahmed Siddiqui and Muhammad Nabeel Asim for providing useful feedback during this work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Khan, P.I., Razzak, I., Dengel, A., Ahmed, S. (2020). Improving Personal Health Mention Detection on Twitter Using Permutation Based Word Representation Learning. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Lecture Notes in Computer Science(), vol 12532. Springer, Cham. https://doi.org/10.1007/978-3-030-63830-6_65
Download citation
DOI: https://doi.org/10.1007/978-3-030-63830-6_65
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63829-0
Online ISBN: 978-3-030-63830-6
eBook Packages: Computer ScienceComputer Science (R0)