Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Exploring Disorder-Aware Attention for Clinical Event Extraction

Published: 17 April 2020 Publication History

Abstract

Event extraction is one of the crucial tasks in biomedical text mining that aims to extract specific information concerning incidents embedded in the texts. In this article, we propose a deep learning framework that aims to identify the attributes (severity, course, temporal expression, and document creation time) associated with the medical concepts extracted from electronic medical records. The bi-directional long short-term memory network assisted by the attention mechanism is utilized to uncover the important aspects of the patient’s medical conditions. The attention mechanism specific to the medical disorder mention can focus on various parts of the sentence when different disorders are considered as input. The proposed methodology is evaluated on benchmark ShARe/CLEF eHealth Evaluation Lab 2014 shared task 2 datasets. In addition to the CLEF dataset, we also used the social media text, especially the medical blog posts. Experimental results of the proposed approach illustrate that our proposed approach achieves significant performance improvements over the state-of-the-art techniques and the highly competitive deep learning--based baseline methods.

Supplementary Material

a31-yadav-suppl.pdf (yadav.zip)
Supplemental movie, appendix, image and software files for, Exploring Disorder-Aware Attention for Clinical Event Extraction

References

[1]
Susannah Fox. 2011. The Social Life of Health Information, 2011. Pew Internet 8 American Life Project, Washington, DC.
[2]
Mark Dredze and Michael J. Paul. 2014. Natural language processing for health and social media. IEEE Intelligent Systems 29, 2 (2014), 64--67.
[3]
Yang Liu, Songhua Xu, Hong-Jun Yoon, and Georgia Tourassi. 2014. Extracting patient demographics and personal medical information from online health forums. In AMIA Annual Symposium Proceedings, Vol. 2014. American Medical Informatics Association, Bethesda, MD, 1825--1834.
[4]
Per E. Kummervold, Deede Gammon, Svein Bergvik, Jan-Are K. Johnsen, Toralf Hasvold, and Jan H. Rosenvinge. 2002. Social support in a wired world: Use of online mental health forums in Norway. Nordic Journal of Psychiatry 56, 1 (2002), 59--65.
[5]
George Hripcsak, Nicholas D. Soulakis, Li Li, Frances P. Morrison, Albert M. Lai, Carol Friedman, Neil S. Calman, and Farzad Mostashari. 2009. Syndromic surveillance using ambulatory electronic health records. Journal of the American Medical Informatics Association 16, 3 (2009), 354--361.
[6]
Carlo Combi and Yuval Shahar. 1997. Temporal reasoning and temporal data maintenance in medicine: issues and challenges. Computers in Biology and Medicine 27, 5 (1997), 353--368.
[7]
Jennifer D’Souza and Vincent Ng. 2014. Knowledge-rich temporal relation identification and classification in clinical notes. Database 2014 (2014), bau109.
[8]
Luis Fernandez-Luque, Randi Karlsen, and Jason Bonander. 2011. Review of extracting information from the Social Web for health personalization. Journal of Medical Internet Research 13, 1 (2011), e15.
[9]
Liadh Kelly, Lorraine Goeuriot, Hanna Suominen, Tobias Schreck, Gondy Leroy, Danielle L. Mowery, Sumithra Velupillai, et al. 2014. Overview of the ShARe/CLEF eHealth Evaluation Lab 2014. In Proceedings of the International Conference of the Cross-Language Evaluation Forum for European Languages. 172--191.
[10]
Li Zhou, Carol Friedman, Simon Parsons, and George Hripcsak. 2005. System architecture for temporal information extraction, representation and reasoning in clinical narrative reports. In AMIA Annual Symposium Proceedings, Vol. 2005. American Medical Informatics Association, Bethesda, MD, 869--873.
[11]
Buzhou Tang, Yonghui Wu, Min Jiang, Yukun Chen, Joshua C. Denny, and Hua Xu. 2013. A hybrid system for temporal information extraction from clinical text. Journal of the American Medical Informatics Association 20, 5 (2013), 828--835.
[12]
Chen Lin, Dmitriy Dligach, Timothy A. Miller, Steven Bethard, and Guergana K. Savova. 2015. Multilayered temporal modeling for the clinical domain. Journal of the American Medical Informatics Association 23, 2 (2015), 387--395.
[13]
Weiyi Sun, Anna Rumshisky, and Ozlem Uzuner. 2013. Annotating temporal information in clinical narratives. Journal of Biomedical Informatics 46 (2013), S5--S12.
[14]
Nishikant Johri, Yoshiki Niwa, and Veera Raghavendra Chikka. 2014. Optimizing Apache cTAKES for disease/disorder template filling: Team HITACHI in 2014 ShARe/CLEF eHealth Evaluation Lab. In Proceedings of the Conference and Labs of the Evaluation Forum (CLEF’14).
[15]
S. V. Ramanan and P. Senthil Nathan. 2014. Cocoa: Extending a rule-based system to tag disease attributes in clinical records. In Working Notes for the CLEF 2014 Conference.
[16]
Azadeh Nikfarjam, Abeed Sarker, Karen O’Connor, Rachel Ginn, and Graciela Gonzalez. 2015. Pharmacovigilance from social media: Mining adverse drug reaction mentions using sequence labeling with word embedding cluster features. Journal of the American Medical Informatics Association 22, 3 (2015), 671--681.
[17]
Glen Coppersmith, Mark Dredze, Craig Harman, Kristy Hollingshead, and Margaret Mitchell. 2015. CLPsych 2015 shared task: Depression and PTSD on Twitter. In Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality. 31--39.
[18]
Donghee Yvette Wohn, Casey Fiesler, Libby Hemphill, Munmun De Choudhury, and J. Nathan Matias. 2017. How to handle online risks? Discussing content curation and moderation in social media. In Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems. ACM, New York, NY, 1271--1276.
[19]
Alex Graves. 2013. Generating sequences with recurrent neural networks. arXiv:1308.0850.
[20]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv:1409.0473.
[21]
Shweta Yadav, Asif Ekbal, Sriparna Saha, Ankit Kumar, and Pushpak Bhattacharyya. 2019. Feature assisted stacked attentive shortest dependency path based Bi-LSTM model for protein--protein interaction. Knowledge-Based Systems 166 (2019), 18--29.
[22]
Dhanachandra Ningthoujam, Shweta Yadav, Pushpak Bhattacharyya, and Asif Ekbal. 2019. Relation extraction between the clinical entities based on the shortest dependency path based LSTM. arXiv:1903.09941.
[23]
Jason P. C. Chiu and Eric Nichols. 2016. Named entity recognition with bidirectional LSTM-CNNs. Transactions of the Association for Computational Linguistics 4 (2016), 357--370.
[24]
Peng Zhou, Zhenyu Qi, Suncong Zheng, Jiaming Xu, Hongyun Bao, and Bo Xu. 2016. Text classification improved by integrating bidirectional LSTM with two-dimensional max pooling. arXiv:1611.06639.
[25]
Yequan Wang, Minlie Huang, Xiaoyan Zhu, and Li Zhao. 2016. Attention-based LSTM for aspect-level sentiment classification. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 606--615.
[26]
Deepak Gupta, Rajkumar Pujari, Asif Ekbal, Pushpak Bhattacharyya, Anutosh Maitra, Tom Jain, and Shubhashis Sengupta. 2018. Can taxonomy help? Improving semantic question matching using question taxonomy. In Proceedings of the 27th International Conference on Computational Linguistics. 499--513. https://www.aclweb.org/anthology/C18-1042.
[27]
Anutosh Maitra, Shubhashis Sengupta, Abhisek Mukhopadhyay, Deepak Gupta, Rajkumar Pujari, Pushpak Bhattacharya, Asif Ekbal, and Tom Geo Jain. 2018. Semantic question matching in data constrained environment. In Text, Speech, and Dialogue, P. Sojka, A. Horák, I. Kopeček, and K. Pala (Eds.). Springer International Publishing, Cham, Switzerland, 267--276.
[28]
Danielle L. Mowery, Sumithra Velupillai, Brett R. South, Lee Christensen, David Martinez, Liadh Kelly, Lorraine Goeuriot, et al. 2014. Task 2: ShARe/CLEF eHealth Evaluation Lab 2014. In Proceedings of the Conference and Labs of the Evaluation Forum (CLEF’14).
[29]
Thierry Hamon, Cyril Grouin, and Pierre Zweigenbaum. 2014. Disease and disorder template filling using rule-based and statistical approaches. In Working Notes for the CLEF 2014 Conference. 79--90.
[30]
Tigran Mkrtchyan and Daniel Sonntag. 2014. Deep parsing at the CLEF2014 IE task (DFKI-Medical). In CEUR Workshop Proceedings, Vol. 1180. 138--146.
[31]
Konrad Herbst, Cindy Fähnrich, Mariana L. Neves, and Matthieu-P. Schapranow. 2014. Applying in-memory technology for automatic template filling in the clinical domain. In Working Notes for the CLEF 2014 Conference. 91--102.
[32]
Noémie Elhadad, Sameer Pradhan, Sharon Gorman, Suresh Manandhar, Wendy Chapman, and Guergana Savova. 2015. SemEval-2015 task 14: Analysis of clinical text. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval’15). 303--310.
[33]
Jun Xu, Yaoyun Zhang, Jingqi Wang, Yonghui Wu, Min Jiang, Ergin Soysal, and Hua Xu. 2015. UTH-CCB: The participation of the SemEval 2015 challenge—Task 14. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval’15). 311--314.
[34]
Parth Pathak, Pinal Patel, Vishal Panchal, Sagar Soni, Kinjal Dani, Amrish Patel, and Narayan Choudhary. 2015. ezDI: A supervised NLP system for clinical narrative analysis. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval’15). 412--416.
[35]
Weiyi Sun, Anna Rumshisky, and Ozlem Uzuner. 2013. Evaluating temporal relations in clinical text: 2012 i2b2 challenge. Journal of the American Medical Informatics Association 20, 5 (2013), 806--813.
[36]
Steven Bethard, Guergana Savova, Wei-Te Chen, Leon Derczynski, James Pustejovsky, and Marc Verhagen. 2016. SemEval-2016 Task 12: Clinical TempEval. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval’16). 1052--1062.
[37]
Steven Bethard, Leon Derczynski, Guergana Savova, James Pustejovsky, and Marc Verhagen. 2015. SemEval-2015 Task 6: Clinical TempEval. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval’15). 806--814.
[38]
Sean MacAvaney, Arman Cohan, and Nazli Goharian. 2017. GUIR at SemEval-2017 Task 12: A framework for cross-domain clinical temporal information extraction. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval’17). 1024--1029.
[39]
James Pustejovsky and Amber Stubbs. 2011. Increasing informativeness in temporal annotation. In Proceedings of the 5th Linguistic Annotation Workshop. 152--160.
[40]
Weiyi Sun, Anna Rumshisky, and Ozlem Uzuner. 2013. Temporal reasoning over clinical text: The state of the art. Journal of the American Medical Informatics Association 20, 5 (2013), 814--819.
[41]
Yu Long, Zhijing Li, Xuan Wang, and Chen Li. 2017. XJNLP at SemEval-2017 Task 12: Clinical temporal information ex-traction with a hybrid model. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval’17). 1014--1018.
[42]
Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, and Hannaneh Hajishirzi. 2016. Bidirectional attention flow for machine comprehension. arXiv:1611.01603.
[43]
Deepak Gupta, Sarah Kohail, and Pushpak Bhattacharyya. 2018. Combining graph-based dependency features with convolutional neural network for answer triggering. arXiv:1808.01650.
[44]
Deepak Gupta, Pabitra Lenka, Asif Ekbal, and Pushpak Bhattacharyya. 2018. Uncovering code-mixed challenges: A framework for linguistically driven question generation and neural based question answering. In Proceedings of the 22nd Conference on Computational Natural Language Learning. 119--130.
[45]
Deepak Gupta, Surabhi Kumari, Asif Ekbal, and Pushpak Bhattacharyya. 2018. MMQA: A multi-domain multi-lingual question-answering framework for English and Hindi. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC’18). https://www.aclweb.org/anthology/L18-1440.
[46]
Mahmood Yousefi Azar, Kairit Sirts, Diego Mollá Aliod, and Len Hamey. 2015. Query-based single document summarization using an ensemble noisy auto-encoder. In Proceedings of the Australasian Language Technology Association Workshop 2015. 2--10. https://www.aclweb.org/anthology/U15-1001.
[47]
Johan Hasselqvist, Niklas Helmertz, and Mikael Kågebäck. 2017. Query-based abstractive summarization using neural networks. arXiv:1712.06100.
[48]
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems. 3104--3112.
[49]
Preksha Nema, Mitesh M. Khapra, Anirban Laha, and Balaraman Ravindran. 2017. Diversity driven attention model for query-based abstractive summarization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1063--1072.
[50]
Ming Tan, Cicero dos Santos, Bing Xiang, and Bowen Zhou. 2015. LSTM-based deep learning models for non-factoid answer selection. arXiv:1511.04108.
[51]
Qin Chen, Qinmin Hu, Jimmy Xiangji Huang, Liang He, and Weijie An. 2017. Enhancing recurrent neural networks with positional attention for question answering. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, 993--996.
[52]
Chuanqi Tan, Furu Wei, Nan Yang, Bowen Du, Weifeng Lv, and Ming Zhou. 2017. S-Net: From answer extraction to answer generation for machine reading comprehension. arXiv:1706.04815.
[53]
Tim Rocktäschel, Edward Grefenstette, Karl Moritz Hermann, Tomáš Kočiskỳ, and Phil Blunsom. 2015. Reasoning about entailment with neural attention. arXiv:1509.06664.
[54]
Mohammed Saeed, Christine Lieu, Greg Raber, and Roger G. Mark. 2002. MIMIC II: A massive temporal ICU patient database to support research in intelligent patient monitoring. In Proceedings of Computers in Cardiology. 641--644.
[55]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980.
[56]
Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global vectors for word representation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’14). 1532--1543. http://www.aclweb.org/anthology/D14-1162.
[57]
Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Retrieved February 13, 2020 from https://www.tensorflow.org/ (Software available from tensorflow.org.)
[58]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long and Short Papers). 4171--4186.
[59]
Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2019. Biobert: Pre-trained biomedical language representation model for biomedical text mining. arXiv:1901.08746.
[60]
Munmun De Choudhury, Sanket S. Sharma, Tomaz Logar, Wouter Eekhout, and René Clausen Nielsen. 2017. Gender and cross-cultural differences in social media disclosures of mental illness. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing. ACM, New York, NY, 353--369.
[61]
Sindhu Kiranmai Ernala, Tristan Labetoulle, Fred Bane, Michael L. Birnbaum, Asra F. Rizvi, John M. Kane, and Munmun De Choudhury. 2018. Characterizing audience engagement and assessing its impact on social media disclosures of mental illnesses. In Proceedings of the 12th International AAAI Conference on Web and Social Media.
[62]
Michael L. Birnbaum, Sindhu Kiranmai Ernala, Asra F. Rizvi, Munmun De Choudhury, and John M. Kane. 2017. A collaborative approach to identifying social media markers of schizophrenia by employing machine learning and clinical appraisals. Journal of Medical Internet Research 19, 8 (2017), e289.
[63]
Eva Sharma and Munmun De Choudhury. 2018. Mental health support and its relationship to linguistic accommodation in online communities. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 641.
[64]
Sarmistha Dutta, Jennifer Ma, and Munmun De Choudhury. 2018. Measuring the impact of anxiety on online social interactions. In Proceedings of the 12th International AAAI Conference on Web and Social Media.
[65]
Shweta Yadav, Asif Ekbal, Sriparna Saha, and Pushpak Bhattacharyya. 2019. A unified multi-task adversarial learning framework for pharmacovigilance mining. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 5234--5245.
[66]
Shweta Yadav, Joy Sain, Amit Sheth, Asif Ekbal, Sriparna Saha, and Pushpak Bhattacharyya. 2018. Leveraging medical sentiment to understand patients health on social media. arXiv:1807.11172.
[67]
Shweta Yadav, Asif Ekbal, Sriparna Saha, and Pushpak Bhattacharyya. 2018. Medical sentiment analysis using social media: Towards building a patient assisted system. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC’18).
[68]
Shweta Yadav, Asif Ekbal, Sriparna Saha, Pushpak Bhattacharyya, and Amit Sheth. 2018. Multi-task learning framework for mining crowd intelligence towards clinical treatment. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers). 271--277.
[69]
Silvio Amir, Mark Dredze, and John W. Ayers. 2019. Mental health surveillance over social media with digital cohorts. In Proceedings of the 6th Workshop on Computational Linguistics and Clinical Psychology. 114--120.
[70]
Munmun De Choudhury, Emre Kiciman, Mark Dredze, Glen Coppersmith, and Mrinal Kumar. 2016. Discovering shifts to suicidal ideation from mental health content in social media. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 2098--2110.

Cited By

View all
  • (2022)Construction and Application of Event Logic Graph: A SurveyDatabase Systems for Advanced Applications. DASFAA 2022 International Workshops10.1007/978-3-031-11217-1_12(160-174)Online publication date: 11-Apr-2022
  • (2021)LSLSD: Fusion Long Short-Level Semantic Dependency of Chinese EMRs for Event ExtractionApplied Sciences10.3390/app1116723711:16(7237)Online publication date: 5-Aug-2021
  • (2021)“When they say weed causes depression, but it’s your fav antidepressant”: Knowledge-aware attention framework for relationship extractionPLOS ONE10.1371/journal.pone.024829916:3(e0248299)Online publication date: 25-Mar-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications
ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 16, Issue 1s
Special Issue on Multimodal Machine Learning for Human Behavior Analysis and Special Issue on Computational Intelligence for Biomedical Data and Imaging
January 2020
376 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/3388236
Issue’s Table of Contents
© 2020 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 April 2020
Accepted: 01 November 2019
Revised: 01 November 2019
Received: 01 May 2019
Published in TOMM Volume 16, Issue 1s

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Neural networks
  2. attention
  3. clinical event extraction
  4. event extraction
  5. social media
  6. temporal event extraction

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Construction and Application of Event Logic Graph: A SurveyDatabase Systems for Advanced Applications. DASFAA 2022 International Workshops10.1007/978-3-031-11217-1_12(160-174)Online publication date: 11-Apr-2022
  • (2021)LSLSD: Fusion Long Short-Level Semantic Dependency of Chinese EMRs for Event ExtractionApplied Sciences10.3390/app1116723711:16(7237)Online publication date: 5-Aug-2021
  • (2021)“When they say weed causes depression, but it’s your fav antidepressant”: Knowledge-aware attention framework for relationship extractionPLOS ONE10.1371/journal.pone.024829916:3(e0248299)Online publication date: 25-Mar-2021
  • (2021)Overview of CCKS 2020 Task 3: Named Entity Recognition and Event Extraction in Chinese Electronic Medical RecordsData Intelligence10.1162/dint_a_000933:3(376-388)Online publication date: 8-Sep-2021
  • (2021)Hierarchical deep multi-modal network for medical visual question answeringExpert Systems with Applications10.1016/j.eswa.2020.113993164(113993)Online publication date: Feb-2021
  • (2021)Data structuring of electronic health records: a systematic reviewHealth and Technology10.1007/s12553-021-00607-wOnline publication date: 29-Oct-2021

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media