Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3394486.3412861acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Effective Transfer Learning for Identifying Similar Questions: Matching User Questions to COVID-19 FAQs

Published: 20 August 2020 Publication History

Abstract

People increasingly search online for answers to their medical questions but the rate at which medical questions are asked online significantly exceeds the capacity of qualified people to answer them. This leaves many questions unanswered or inadequately answered. Many of these questions are not unique, and reliable identification of similar questions would enable more efficient and effective question answering schema. COVID-19 has only exacerbated this problem. Almost every government agency and healthcare organization has tried to meet the informational need of users by building online FAQs, but there is no way for people to ask their question and know if it is answered on one of these pages. While many research efforts have focused on the problem of general question similarity, these approaches do not generalize well to domains that require expert knowledge to determine semantic similarity, such as the medical domain. In this paper, we show how a double fine-tuning approach of pretraining a neural network on medical question-answer pairs followed by fine-tuning on medical question-question pairs is a particularly useful intermediate task for the ultimate goal of determining medical question similarity. While other pretraining tasks yield an accuracy below 78.7% on this task, our model achieves an accuracy of 82.6% with the same number of training examples, an accuracy of 80.0% with a much smaller training set, and an accuracy of 84.5% when the full corpus of medical question-answer data is used. We also describe a currently live system that uses the trained model to match user questions to COVID-related FAQs.

Supplementary Material

MP4 File (3394486.3412861.mp4)
Video presenting Curai's work on "Effective Transfer Learning for Identifying Similar Questions: Matching User Questions to COVID-19 FAQs". In this work, we show how, when one does not have enough labeled data for a given task, a double fine-tuning approach of fine-tuning on an appropriate in-domain task before fine-tuning on the final task helps imbibe domain knowledge into the model and boosts model performance. As a part of this work, we also release a medical question-question similarity dataset. We apply our model to help match user's COVID-related questions to FAQs released by various institutes such as CDC and FDA.

References

[1]
2009. The Social Life of Health Information. https://www.pewresearch.org/ internet/2009/06/11/a-shifting-landscape/#fn-536-8
[2]
2019. Dr Google will see you now: Search giant wants to cash in on your medical queries. https://www.telegraph.co.uk/technology/2019/03/10/google-siftingone-billion-health-questions-day
[3]
Asma Ben Abacha and Dina Demner-Fushman. 2016. Recognizing Question Entailment for Medical Question Answering.
[4]
Asma Ben Abacha and Dina Demner-Fushman. 2019. A Question-Entailment Approach to Question Answering. arxiv: cs.CL/1901.08079
[5]
AIcrowd. 2019. MEDIQA 2019 - Recognizing Question Entailment (RQE). https://www.aicrowd.com/challenges/mediqa-2019-recognizing-question-entailment-rqe
[6]
Iz Beltagy, Kyle Lo, and Arman Cohan. 2019. SciBERT: Pretrained Contextualized Embeddings for Scientific Text. arxiv: cs.CL/1903.10676
[7]
Dasha Bogdanova, C'icero dos Santos, Luciano Barbosa, and Bianca Zadrozny. 2015. Detecting Semantically Equivalent Questions in Online User Forums. In Proceedings of the Nineteenth Conference on Computational Natural Language Learning. Association for Computational Linguistics, Beijing, China, 123--131. https://doi.org/10.18653/v1/K15-1013
[8]
Li Cai, Guangyou Zhou, Kang Liu, and Jun Zhao. 2011. Learning the latent topics for question retrieval in community qa. In Proceedings of 5th International Joint Conference on Natural Language Processing.
[9]
Yllias Chali and Rafat Islam. 2018. Question-Question Similarity in Online Forums. In Proceedings of the 10th Annual Meeting of the Forum for Information Retrieval Evaluation (FIRE'18). Association for Computing Machinery, 21--28.
[10]
Kornel Csernai. 2017. Quora. https://www.quora.com/q/quoradata/First-Quora-Dataset-Release-Question-Pairs
[11]
Arpita Das, Harish Yenala, Manoj Chinnakotla, and Manish Shrivastava. 2016. Together we stand: Siamese Networks for Similar Question Retrieval. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Berlin, Germany, 378--387.
[12]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arxiv: cs.CL/1810.04805
[13]
durakkerem. 2018. Medical Question Answer Datasets. https://github.com/durakkerem/Medical-Question-Answer-Datasets
[14]
Kexin Huang, Jaan Altosaar, and Rajesh Ranganath. 2019. ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission. arxiv: cs.CL/1904.05342
[15]
Jiwoon Jeon, W. Croft, and Joon Lee. 2005. Finding similar questions in large question and answer archives. International Conference on Information and Knowledge Management, Proceedings, 84--90.
[16]
Zongcheng Ji, Fei Xu, Bin Wang, and Ben He. 2012. Question-Answer Topic Model for Question Retrieval in Community Question Answering. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management. Association for Computing Machinery, 2471--2474.
[17]
Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2019. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. arxiv: cs.CL/1901.08746
[18]
Tao Lei, Hrishikesh Joshi, Regina Barzilay, Tommi Jaakkola, Kateryna Tymoshenko, Alessandro Moschitti, and Lluís Màrquez. 2016. Semi-supervised Question Retrieval with Gated Convolutions. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2016). https://doi.org/10.18653/v1/n16-1153
[19]
Yaliang Li, Liuyi Yao, Nan Du, Jing Gao, Qi Li, Chuishi Meng, Chenwei Zhang, and Wei Fan. 2018. Finding Similar Medical Questions from Question Answering Websites. arxiv: cs.AI/1810.05983
[20]
Xiaodong Liu, Pengcheng He, Weizhu Chen, and Jianfeng Gao. 2019. Multi-Task Deep Neural Networks for Natural Language Understanding. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/p19-1441
[21]
Lasse Regin Nielsen. 2017. Medical Question Answer Data. https://github.com/LasseRegin/medical-question-answer-data
[22]
Matthew Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, and Luke Zettlemoyer. 2018. Deep Contextualized Word Representations. https://doi.org/10.18653/v1/n18--1202
[23]
Jason Phang, Thibault Févry, and Samuel R. Bowman. 2018. Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks. arxiv: cs.CL/1811.01088
[24]
Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever. 2019. Language models are unsupervised multitask learners. OpenAI Blog (2019).
[25]
Alexander Ratner, Stephen H. Bach, Henry Ehrenberg, Jason Fries, Sen Wu, and Christopher Ré. 2017. Snorkel. https://doi.org/10.14778/3157794.3157797
[26]
Alon Talmor and Jonathan Berant. 2019. MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension. arxiv: cs.CL/1905.13453
[27]
Xiaobing Xue, Jiwoon Jeon, and W. Bruce Croft. 2008. Retrieval Models for Question and Answer Archives. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, New York, NY, USA, 475--482.
[28]
Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, and Quoc V. Le. 2019. XLNet: Generalized Autoregressive Pretraining for Language Understanding. arxiv: cs.CL/1906.08237
[29]
Kai Zhang, Wei Wu, Haocheng Wu, Zhoujun Li, and Ming Zhou. 2014. Question Retrieval with High Quality Answers in Community Question Answering. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. Association for Computing Machinery, 371--380.

Cited By

View all
  • (2024)Generative AI for Evidence-Based Medicine: A PICO GenAI for Synthesizing Clinical Case ReportsICC 2024 - IEEE International Conference on Communications10.1109/ICC51166.2024.10622271(1503-1508)Online publication date: 9-Jun-2024
  • (2024)A comparison review of transfer learning and self-supervised learningExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.122807242:COnline publication date: 16-May-2024
  • (2023)Identifying the Question Similarity of Regulatory Documents in the Pharmaceutical Industry by Using the Recognizing Question Entailment System: Evaluation StudyJMIR AI10.2196/434832(e43483)Online publication date: 26-Sep-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
August 2020
3664 pages
ISBN:9781450379984
DOI:10.1145/3394486
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 August 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. expert domains
  2. healthcare
  3. medicine
  4. question similarity
  5. transfer learning

Qualifiers

  • Research-article

Conference

KDD '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)25
  • Downloads (Last 6 weeks)2
Reflects downloads up to 24 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Generative AI for Evidence-Based Medicine: A PICO GenAI for Synthesizing Clinical Case ReportsICC 2024 - IEEE International Conference on Communications10.1109/ICC51166.2024.10622271(1503-1508)Online publication date: 9-Jun-2024
  • (2024)A comparison review of transfer learning and self-supervised learningExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.122807242:COnline publication date: 16-May-2024
  • (2023)Identifying the Question Similarity of Regulatory Documents in the Pharmaceutical Industry by Using the Recognizing Question Entailment System: Evaluation StudyJMIR AI10.2196/434832(e43483)Online publication date: 26-Sep-2023
  • (2023)Review of Natural Language Processing in PharmacologyPharmacological Reviews10.1124/pharmrev.122.00071575:4(714-738)Online publication date: 17-Mar-2023
  • (2023)Privacy-Preserving-Enabled Lightweight COVID-19 Simulation Model for Mobile Intelligent ApplicationIEEE Internet of Things Journal10.1109/JIOT.2022.316268710:8(6742-6755)Online publication date: 15-Apr-2023
  • (2023)A text style transfer system for reducing the physician–patient expertise gap: An analysis with automatic and human evaluationsExpert Systems with Applications10.1016/j.eswa.2023.120874233(120874)Online publication date: Dec-2023
  • (2023)KisanQRS: A deep learning-based automated query-response system for agricultural decision-makingComputers and Electronics in Agriculture10.1016/j.compag.2023.108180213(108180)Online publication date: Oct-2023
  • (2023)Understanding Visitors’ Curiosity in a Science Centre with Deep Question Processing NetworkInternational Journal of Artificial Intelligence in Education10.1007/s40593-023-00377-834:3(1072-1101)Online publication date: 20-Nov-2023
  • (2023)An answer recommendation framework for an online cancer community forumMultimedia Tools and Applications10.1007/s11042-023-15477-983:1(173-199)Online publication date: 15-May-2023
  • (2022)Identifying Similar Questions in the Medical Domain Using a Fine-tuned Siamese-BERT Model2022 IEEE 19th India Council International Conference (INDICON)10.1109/INDICON56171.2022.10040144(1-6)Online publication date: 24-Nov-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media