Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2930971.2930977acmotherconferencesArticle/Chapter ViewAbstractPublication PagessmsocietyConference Proceedingsconference-collections
research-article

Quantifying Self-Reported Adverse Drug Events on Twitter: Signal and Topic Analysis

Published: 11 July 2016 Publication History

Abstract

When a drug that is sold exhibits side effects, a well functioning ecosystem of pharmaceutical drug suppliers includes responsive regulators and pharmaceutical companies. Existing systems for monitoring adverse drug events, such as the Federal Adverse Events Reporting System (FAERS) in the US, have shown limited effectiveness due to the lack of incentives for healthcare professionals and patients. While social media present opportunities to mine information about adverse events in near real-time, there are still important questions to be answered in order to understand their impact on pharmacovigilance. First, it is not known how many relevant social media posts occur per day on platforms like Twitter, i.e., whether there is "enough signal" for a post-market pharmacovigilance program based on Twitter mining. Second, it is not known what other topics are discussed by users in posts mentioning pharmaceutical drugs.
In this paper, we outline how social media can be used as a human sensor for drug use monitoring. We introduce a large-scale, near real-time system for computational pharmacovigilance, and use our system to estimate the order of magnitude of the volume of daily self-reported pharmaceutical drug side effect tweets. The processing pipeline comprises a set of cascaded filters, followed by a supervised machine learning classifier. The cascaded filters quickly reduce the volume to a manageable sub-stream, from which a Support Vector Machine (SVM) based classifier identifies adverse events based on a rich set of features taking into account surface-textual properties, as well as domain knowledge about drugs, side effects and the Twitter medium. Using a dataset of 10,000 manually annotated tweets, a SVM classifier achieves F1=60.4% and AUC=0.894. The yield of the classifier for a drug universe comprising 2,600 keywords is 721 tweets per day. We also investigate what other topics are discussed in the posts mentioning pharmaceutical drugs. We conclude by suggesting an ecosystem where regulators and pharmaceutical companies utilize social media to obtain feedback about consequences of pharmaceutical drug use.

References

[1]
J. Bian, U. Topaloglu, and F. Yu. Towards Large-scale Twitter Mining for Drug-related Adverse Events. In Proceedings of the 2012 International Workshop on Smart Health and Wellbeing, SHB '12, pages 25--32, 2012.
[2]
K.E. Campbell, D.E. Oliver, and E.H. Shortliffe. The Unified Medical Language System: Toward a Collaborative Approach for Solving Terminological Problems. J. Am. Med. Inform. Assoc., 5:12--16, 1998.
[3]
C.-C. Chang and C.-J. Lin. LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol., 2(3):27:1--27:27, 2011.
[4]
B.W. Chee, R. Berlin, and B. Schatz. Predicting Adverse Drug Events from Personal Health Messages. In AMIA Annual Symposium Proceedings, pages 217--226, 2011.
[5]
S.J.W. Evans, P.C. Waller, and S. Davis. Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports. Pharmacoepidemiology and Drug Safety, 10:483--486, 2001.
[6]
C.C. Freifeld, J.S. Brownstein, C.M. Menone, W. Bao, R. Filice, T. Kass-Hout, and N. Dasgupta. Digital Drug Safety Surveillance: Monitoring Pharmaceutical Products in Twitter. Drug Safety, 37(5):343--350, 2014.
[7]
R. Ginn, P. Pimpalkhute, A. Nikfarjam, A. Pakti, K. O'Connor, A. Sarker, K. Smith, and G. Gonzalez. Mining Twitter for Adverse Drug Reaction Mentions: A Corpus and Classification Benchmark. In Proceedings of the 4th Workshop on Building and Evaluating Resources for Health and Biomedical Text Processing, pages 1--8, 2014.
[8]
K.L. Gwet. Handbook of Inter-Rater Reliability. Advanced Analytics LLC, 4th edition, 2014.
[9]
L.K. Hansen, A. Arvidsson, F.A. Nielsen, E. Colleoni, and M. Etter. Good Friends, Bad News - Affect and Virality in Twitter. In Future Information Technology, volume 185 of CCIS, pages 34--43. 2011.
[10]
L. Hazell and S.A.W. Shakir. Under-Reporting of Adverse Drug Reactions. Drug Safety, 29(5):385--396, 2006.
[11]
S. Karimi, A. Metke-Jimenez, and A. Nguyen. CADEminer: A system for mining consumer reports on adverse drug side effects. In Proceedings of the 8th Workshop on Exploiting Semantic Annotations in Information Retrieval, ESAIR '15, pages 47--50, 2015.
[12]
S. Karimi, C. Wang, A. Metke-Jimenez, R. Gaire, and C. Paris. Text and data mining techniques in adverse drug reaction detection. ACM Comput. Surv., 47(4):56:1--56:39, 2015.
[13]
M. Kuhn, M. Campilos, I. Letunic, L.J. Jensen, and P. Bork. A side effect resource to capture phenotypic effects of drugs. Molecular Systems Biology, 6(343), 2010.
[14]
J.R. Landis and G.G. Koch. The measurement of observer agreement for categorical data. Biometrics, 33(1):159--174, 1977.
[15]
R. Leaman, L. Wojtulewicz, R. Sullivan, A. Skariah, J. Yang, and G. Gonzalez. Towards internet-age pharmacovigilance: extracting adverse drug reactions from user posts to health-related social networks. In Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, pages 117--125, 2010.
[16]
J. Liu, A. Li, and S. Seneff. Automatic Drug Side Effect Discovery from Online Patient-Submitted Reviews: Focus on Statin Drugs. In Proceedings of the 1st International Conference on Advances in Information Mining and Management, pages 91--96, 2011.
[17]
M. Lui and T. Baldwin. langid.py: An Off-the-shelf Language Identification Tool. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012), pages 25--30, 2012.
[18]
A. Metke-Jimenez and S. Karimi. Concept extraction to identify adverse drug reactions in medical forums: A comparison of algorithms. http://arxiv.org/abs/1504.06936, 2015.
[19]
A. Metke-Jimenez, S. Karimi, and C. Paris. Evaluation of Text-Processing Algorithms for Adverse Drug Event Extraction from Social Media. In Proceedings of the International Workshop on Social Media Retrieval and Analysis, 2014.
[20]
A. Nikfarjam and G.H. Gonzalez. Pattern mining for extraction of mentions of adverse drug reactions from user comments. In AMIA Annual Symposium Proceedings, pages 1019--1026, 2011.
[21]
A. Nikfarjam, A. Sarker, K. O'Connor, R. Ginn, and G. Gonzalez. Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features. J. Am. Med. Inform. Assoc., 22:671--681, 2015.
[22]
M. Paul and M. Dredze. A Model for Mining Public Health Topics from Twitter. Technical report, Department of Computer Science, John Hopkins University, 2011.
[23]
A. Sarker, R. Ginn, A. Nikfarjam, K. O'Connor, K. Smith, S. Jayaraman, T. Upadhaya, and G. Gonzalez. Utilizing social media data for pharmacovigilance. Journal of Biomedical Informatics, 54:202--212, 2015.
[24]
A. Sarker and G. Gonzalez. Portable automatic text classification for adverse drug reaction detection via multi-corpus training. Journal of Biomedical Informatics, 53:196--207, 2015.
[25]
A. Sarker, A. Nikfarjam, and G. Gonzalez. Social media mining shared task workshop. In Proceedings of the Pacific Symposium on Biocomputing, pages 581--592, 2015.
[26]
S. Siegel and N.J. Castellan Jr. Nonparametric statistics for the behavioral sciences. McGraw--Hill, Inc., 2nd edition, 1988.
[27]
J.S. Uebersax. Diversity of decision-making models and the measurement of interrated agreement. Phychological Bulletin, 101(1):140--146, 1987.
[28]
S. Wang and C.D. Manning. Baselines and bigrams: Simple, good sentiment and topic classification. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2, ACL '12, pages 90--94, 2012.
[29]
World Health Organization. The Importance of Pharmacovigilance. http://bit.ly/1RI8HcE, 2002.
[30]
H. Wu, H. Fang, and S.J. Stanhope. An Early Warning System for Unrecognized Drug Side Effects Discovery. In Proceedings of the 21st International Conference on World Wide Web companion, pages 437--440, 2012.
[31]
X. Yan, J. Guo, S. Liu, X.-Q. Cheng, and Y. Wang. Clustering Short Text Using Ncut-weighted Non-negative Matrix Factorization. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM '12, pages 2259--2262, 2012.
[32]
C.C. Yang, H. Yang, L. Jiang, and M. Zhang. Social Media Mining for Drug Safety Signal Detection. In Proceedings of the 2012 International Workshop on Smart Health and Wellbeing, pages 33--40, 2012.
[33]
A. Yates, N. Goharian, and O. Frieder. Extracting Adverse Drug Reactions from Forum Posts and Linking them to Drugs. In Proceedings of the ACM SIGIR Workshop on Health Search & Discovery: Helping Users & Advancing Medicine, pages 55--58, 2013.
[34]
A. Yates, N. Goharian, and O. Frieder. Extracting adverse drug reactions from social media. In Proceedings of the 29th AAAI Conference on Artificial Intelligence, pages 2460--2467, 2015.
[35]
Q.T. Zeng, J. Crowell, G. Divita, L. Roth, and A.C. Browne. Identifying Consumer-Friendly Display (CFD) Names for Health Concepts. In AMIA Annual Symposium Proceedings, pages 859--863, 2005.

Cited By

View all
  • (2024)A Review on Adverse Drug Reaction Detection TechniquesARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY10.14500/aro.1138812:1(143-153)Online publication date: 7-Jun-2024
  • (2023)Machine Learning Applications in Pharmacovigilance: Scoping ReviewPharmacovigilance - Volume 210.5772/intechopen.107290Online publication date: 15-Feb-2023
  • (2021)Breast Cancer Identification from Patients’ Tweet Streaming Using Machine Learning Solution on SparkComplexity10.1155/2021/66535082021:1Online publication date: 28-Jan-2021
  • Show More Cited By
  1. Quantifying Self-Reported Adverse Drug Events on Twitter: Signal and Topic Analysis

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      SMSociety '16: Proceedings of the 7th 2016 International Conference on Social Media & Society
      July 2016
      220 pages
      ISBN:9781450339384
      DOI:10.1145/2930971
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 11 July 2016

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Classification
      2. Drug side effects
      3. Natural Language Processing
      4. Text Mining
      5. Twitter

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      SMSociety '16

      Acceptance Rates

      Overall Acceptance Rate 78 of 189 submissions, 41%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)9
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 13 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Review on Adverse Drug Reaction Detection TechniquesARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY10.14500/aro.1138812:1(143-153)Online publication date: 7-Jun-2024
      • (2023)Machine Learning Applications in Pharmacovigilance: Scoping ReviewPharmacovigilance - Volume 210.5772/intechopen.107290Online publication date: 15-Feb-2023
      • (2021)Breast Cancer Identification from Patients’ Tweet Streaming Using Machine Learning Solution on SparkComplexity10.1155/2021/66535082021:1Online publication date: 28-Jan-2021
      • (2021)Adverse Drug Events Detection, Extraction and Normalization from Online Comments of Chinese Patent MedicinesAdvances in Knowledge Discovery and Data Mining10.1007/978-3-030-75762-5_21(251-262)Online publication date: 9-May-2021
      • (2019)Nobody Said it Would be EasyProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331444(1387-1388)Online publication date: 18-Jul-2019
      • (2019)Harvesting Drug Effectiveness from Social MediaProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331263(55-64)Online publication date: 18-Jul-2019
      • (2019)MSAProceedings of the Twelfth ACM International Conference on Web Search and Data Mining10.1145/3289600.3290980(33-41)Online publication date: 30-Jan-2019
      • (2019)Predicting User Reported Symptoms Using a Gated Neural Network2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI.2019.00059(368-375)Online publication date: Nov-2019
      • (2019)Medical Social Media Text Classification Integrating Consumer Health TerminologyIEEE Access10.1109/ACCESS.2019.29219387(78185-78193)Online publication date: 2019
      • (2019)Using Arabic Tweets to Understand Drug Selling BehaviorsProcedia Computer Science10.1016/j.procs.2019.12.220164(559-564)Online publication date: 2019
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media