Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3152494.3152518acmotherconferencesArticle/Chapter ViewAbstractPublication PagescodsConference Proceedingsconference-collections
research-article

Stance classification of multi-perspective consumer health information

Published: 11 January 2018 Publication History

Abstract

While search engines are effective in answering direct factual questions such as, 'What are the symptoms of a disease X?', they are not so effective in addressing complex consumer health queries, which do not have a single definitive answer, such as, 'Is treatment X effective for disease Y?'. Instead, the users are presented with a vast number of search results with often contradictory perspectives and no definitive answer. We denote such queries as Multi-Perspective Consumer Health Information (MPCHI) queries for which there is no single 'Yes or No' answer. While ascertaining the credibility of the claims requires domain expertise, an efficient categorization of the search results according to their stance (support or oppose) to the queries will help the searcher in decision making. Hence, this paper focuses on the problem of stance classification for MPCHI data at sentence level, presenting a new data set for MPCHI queries. Unlike typical debate or argumentative text, the linguistic characteristics of MPCHI is quite different, with extensive use of scientific formal language and absence of opinion bearing words. Hence, such inherently different characteristic of MPCHI text requires going beyond traditional Bag of Words (BoW) features for stance classification. Hence, we propose using a rich non-traditional set of features such as medical semantic relations, stance vectors, sentiment polarity, textual entailment, and study their impact on MPCHI stance classification using an SVM and a neural network classifier. We find that using novel non-traditional features improves MPCHI stance classification performance over traditional BoW model by 24% for the SVM classifier, and 44% for the neural network classifier respectively, for the best feature combination.

References

[1]
Ameeta Agrawal and Aijun An. 2012. Unsupervised emotion detection from text using semantic and syntactic relations. In Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on, Vol. 1. IEEE, 346--353.
[2]
Rakesh Agrawal, Sridhar Rajagopalan, Ramakrishnan Srikant, and Yirong Xu. 2003. Mining Newsgroups Using Networks Arising from Social Behavior. In Proceedings of the 12th International Conference on World Wide Web (WWW '03). ACM, New York, NY, USA, 529--535.
[3]
Pranav Anand, Marilyn Walker, Rob Abbott, Jean E. Fox Tree, Robeson Bowmani, and Michael Minor. 2011. Cats Rule and Dogs Drool!: Classifying Stance in Online Debate. In Proceedings of the 2Nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA '11). Association for Computational Linguistics, Stroudsburg, PA, USA, 1--9. http://dl.acm.org/citation.cfm?id=2107653.2107654
[4]
Alexandra Balahur, Zornitsa Kozareva, and Andrés Montoyo. 2009. Determining the Polarity and Source of Opinions Expressed in Political Debates. In Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing '09). Springer-Verlag, Berlin, Heidelberg, 468--480.
[5]
O. Biran and O Rambow. 2011. Identifying Justifications in Written Dialogs,. In Proceedings of the 2011 IEEE Fifth International Conference on Semantic Computing.
[6]
Official Google Blog. 2015. Google Health Information Knowledge Graph. (2015). https://googleblog.blogspot.co.uk/2015/02/health-info-knowledge-graph.html
[7]
Clinton Burfoot, Steven Bird, and Timothy Baldwin. 2011. Collective Classification of Congressional Floor-debate Transcripts. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1 (HLT '11). Association for Computational Linguistics, Stroudsburg, PA, USA, 1506--1515. http://dl.acm.org/citation.cfm?id=2002472.2002655
[8]
Jacob Cohen. 1968. Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychological bulletin 70, 4 (1968), 213.
[9]
Janez Demšar. 2006. Statistical comparisons of classifiers over multiple data sets. Journal of Machine learning research 7, Jan (2006), 1--30.
[10]
Shiri Dori-Hacohen. 2015. Controversy Detection and Stance Analysis. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '15). ACM, New York, NY, USA, 1057--1057.
[11]
Shiri Dori-Hacohen and James Allan. 2013. Detecting controversy on the web. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13). ACM, New York, NY, USA, 1845--1848.
[12]
Shiri Dori-Hacohen and James Allan. 2015. Automated Controversy Detection on the Web. In Proceedings of the 37th European Conference on Information Retrieval. 423--434. ECIR submission.
[13]
Shiri Dori-Hacohen, Elad Yom-Tov, and James Allan. 2015. Navigating Controversy as a Complex Search Task. In ECIR 15. Electronic proceedings only.
[14]
Adam Faulkner. 2012. Automated Classification of Stance in Student Essays: An Approach Using Stance Target Information and the Wikipedia Link-Based Measure. In Proceedings of the Twenty-Seventh International Artificial Intelligence Research Conference.
[15]
W. Ferreira and A Vlachos. 2016. Emergent: a novel data-set for stance classification. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics.
[16]
Kazi Saidul Hasan and Vincent Ng. 2013. Stance Classification of Ideological Debates: Data, Models, Features, and Constraints. In Proceedings of the Sixth International Joint Conference on Natural Language Processing. 1348--1356.
[17]
Myungha Jang and James Allan. 2016. Improving Automated Controversy Detection on the Web. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '16). ACM, New York, NY, USA, 865--868.
[18]
Tony Mullen and Robert Malouf. 2008. Taking sides: User classification for informal online political discourse. Internet Research 18 (2008), 177--190.
[19]
Akiko Murakami and Rudy Raymond. 2010. Support or Oppose?: Classifying Positions in Online Debates from Reply Activities and Opinion Expressions. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters (COLING '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 869--875. http://dl.acm.org/citation.cfm?id=1944566.1944666
[20]
SEBASTIAN PAD?!, TAE-GIL NOH, ASHER STERN, RUI WANG, and ROBERTO ZANOLI. 2015. Design and realization of a modular architecture for textual entailment. Natural Language Engineering 21 (3 2015), 167--200. Issue 02.
[21]
Thomas C. Rindflesh and Marcelo Fiszman. 2003. The Interaction of Domain Knowledge and Linguistic Structure in Natural Language Processing: Interpreting Hypernymic Propositions in Biomedical Text. J. of Biomedical Informatics 36, 6 (Dec. 2003), 462--477.
[22]
Parinaz Sobhani, Diana Inkpen, and Stan Matwin. 2015. From argumentation mining to stance classification. NAACL HLT 2015 (2015).
[23]
Swapna Somasundaran and Janyce Wiebe. 2009. Recognizing Stances in Online Debates. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1 (ACL '09). Association for Computational Linguistics, Stroudsburg, PA, USA, 226--234. http://dl.acm.org/citation.cfm?id=1687878.1687912
[24]
Swapna Somasundaran and Janyce Wiebe. 2010. Recognizing Stances in Ideological On-line Debates. In Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text (CAAGET '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 116--124. http://dl.acm.org/citation.cfm?id=1860631.1860645
[25]
Dhanya Sridhar, Lise Getoor, and Marilyn Walker. 2014. Collective Stance Classification of Posts in Online Debate Forums. In ACL Joint Workshop on Social Dynamics and Personal Attributes in Social Media.
[26]
Matt Thomas, Bo Pang, and Lillian Lee. 2006. Get out the Vote: Determining Support or Opposition from Congressional Floor-debate Transcripts. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP '06). Association for Computational Linguistics, Stroudsburg, PA, USA, 327--335. http://dl.acm.org/citation.cfm?id=1610075.1610122
[27]
Marilyn A. Walker, Pranav Anand, Robert Abbott, and Ricky Grant. 2012. Stance Classification Using Dialogic Properties of Persuasion (NAACL HLT '12). Association for Computational Linguistics, Stroudsburg, PA, USA, 592--596. http://dl.acm.org/citation.cfm?id=2382029.2382124
[28]
Yi-Chia Wang and Carolyn P. Rosé. 2010. Making Conversational Structure Explicit: Identification of Initiation-response Pairs Within Online Discussions. In 2010 North American Chapter of the Association for Computational Linguistics (HLT '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 673--676. http://dl.acm.org/citation.cfm?id=1857999.1858096
[29]
WebMD. 2016. WebMD information on link between vaccine and autism. (2016). http://www.webmd.com/children/vaccines/features/vaccine-linked-to-autism#1
[30]
Ainur Yessenalina, Yisong Yue, and Claire Cardie. 2010. Multi-level Structured Models for Document-level Sentiment Classification. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 1046--1056. http://dl.acm.org/citation.cfm?id=1870658.1870760
[31]
Elad Yom-Tov and Luis Fernández-Luque. 2014. Information is in the eye of the beholder: Seeking information on the MMR vaccine through an Internet search engine. In AMIA 2014, American Medical Informatics Association Annual Symposium, Washington, DC, USA, November 15-19, 2014. http://knowledge.amia.org/56638-amia-1.1540970/t-004-1.1544972/f-004-1.1544973/a-226-1.1544995/a-227-1.1544992
[32]
Charles Zaiontz. 2016. Real Statistics Using Excel. (2016). http://www.real-statistics.com/statistics-tables/wilcoxon-signed-ranks-table/

Cited By

View all
  • (2024)Collaborative Knowledge Infusion for Low-Resource Stance DetectionBig Data Mining and Analytics10.26599/BDMA.2024.90200217:3(682-698)Online publication date: Sep-2024
  • (2024)Topic-Specific Political Stance Inference in Social Networks With Case StudiesIEEE Access10.1109/ACCESS.2024.336048712(21921-21935)Online publication date: 2024
  • (2023)Explainable Cross-Topic Stance Detection for Search ResultsProceedings of the 2023 Conference on Human Information Interaction and Retrieval10.1145/3576840.3578296(221-235)Online publication date: 19-Mar-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data
January 2018
379 pages
ISBN:9781450363419
DOI:10.1145/3152494
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 January 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. health information mining
  2. stance classification
  3. text processing

Qualifiers

  • Research-article

Conference

CoDS-COMAD '18

Acceptance Rates

CODS-COMAD '18 Paper Acceptance Rate 50 of 150 submissions, 33%;
Overall Acceptance Rate 197 of 680 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)3
Reflects downloads up to 23 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Collaborative Knowledge Infusion for Low-Resource Stance DetectionBig Data Mining and Analytics10.26599/BDMA.2024.90200217:3(682-698)Online publication date: Sep-2024
  • (2024)Topic-Specific Political Stance Inference in Social Networks With Case StudiesIEEE Access10.1109/ACCESS.2024.336048712(21921-21935)Online publication date: 2024
  • (2023)Explainable Cross-Topic Stance Detection for Search ResultsProceedings of the 2023 Conference on Human Information Interaction and Retrieval10.1145/3576840.3578296(221-235)Online publication date: 19-Mar-2023
  • (2023)Hashtag-Guided Low-Resource Tweet ClassificationProceedings of the ACM Web Conference 202310.1145/3543507.3583194(1415-1426)Online publication date: 30-Apr-2023
  • (2023)Big Data ML-Based Fake News Detection Using Distributed LearningIEEE Access10.1109/ACCESS.2023.326076311(29447-29463)Online publication date: 2023
  • (2022)A Tutorial on Stance DetectionProceedings of the Fifteenth ACM International Conference on Web Search and Data Mining10.1145/3488560.3501391(1626-1628)Online publication date: 11-Feb-2022
  • (2022)Few-Shot Stance Detection via Target-Aware Prompt DistillationProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531979(837-847)Online publication date: 6-Jul-2022
  • (2022)Opinion Mining System For Domain Specific Mobile Application Development2022 IEEE Delhi Section Conference (DELCON)10.1109/DELCON54057.2022.9753286(1-5)Online publication date: 11-Feb-2022
  • (2021)Stance Detection: Concepts, Approaches, Resources, and Outstanding IssuesProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3404835.3462815(2673-2676)Online publication date: 11-Jul-2021
  • (2021)HeadlineStanceCheckerWeb Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2021.10066071:COnline publication date: 1-Nov-2021
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media