Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3295750.3298933acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article
Public Access

HAIR: Towards Developing a Global Self-Updating Peer Support Group Meeting List Using Human-Aided Information Retrieval

Published: 08 March 2019 Publication History

Abstract

Alcoholics Anonymous (AA) is the largest grassroots peer support group for any health condition. While AA meeting attendance is particularly important for people who are newly sober, newcomers often have trouble finding meetings because of a lack of global up-to-date meeting list due to preference for regional autonomy in AA's organizational structure. Detection of regional webpages containing meetings and extraction of day, time, and address of meetings from those pages are essential steps in making the information available and up-to-date in a global meeting list. However, varied structure of the webpages and the meetings pose challenges in achieving the goal with traditional information retrieval methods. In this paper we propose HAIR: a semi-automated human-aided information retrieval technique and explore its potential to solve this problem. We describe future directions in developing this critical tool and discuss major implications of our work in pointing to the importance of context-specific rather than context-agnostic semi-automated in-formation retrieval techniques by conceptualizing the proposed methods and results in a broader context.

References

[1]
Ansam A. AbdulHussien. 2017. Comparison of Machine Learning Algorithms to Classify Web Pages. International Journal of Advanced Computer Science and Applications (ijacsa) 8, 11.
[2]
Anagnostopoulos, C. Anagnostopoulos, V. Loumos, and E. Kayafas. 2004. Classifying Web pages employing a probabilistic neural network. IEE Proceedings - Software 151, 3: 139--150.
[3]
Howard C. Becker. 2008. Alcohol dependence, withdrawal, and relapse. Alcohol Research & Health 31, 4: 348--361.
[4]
Mike Cassidy. 2014. Centaur Chess Shows Power of Teaming Human and Machine. Huffington Post. Retrieved August 13, 2018 from https://www.huffingtonpost.com/mike-cassidy/centaur-chess-shows-power_b_6383606.html
[5]
Chia-Hui Chang and Shao-Chen Lui. 2001. IEPAD: Information Extraction Based on Pattern Discovery. In Proceedings of the 10th International Conference on World Wide Web (WWW '01), 681--688.
[6]
Michael Chau and Hsinchun Chen. 2008. A Machine Learning Approach to Web Page Filtering Using Content and Structure Analysis. Decis. Support Syst. 44, 2: 482--494.
[7]
Yinlin Chen, Paul Logasa Bogen II, Haowei Hsieh, Edward A. Fox, and Lillian N. Cassel. 2012. Categorization of Computing Education Resources with Utilization of Crowdsourcing. In Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '12), 121--124.
[8]
Zhe Chen. 2014. A Semiautomatic Approach for Accurate and Low-Effort Spreadsheet Data Extraction.
[9]
Justin Cheng and Michael S. Bernstein. 2015. Flock: Hybrid Crowd-Machine Learning Classifiers. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15), 600--611.
[10]
Ed H. Chi. 2017. Technical Perspective: Humans and Computers Working Together on Hard Tasks. Commun. ACM 60, 9: 92--92.
[11]
M. I. Devi, R. Rajaram, and K. Selvakuberan. 2007. Machine Learning Techniques for Automated Web Page Classification Using URL Features. In International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007), 116--120.
[12]
M. Ferri, L. Amato, and M. Davoli. 2006. Alcoholics Anonymous and other 12-step programmes for alcohol dependence. The Cochrane Database of Systematic Reviews, 3: CD005032.
[13]
Dayne Freitag. 2000. Machine Learning for Information Extraction in Informal Domains. Machine Learning 39, 2--3: 169--202.
[14]
F. S. Gharehchopogh and Z. A. Khalifelu. 2011. Analysis and evaluation of unstructured data: text mining versus natural language processing. In 2011 5th International Conference on Application of Information and Communication Technologies (AICT), 1--4.
[15]
Kotaro Hara and Jon E. Froehlich. 2015. Characterizing and Visualizing Physical World Accessibility at Scale Using Crowdsourcing, Computer Vision, and Machine Learning. SIGACCESS Access. Comput., 113: 13--21.
[16]
Yun Huang, Yifeng Huang, Na Xue, and Jeffrey P. Bigham. 2017. Leveraging Complementary Contributions of Different Workers for Efficient Crowdsourcing of Video Captions. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems (CHI '17), 4617--4626.
[17]
Zhongjun Jin, Michael R. Anderson, Michael Cafarella, and H. V. Jagadish. 2017. Foofah: Transforming Data By Example. In Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD '17), 683--698.
[18]
Thorsten Joachims. 1998. Text categorization with Support Vector Machines: Learning with many relevant features. In Machine Learning: ECML-98, Claire Nédellec and Céline Rouveirol (eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 137--142.
[19]
Herbert D. Kleber, Roger D. Weiss, Raymond F. Anton, Tony P. George, Shelly F. Greenfield, Thomas R. Kosten, Charles P. O'Brien, Bruce J. Rounsaville, Eric C. Strain, Douglas M. Ziedonis, Grace Hennessy, Hilary Smith Connery, John S. McIntyre, Sara C. Charles, Daniel J. Anzia, Ian A. Cook, Molly T. Finnerty, Bradley R. Johnson, James E. Nininger, Paul Summergrad, Sherwyn M. Woods, Joel Yager, Robert Pyles, C. Deborah Cross, Roger Peele, John P. D. Shemo, Lawrence Lurie, R. Dale Walker, Mary Ann Barnovitz, Sheila Hafter Gray, Sunil Saxena, Tina Tonnu, Robert Kunkle, Amy B. Albert, Laura J. Fochtmann, Claudia Hart, Darrel Regier, Work Group on Substance Use Disorders, American Psychiatric Association, and Steering Committee on Practice Guidelines. 2007. Treatment of patients with substance use disorders, second edition. American Psychiatric Association. The American Journal of Psychiatry 164, 4 Suppl: 5--123.
[20]
Erdal Kuzey and Gerhard Weikum. 2012. Extraction of Temporal Facts and Events from Wikipedia. In Proceedings of the 2Nd Temporal Web Analytics Workshop (TempWeb '12), 25--32.
[21]
Gierad Laput, Walter S. Lasecki, Jason Wiese, Robert Xiao, Jeffrey P. Bigham, and Chris Harrison. 2015. Zensors: Adaptive, Rapidly Deployable, Human-Intelligent Sensor Feeds. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15), 1935--1944.
[22]
Wendy Lehnert, Stephen Soderland, David Aronow, FangFang Feng, and Avinoam Shmueli. 1995. Inductive text classification for medical applications. Journal of Experimental & Theoretical Artificial Intelligence 7, 1: 49--80.
[23]
Camelia Lemnaru and Rodica Potolea. 2012. Imbalanced Classification Problems: Systematic Study, Issues and Best Practices. In Enterprise Information Systems (Lecture Notes in Business Information Processing), 35--50.
[24]
Xuanchong Li, Kaimin Chang, Yueran Yuan, and Alexander Hauptmann. 2015. Massive Open Online Proctor: Protecting the Credibility of MOOCs Certificates. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15), 1129--1137.
[25]
Henry Lieberman, Karthik Dinakar, and Birago Jones. 2013. Crowdsourced Ethics with Personalized Story Matching. In CHI '13 Extended Abstracts on Human Factors in Computing Systems (CHI EA '13), 709--714.
[26]
Xiao Ling and Daniel S. Weld. 2010. Temporal Information Extraction. In Twenty-Fourth AAAI Conference on Artificial Intelligence. Retrieved September 21, 2018 from https://www.aaai.org/ocs/index.php/AAAI/AAAI10/paper/view/1805
[27]
Huaxi Liu, Ning Wang, and Xiangran Ren. 2015. CrowdSR: A Crowd Enabled System for Semantic Recovering of Web Tables. In Web-Age Information Management (Lecture Notes in Computer Science), 581--583.
[28]
Brian McLernon and Nicholas Kushmerick. 2006. Transductive Pattern Learning for Information Extraction. UNIVERSITY COLL DUBLIN (IRELAND), UNIVERSITY COLL DUBLIN (IRELAND). Retrieved August 19, 2018 from http://www.dtic.mil/docs/citations/ADA456766
[29]
Dunja Mladenic. 1998. Turning Yahoo into an Automatic Web-Page Classifier.
[30]
Raymond J. Mooney and Razvan Bunescu. 2005. Mining Knowledge from Text Using Information Extraction. SIGKDD Explor. Newsl. 7, 1: 3--10.
[31]
Rudolf H. Moos and Bernice S. Moos. 2006. Participation in Treatment and Alcoholics Anonymous: A 16-Year Follow-Up of Initially Untreated Individuals. Journal of clinical psychology 62, 6: 735--750.
[32]
Clifton Phua, Damminda Alahakoon, and Vincent Lee. 2004. Minority Report in Fraud Detection: Classification of Skewed Data. SIGKDD Explor. Newsl. 6, 1: 50--59.
[33]
Xiaoguang Qi and Brian D. Davison. 2006. Knowing a Web Page by the Company It Keeps. In Proceedings of the 15th ACM International Conference on Information and Knowledge Management (CIKM '06), 228--237.
[34]
Xiaoguang Qi and Brian D. Davison. 2009. Web Page Classification: Features and Algorithms. ACM Comput. Surv. 41, 2: 12:1--12:31.
[35]
Alan Ritter, Mausam, Oren Etzioni, and Sam Clark. 2012. Open Domain Event Extraction from Twitter. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '12), 1104--1112.
[36]
Sabirat Rubya. 2017. Facilitating Peer Support for Recovery from Substance Use Disorders. In Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems (CHI EA '17), 172--177.
[37]
Sabirat Rubya and Svetlana Yarosh. 2017. Video-Mediated Peer Support in an Online Community for Recovery from Substance Use Disorders. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '17), 1454--1469.
[38]
E. Saraç and S. A. Özel. 2013. Web page classification using firefly optimization. In 2013 IEEE INISTA, 1--5.
[39]
Thimo Schulze, Simone Krug, and Martin Schader. 2012. Workers' Task Choice in Crowdsourcing and Human Computation Markets. ICIS 2012 Proceedings. Retrieved from http://aisel.aisnet.org/icis2012/proceedings/ResearchInProgress/40
[40]
Vinay Shashidhar, Nishant Pandey, and Varun Aggarwal. 2015. Spoken English Grading: Machine Learning with Crowd Intelligence. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '15), 2089--2097.
[41]
Wongkot Sriurai, Phayung Meesad, and Choochart Haruechaiyasak. 2010. Hierarchical Web Page Classification Based on a Topic Model and Neighboring Pages Integration. arXiv:1003.1510 {cs}. Retrieved August 13, 2018 from http://arxiv.org/abs/1003.1510
[42]
Chong Sun, Narasimhan Rampalli, Frank Yang, and AnHai Doan. 2014. Chimera: Large-scale Classification Using Machine Learning, Rules, and Crowdsourcing. Proc. VLDB Endow. 7, 13: 1529--1540.
[43]
Makoto Tsukada, Takashi Washio, and Hiroshi Motoda. 2001. Automatic Web-Page Classification by Using Machine Learning Methods. In Proceedings of the First Asia-Pacific Conference on Web Intelligence: Research and Development (WI '01), 303--313. Retrieved January 11, 2017 from http://dl.acm.org/citation.cfm?id=645960.673927
[44]
Yafang Wang, Mingjie Zhu, Lizhen Qu, Marc Spaniol, and Gerhard Weikum. 2010. Timely YAGO: Harvesting, Querying, and Visualizing Temporal Knowledge from Wikipedia. In Proceedings of the 13th International Conference on Extending Database Technology (EDBT '10), 697--700.
[45]
Shomir Wilson, Florian Schaub, Rohan Ramanath, Norman Sadeh, Fei Liu, Noah A. Smith, and Frederick Liu. 2016. Crowdsourcing Annotations for Websites' Privacy Policies: Can It Really Work? In Proceedings of the 25th International Conference on World Wide Web (WWW '16), 133--143.
[46]
Yang Yang, Bin B. Zhu, Rui Guo, Linjun Yang, Shipeng Li, and Nenghai Yu. 2008. A Comprehensive Human Computation Framework: With Application to Image Labeling. In Proceedings of the 16th ACM International Conference on Multimedia (MM '08), 479--488.
[47]
Dahai Yao, Hailong Sun, and Xudong Liu. 2015. Combining Crowd Contributions with Machine Learning to Detect Malicious Mobile Apps. In Proceedings of the 7th Asia-Pacific Symposium on Internetware (Internetware '15), 120--123.
[48]
Svetlana Yarosh. 2013. Shifting Dynamics or Breaking Sacred Traditions?: The Role of Technology in Twelve-step Fellowships. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13), 3413--3422.
[49]
D. Yue, G. Yu, D. Shen, and X. Yu. 2014. A Weighted Aggregation Rule in Crowdsourcing Systems for High Result Accuracy. In 2014 IEEE 12th International Conference on Dependable, Autonomic and Secure Computing, 265--270.
[50]
Alcohol Facts and Statistics | National Institute on Alcohol Abuse and Alcoholism (NIAAA). Retrieved September 6, 2018 from https://www.niaaa.nih.gov/alcohol-health/overview-alcohol-consumption/alcohol-facts-and-statistics
[51]
Alcoholics Anonymous?: A.A. Near You. Retrieved September 23, 2018 from https://www.aa.org/pages/en_US/find-aa-resources
[52]
Amazon Mechanical Turk. Retrieved September 23, 2018 from https://www.mturk.com/

Cited By

View all
  • (2023)AI-Based Intrusion Detection Systems for In-Vehicle Networks: A SurveyACM Computing Surveys10.1145/357095455:11(1-40)Online publication date: 9-Feb-2023
  • (2022)The Work of Digital Social Re-entry in Substance Use Disorder RecoveryProceedings of the ACM on Human-Computer Interaction10.1145/35556586:CSCW2(1-33)Online publication date: 11-Nov-2022
  • (2022)Maintaining ValuesProceedings of the ACM on Human-Computer Interaction10.1145/35555506:CSCW2(1-28)Online publication date: 11-Nov-2022
  • Show More Cited By

Index Terms

  1. HAIR: Towards Developing a Global Self-Updating Peer Support Group Meeting List Using Human-Aided Information Retrieval

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CHIIR '19: Proceedings of the 2019 Conference on Human Information Interaction and Retrieval
      March 2019
      463 pages
      ISBN:9781450360258
      DOI:10.1145/3295750
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 08 March 2019

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. alcoholics anonymous
      2. classification
      3. crowdsourcing
      4. human computation
      5. information retrieval
      6. peer support

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      CHIIR '19
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 55 of 163 submissions, 34%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)97
      • Downloads (Last 6 weeks)7
      Reflects downloads up to 16 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)AI-Based Intrusion Detection Systems for In-Vehicle Networks: A SurveyACM Computing Surveys10.1145/357095455:11(1-40)Online publication date: 9-Feb-2023
      • (2022)The Work of Digital Social Re-entry in Substance Use Disorder RecoveryProceedings of the ACM on Human-Computer Interaction10.1145/35556586:CSCW2(1-33)Online publication date: 11-Nov-2022
      • (2022)Maintaining ValuesProceedings of the ACM on Human-Computer Interaction10.1145/35555506:CSCW2(1-28)Online publication date: 11-Nov-2022
      • (2022)ColorCook: Augmenting Color Design for Dashboarding with Domain-Associated PalettesProceedings of the ACM on Human-Computer Interaction10.1145/35555346:CSCW2(1-25)Online publication date: 11-Nov-2022
      • (2022)Auggie: Encouraging Effortful Communication through Handcrafted Digital ExperiencesProceedings of the ACM on Human-Computer Interaction10.1145/35551526:CSCW2(1-25)Online publication date: 11-Nov-2022
      • (2022)"For an App Supposed to Make Its Users Feel Better, It Sure is a Joke" - An Analysis of User Reviews of Mobile Mental Health ApplicationsProceedings of the ACM on Human-Computer Interaction10.1145/35551466:CSCW2(1-29)Online publication date: 11-Nov-2022
      • (2022)Understanding Account Deletion and Relevant Dark Patterns on Social MediaProceedings of the ACM on Human-Computer Interaction10.1145/35551426:CSCW2(1-43)Online publication date: 11-Nov-2022
      • (2021)Comparing Generic and Community-Situated Crowdsourcing for Data Validation in the Context of Recovery from Substance Use DisordersProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445399(1-17)Online publication date: 6-May-2021
      • (2019)An interview with Lana YaroshUbiquity10.1145/33386282019:June(1-7)Online publication date: 11-Jun-2019

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media