Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3127404.3127426acmotherconferencesArticle/Chapter ViewAbstractPublication PageschinesecscwConference Proceedingsconference-collections
research-article

Recommending Answerers for Stack Overflow with LDA Model

Published: 22 September 2017 Publication History

Abstract

Stack Overflow is the largest Community-based Question Answering (CQA) site for software developers. Its popularity is mainly attributed to the timely answers provided by a great number of developers. When having problems in learning and using new technologies, developers resort to Stack Overflow for help. However, it is difficult to recommend questions to the potential answerers due to the huge numbers of questions. In order to improve the accuracy of question recommending, we need to find out the members who are interested in the fields related to the questions and match the ability of developers with the difficulty of questions. To do so, we need to pay close attention to the behavior of developers. This paper presents a model with two prediction approaches, namely, the traditional feature-based approach and LDA (Latent Dirichlet Allocation) based approach. When a new question arrives, this model will use LDA method to label the question and indicate the proper category to which the question belongs according to latent semantic feature and content feature. Then, with the traditional features of the question and the asker information, the model will recommend the appropriate developers to answer this new question.

References

[1]
Asaduzzaman M, Mashiyat A S, Roy C K, et al. Answering questions about unanswered questions of stack overflow{C}Proceedings of the 10th Working Conference on Mining Software Repositories. IEEE Press, 2013: 97--100.
[2]
Ahasanuzzaman M, Roy C K, et al. Mining duplicate questions in stack overflow{C}Proceedings of the 13th International Workshop on Mining Software Repositories. ACM, 2016: 402--412.
[3]
Xu B, Ye D, Xing Z, et al. Predicting semantically linkable knowledge in developer online forums via convolutional neural network{C}Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering. ACM, 2016: 51--62.
[4]
Yao Y, Tong H, Xie T, et al. Detecting high-quality posts in community question answering sites{J}. Information Sciences, 2015, 302: 70--82.
[5]
Baltadzhieva A, Chrupała G. Question Quality in Community Question Answering Forums: a survey{J}. ACM SIGKDD Explorations Newsletter, 2015, 17(1): 8--13.
[6]
Correa D, Sureka A. Chaff from the wheat: characterization and modeling of deleted questions on stack overflow{C}Proceedings of the 23rd international conference on World wide web. ACM, 2014: 631--642.
[7]
Beyer S, Pinzger M. Synonym suggestion for tags on stack overflow{C}Proceedings of the 2015 IEEE 23rd International Conference on Program Comprehension. IEEE Press, 2015: 94--103.
[8]
Beyer S, Pinzger M. Grouping android tag synonyms on stack overflow{C}Proceedings of the 13th International Workshop on Mining Software Repositories. ACM, 2016: 430--440.
[9]
Halavais A, Kwon K H, Havener S, et al. Badges of friendship: social influence and badge acquisition on Stack Overflow{C}2014 47th Hawaii International Conference on System Sciences. IEEE, 2014: 1607--1615.
[10]
Vasilescu B, Filkov V, Serebrenik A. StackOverflow and GitHub: Associations between software development and crowdsourced knowledge{C}Social Computing (SocialCom), 2013 International Conference on. IEEE, 2013: 188--195.
[11]
Wang S, Lo D, Jiang L. An empirical study on developer interactions in stackoverflow{C}Proceedings of the 28th Annual ACM Symposium on Applied Computing. ACM, 2013: 1019--1024.
[12]
Lin B, Serebrenik A. Recognizing gender of stack overflow users{C}Proceedings of the 13th International Workshop on Mining Software Repositories. ACM, 2016: 425--429.
[13]
Bazelli B, Hindle A, Stroulia E. On the Personality Traits of StackOverflow Users{C}ICSM. 2013: 460--463.
[14]
Chang S, Pal A. Routing questions for collaborative answering in community question answering{C}Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. ACM, 2013: 494--501.
[15]
Guo J, Xu S, Bao S, et al. Tapping on the potential of q&a community by recommending answer providers{C}Proceedings of the 17th ACM conference on Information and knowledge management. ACM, 2008: 921--930.
[16]
Jurczyk P, Agichtein E. Discovering authorities in question answer communities by using link analysis{C}Proceedings of the sixteenth ACM conference on Conference on information and knowledge management. ACM, 2007: 919--922.
[17]
Zhou T C, Lyu M R, King I. A classification-based approach to question routing in community question answering{C}Proceedings of the 21st International Conference on World Wide Web. ACM, 2012: 783--790.
[18]
Hanrahan B V, Convertino G, Nelson L. Modeling problem difficulty and expertise in stackoverflow{C}//Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work Companion. ACM, 2012: 91--94.
[19]
Movshovitz-Attias D, Movshovitz-Attias Y, Steenkiste P, et al. Analysis of the reputation system and user contributions on a question answering website: Stackoverflow{C}//Advances in Social Networks Analysis and Mining (ASONAM), 2013 IEEE/ACM International Conference on. IEEE, 2013: 886--893.
[20]
Pinto G, Castor F, Liu Y D. Mining questions about software energy consumption{C}//Proceedings of the 11th Working Conference on Mining Software Repositories. ACM, 2014: 22--31.
[21]
Bajaj K, Pattabiraman K, Mesbah A. Mining questions asked by web developers{C}//Proceedings of the 11th Working Conference on Mining Software Repositories. ACM, 2014: 112--121.
[22]
Gantayat N, Dhoolia P, Padhye R, et al. The synergy between voting and acceptance of answers on stackoverflow, or the lack thereof{C}//Proceedings of the 12th Working Conference on Mining Software Repositories. IEEE Press, 2015: 406--409.
[23]
Zagalsky A, Barzilay O, Yehudai A. Example overflow: Using social media for code recommendation{C}//Proceedings of the Third International Workshop on Recommendation Systems for Software Engineering. IEEE Press, 2012: 38--42.
[24]
Subramanian S, Inozemtseva L, Holmes R. Live API documentation{C}//Proceedings of the 36th International Conference on Software Engineering. ACM, 2014: 643--652.
[25]
Atwood J, "Stackoverflow careers: Amplifying your awesome," Code Horror, 2009, http://www.codinghorror.com/blog/2009/11/stack-overflow-careers-amplifying-your-awesome.html
[26]
Chang S, Pal A. Routing questions for collaborative answering in Community Question Answering{C}// Ieee/acm International Conference on Advances in Social Networks Analysis and Mining. IEEE, 2013:494--501.
[27]
J. Guo, S. Xu, S. Bao, and Y. Yu, "Tapping on the potential of q&a community by recommending answer providers," Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08, pp. 921--930, 2008.
[28]
Guo J, Xu S, Bao S, et al. Tapping on the potential of q&a community by recommending answer providers{C}// ACM Conference on Information and Knowledge Management, CIKM 2008, Napa Valley, California, Usa, October. DBLP, 2008:921--930.
[29]
Jurczyk P, Agichtein E. Discovering authorities in question answer communities by using link analysis{C}// Sixteenth ACM Conference on Information and Knowledge Management, CIKM 2007, Lisbon, Portugal, November. DBLP, 2007:919--922.
[30]
Zhou T C, Lyu M R, King I. A classification-based approach to question routing in community question answering{C}// Proceedings of the 21st international conference companion on World Wide Web. ACM, 2012:783--790.
[31]
Hu H, Zhang H, Xuan J, et al. Effective Bug Triage Based on Historical Bug-Fix Information{C}// IEEE, International Symposium on Software Reliability Engineering. IEEE, 2014:122--132.
[32]
Choetkiertikul M, Avery D, Dam H K, et al. Who Will Answer My Question on Stack Overflow?{C}// Software Engineering Conference. IEEE, 2015:155--164.
[33]
Zhao W X, Jiang J, Weng J, et al. Comparing Twitter and Traditional Media Using Topic Models{M}// Advances in Information Retrieval. Springer Berlin Heidelberg, 2011:338--349.
[34]
Titov I, Mcdonald R. Modeling online reviews with multi-grain topic models{J}. 2012:111--120.

Cited By

View all
  • (2023)Finding Experts in Community Question Answering System Using Trie String Matching Algorithm with Domain KnowledgeIETE Journal of Research10.1080/03772063.2023.218123370:3(2602-2614)Online publication date: 23-Feb-2023
  • (2022)More Gamification Is Not Always BetterProceedings of the ACM on Human-Computer Interaction10.1145/35555536:CSCW2(1-32)Online publication date: 11-Nov-2022
  • (2022)A Dual-Attention Heterogeneous Graph Neural Network for Expert Recommendation in Online Agricultural Question and Answering Communities2022 IEEE 25th International Conference on Computer Supported Cooperative Work in Design (CSCWD)10.1109/CSCWD54268.2022.9776176(926-931)Online publication date: 4-May-2022
  • Show More Cited By

Index Terms

  1. Recommending Answerers for Stack Overflow with LDA Model

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ChineseCSCW '17: Proceedings of the 12th Chinese Conference on Computer Supported Cooperative Work and Social Computing
    September 2017
    269 pages
    ISBN:9781450353526
    DOI:10.1145/3127404
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 September 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. LDA
    2. Stack Overflow
    3. classifier
    4. recommender system

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ChineseCSCW '17

    Acceptance Rates

    ChineseCSCW '17 Paper Acceptance Rate 21 of 84 submissions, 25%;
    Overall Acceptance Rate 21 of 84 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Finding Experts in Community Question Answering System Using Trie String Matching Algorithm with Domain KnowledgeIETE Journal of Research10.1080/03772063.2023.218123370:3(2602-2614)Online publication date: 23-Feb-2023
    • (2022)More Gamification Is Not Always BetterProceedings of the ACM on Human-Computer Interaction10.1145/35555536:CSCW2(1-32)Online publication date: 11-Nov-2022
    • (2022)A Dual-Attention Heterogeneous Graph Neural Network for Expert Recommendation in Online Agricultural Question and Answering Communities2022 IEEE 25th International Conference on Computer Supported Cooperative Work in Design (CSCWD)10.1109/CSCWD54268.2022.9776176(926-931)Online publication date: 4-May-2022
    • (2022)An empirical study of COVID-19 related posts on Stack OverflowJournal of Systems and Software10.1016/j.jss.2021.111089182:COnline publication date: 22-Apr-2022
    • (2022)Augmenting Textbooks with cQA Question-Answers and Annotated YouTube Videos to Increase Its RelevanceNeural Processing Letters10.1007/s11063-022-10897-455:1(551-588)Online publication date: 30-Jun-2022
    • (2021)TSAR-based Expert Recommendation Mechanism for Community Question Answering2021 IEEE 24th International Conference on Computer Supported Cooperative Work in Design (CSCWD)10.1109/CSCWD49262.2021.9437843(162-167)Online publication date: 5-May-2021
    • (2021)Selecting the most helpful answers in online health question answering communitiesJournal of Intelligent Information Systems10.1007/s10844-021-00640-1Online publication date: 18-May-2021
    • (2020)How Fast and Effectively Can Code Change History Enrich Stack Overflow?2020 IEEE 20th International Conference on Software Quality, Reliability and Security (QRS)10.1109/QRS51102.2020.00066(467-478)Online publication date: Dec-2020
    • (2019)Why is Developing Machine Learning Applications Challenging? A Study on Stack Overflow Posts2019 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)10.1109/ESEM.2019.8870187(1-11)Online publication date: Sep-2019

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media