Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1007/978-3-642-28601-8_27guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Bootstrap-Based equivalent pattern learning for collaborative question answering

Published: 11 March 2012 Publication History

Abstract

Semantically similar questions are submitted to collaborative question answering systems repeatedly even though these questions already contain best answers before. To solve the problem, we propose a precise approach of automatically finding an answer to such questions by identifying "equivalent" questions submitted and answered. Our method is based on a new pattern generation method T-IPG to automatically extract equivalent question patterns. Taking these patterns from training data as seed patterns, we further propose a bootstrap-based pattern learning method to extend more equivalent patterns on these seed patterns. The resulting patterns can be applied to match a new question to an equivalent one that has already been answered, and thus suggest potential answers automatically. We experimented with this approach over a large collection of more than 200,000 real questions drawn from Yahoo! Answers archive, automatically acquiring over 16,991 equivalent question patterns. These patterns allow our method to obtain over 57% recall and over 54% precision on suggesting an answer automatically to new questions, significantly improving over baseline methods.

References

[1]
Yahoo! Answers (2011), http://answers.yahoo.com/
[2]
Whitehead, S. D.: Auto-FAQ: An Experiment In Cyberspace Leveraging. Journal of Computer Networks and ISDN Systems 28, 137-146 (1995)
[3]
Hammond, K., Bruke, R., Martin, C., Lytinen, S.: FAQ-Finder: A Case Based Approach to Knowledge Navigation. In: Working Notes of the AAAI Spring Symposium on Information Gathering from Heterogeneous Distributed Environments, AAAI, pp. 80-86 (1995)
[4]
Tomuro, N.: Question Terminology and Representation for Question Type Classification. Terminology 10(1), 153-168 (2004)
[5]
Lenz, M., Hbner, A., Kunze, M.: Question Answering With Textual CBR. In: Proceedings of the International Conference on FQAS, Denmark, pp. 236-247 (1998)
[6]
Sneiders, E.: Automated Question Answering Using Question Templates That Cover the Conceptual Model of the Database, Natural Language Processing and Information Systems. In: Proceedings of the NLDB Conference, Sweden, pp. 235-239 (2002)
[7]
Berger, A., Caruana, R., Cohn, D., Freitag, D., Mittal, V.: Bridging the Lexical Chasm: Statistical Approaches To Answer-finding. In: Proceedings of ACM SIGIR Conference, New York, pp. 192-199 (2000)
[8]
GIZA++: Training of statistical translation models (2010), http://fjoch.com/GIZA++.html
[9]
Jeon, J., Croft, W. B., Lee, J. H.: Finding Semantically Similar Questions Based on Their Answers. In: Proceedings of the 28th ACM SIGIR Conference, Salvador, Brazil (2005)
[10]
Jeon, J., Croft, W. B., Lee, J. H.: Finding Similar Questions in Large Question and Answer Archives. In: Proceedings of the 14th CIKM, pp. 84-90 (2005)
[11]
Kosseim, L., Yousefi, J.: Improving the Performance of Question Answering With Semantically Equivalent Answer Patterns. Journal of Data & Knowledge Engineering 66, 57-67 (2008)
[12]
Mark, A. G., Horacio, S.: A Pattern Based Approach to Answering Factoid, List and Definition Questions. In: Proceedings of the 7th RIAO Conference, Avignon, France (2004)
[13]
OpenNLP (2010), http://opennlp.sourceforge.net/
[14]
Ravichandran, D., Hovy, E.: Learning Surface Text Patterns for a Question Answering System. In: Proceedings of the 40th ACL Conference, Philadelphia (2002)
[15]
Bernhard, D., Gurevych, I.: Answering Learners' Questions by Retrieving Question Paraphrases from Social Q&A Sites. In: Proceedings of the 3rd Workshop on Innovative Use of NLP for Building Educational Applications, pp. 44-52 (2008)
[16]
Term frequency/Inverse document frequency implementation in C# (2011), http://www.codeproject.com/KB/cs/tfidf.aspx
[17]
Bian, J., Liu, Y., Agichtein, E., Zha, H.: Finding the Right Facts in the Crowd: Factoid Question Answering Over Social Media. In: Proceedings of WWW Conference (2008)
[18]
Ion, M.: Extraction Patterns for Information Extraction Tasks: a Survey. In: Workshop on Machine Learning for Information Extraction, Orlando (1999)
[19]
Hao, T.Y., Hu, D. W., Liu, W.Y., Zeng, Q. T.: Semantic Patterns for User-interactive Question Answering. Journal of Concurrency and Computation-practice & Experience 20(7), 783-799 (2008)
[20]
Hu, D. W., Liu, W.Y.: SIIPU*S: A Semantic Pattern Learning Algorithm. In: Proceedings of the SKG Conference, Guilin, China (2006)
[21]
Wu, C. H., Yeh, J. F., Chen, M. J.: Domain-specific FAQ Retrieval Using Independent Aspects. Journal of ACM Transactions on Asian Language Information Processing 4(1), 1-17 (2005)
[22]
Zhang, D., Lee, W. S.: Web based Pattern Mining and Matching Approach to Question Answering. In: Proceedings of TREC-10 (2001)
[23]
Jijkoun, V., Rijke, M. D.: Retrieving Answers From Frequently Asked Questions Pages on the Web. In: Proceedings of the 14th CIKM Conference, Bremen, Germany (2005)
[24]
Wang, K., Ming, Z., Chua, T. S.: A Syntactic Tree Matching Approach to Finding Similar Questions in Community-based QA Services. In: Proceedings of SIGIR Conference, pp. 187-194 (2009)

Cited By

View all
  • (2015)Recognition of Patient-Related Named Entities in Noisy Tele-Health TextsACM Transactions on Intelligent Systems and Technology10.1145/26514446:4(1-23)Online publication date: 24-Jul-2015
  1. Bootstrap-Based equivalent pattern learning for collaborative question answering

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Guide Proceedings
      CICLing'12: Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part II
      March 2012
      514 pages
      ISBN:9783642286001
      • Editor:
      • Alexander Gelbukh

      Sponsors

      • Springer
      • National Polytechnic Institute, Mexico: National Polytechnic Institute, Mexico
      • The Association for Computational Linguistics
      • Natural Language and Text Processing Lab., CIC-IPN: Natural Language and Text Processing Laboratory, CIC-IPN
      • Indian Institute of Technology, Delhi: Indian Institute of Technology, Delhi

      Publisher

      Springer-Verlag

      Berlin, Heidelberg

      Publication History

      Published: 11 March 2012

      Author Tags

      1. bootstrap
      2. collaborative question answering
      3. equivalent pattern
      4. pattern extension

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 20 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2015)Recognition of Patient-Related Named Entities in Noisy Tele-Health TextsACM Transactions on Intelligent Systems and Technology10.1145/26514446:4(1-23)Online publication date: 24-Jul-2015

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media