Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Deep Learning Approaches to Semantic Relevance Modeling for Chinese Question-Answer Pairs

Published: 01 December 2011 Publication History

Abstract

The human-generated question-answer pairs in the Web social communities are of great value for the research of automatic question-answering technique. Due to the large amount of noise information involved in such corpora, it is still a problem to detect the answers even though the questions are exactly located. Quantifying the semantic relevance between questions and their candidate answers is essential to answer detection in social media corpora. Since both the questions and their answers usually contain a small number of sentences, the relevance modeling methods have to overcome the problem of word feature sparsity. In this article, the deep learning principle is introduced to address the semantic relevance modeling task. Two deep belief networks with different architectures are proposed by us to model the semantic relevance for the question-answer pairs. According to the investigation of the textual similarity between the community-driven question-answering (cQA) dataset and the forum dataset, a learning strategy is adopted to promote our models’ performance on the social community corpora without hand-annotating work. The experimental results show that our method outperforms the traditional approaches on both the cQA and the forum corpora.

References

[1]
Agichtein, E., Castillo, C., Donato, D., Gionis, A., and Mishne, G. 2008. Finding high-quality content in social media. In Proceedings of the International Conference on Web Search and Web Data Mining (WSDM’08). 183--194.
[2]
Berger, A., Caruana, R., Cohn, D., Freitag, D., and Mittal, V. 2000. Bridging the lexical chasm: Statistical approaches to answering. In Proceedings of the 23rd Annual International ACM Conference on Research and Development in Information Retrieval (SIGIR’00). 192--199.
[3]
Bernhard, D. and Gurevych, I. 2009. Combining lexical semantic resources with question & answer archives for translation-based answering. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (IJCNLP’09). 728--736.
[4]
Cong, G., Wang, L., Lin, C.-Y., Song, Y.-I., and Sun, Y. 2008. Finding question-answer pairs from online forums. In Proceedings of the 31st Annual International ACM Conference on Research and Development in Information Retrieval (SIGIR’08). 467--474.
[5]
Ding, S., Cong, G., Lin, C.-Y., and Zhu, X. 2008. Using conditional random fields to extract contexts and answers of questions from online forums. In Proceedings of the Conference on Human Language Translation (HLT’08). 710--718.
[6]
Duan, H., Cao, Y., Lin, C.-Y., and Yu, Y. 2008. Searching questions by identifying question topic and question focus. In Proceedings of the Conference on Human Language Translation (HLT’08). 156--164.
[7]
Feng, D., Shaw, E., Kim, J., and Hovy, E. H. 2006. An intelligent discussion-bot for answering student queries in threaded discussions. In Proceedings of the International Conference on Intelligent User Interfaces. C. Paris and C. L. Sidner Eds., ACM, 171--177.
[8]
Hinton, G. E. 2002. Training products of experts by minimizing contrastive divergence. Neural Comput. 14.
[9]
Hinton, G. E. and Osindero, S. 2006. A fast learning algorithm for deep belief nets. Neural Comput. 18.
[10]
Hinton, G. E. and Salakhutdinov, R. R. 2006. Reducing the dimensionality of data with neural networks. Science 313, 5786, 504--507.
[11]
Hong, L. and Davison, B. D. 2009. A classification-based approach to question answering in discussion boards. In Proceedings of the 32nd International ACM Conference on Research and Development in Information Retrieval (SIGIR’09). 171--178.
[12]
Huang, J., Zhou, M., and Yang, D. 2007. Extracting chatbot knowledge from online discussion forums. In Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI’07). 423--428.
[13]
Jeon, J., Croft, W. B., and Lee, J. H. 2005. Finding similar questions in large question and answer archives. In Proceedings of the International Conference on Information and Knowledge Management (CIKM’05). 84--90.
[14]
Jeon, J., Croft, W. B., Lee, J. H., and Park, S. 2006. A framework to predict the quality of answers with non-textual features. In Proceedings of the International ACM Conference on Research and Development in Information Retrieval (SIGIR’06). 228--235.
[15]
Jijkoun, V. and de Rijke, M. 2005. Retrieving answers from frequently asked questions pages on the web. In Proceedings of the International Conference on Information and Knowledge Management (CIKM’05). 76--83.
[16]
Lee, J.-T., Kim, S.-B., Song, Y.-I., and Rim, H.-C. 2008. Bridging lexical gaps between queries and questions on large online Q&A collections with compact translation models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’08). 410--418.
[17]
Li, F., Tang, Y., Huang, M., and Zhu, X. 2009. Answering opinion questions with random walks on graphs. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (IJCNLP’09). 737--745.
[18]
Riezler, S., Vasserman, A., Tsochantaridis, I., Mittal, V., and Liu, Y. 2007. Statistical machine translation for query expansion in answer retrieval. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics (ACL’07). 464--471.
[19]
Salakhutdinov, R. and Hinton, G. 2009. Semantic hashing. Int. J. Approx. Reasoning 50, 7, 969--978.
[20]
Shrestha, L. and McKeown, K. 2004. Detection of question-answer pairs in email conversations. In Proceedings of the International Conference on Computer Linguistics (COLING’04). 889--895.
[21]
Surdeanu, M., Ciaramita, M., and Zaragoza, H. 2008. Learning to rank answers on large online QA collections. In Proceedings of the Conference on Human Language Translation (HLT’08). 719--727.
[22]
Tomasoni, M. and Huang, M. 2010. Metadata-aware measures for answer summarization in community question answering. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL’10). 760--769.
[23]
Wang, B., Liu, B., Sun, C., Wang, X., and Sun, L. 2009. Extracting Chinese question-answer pairs from online forums. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (SMC’09). 1159--1164.

Cited By

View all
  • (2022)Evaluating Simple and Complex Models’ Performance When Predicting Accepted Answers on Stack Overflow2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)10.1109/SEAA56994.2022.00014(29-38)Online publication date: Aug-2022
  • (2021)Recognizing Named Entity in Electric Power DocumentsAdvances in Natural Computation, Fuzzy Systems and Knowledge Discovery10.1007/978-3-030-70665-4_71(652-660)Online publication date: 27-Jun-2021
  • (2020)The influence of semantic link network on the ability of question-answering systemFuture Generation Computer Systems10.1016/j.future.2020.02.042108(1-14)Online publication date: Jul-2020
  • Show More Cited By

Index Terms

  1. Deep Learning Approaches to Semantic Relevance Modeling for Chinese Question-Answer Pairs

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Asian Language Information Processing
    ACM Transactions on Asian Language Information Processing  Volume 10, Issue 4
    December 2011
    112 pages
    ISSN:1530-0226
    EISSN:1558-3430
    DOI:10.1145/2025384
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 December 2011
    Accepted: 01 April 2011
    Revised: 01 February 2011
    Received: 01 November 2010
    Published in TALIP Volume 10, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Deep belief network
    2. question-answer pairs
    3. semantic relevance

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 21 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Evaluating Simple and Complex Models’ Performance When Predicting Accepted Answers on Stack Overflow2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)10.1109/SEAA56994.2022.00014(29-38)Online publication date: Aug-2022
    • (2021)Recognizing Named Entity in Electric Power DocumentsAdvances in Natural Computation, Fuzzy Systems and Knowledge Discovery10.1007/978-3-030-70665-4_71(652-660)Online publication date: 27-Jun-2021
    • (2020)The influence of semantic link network on the ability of question-answering systemFuture Generation Computer Systems10.1016/j.future.2020.02.042108(1-14)Online publication date: Jul-2020
    • (2020)Social QA in non-CQA platformsFuture Generation Computer Systems10.1016/j.future.2019.12.023105:C(631-649)Online publication date: 1-Apr-2020
    • (2019)Learning semantic representation with neural networks for community question answering retrievalKnowledge-Based Systems10.1016/j.knosys.2015.11.00293:C(75-83)Online publication date: 1-Jan-2019
    • (2018)A unified model for document-based question answering based on human-like reading strategyProceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence10.5555/3504035.3504110(604-611)Online publication date: 2-Feb-2018
    • (2018)Response Selection and Automatic Message-Response Expansion in Retrieval-Based QA Systems using Semantic Dependency Pair ModelACM Transactions on Asian and Low-Resource Language Information Processing10.1145/322918418:1(1-24)Online publication date: 12-Nov-2018
    • (2017)Using re-ranking to boost deep learning based community question retrievalProceedings of the International Conference on Web Intelligence10.1145/3106426.3106442(807-814)Online publication date: 23-Aug-2017
    • (2017)Sparseness Analysis in the Pretraining of Deep Neural NetworksIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2016.254168128:6(1425-1438)Online publication date: Jun-2017
    • (2017)Modeling and Learning Distributed Word Representation with Metadata for Question RetrievalIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.266562529:6(1226-1239)Online publication date: 1-Jun-2017
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media