research-article

Open Domain Question Answering via Semantic Enrichment

Authors:

Ming-Wei ChangAuthors Info & Claims

WWW '15: Proceedings of the 24th International Conference on World Wide Web

Pages 1045 - 1055

https://doi.org/10.1145/2736277.2741651

Published: 18 May 2015 Publication History

Abstract

Most recent question answering (QA) systems query large-scale knowledge bases (KBs) to answer a question, after parsing and transforming natural language questions to KBs-executable forms (e.g., logical forms). As a well-known fact, KBs are far from complete, so that information required to answer questions may not always exist in KBs. In this paper, we develop a new QA system that mines answers directly from the Web, and meanwhile employs KBs as a significant auxiliary to further boost the QA performance. Specifically, to the best of our knowledge, we make the first attempt to link answer candidates to entities in Freebase, during answer candidate generation. Several remarkable advantages follow: (1) Redundancy among answer candidates is automatically reduced. (2) The types of an answer candidate can be effortlessly determined by those of its corresponding entity in Freebase. (3) Capitalizing on the rich information about entities in Freebase, we can develop semantic features for each answer candidate after linking them to Freebase. Particularly, we construct answer-type related features with two novel probabilistic models, which directly evaluate the appropriateness of an answer candidate's types under a given question. Overall, such semantic features turn out to play significant roles in determining the true answers from the large answer candidate pool. The experimental results show that across two testing datasets, our QA system achieves an 18%~54% improvement under F_1 metric, compared with various existing QA systems.

References

[1]

K. Balog and R. Neumayer. Hierarchical target type identification for entity-oriented queries. In CIKM, pages 2391--2394. ACM, 2012.

Digital Library

[2]

J. Berant, A. Chou, R. Frostig, and P. Liang. Semantic parsing on Freebase from question-answer pairs. In EMNLP, pages 1533--1544, 2013.

[3]

J. Berant and P. Liang. Semantic parsing via paraphrasing. In ACL, 2014.

[4]

C. Bishop et al. Pattern recognition and machine learning, volume 1. springer New York, 2006.

Digital Library

[5]

D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003.

Digital Library

[6]

K. Bollacker, C. Evans, P. Paritosh, T. Sturge, and J. Taylor. Freebase: a collaboratively created graph database for structuring human knowledge. In SIGMOD, pages 1247--1250. ACM, 2008.

Digital Library

[7]

E. Brill, S. Dumais, and M. Banko. An analysis of the AskMSR question-answering system. In EMNLP, pages 257--264, 2002.

Digital Library

[8]

E. Brill, J. J. Lin, M. Banko, S. T. Dumais, and A. Y. Ng. Data-intensive question answering. In TREC, 2001.

[9]

C. Burges. From RankNet to LambdaRank to LambdaMART: An overview. Learning, 11:23--581, 2010.

[10]

C. Burges, K. M. Svore, P. N. Bennett, A. Pastusiak, and Q. Wu. Learning to rank using an ensemble of lambda-gradient models. In Yahoo! Learning to Rank Challenge, pages 25--35, 2011.

Digital Library

[11]

S. Chaturvedi, V. Castelli, R. Florian, R. M. Nallapati, and H. Raghavan. Joint question clustering and relevance prediction for open domain non-factoid question answering. In WWW, pages 503--514, 2014.

Digital Library

[12]

J. Chu-Carroll, J. Prager, C. Welty, K. Czuba, and D. Ferrucci. A multi-strategy and multi-source approach to question answering. Technical report, DTIC Document, 2006.

[13]

S. Cucerzan and A. Sil. The msr systems for entity linking and temporal slot filling at TAC 2013. In Text Analysis Conference, 2013.

[14]

X. Dong, K. Murphy, E. Gabrilovich, G. Heitz, W. Horn, N. Lao, T. Strohmann, S. Sun, and W. Zhang. Knowledge vault: A Web-scale approach to probabilistic knowledge fusion. In SIGKDD, pages 601--610, 2014.

Digital Library

[15]

O. Etzioni. Search needs a shake-up. Nature, 476(7358):25--26, 2011.

[16]

A. Fader, S. Soderland, and O. Etzioni. Identifying relations for open information extraction. In EMNLP, pages 1535--1545, 2011.

Digital Library

[17]

A. Fader, L. Zettlemoyer, and O. Etzioni. Paraphrase-driven learning for open question answering. In ACL, pages 1608--1618, 2013.

[18]

A. Fader, L. Zettlemoyer, and O. Etzioni. Open question answering over curated and extracted knowledge bases. In SIGKDD. ACM, 2014.

Digital Library

[19]

C. Fellbaum. WordNet: An electronic lexical database. 1998. http://www. cogsci. princeton. edu/wn, 2010.

[20]

D. Ferrucci, E. Brown, J. Chu-Carroll, J. Fan, D. Gondek, A. A. Kalyanpur, A. Lally, J. Murdock, E. Nyberg, J. Prager, et al. Building watson: An overview of the DeepQA project. AI magazine, 31(3):59--79, 2010.

Digital Library

[21]

J. Friedman. Greedy function approximation: a gradient boosting machine. Annals of Statistics, pages 1189--1232, 2001.

[22]

S. Harabagiu, D. Moldovan, M. Pasca, R. Mihalcea, M. Surdeanu, R. Bunescu, R. Girju, V. Rus, and P. Morarescu. FALCON: Boosting knowledge for answer engines. In TREC, volume 9, pages 479--488, 2000.

[23]

G. Hinton and R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786):504--507, 2006.

[24]

E. Hovy, L. Gerber, U. Hermjakob, M. Junk, and C. Lin. Question answering in Webclopedia. In TREC, volume 9, 2000.

[25]

J. Ko, E. Nyberg, and L. Si. A probabilistic graphical model for joint answer ranking in question answering. In SIGIR on Rearch and Development in IR, pages 343--350. ACM, 2007.

Digital Library

[26]

C. Kwok, O. Etzioni, and D. Weld. Scaling question answering to the Web. TOIS, 19(3):242--262, 2001.

Digital Library

[27]

A. Lally, J. Prager, M. McCord, B. Boguraev, S. Patwardhan, J. Fan, P. Fodor, and J. Chu-Carroll. Question analysis: How watson reads a clue. IBM Journal of Research and Development, 56(3.4):2--1, 2012.

Digital Library

[28]

X. Li and D. Roth. Learning question classifiers. In ICCL, pages 1--7, 2002.

Digital Library

[29]

D. C. Liu and J. Nocedal. On the limited memory BFGS method for large scale optimization. Mathematical programming, 45(1-3):503--528, 1989.

Digital Library

[30]

X. Luo, H. Raghavan, V. Castelli, S. Maskey, and R. Florian. Finding what matters in questions. In HLT-NAACL, pages 878--887, 2013.

[31]

E. Marsh and D. Perzanowski. MUC-7 evaluation of ie technology: Overview of results. In MUC-7, volume 20, 1998.

[32]

B. Min, R. Grishman, L. Wan, C. Wang, and D. Gondek. Distant supervision for relation extraction with an incomplete knowledge base. In HLT-NAACL, pages 777--782, 2013.

[33]

J. W. Murdock, A. Kalyanpur, C. Welty, J. Fan, D. A. Ferrucci, D. Gondek, L. Zhang, and H. Kanayama. Typing candidate answers using type coercion. IBM Journal of Research and Development, 56(3.4):7--1, 2012.

Digital Library

[34]

S. Na, I. Kang, S. Lee, and J. Lee. Question answering approach using a WordNet-based answer type taxonomy. In TREC, 2002.

[35]

C. Pinchak and D. Lin. A probabilistic answer type model. In EACL, 2006.

[36]

S. Robertson and H. Zaragoza. On rank-based effectiveness measures and optimization. Information Retrieval, 10(3):321--339, 2007.

Digital Library

[37]

N. Schlaefer, P. Gieselmann, T. Schaaf, and A. Waibel. A pattern learning approach to question answering within the ephyra framework. In Text, speech and dialogue, pages 687--694. Springer, 2006.

Digital Library

[38]

F. M. Suchanek, G. Kasneci, and G. Weikum. YAGO: a core of semantic knowledge. In WWW, pages 697--706. ACM, 2007.

Digital Library

[39]

C. Tsai, W. Yih, and C. Burges. Web-based question answering: Revisiting AskMSR. Technical Report MSR-TR-2015-20, Microsoft Research, 2015.

[40]

C. Unger, L. Buhmann, J. Lehmann, A. Ngonga Ngomo, D. Gerber, and P. Cimiano. Template-based question answering over RDF data. In WWW, pages 639--648. ACM, 2012.

Digital Library

[41]

E. M. Voorhees and D. M. Tice. Building a question answering test collection. In SIGIR on Rearch and Development in IR, pages 200--207. ACM, 2000.

Digital Library

[42]

R. West, E. Gabrilovich, K. Murphy, S. Sun, R. Gupta, and D. Lin. Knowledge base completion via search-based question answering. In WWW, pages 515--526, 2014.

Digital Library

[43]

R. W. White, M. Richardson, and W. Yih. Questions vs. queries in informational search tasks. Technical Report MSR-TR-2014-96, Microsoft Research, 2014.

[44]

M. Yahya, K. Berberich, S. Elbassuoni, M. Ramanath, V. Tresp, and G. Weikum. Natural language questions for the Web of data. In EMNLP-CoNLL, pages 379--390, 2012.

Digital Library

[45]

X. Yao and B. Van Durme. Information extraction over structured data: Question answering with Freebase. In ACL, 2014.

[46]

L. Zou, R. Huang, H. Wang, J. X. Yu, W. He, and D. Zhao. Natural language question answering over RDF: a graph data driven approach. In SIGMOD, pages 313--324. ACM, 2014.

Digital Library

Cited By

Setty VKlein MBen-David AJäschke RKelly M(2024)Extreme Classification for Answer Type Prediction in Question AnsweringProceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries10.1109/JCDL57899.2023.00041(232-236)Online publication date: 26-Jun-2024
https://dl.acm.org/doi/10.1109/JCDL57899.2023.00041
Sharma SJain STiwari Mlal S(2024)Classifying the state of knowledge-based question answering: patterns, progress, and prospectsInternational Journal of Computers and Applications10.1080/1206212X.2024.2426512(1-13)Online publication date: 19-Nov-2024
https://doi.org/10.1080/1206212X.2024.2426512
Andreasen TBordogna GTré GKacprzyk JLarsen HZadrożny S(2024)The power and potentials of Flexible Query Answering Systems: A critical and comprehensive analysisData & Knowledge Engineering10.1016/j.datak.2023.102246149(102246)Online publication date: Jan-2024
https://doi.org/10.1016/j.datak.2023.102246
Show More Cited By

Index Terms

Open Domain Question Answering via Semantic Enrichment
1. Computing methodologies
  1. Artificial intelligence
    1. Knowledge representation and reasoning
      1. Logic programming and answer set programming
2. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Table Cell Search for Question Answering
WWW '16: Proceedings of the 25th International Conference on World Wide Web

Tables are pervasive on the Web. Informative web tables range across a large variety of topics, which can naturally serve as a significant resource to satisfy user information needs. Driven by such observations, in this paper, we investigate an ...
Fusing Essential Knowledge for Text-Based Open-Domain Question Answering
Advances in Knowledge Discovery and Data Mining
Abstract
Question answering (QA) systems can be classified as either text-based QA systems or knowledge base QA (KBQA) systems, depending on the used knowledge source. KBQA systems are generally domain-specific and can’t deal with a variety of questions in ...
Quality-aware collaborative question answering: methods and evaluation
WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data Mining

Community Question Answering (QA) portals contain questions and answers contributed by hundreds of millions of users. These databases of questions and answers are of great value if they can be used directly to answer questions from any user. In this ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '15: Proceedings of the 24th International Conference on World Wide Web

May 2015

1460 pages

ISBN:9781450334693

General Chairs:
Aldo Gangemi
National Research Council, Italy & Paris 13 University-CNRS, France
,
Stefano Leonardi
Sapienza University of Rome, Italy
,
Alessandro Panconesi
Sapienza University of Rome, Italy

Copyright © 2015 Copyright is held by the International World Wide Web Conference Committee (IW3C2).

Sponsors

IW3C2: International World Wide Web Conference Committee

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 18 May 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

WWW '15

Sponsor:

IW3C2

WWW '15: 24th International World Wide Web Conference

May 18 - 22, 2015

Florence, Italy

Acceptance Rates

WWW '15 Paper Acceptance Rate 131 of 929 submissions, 14%;

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

51
Total Citations
View Citations
675
Total Downloads

Downloads (Last 12 months)15
Downloads (Last 6 weeks)1

Reflects downloads up to 23 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Setty VKlein MBen-David AJäschke RKelly M(2024)Extreme Classification for Answer Type Prediction in Question AnsweringProceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries10.1109/JCDL57899.2023.00041(232-236)Online publication date: 26-Jun-2024
https://dl.acm.org/doi/10.1109/JCDL57899.2023.00041
Sharma SJain STiwari Mlal S(2024)Classifying the state of knowledge-based question answering: patterns, progress, and prospectsInternational Journal of Computers and Applications10.1080/1206212X.2024.2426512(1-13)Online publication date: 19-Nov-2024
https://doi.org/10.1080/1206212X.2024.2426512
Andreasen TBordogna GTré GKacprzyk JLarsen HZadrożny S(2024)The power and potentials of Flexible Query Answering Systems: A critical and comprehensive analysisData & Knowledge Engineering10.1016/j.datak.2023.102246149(102246)Online publication date: Jan-2024
https://doi.org/10.1016/j.datak.2023.102246
Franciscatto MErpen de Bona LTrois CDidonet Del FabroFabro MDamasceno Lima J(2024)Situational Data Integration in Question Answering systems: a survey over two decadesKnowledge and Information Systems10.1007/s10115-024-02136-066:10(5875-5918)Online publication date: 18-Jun-2024
https://doi.org/10.1007/s10115-024-02136-0
Guo MZhou ZGotz DWang Y(2023)GRAFS: Graphical Faceted Search System to Support Conceptual Understanding in Exploratory SearchACM Transactions on Interactive Intelligent Systems10.1145/358831913:2(1-36)Online publication date: 31-Mar-2023
https://dl.acm.org/doi/10.1145/3588319
Peng Y(2023)Query-Driven Knowledge Graph Construction using Question Answering and Multimodal FusionCompanion Proceedings of the ACM Web Conference 202310.1145/3543873.3587567(1119-1126)Online publication date: 30-Apr-2023
https://dl.acm.org/doi/10.1145/3543873.3587567
Wang NChen TRen CWang H(2023)Document-level relation extraction with multi-layer heterogeneous graph attention networkEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106212123(106212)Online publication date: Aug-2023
https://doi.org/10.1016/j.engappai.2023.106212
Qamar FLatif SShah A(2023)Techniques, datasets, evaluation metrics and future directions of a question answering systemKnowledge and Information Systems10.1007/s10115-023-02019-w66:4(2235-2268)Online publication date: 22-Dec-2023
https://dl.acm.org/doi/10.1007/s10115-023-02019-w
Chi RLiu X(2023)Improvement of Graph Convolution Network of Missing Data Based on P SystemsAdvanced Intelligent Computing Technology and Applications10.1007/978-981-99-4752-2_25(298-309)Online publication date: 31-Jul-2023
https://doi.org/10.1007/978-981-99-4752-2_25
Houssein EIbrahem NZaki ASayed A(2022)Semantic Protocol and Resource Description Framework Query Language: A Comprehensive ReviewMathematics10.3390/math1017320310:17(3203)Online publication date: 5-Sep-2022
https://doi.org/10.3390/math10173203
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents