Mining the web for answers to natural language questions

DR Radev, H Qi, Z Zheng, S Blair-Goldensohn… - Proceedings of the …, 2001 - dl.acm.org
DR Radev, H Qi, Z Zheng, S Blair-Goldensohn, Z Zhang, W Fan, J Prager
Proceedings of the tenth international conference on Information and …, 2001dl.acm.org
The web is now becoming one of the largest information and knowledge repositories. Many
large scale search engines (Google, Fast, Northern Light, etc.) have emerged to help users
find information. In this paper, we study how we can effectively use these existing search
engines to mine the Web and discover the" correct" answers to factual natural language
questions. We propose a probabilistic algorithm called QASM (Question Answering using
Statistical Models) that learns the best query paraphrase of a natural language question. We …
The web is now becoming one of the largest information and knowledge repositories. Many large scale search engines (Google, Fast, Northern Light, etc.) have emerged to help users find information. In this paper, we study how we can effectively use these existing search engines to mine the Web and discover the "correct" answers to factual natural language questions.We propose a probabilistic algorithm called QASM (Question Answering using Statistical Models) that learns the best query paraphrase of a natural language question. We validate our approach for both local and web search engines using questions from the TREC evaluation. We also show how this algorithm can be combined with another algorithm (AnSel) to produce precise answers to natural language questions.
ACM Digital Library