Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1099554.1099725acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Query expansion using term relationships in language models for information retrieval

Published: 31 October 2005 Publication History

Abstract

Language Modeling (LM) has been successfully applied to Information Retrieval (IR). However, most of the existing LM approaches only rely on term occurrences in documents, queries and document collections. In traditional unigram based models, terms (or words) are usually considered to be independent. In some recent studies, dependence models have been proposed to incorporate term relationships into LM, so that links can be created between words in the same sentence, and term relationships (e.g. synonymy) can be used to expand the document model. In this study, we further extend this family of dependence models in the following two ways: (1) Term relationships are used to expand query model instead of document model, so that query expansion process can be naturally implemented; (2) We exploit more sophisticated inferential relationships extracted with Information Flow (IF). Information flow relationships are not simply pairwise term relationships as those used in previous studies, but are between a set of terms and another term. They allow for context-dependent query expansion. Our experiments conducted on TREC collections show that we can obtain large and significant improvements with our approach. This study shows that LM is an appropriate framework to implement effective query expansion.

References

[1]
A. Berger and J. Lafferty (1999). Information Retrieval as Statistical Translation. In Proceedings of the 22th ACM SIGIR Conference on Research and Development in IR, pp.222--229.
[2]
P. D. Bruza and D. Song (2002). Inferring Query Models by Computing Information Flow. In Proceedings of the 11th International ACM Conference on Information and Knowledge Management, pp.260--269.
[3]
C. Burgess, K. Livesay and K. Lund (1998). Explorations in Context Space: Words, Sentences, Discourse. Discourse Processes, 25(2&3), 211--257.
[4]
G. Cao, J. Y. Nie and J. Bai (2005). Integrating Term Relationships into Language Models. In Proceedings of the 28th ACM SIGIR Conference on Research and Development in IR, pp.298--305.
[5]
W.B. Croft and J. Lafferty (2002). Language Models for Information Retrieval. Kluwer Int. Series on Information Retrieval, Vol. 13, Kluwer Academic Publishers.
[6]
J. F. Gao, J. Y. Nie, G. Wu and G. Cao (2004). Dependence Language Model for Information Retrieval. In Proceedings of the 27th ACM SIGIR Conference on Research and Development in IR, pp.170--177.
[7]
J. F. Gao, J. Y. Nie, J. Zhang, E. Xun, M. Zhou and C. Huang (2001). Improving Query Translation for CLIR using Statistical Models. In Proceedings of the 24th ACM SIGIR Conference on Research and Development in IR, pp. 96--104.
[8]
J. Lafferty and C. Zhai (2001). Document Language Models, Query Models, and Risk Minimization for Information Retrieval. In Proceedings of the 24th ACM SIGIR Conference on Research and Development in IR, pp.111--119.
[9]
V. Lavrenko and W. B. Croft (2001). Relevance-based Language Models. In Proceedings of the 24th ACM SIGIR Conference on Research and Development in IR, pp.120--127.
[10]
K. Lund and C. Burgess (1996). Producing High-dimensional Semantic Spaces from Lexical Co-occurrence. Behavior Research Methods, Instruments, & Computers, 28(2), 203--208.
[11]
K. Ng. (1999). A Maximum Likelihood Ratio Information Retrieval Model. In TREC-8 Workshop notebook.
[12]
J. Ponte and W. B. Croft (1998). A Language Modeling Approach to Information Retrieval. In Proceedings of the 21st ACM SIGIR Conference on Research and Development in IR, pp.275--281.
[13]
Y. Qiu and H. P. Frei (1993). Concept Based Query Expansion. In Proceedings of the 16th ACM SIGIR Conference on Research and Development in IR, pp.160--169.
[14]
H. Schutze and J. O. Pedersen (1997). A Co-occurrence based Thesaurus and Two Applications to Information Retrieval. Information Processing and Management, 33(3), 307--318.
[15]
D. Song and P. D. Bruza (2003). Towards Context-sensitive Information Inference. Journal of the American Society for Information Science and Technology (JASIST), Vol. 54, 321--334.
[16]
J. Xu and W. B. Croft (1996). Query Expansion Using Local and Global Document Analysis. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in IR, pp.4--11.
[17]
C. Zhai and J. Lafferty (2001). A Study of Smoothing Methods for Language Models Applied to Ad hoc Information Retrieval. In Proceedings of the 24th ACM SIGIR Conference on Research and Development in IR, pp.334--342.
[18]
C. Zhai and J. Lafferty (2001). Model-based Feedback in the Language Modeling Approach to Information Retrieval. In Proceedings of the 10th International Conference on Information and Knowledge Management, pp.403--410.

Cited By

View all
  • (2024)Envisioning Information Access Systems: What Makes for Good Tools and a Healthy Web?ACM Transactions on the Web10.1145/364946818:3(1-24)Online publication date: 26-Feb-2024
  • (2023)Recent Query Reformulation Approaches for Information Retrieval System - A SurveyRecent Advances in Computer Science and Communications10.2174/266625581566622040409192016:1Online publication date: Jan-2023
  • (2023)A discriminative method for global query expansion and term reweighting using co-occurrence graphsJournal of Information Science10.1177/016555152199804749:1(183-206)Online publication date: 1-Feb-2023
  • Show More Cited By

Index Terms

  1. Query expansion using term relationships in language models for information retrieval

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management
    October 2005
    854 pages
    ISBN:1595931406
    DOI:10.1145/1099554
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 31 October 2005

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. information flow
    2. language model
    3. query expansion
    4. term relationships

    Qualifiers

    • Article

    Conference

    CIKM05
    Sponsor:
    CIKM05: Conference on Information and Knowledge Management
    October 31 - November 5, 2005
    Bremen, Germany

    Acceptance Rates

    CIKM '05 Paper Acceptance Rate 77 of 425 submissions, 18%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)31
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 10 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Envisioning Information Access Systems: What Makes for Good Tools and a Healthy Web?ACM Transactions on the Web10.1145/364946818:3(1-24)Online publication date: 26-Feb-2024
    • (2023)Recent Query Reformulation Approaches for Information Retrieval System - A SurveyRecent Advances in Computer Science and Communications10.2174/266625581566622040409192016:1Online publication date: Jan-2023
    • (2023)A discriminative method for global query expansion and term reweighting using co-occurrence graphsJournal of Information Science10.1177/016555152199804749:1(183-206)Online publication date: 1-Feb-2023
    • (2023)SPRF: A semantic Pseudo-relevance Feedback enhancement for information retrieval via ConceptNetKnowledge-Based Systems10.1016/j.knosys.2023.110602274(110602)Online publication date: Aug-2023
    • (2022)Automatic Recognition and Extraction of English Verb Types Based on Index Line ClusteringMobile Information Systems10.1155/2022/26526222022Online publication date: 1-Jan-2022
    • (2022)A Multi-Dimensional Semantic Pseudo-Relevance Feedback Information Retrieval Model2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)10.1109/WI-IAT55865.2022.00141(866-872)Online publication date: Nov-2022
    • (2022)A probabilistic framework for integrating sentence-level semantics via BERT into pseudo-relevance feedbackInformation Processing and Management: an International Journal10.1016/j.ipm.2021.10273459:1Online publication date: 9-Apr-2022
    • (2022)A COVID-19 Search Engine (CO-SE) with Transformer-based architectureHealthcare Analytics10.1016/j.health.2022.1000682(100068)Online publication date: Nov-2022
    • (2022)An automatic query expansion based on hybrid CMO-COOT algorithm for optimized information retrievalThe Journal of Supercomputing10.1007/s11227-021-04171-y78:6(8625-8643)Online publication date: 12-Jan-2022
    • (2021)Time segment language model for microblog retrievalNeural Computing and Applications10.1007/s00521-020-05534-xOnline publication date: 3-Jan-2021
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media