Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1835449.1835506acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

DivQ: diversification for keyword search over structured databases

Published: 19 July 2010 Publication History

Abstract

Keyword queries over structured databases are notoriously ambiguous. No single interpretation of a keyword query can satisfy all users, and multiple interpretations may yield overlapping results. This paper proposes a scheme to balance the relevance and novelty of keyword search results over structured databases. Firstly, we present a probabilistic model which effectively ranks the possible interpretations of a keyword query over structured data. Then, we introduce a scheme to diversify the search results by re-ranking query interpretations, taking into account redundancy of query results. Finally, we propose α-nDCG-W and WS-recall, an adaptation of α-nDCG and S-recall metrics, taking into account graded relevance of subtopics. Our evaluation on two real-world datasets demonstrates that search results obtained using the proposed diversification algorithms better characterize possible answers available in the database than the results of the initial relevance ranking.

References

[1]
Agrawal, R., Gollapudi, S., Halverson, A., & Leong, S. Diversifying Search Results. WSDM 2009.
[2]
Carbonell, J., & Goldstein, J. The use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries. In Proceedings of the SIGIR 1998.
[3]
Chakaravarthy, V. T., Gupta, H., Roy, P., & Mohania, M. Efficiently Linking Text Documents with Relevant Structured Information. In Proceedings of the VLDB 2006.
[4]
Chen, H., & Karger, D. R. Less is More. Probabilistic Models for Retrieving Fewer Relevant Documents. SIGIR'06
[5]
Chen, Z., & Li, T. Addressing Diverse User Preferences in SQL-Query-Result Navigation. SIGMOD 2007
[6]
Clarke, C. L., Kolla, M., Cormack, G. V., Vechtomova, O., Ashkan, A., Büttcher, S., MacKinnon, I. Novelty and Diversity in Information Retrieval Evaluation. SIGIR 2008.
[7]
Clough, P., Sanderson, M., Abouammoh, M., Navarro, S., Paramita, M.: Multiple Approaches to Analysing Query Diversity. In Proceedings of SIGIR2009.
[8]
Demidova, E., Zhou, X. & Nejdl, W. IQP: Incremental Query Construction, a Probabilistic Approach. In ICDE 2010.
[9]
Gollapudi, S., Sharma, A. An Axiomatic Approach for Result Diversification. In Proceedings of WWW 2009.
[10]
Hearst, M. A. Clustering versus Faceted Categories for Information Exploration. Commun, ACM 49, April 2006.
[11]
Hristidis, V., Gravano, L., Papakonstantinou, Y. Efficient IR-Style Keyword Search over Relational Databases. VLDB 03.
[12]
Järvelin, K., & Kekäläinen, J. Cumulated Gain-based Evaluation of IR Techniques. ACM Trans. Inf. Syst., 2002.
[13]
Kandogan, E., Krishnamurthy, R., Raghavan, S., Vaithyanathan, S., & Zhu, H. Avatar Semantic Search: A Database Approach to Information retrieval. SIGMOD 2006.
[14]
Liu, B., & Jagadish, H. V. Using Trees to Depict a Forest. In Proceedings of the VLDB 2009.
[15]
Manning, C. D., Raghavan, P. and Schütze, H. Introduction to Information Retrieval, Cambridge University Press. 2008.
[16]
Tata, S., & Lohman, G. M. SQAK: doing more with keywords. In Proceedings of the SIGMOD 2008.
[17]
Tran, T., Cimiano, P., Rudolph, S., & Studer, R. Ontology-based Interpretation of Keywords for Semantic Search. In Proceedings of the ISWC 2007.
[18]
vanLeuken, R., Pueyo, L., Olivares, X., & Zwol, R. Visual Diversification of Image Search Results. WWW 2009.
[19]
Vee, E., Srivastava, U., & Shanmugasund, J. Efficient Computation of Diverse Query Results. ICDE 2008.
[20]
Wang, J., & Zhu, J. Portfolio Theory of Information Retrieval. In Proceedings of the SIGIR 2009.
[21]
Zhou, Q., Wang, C., Xiong, M., Wang, H., Yu, Y. SPARK: Adapting Keyword Query to Semantic Search. ISWC 2007.

Cited By

View all
  • (2024)Scalable Diversified Top-k Pattern Matching in Big GraphsBig Data Research10.1016/j.bdr.2024.10046436:COnline publication date: 28-May-2024
  • (2023)Extracting Top-$k$ Frequent and Diversified Patterns in Knowledge GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3233594(1-18)Online publication date: 2023
  • (2023)Keyword-based Socially Tenuous Group Queries2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00079(965-977)Online publication date: Apr-2023
  • Show More Cited By

Index Terms

  1. DivQ: diversification for keyword search over structured databases

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
    July 2010
    944 pages
    ISBN:9781450301534
    DOI:10.1145/1835449
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 July 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. diversity
    2. query intent
    3. ranking in databases

    Qualifiers

    • Research-article

    Conference

    SIGIR '10
    Sponsor:

    Acceptance Rates

    SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 24 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Scalable Diversified Top-k Pattern Matching in Big GraphsBig Data Research10.1016/j.bdr.2024.10046436:COnline publication date: 28-May-2024
    • (2023)Extracting Top-$k$ Frequent and Diversified Patterns in Knowledge GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3233594(1-18)Online publication date: 2023
    • (2023)Keyword-based Socially Tenuous Group Queries2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00079(965-977)Online publication date: Apr-2023
    • (2021)Mining Diversified Top-r Lasting Cohesive Subgraphs on Temporal NetworksIEEE Transactions on Big Data10.1109/TBDATA.2021.3058294(1-1)Online publication date: 2021
    • (2021)Natural language query formalization to SPARQL for querying knowledge bases using RasaProgress in Artificial Intelligence10.1007/s13748-021-00271-111:3(193-206)Online publication date: 11-Dec-2021
    • (2020)DivDBProceedings of the VLDB Endowment10.14778/3402755.34027794:12(1395-1398)Online publication date: 3-Jun-2020
    • (2020)BROADProceedings of the VLDB Endowment10.14778/3402755.34027694:12(1355-1358)Online publication date: 3-Jun-2020
    • (2020)Diversified spatial keyword search on RDF dataThe VLDB Journal10.1007/s00778-020-00610-z29:5(1171-1189)Online publication date: 12-Mar-2020
    • (2020)Supervised Learning Methods for Diversification of Image Search ResultsAdvances in Information Retrieval10.1007/978-3-030-45442-5_20(158-165)Online publication date: 14-Apr-2020
    • (2019)Diversified top-k search with relaxed graph simulationSocial Network Analysis and Mining10.1007/s13278-019-0599-19:1Online publication date: 27-Sep-2019
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media