Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2783258.2788564acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Leveraging Knowledge Bases for Contextual Entity Exploration

Published: 10 August 2015 Publication History

Abstract

Users today are constantly switching back and forth from applications where they consume or create content (such as e-books and productivity suites like Microsoft Office and Google Docs) to search engines where they satisfy their information needs. Unfortunately, though, this leads to a suboptimal user experience as the search engine lacks any knowledge about the content that the user is authoring or consuming in the application. As a result, productivity suites are starting to incorporate features that let the user "explore while they work". Existing work in the literature that can be applied to this problem takes a standard bag-of-words information retrieval approach, which consists of automatically creating a query that includes not only the target phrase or entity chosen by the user but also relevant terms from the context. While these approaches have been successful, they are inherently limited to returning results (documents) that have a syntactic match with the keywords in the query.
We argue that the limitations of these approaches can be overcome by leveraging semantic signals from a knowledge graph built from knowledge bases such as Wikipedia. We present a system called Lewis for retrieving contextually relevant entity results leveraging a knowledge graph, and perform a large scale crowdsourcing experiment in the context of an e-reader scenario, which shows that Lewis can outperform the state-of-the-art contextual entity recommendation systems by more than 20% in terms of the MAP score.

Supplementary Material

M4V File (p1949.m4v)

References

[1]
G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17(6):734--749, 2005.
[2]
A. Agarwal, S. Chakrabarti, and S. Aggarwal. Learning to rank networked entities. In Proc. of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006.
[3]
K. Balog, A. P. de Vries, P. Serdyukov, P. Thomas, and T. Westerveld. Overview of the trec 2009 entity track. In Proc. of the Text Retrieval Conference Working Notes, 2009.
[4]
K. Balog and H. Ramampiaro. Cumulative citation recommendation: Classification vs. ranking. In Proc. of the International ACM SIGIR Conference, 2013.
[5]
I. Bordino, Y. Mejova, and M. Lalmas. Penguins in sweaters, or serendipitous entity search on user-generated content. In Proc. of the ACM International Conference on Information Knowledge Management, 2013.
[6]
C. Buckley and S. E. Robertson. Relevance feedback track overview: Trec 2008. In Proc. of the Text Retrieval Conference, 2008.
[7]
C. Buckley, G. Salton, J. Allan, and A. Singhal. Automatic query expansion using smart: Trec 3. In Proc. of the Text Retrieval Conference, 1994.
[8]
W. Chen, W. Hsu, and M. L. Lee. Tagcloud-based explanation with feedback for recommender systems. In Proc. of the International ACM SIGIR Conference, 2013.
[9]
S. Cucerzan. Large-scale named entity disambiguation based on wikipedia data. In EMNLP-CoNLL, 2007.
[10]
J. Dalton, L. Dietz, and J. Allan. Entity query feature expansion using knowledge base links. In Proc. of the International ACM SIGIR conference on Research and Development in Information Retrieval, 2014.
[11]
L. Finkelstein, E. Gabrilovich, Y. Matias, E. Rivlin, Z. Solan, G. Wolfman, and E. Ruppin. Placing search in context: The concept revisited. In Proc. of the International World Wide Web Conference, 2001.
[12]
L. C. Freeman. A set of measures of centrality based on betweenness. Sociometry, pages 35--41, 1977.
[13]
A. Fuxman, P. Pantel, Y. Lv, A. Chandra, P. Chilakamarri, M. Gamon, D. Hamilton, B. Kohlmeier, D. Narayanan, E. Papalexakis, and B. Zhao. Contextual insights. In Proc. of the Companion Publication of the International Conference on World Wide Web Companion, 2014.
[14]
S. Gottipati and J. Jiang. Linking entities to a knowledge base with query expansion. In Proc. of the Conference on Empirical Methods in Natural Language Processing, 2011.
[15]
S. Gouws, G. Van Rooyen, and H. A. Engelbrecht. Measuring conceptual similarity by spreading activation over wikipedia's hyperlink structure. In Proc. of Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources, 2010.
[16]
J. L. Herlocker, J. A. Konstan, and J. Riedl. Explaining collaborative filtering recommendations. In Proc. of the ACM Conference on Computer Supported Cooperative Work, 2000.
[17]
J. Hoffart, S. Seufert, D. B. Nguyen, M. Theobald, and G. Weikum. Kore: Keyphrase overlap relatedness for entity disambiguation. In Proc. of the ACM International Conference on Information and Knowledge Management, 2012.
[18]
G. Jeh and J. Widom. Scaling personalized web search. In Proc. of the International Conference on World Wide Web, 2003.
[19]
R. Kraft, C. C. Chang, F. Maghoul, and R. Kumar. Searching with context. In Proc. of the International World Wide Web Conference, 2006.
[20]
S. Kulkarni, A. Singh, G. Ramakrishnan, and S. Chakrabarti. Collective annotation of wikipedia entities in web text. In Proc. of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2009.
[21]
V. Lavrenko and W. B. Croft. Relevance-based language models. In Proc. of the International ACM SIGIR Conference, 2001.
[22]
S. Lee, S.-i. Song, M. Kahng, D. Lee, and S.-g. Lee. Random walk based entity ranking on graph for multidimensional recommendation. In Proc. of the ACM Conference on Recommender Systems, 2011.
[23]
Y. Lv and A. Fuxman. In situ insights. In Proc. of the International ACM SIGIR Conference, 2015.
[24]
Y. Lv, T. Moon, P. Kolari, Z. Zheng, X. Wang, and Y. Chang. Learning to model relatedness for news recommendation. In Proc. of the International World Wide Web Conference, 2011.
[25]
Y. Lv and C. Zhai. Positional relevance model for pseudo-relevance feedback. In Proc. of the International ACM SIGIR Conference, 2010.
[26]
R. Mihalcea, C. Corley, and C. Strapparava. Corpus-based and knowledge-based measures of text semantic similarity. In Proc. of the National Conference on Artificial Intelligence, 2006.
[27]
R. Mihalcea and A. Csomai. Wikify!: linking documents to encyclopedic knowledge. In Proc. of the ACM Conference on Information and Knowledge Management, 2007.
[28]
D. Milne and I. Witten. An effective, low-cost measure of semantic relatedness obtained from wikipedia links. In Proc. of AAAI Workshop on Wikipedia and Artificial Intelligence, 2008.
[29]
D. Odijk, E. Meij, and M. de Rijke. Feeding the second screen: Semantic linking based on subtitles. In Proc. of the Conference on Open Research Areas in Information Retrieval, 2013.
[30]
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. 1999.
[31]
D. Petkova and W. B. Croft. Proximity-based document representation for named entity retrieval. In Proc. of the ACM Conference on Information and Knowledge Management, 2007.
[32]
B. Ribeiro-Neto, M. Cristo, P. B. Golgher, and E. Silva de Moura. Impedance coupling in content-targeted advertising. In Proc. of the International ACM SIGIR Conference, 2005.
[33]
S. Robertson and I. Soboroff. The trec 2002 filtering track report. In Proc. of the Text Retrieval Conference, 2002.
[34]
S. E. Robertson and K. S. Jones. Relevance weighting of search terms. Journal of the American Society of Information Science, 27(3):129--146, 1976.
[35]
J. J. Rocchio. Relevance feedback in information retrieval. In In The SMART Retrieval System: Experiments in Automatic Document Processing. Prentice-Hall Inc., 1971.
[36]
M. Strube and S. P. Ponzetto. Wikirelate! computing semantic relatedness using wikipedia. In Proc. of the AAAI Conference on Artificial Intelligence.
[37]
P. Symeonidis, A. Nanopoulos, and Y. Manolopoulos. Providing justifications in recommender systems. IEEE Transactions on Systems, Man and Cybernetics, Part A, 38(6):1262--1272, 2008.
[38]
A.-M. Vercoustre, J. A. Thom, and J. Pehcevski. Entity ranking in wikipedia. In Proc. of the ACM Symposium on Applied Computing, 2008.
[39]
J. Vig, S. Sen, and J. Riedl. Tagsplanations: explaining recommendations using tags. In Proc. of the International Conference on Intelligent User Interfaces, 2009.
[40]
N. Voskarides, D. Odijk, M. Tsagkias, W. Weerkamp, and M. de Rijke. Query-dependent contextualization of streaming data. In Proc. of the European Conference on Information Retrieval, 2014.
[41]
E. Yeh, D. Ramage, C. D. Manning, E. Agirre, and A. Soroa. Wikiwalk: Random walks on wikipedia for semantic relatedness. In Proc. of the Workshop on Graph-based Methods for Natural Language Processing, 2009.
[42]
M. A. Yosef, J. Hoffart, I. Bordino, M. Spaniol, and G. Weikum. Aida: An online tool for accurate disambiguation of named entities in text and tables. Proc. of the VLDB Endowment, 4(12):1450--1453, 2011.
[43]
C. Yu, L. V. Lakshmanan, and S. Amer-Yahia. Recommendation diversification using explanations. In Proc. of the IEEE International Conference on Data Engineering, 2009.
[44]
M. Zhou and K. C.-C. Chang. Entity-centric document filtering: boosting feature mapping through meta-features. In Proc. of the ACM International Conference on Information and Knowledge Management, 2013.

Cited By

View all
  • (2021)Graph Representation Learning in Document WikificationDocument Analysis and Recognition – ICDAR 2021 Workshops10.1007/978-3-030-86159-9_37(509-524)Online publication date: 2-Sep-2021
  • (2018)Utilizing Entities for an Enhanced Search ExperienceEntity-Oriented Search10.1007/978-3-319-93935-3_9(299-336)Online publication date: 3-Oct-2018
  • (2016)Context-Sensitive Auto-Completion for Searching with Entities and CategoriesProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2911461(1097-1100)Online publication date: 7-Jul-2016
  • Show More Cited By

Index Terms

  1. Leveraging Knowledge Bases for Contextual Entity Exploration

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
    August 2015
    2378 pages
    ISBN:9781450336642
    DOI:10.1145/2783258
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 August 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. context
    2. context-selection betweenness
    3. entity recommendation
    4. knowledge base
    5. semantic

    Qualifiers

    • Research-article

    Conference

    KDD '15
    Sponsor:

    Acceptance Rates

    KDD '15 Paper Acceptance Rate 160 of 819 submissions, 20%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Graph Representation Learning in Document WikificationDocument Analysis and Recognition – ICDAR 2021 Workshops10.1007/978-3-030-86159-9_37(509-524)Online publication date: 2-Sep-2021
    • (2018)Utilizing Entities for an Enhanced Search ExperienceEntity-Oriented Search10.1007/978-3-319-93935-3_9(299-336)Online publication date: 3-Oct-2018
    • (2016)Context-Sensitive Auto-Completion for Searching with Entities and CategoriesProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2911461(1097-1100)Online publication date: 7-Jul-2016
    • (2015)In Situ InsightsProceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/2766462.2767696(655-664)Online publication date: 9-Aug-2015

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media