research-article

Leveraging Knowledge Bases for Contextual Entity Exploration

Authors:

Yuanhua LvAuthors Info & Claims

KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 1949 - 1958

https://doi.org/10.1145/2783258.2788564

Published: 10 August 2015 Publication History

Abstract

Users today are constantly switching back and forth from applications where they consume or create content (such as e-books and productivity suites like Microsoft Office and Google Docs) to search engines where they satisfy their information needs. Unfortunately, though, this leads to a suboptimal user experience as the search engine lacks any knowledge about the content that the user is authoring or consuming in the application. As a result, productivity suites are starting to incorporate features that let the user "explore while they work". Existing work in the literature that can be applied to this problem takes a standard bag-of-words information retrieval approach, which consists of automatically creating a query that includes not only the target phrase or entity chosen by the user but also relevant terms from the context. While these approaches have been successful, they are inherently limited to returning results (documents) that have a syntactic match with the keywords in the query.

We argue that the limitations of these approaches can be overcome by leveraging semantic signals from a knowledge graph built from knowledge bases such as Wikipedia. We present a system called Lewis for retrieving contextually relevant entity results leveraging a knowledge graph, and perform a large scale crowdsourcing experiment in the context of an e-reader scenario, which shows that Lewis can outperform the state-of-the-art contextual entity recommendation systems by more than 20% in terms of the MAP score.

Supplementary Material

M4V File (p1949.m4v)

Download
2885.37 MB

References

[1]

G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17(6):734--749, 2005.

Digital Library

[2]

A. Agarwal, S. Chakrabarti, and S. Aggarwal. Learning to rank networked entities. In Proc. of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006.

Digital Library

[3]

K. Balog, A. P. de Vries, P. Serdyukov, P. Thomas, and T. Westerveld. Overview of the trec 2009 entity track. In Proc. of the Text Retrieval Conference Working Notes, 2009.

[4]

K. Balog and H. Ramampiaro. Cumulative citation recommendation: Classification vs. ranking. In Proc. of the International ACM SIGIR Conference, 2013.

Digital Library

[5]

I. Bordino, Y. Mejova, and M. Lalmas. Penguins in sweaters, or serendipitous entity search on user-generated content. In Proc. of the ACM International Conference on Information Knowledge Management, 2013.

Digital Library

[6]

C. Buckley and S. E. Robertson. Relevance feedback track overview: Trec 2008. In Proc. of the Text Retrieval Conference, 2008.

[7]

C. Buckley, G. Salton, J. Allan, and A. Singhal. Automatic query expansion using smart: Trec 3. In Proc. of the Text Retrieval Conference, 1994.

[8]

W. Chen, W. Hsu, and M. L. Lee. Tagcloud-based explanation with feedback for recommender systems. In Proc. of the International ACM SIGIR Conference, 2013.

Digital Library

[9]

S. Cucerzan. Large-scale named entity disambiguation based on wikipedia data. In EMNLP-CoNLL, 2007.

[10]

J. Dalton, L. Dietz, and J. Allan. Entity query feature expansion using knowledge base links. In Proc. of the International ACM SIGIR conference on Research and Development in Information Retrieval, 2014.

Digital Library

[11]

L. Finkelstein, E. Gabrilovich, Y. Matias, E. Rivlin, Z. Solan, G. Wolfman, and E. Ruppin. Placing search in context: The concept revisited. In Proc. of the International World Wide Web Conference, 2001.

Digital Library

[12]

L. C. Freeman. A set of measures of centrality based on betweenness. Sociometry, pages 35--41, 1977.

[13]

A. Fuxman, P. Pantel, Y. Lv, A. Chandra, P. Chilakamarri, M. Gamon, D. Hamilton, B. Kohlmeier, D. Narayanan, E. Papalexakis, and B. Zhao. Contextual insights. In Proc. of the Companion Publication of the International Conference on World Wide Web Companion, 2014.

Digital Library

[14]

S. Gottipati and J. Jiang. Linking entities to a knowledge base with query expansion. In Proc. of the Conference on Empirical Methods in Natural Language Processing, 2011.

Digital Library

[15]

S. Gouws, G. Van Rooyen, and H. A. Engelbrecht. Measuring conceptual similarity by spreading activation over wikipedia's hyperlink structure. In Proc. of Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources, 2010.

[16]

J. L. Herlocker, J. A. Konstan, and J. Riedl. Explaining collaborative filtering recommendations. In Proc. of the ACM Conference on Computer Supported Cooperative Work, 2000.

Digital Library

[17]

J. Hoffart, S. Seufert, D. B. Nguyen, M. Theobald, and G. Weikum. Kore: Keyphrase overlap relatedness for entity disambiguation. In Proc. of the ACM International Conference on Information and Knowledge Management, 2012.

Digital Library

[18]

G. Jeh and J. Widom. Scaling personalized web search. In Proc. of the International Conference on World Wide Web, 2003.

Digital Library

[19]

R. Kraft, C. C. Chang, F. Maghoul, and R. Kumar. Searching with context. In Proc. of the International World Wide Web Conference, 2006.

Digital Library

[20]

S. Kulkarni, A. Singh, G. Ramakrishnan, and S. Chakrabarti. Collective annotation of wikipedia entities in web text. In Proc. of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2009.

Digital Library

[21]

V. Lavrenko and W. B. Croft. Relevance-based language models. In Proc. of the International ACM SIGIR Conference, 2001.

Digital Library

[22]

S. Lee, S.-i. Song, M. Kahng, D. Lee, and S.-g. Lee. Random walk based entity ranking on graph for multidimensional recommendation. In Proc. of the ACM Conference on Recommender Systems, 2011.

Digital Library

[23]

Y. Lv and A. Fuxman. In situ insights. In Proc. of the International ACM SIGIR Conference, 2015.

Digital Library

[24]

Y. Lv, T. Moon, P. Kolari, Z. Zheng, X. Wang, and Y. Chang. Learning to model relatedness for news recommendation. In Proc. of the International World Wide Web Conference, 2011.

Digital Library

[25]

Y. Lv and C. Zhai. Positional relevance model for pseudo-relevance feedback. In Proc. of the International ACM SIGIR Conference, 2010.

Digital Library

[26]

R. Mihalcea, C. Corley, and C. Strapparava. Corpus-based and knowledge-based measures of text semantic similarity. In Proc. of the National Conference on Artificial Intelligence, 2006.

Digital Library

[27]

R. Mihalcea and A. Csomai. Wikify!: linking documents to encyclopedic knowledge. In Proc. of the ACM Conference on Information and Knowledge Management, 2007.

Digital Library

[28]

D. Milne and I. Witten. An effective, low-cost measure of semantic relatedness obtained from wikipedia links. In Proc. of AAAI Workshop on Wikipedia and Artificial Intelligence, 2008.

[29]

D. Odijk, E. Meij, and M. de Rijke. Feeding the second screen: Semantic linking based on subtitles. In Proc. of the Conference on Open Research Areas in Information Retrieval, 2013.

Digital Library

[30]

L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. 1999.

[31]

D. Petkova and W. B. Croft. Proximity-based document representation for named entity retrieval. In Proc. of the ACM Conference on Information and Knowledge Management, 2007.

Digital Library

[32]

B. Ribeiro-Neto, M. Cristo, P. B. Golgher, and E. Silva de Moura. Impedance coupling in content-targeted advertising. In Proc. of the International ACM SIGIR Conference, 2005.

Digital Library

[33]

S. Robertson and I. Soboroff. The trec 2002 filtering track report. In Proc. of the Text Retrieval Conference, 2002.

[34]

S. E. Robertson and K. S. Jones. Relevance weighting of search terms. Journal of the American Society of Information Science, 27(3):129--146, 1976.

[35]

J. J. Rocchio. Relevance feedback in information retrieval. In In The SMART Retrieval System: Experiments in Automatic Document Processing. Prentice-Hall Inc., 1971.

[36]

M. Strube and S. P. Ponzetto. Wikirelate! computing semantic relatedness using wikipedia. In Proc. of the AAAI Conference on Artificial Intelligence.

Digital Library

[37]

P. Symeonidis, A. Nanopoulos, and Y. Manolopoulos. Providing justifications in recommender systems. IEEE Transactions on Systems, Man and Cybernetics, Part A, 38(6):1262--1272, 2008.

Digital Library

[38]

A.-M. Vercoustre, J. A. Thom, and J. Pehcevski. Entity ranking in wikipedia. In Proc. of the ACM Symposium on Applied Computing, 2008.

Digital Library

[39]

J. Vig, S. Sen, and J. Riedl. Tagsplanations: explaining recommendations using tags. In Proc. of the International Conference on Intelligent User Interfaces, 2009.

Digital Library

[40]

N. Voskarides, D. Odijk, M. Tsagkias, W. Weerkamp, and M. de Rijke. Query-dependent contextualization of streaming data. In Proc. of the European Conference on Information Retrieval, 2014.

[41]

E. Yeh, D. Ramage, C. D. Manning, E. Agirre, and A. Soroa. Wikiwalk: Random walks on wikipedia for semantic relatedness. In Proc. of the Workshop on Graph-based Methods for Natural Language Processing, 2009.

Digital Library

[42]

M. A. Yosef, J. Hoffart, I. Bordino, M. Spaniol, and G. Weikum. Aida: An online tool for accurate disambiguation of named entities in text and tables. Proc. of the VLDB Endowment, 4(12):1450--1453, 2011.

Digital Library

[43]

C. Yu, L. V. Lakshmanan, and S. Amer-Yahia. Recommendation diversification using explanations. In Proc. of the IEEE International Conference on Data Engineering, 2009.

Digital Library

[44]

M. Zhou and K. C.-C. Chang. Entity-centric document filtering: boosting feature mapping through meta-features. In Proc. of the ACM International Conference on Information and Knowledge Management, 2013.

Digital Library

Cited By

Saeidi MMilios EZeh N(2021)Graph Representation Learning in Document WikificationDocument Analysis and Recognition – ICDAR 2021 Workshops10.1007/978-3-030-86159-9_37(509-524)Online publication date: 2-Sep-2021
https://doi.org/10.1007/978-3-030-86159-9_37
Balog KBalog K(2018)Utilizing Entities for an Enhanced Search ExperienceEntity-Oriented Search10.1007/978-3-319-93935-3_9(299-336)Online publication date: 3-Oct-2018
https://doi.org/10.1007/978-3-319-93935-3_9
Schmidt AHoffart JMilchevski DWeikum GPerego RSebastiani FAslam JRuthven IZobel J(2016)Context-Sensitive Auto-Completion for Searching with Entities and CategoriesProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2911461(1097-1100)Online publication date: 7-Jul-2016
https://dl.acm.org/doi/10.1145/2911451.2911461
Show More Cited By

Index Terms

Leveraging Knowledge Bases for Contextual Entity Exploration
1. Information systems
  1. Information systems applications

Recommendations

Generic and Scalable Framework for Automated Time-series Anomaly Detection

KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

This paper introduces a generic and scalable framework for automated anomaly detection on large scale time-series data. Early detection of anomalies plays a key role in maintaining consistency of person's data and protects corporations against malicious ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 2015

2378 pages

ISBN:9781450336642

DOI:10.1145/2783258

General Chairs:
Longbing Cao
University of Technology, Sydney
,
Chengqi Zhang
University of Technology, Sydney
,
Program Chairs:
Thorsten Joachims
Cornell University
,
Geoff Webb
Monash University
,
Dragos D. Margineantu
Boeing Research
,
Graham Williams
Australian Taxation Office

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 August 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD '15

Sponsor:

KDD '15: The 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 10 - 13, 2015

NSW, Sydney, Australia

Acceptance Rates

KDD '15 Paper Acceptance Rate 160 of 819 submissions, 20%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
624
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 18 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Saeidi MMilios EZeh N(2021)Graph Representation Learning in Document WikificationDocument Analysis and Recognition – ICDAR 2021 Workshops10.1007/978-3-030-86159-9_37(509-524)Online publication date: 2-Sep-2021
https://doi.org/10.1007/978-3-030-86159-9_37
Balog KBalog K(2018)Utilizing Entities for an Enhanced Search ExperienceEntity-Oriented Search10.1007/978-3-319-93935-3_9(299-336)Online publication date: 3-Oct-2018
https://doi.org/10.1007/978-3-319-93935-3_9
Schmidt AHoffart JMilchevski DWeikum GPerego RSebastiani FAslam JRuthven IZobel J(2016)Context-Sensitive Auto-Completion for Searching with Entities and CategoriesProceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval10.1145/2911451.2911461(1097-1100)Online publication date: 7-Jul-2016
https://dl.acm.org/doi/10.1145/2911451.2911461
Lv YFuxman ABaeza-Yates RLalmas MMoffat ARibeiro-Neto B(2015)In Situ InsightsProceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/2766462.2767696(655-664)Online publication date: 9-Aug-2015
https://dl.acm.org/doi/10.1145/2766462.2767696

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents