Abstract
In this paper, we introduce Contextual Entity Ranking (CoER) for the task of entity retrieval. CoER utilizes a textual knowledge graph to learn entities’ representations that are contextualized based on a given query. With these contextual representations and the query, CoER includes a set of models that learn to rank relevant entities. The introduced ranking models measure semantic relevance between entities’ contextual representations and the textual query, between entities’ contextual representations along with entities’ non-contextual and general descriptions and the textual query, and finally, between entities’ contextual representations and their relevance to the entities in the given query. We empirically illustrate that CoER is effective in retrieving and ranking entities across different benchmark datasets compared with state-of-the-art models. We also report ablation studies that investigate the impact of the contextual representation model and the ranking models on the final performance.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability and access
Code and results available at https://github.com/ParastooSJ/CER
Notes
Code and results available at https://github.com/ParastooSJ/CER
References
Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z (2007) Dbpedia: A nucleus for a web of open data. In: The semantic web, pages 722–735. Springer
Balog K, Bron M, De Rijke M (2011) Query modeling for entity search based on terms, categories, and examples. ACM Transactions on Information Systems (TOIS) 29(4):1–31
Blanco R, Mika P, Vigna S (2011) Effective and efficient entity search in rdf data. In: International semantic web conference, pages 83–97. Springer
Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pages 1247–1250
Chatterjee S, Dietz L (2021) Entity retrieval using fine-grained entity aspects. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1662–1666
Chatterjee S, Dietz L (2022) Bert-er: Query-specific bert entity representations for entity ranking
Chen J, Xiong C, Callan J (2016) An empirical study of learning to rank for entity search. In: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pages 737–740
Daza D, Cochez M, Groth P (2021) Inductive entity representations from text via link prediction. Proceedings of the Web Conference 2021:798–808
De Cao N, Izacard G, Riedel S, Petroni F (2020) Autoregressive entity retrieval. In: International Conference on Learning Representations
Dietz L (2019) Ent rank: Retrieving entities for topical information needs through entity-neighbor-text relations. In: Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval, pages 215–224
Dietz L, Foley J (2019) Trec car y3: Complex answer retrieval overview. In: Proceedings of Text REtrieval Conference (TREC)
Dietz L, Verma M, Radlinski F, Craswell N (2017) Trec complex answer retrieval overview. In: TREC
Dong L, Wei F, Zhou M, Xu K (2015) Question answering over freebase with multi-column convolutional neural networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 260–269
Ferragina P, Scaiella U (2010) Tagme: on-the-fly annotation of short text fragments (by wikipedia entities). In: Proceedings of the 19th ACM international conference on Information and knowledge management, pages 1625–1628
Gerritse EJ, Hasibi F, Vries APd (2020) Graph-embedding empowered entity retrieval. In: European Conference on Information Retrieval, pages 97–110. Springer
Gerritse EJ, Hasibi F, de Vries AP (2022) Entity-aware transformers for entity search. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1455–1465
Guo J, Xu G, Cheng X, Li H (2009) Named entity recognition in query. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pages 267–274
Hasibi F, Balog K, Bratsberg SE (2016) Exploiting entity linking in queries for entity retrieval. In: Proceedings of the 2016 acm international conference on the theory of information retrieval, pages 209–218
Hasibi F, Nikolaev F, Xiong C, Balog K, Bratsberg SE, Kotov A, Callan J (2017) Dbpedia-entity v2: a test collection for entity search. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1265–1268
Hu X, Duan J, Dang D (2021) Natural language question answering over knowledge graph: the marriage of sparql query and keyword search. Knowl Inf Syst 63(4):819–844
Jameel S, Bouraoui Z, Schockaert S (2017) Member: Max-margin based embeddings for entity retrieval. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 783–792
Kenton JDM-WC, Toutanova LK (2019) Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of naacL-HLT, volume 1, page 2
Liu X, Zheng W, Fang H (2013) An exploration of ranking models and feedback method for related entity finding. Inf Process Manag 49(5):995–1007
Lo P-C, Lim E-P (2023) Contextual path retrieval: A contextual entity relation embedding-based approach. ACM Trans Inf Syst 41(1):1–38
Maheshwari G, Trivedi P, Lukovnikov D, Chakraborty N, Fischer A, Lehmann J (2019) Learning to rank query graphs for complex question answering over knowledge graphs. In: International semantic web conference, pages 487–504. Springer
Meij E, Bron M, Hollink L, Huurnink B, de Rijke M (2011) Mapping queries to the linking open data cloud: A case study using dbpedia. Journal of Web Semantics 9(4):418–433
Meij E, Balog K, Odijk D (2014) Entity linking and retrieval for semantic search. WSDM 10:2556195–2556201
Metzler D, Croft WB (2005) A markov random field model for term dependencies. In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 472–479
Neumayer R, Balog K, Nørvåg K (2012) On the modeling of entities for ad-hoc entity search in the web of data. In: European Conference on Information Retrieval, pages 133–145. Springer
Nikolaev F, Kotov A (2020) Joint word and entity embeddings for entity retrieval from a knowledge graph. In: European Conference on Information Retrieval, pages 141–155. Springer
Nikolaev F, Kotov A, Zhiltsov N (2016) Parameterized fielded term dependence models for ad-hoc entity retrieval from knowledge graph. In: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pages 435–444
Noraset T, Lowphansirikul L, Tuarob S (2021) Wabiqa: A wikipedia-based thai question-answering system. Information Process Manag 58(1):102431
Nozza D, Manchanda P, Fersini E, Palmonari M, Messina E (2021) Learningtoadapt with word embeddings: Domain adaptation of named entity recognition systems. Inf Process Manag 58(3):102537
Raviv H, Carmel D, Kurland O (2012) A ranking framework for entity oriented search using markov random fields. In: Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search, pages 1–6
Robertson SE, Walker S, Jones S, Hancock-Beaulieu MM, Gatford M et al (1995) Okapi at trec-3. Nist Special Publication Sp 109:109
Sarwar SM, Foley J, Allan J (2018) Term relevance feedback for contextual named entity retrieval. In: Proceedings of the 2018 Conference on Human Information Interaction & Retrieval, pages 301–304
Tonon A, Demartini G, Cudré-Mauroux P (2012) Combining inverted indices and structured search for ad-hoc object retrieval. In: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, pages 125–134
Vechtomova O, Robertson SE (2012) A domain-independent approach to finding related entities. Inf Process Manag 48(4):654–670
Vrandečić D (2012) Wikidata: A new platform for collaborative data collection. In: Proceedings of the 21st international conference on world wide web, pages 1063–1064
Wannous R (2014) Computational inference of conceptual trajectory model: considering domain temporal and spatial dimensions. PhD thesis, Université de La Rochelle
Xiong C, Power R, Callan J (2017) Explicit semantic ranking for academic search via knowledge graph embedding. In: Proceedings of the 26th international conference on world wide web, pages 1271–1279
Xiong H, Wang S, Tang M, Wang L, Lin X (2021) Knowledge graph question answering with semantic oriented fusion model. Knowledge-Based Systems 221:106954
Zhao C, Xiong C, Qian X, Boyd-Graber J (2020) Complex factoid question answering with a free-text knowledge graph. In: Proceedings of The Web Conference 2020, pages 1205–1216, 2020
Zheng W, Yu JX, Zou L, Cheng H (2018) Question answering over knowledge graphs: question understanding via template decomposition. Proceedings of the VLDB Endowment 11(11):1373–1386
Zhiltsov N, Kotov A, Nikolaev F (2015) Fielded sequential dependence model for ad-hoc entity retrieval in the web of data. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 253–262
Funding
The research leading to these results received funding from Natural Sciences and Engineering Research Council of Canada, under Grant No 140869.
Author information
Authors and Affiliations
Contributions
Parastoo Jafarzadeh: Conceptualization, Implementation, Investigation, and Writing. Zahra Amirmahani: Implementation. Faezeh Ensan: Conceptualization, Investigation, Writing, Supervision.
Corresponding author
Ethics declarations
Conflict of interest/Competing interests
(Check journal-specific guidelines for which heading to use): The authors have no competing interests as defined by Springer, or other interests that might be perceived to influence the results and/or discussion reported in this paper.
Ethical and informed consent for data used
The data that has been used is from publicly available sources: Wikipedia and DBpedia and publicly available datasets: DBpedia-Entity V2
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Jafarzadeh, P., Amirmahani, Z. & Ensan, F. Learning contextual representations for entity retrieval. Appl Intell 54, 8820–8840 (2024). https://doi.org/10.1007/s10489-024-05430-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-024-05430-0