Learning contextual representations for entity retrieval

Parastoo Jafarzadeh¹,
Zahra Amirmahani² &
Faezeh Ensan¹

152 Accesses
Explore all metrics

Abstract

In this paper, we introduce Contextual Entity Ranking (CoER) for the task of entity retrieval. CoER utilizes a textual knowledge graph to learn entities’ representations that are contextualized based on a given query. With these contextual representations and the query, CoER includes a set of models that learn to rank relevant entities. The introduced ranking models measure semantic relevance between entities’ contextual representations and the textual query, between entities’ contextual representations along with entities’ non-contextual and general descriptions and the textual query, and finally, between entities’ contextual representations and their relevance to the entities in the given query. We empirically illustrate that CoER is effective in retrieving and ranking entities across different benchmark datasets compared with state-of-the-art models. We also report ablation studies that investigate the impact of the contextual representation model and the ranking models on the final performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

LaQuE: Enabling Entity Search at Scale

DREQ: Document Re-ranking Using Entity-Based Query Understanding

NERank: Bringing Order to Named Entities from Texts

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability and access

Code and results available at https://github.com/ParastooSJ/CER

Notes

Code and results available at https://github.com/ParastooSJ/CER
https://github.com/informagi/GEEER
https://github.com/teanalab/kewer

References

Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z (2007) Dbpedia: A nucleus for a web of open data. In: The semantic web, pages 722–735. Springer
Balog K, Bron M, De Rijke M (2011) Query modeling for entity search based on terms, categories, and examples. ACM Transactions on Information Systems (TOIS) 29(4):1–31
Article Google Scholar
Blanco R, Mika P, Vigna S (2011) Effective and efficient entity search in rdf data. In: International semantic web conference, pages 83–97. Springer
Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pages 1247–1250
Chatterjee S, Dietz L (2021) Entity retrieval using fine-grained entity aspects. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1662–1666
Chatterjee S, Dietz L (2022) Bert-er: Query-specific bert entity representations for entity ranking
Chen J, Xiong C, Callan J (2016) An empirical study of learning to rank for entity search. In: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pages 737–740
Daza D, Cochez M, Groth P (2021) Inductive entity representations from text via link prediction. Proceedings of the Web Conference 2021:798–808
Google Scholar
De Cao N, Izacard G, Riedel S, Petroni F (2020) Autoregressive entity retrieval. In: International Conference on Learning Representations
Dietz L (2019) Ent rank: Retrieving entities for topical information needs through entity-neighbor-text relations. In: Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval, pages 215–224
Dietz L, Foley J (2019) Trec car y3: Complex answer retrieval overview. In: Proceedings of Text REtrieval Conference (TREC)
Dietz L, Verma M, Radlinski F, Craswell N (2017) Trec complex answer retrieval overview. In: TREC
Dong L, Wei F, Zhou M, Xu K (2015) Question answering over freebase with multi-column convolutional neural networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 260–269
Ferragina P, Scaiella U (2010) Tagme: on-the-fly annotation of short text fragments (by wikipedia entities). In: Proceedings of the 19th ACM international conference on Information and knowledge management, pages 1625–1628
Gerritse EJ, Hasibi F, Vries APd (2020) Graph-embedding empowered entity retrieval. In: European Conference on Information Retrieval, pages 97–110. Springer
Gerritse EJ, Hasibi F, de Vries AP (2022) Entity-aware transformers for entity search. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1455–1465
Guo J, Xu G, Cheng X, Li H (2009) Named entity recognition in query. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pages 267–274
Hasibi F, Balog K, Bratsberg SE (2016) Exploiting entity linking in queries for entity retrieval. In: Proceedings of the 2016 acm international conference on the theory of information retrieval, pages 209–218
Hasibi F, Nikolaev F, Xiong C, Balog K, Bratsberg SE, Kotov A, Callan J (2017) Dbpedia-entity v2: a test collection for entity search. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1265–1268
Hu X, Duan J, Dang D (2021) Natural language question answering over knowledge graph: the marriage of sparql query and keyword search. Knowl Inf Syst 63(4):819–844
Article Google Scholar
Jameel S, Bouraoui Z, Schockaert S (2017) Member: Max-margin based embeddings for entity retrieval. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 783–792
Kenton JDM-WC, Toutanova LK (2019) Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of naacL-HLT, volume 1, page 2
Liu X, Zheng W, Fang H (2013) An exploration of ranking models and feedback method for related entity finding. Inf Process Manag 49(5):995–1007
Article Google Scholar
Lo P-C, Lim E-P (2023) Contextual path retrieval: A contextual entity relation embedding-based approach. ACM Trans Inf Syst 41(1):1–38
Article Google Scholar
Maheshwari G, Trivedi P, Lukovnikov D, Chakraborty N, Fischer A, Lehmann J (2019) Learning to rank query graphs for complex question answering over knowledge graphs. In: International semantic web conference, pages 487–504. Springer
Meij E, Bron M, Hollink L, Huurnink B, de Rijke M (2011) Mapping queries to the linking open data cloud: A case study using dbpedia. Journal of Web Semantics 9(4):418–433
Article Google Scholar
Meij E, Balog K, Odijk D (2014) Entity linking and retrieval for semantic search. WSDM 10:2556195–2556201
Google Scholar
Metzler D, Croft WB (2005) A markov random field model for term dependencies. In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 472–479
Neumayer R, Balog K, Nørvåg K (2012) On the modeling of entities for ad-hoc entity search in the web of data. In: European Conference on Information Retrieval, pages 133–145. Springer
Nikolaev F, Kotov A (2020) Joint word and entity embeddings for entity retrieval from a knowledge graph. In: European Conference on Information Retrieval, pages 141–155. Springer
Nikolaev F, Kotov A, Zhiltsov N (2016) Parameterized fielded term dependence models for ad-hoc entity retrieval from knowledge graph. In: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pages 435–444
Noraset T, Lowphansirikul L, Tuarob S (2021) Wabiqa: A wikipedia-based thai question-answering system. Information Process Manag 58(1):102431
Article Google Scholar
Nozza D, Manchanda P, Fersini E, Palmonari M, Messina E (2021) Learningtoadapt with word embeddings: Domain adaptation of named entity recognition systems. Inf Process Manag 58(3):102537
Article Google Scholar
Raviv H, Carmel D, Kurland O (2012) A ranking framework for entity oriented search using markov random fields. In: Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search, pages 1–6
Robertson SE, Walker S, Jones S, Hancock-Beaulieu MM, Gatford M et al (1995) Okapi at trec-3. Nist Special Publication Sp 109:109
Google Scholar
Sarwar SM, Foley J, Allan J (2018) Term relevance feedback for contextual named entity retrieval. In: Proceedings of the 2018 Conference on Human Information Interaction & Retrieval, pages 301–304
Tonon A, Demartini G, Cudré-Mauroux P (2012) Combining inverted indices and structured search for ad-hoc object retrieval. In: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, pages 125–134
Vechtomova O, Robertson SE (2012) A domain-independent approach to finding related entities. Inf Process Manag 48(4):654–670
Article Google Scholar
Vrandečić D (2012) Wikidata: A new platform for collaborative data collection. In: Proceedings of the 21st international conference on world wide web, pages 1063–1064
Wannous R (2014) Computational inference of conceptual trajectory model: considering domain temporal and spatial dimensions. PhD thesis, Université de La Rochelle
Xiong C, Power R, Callan J (2017) Explicit semantic ranking for academic search via knowledge graph embedding. In: Proceedings of the 26th international conference on world wide web, pages 1271–1279
Xiong H, Wang S, Tang M, Wang L, Lin X (2021) Knowledge graph question answering with semantic oriented fusion model. Knowledge-Based Systems 221:106954
Article Google Scholar
Zhao C, Xiong C, Qian X, Boyd-Graber J (2020) Complex factoid question answering with a free-text knowledge graph. In: Proceedings of The Web Conference 2020, pages 1205–1216, 2020
Zheng W, Yu JX, Zou L, Cheng H (2018) Question answering over knowledge graphs: question understanding via template decomposition. Proceedings of the VLDB Endowment 11(11):1373–1386
Article Google Scholar
Zhiltsov N, Kotov A, Nikolaev F (2015) Fielded sequential dependence model for ad-hoc entity retrieval in the web of data. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 253–262

Download references

Funding

The research leading to these results received funding from Natural Sciences and Engineering Research Council of Canada, under Grant No 140869.

Author information

Authors and Affiliations

Department of Electrical, Computer, and Biomedical Engineering, Toronto Metropolitan University, Toronto, Canada
Parastoo Jafarzadeh & Faezeh Ensan
Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran
Zahra Amirmahani

Authors

Parastoo Jafarzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Zahra Amirmahani
View author publications
You can also search for this author in PubMed Google Scholar
Faezeh Ensan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Parastoo Jafarzadeh: Conceptualization, Implementation, Investigation, and Writing. Zahra Amirmahani: Implementation. Faezeh Ensan: Conceptualization, Investigation, Writing, Supervision.

Corresponding author

Correspondence to Faezeh Ensan.

Ethics declarations

Conflict of interest/Competing interests

(Check journal-specific guidelines for which heading to use): The authors have no competing interests as defined by Springer, or other interests that might be perceived to influence the results and/or discussion reported in this paper.

Ethical and informed consent for data used

The data that has been used is from publicly available sources: Wikipedia and DBpedia and publicly available datasets: DBpedia-Entity V2

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Jafarzadeh, P., Amirmahani, Z. & Ensan, F. Learning contextual representations for entity retrieval. Appl Intell 54, 8820–8840 (2024). https://doi.org/10.1007/s10489-024-05430-0

Download citation

Accepted: 27 March 2024
Published: 04 July 2024
Issue Date: October 2024
DOI: https://doi.org/10.1007/s10489-024-05430-0

Learning contextual representations for entity retrieval

Abstract

Access this article

Subscribe and save

Buy Now