Nothing Special   »   [go: up one dir, main page]

Skip to main content
Log in

Learning contextual representations for entity retrieval

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

In this paper, we introduce Contextual Entity Ranking (CoER) for the task of entity retrieval. CoER utilizes a textual knowledge graph to learn entities’ representations that are contextualized based on a given query. With these contextual representations and the query, CoER includes a set of models that learn to rank relevant entities. The introduced ranking models measure semantic relevance between entities’ contextual representations and the textual query, between entities’ contextual representations along with entities’ non-contextual and general descriptions and the textual query, and finally, between entities’ contextual representations and their relevance to the entities in the given query. We empirically illustrate that CoER is effective in retrieving and ranking entities across different benchmark datasets compared with state-of-the-art models. We also report ablation studies that investigate the impact of the contextual representation model and the ranking models on the final performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability and access

Code and results available at https://github.com/ParastooSJ/CER

Notes

  1. Code and results available at https://github.com/ParastooSJ/CER

  2. https://github.com/informagi/GEEER

  3. https://github.com/teanalab/kewer

References

  1. Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z (2007) Dbpedia: A nucleus for a web of open data. In: The semantic web, pages 722–735. Springer

  2. Balog K, Bron M, De Rijke M (2011) Query modeling for entity search based on terms, categories, and examples. ACM Transactions on Information Systems (TOIS) 29(4):1–31

    Article  Google Scholar 

  3. Blanco R, Mika P, Vigna S (2011) Effective and efficient entity search in rdf data. In: International semantic web conference, pages 83–97. Springer

  4. Bollacker K, Evans C, Paritosh P, Sturge T, Taylor J (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pages 1247–1250

  5. Chatterjee S, Dietz L (2021) Entity retrieval using fine-grained entity aspects. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1662–1666

  6. Chatterjee S, Dietz L (2022) Bert-er: Query-specific bert entity representations for entity ranking

  7. Chen J, Xiong C, Callan J (2016) An empirical study of learning to rank for entity search. In: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pages 737–740

  8. Daza D, Cochez M, Groth P (2021) Inductive entity representations from text via link prediction. Proceedings of the Web Conference 2021:798–808

    Google Scholar 

  9. De Cao N, Izacard G, Riedel S, Petroni F (2020) Autoregressive entity retrieval. In: International Conference on Learning Representations

  10. Dietz L (2019) Ent rank: Retrieving entities for topical information needs through entity-neighbor-text relations. In: Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval, pages 215–224

  11. Dietz L, Foley J (2019) Trec car y3: Complex answer retrieval overview. In: Proceedings of Text REtrieval Conference (TREC)

  12. Dietz L, Verma M, Radlinski F, Craswell N (2017) Trec complex answer retrieval overview. In: TREC

  13. Dong L, Wei F, Zhou M, Xu K (2015) Question answering over freebase with multi-column convolutional neural networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 260–269

  14. Ferragina P, Scaiella U (2010) Tagme: on-the-fly annotation of short text fragments (by wikipedia entities). In: Proceedings of the 19th ACM international conference on Information and knowledge management, pages 1625–1628

  15. Gerritse EJ, Hasibi F, Vries APd (2020) Graph-embedding empowered entity retrieval. In: European Conference on Information Retrieval, pages 97–110. Springer

  16. Gerritse EJ, Hasibi F, de Vries AP (2022) Entity-aware transformers for entity search. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1455–1465

  17. Guo J, Xu G, Cheng X, Li H (2009) Named entity recognition in query. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pages 267–274

  18. Hasibi F, Balog K, Bratsberg SE (2016) Exploiting entity linking in queries for entity retrieval. In: Proceedings of the 2016 acm international conference on the theory of information retrieval, pages 209–218

  19. Hasibi F, Nikolaev F, Xiong C, Balog K, Bratsberg SE, Kotov A, Callan J (2017) Dbpedia-entity v2: a test collection for entity search. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1265–1268

  20. Hu X, Duan J, Dang D (2021) Natural language question answering over knowledge graph: the marriage of sparql query and keyword search. Knowl Inf Syst 63(4):819–844

    Article  Google Scholar 

  21. Jameel S, Bouraoui Z, Schockaert S (2017) Member: Max-margin based embeddings for entity retrieval. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 783–792

  22. Kenton JDM-WC, Toutanova LK (2019) Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of naacL-HLT, volume 1, page 2

  23. Liu X, Zheng W, Fang H (2013) An exploration of ranking models and feedback method for related entity finding. Inf Process Manag 49(5):995–1007

    Article  Google Scholar 

  24. Lo P-C, Lim E-P (2023) Contextual path retrieval: A contextual entity relation embedding-based approach. ACM Trans Inf Syst 41(1):1–38

    Article  Google Scholar 

  25. Maheshwari G, Trivedi P, Lukovnikov D, Chakraborty N, Fischer A, Lehmann J (2019) Learning to rank query graphs for complex question answering over knowledge graphs. In: International semantic web conference, pages 487–504. Springer

  26. Meij E, Bron M, Hollink L, Huurnink B, de Rijke M (2011) Mapping queries to the linking open data cloud: A case study using dbpedia. Journal of Web Semantics 9(4):418–433

    Article  Google Scholar 

  27. Meij E, Balog K, Odijk D (2014) Entity linking and retrieval for semantic search. WSDM 10:2556195–2556201

    Google Scholar 

  28. Metzler D, Croft WB (2005) A markov random field model for term dependencies. In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pages 472–479

  29. Neumayer R, Balog K, Nørvåg K (2012) On the modeling of entities for ad-hoc entity search in the web of data. In: European Conference on Information Retrieval, pages 133–145. Springer

  30. Nikolaev F, Kotov A (2020) Joint word and entity embeddings for entity retrieval from a knowledge graph. In: European Conference on Information Retrieval, pages 141–155. Springer

  31. Nikolaev F, Kotov A, Zhiltsov N (2016) Parameterized fielded term dependence models for ad-hoc entity retrieval from knowledge graph. In: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pages 435–444

  32. Noraset T, Lowphansirikul L, Tuarob S (2021) Wabiqa: A wikipedia-based thai question-answering system. Information Process Manag 58(1):102431

    Article  Google Scholar 

  33. Nozza D, Manchanda P, Fersini E, Palmonari M, Messina E (2021) Learningtoadapt with word embeddings: Domain adaptation of named entity recognition systems. Inf Process Manag 58(3):102537

    Article  Google Scholar 

  34. Raviv H, Carmel D, Kurland O (2012) A ranking framework for entity oriented search using markov random fields. In: Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search, pages 1–6

  35. Robertson SE, Walker S, Jones S, Hancock-Beaulieu MM, Gatford M et al (1995) Okapi at trec-3. Nist Special Publication Sp 109:109

    Google Scholar 

  36. Sarwar SM, Foley J, Allan J (2018) Term relevance feedback for contextual named entity retrieval. In: Proceedings of the 2018 Conference on Human Information Interaction & Retrieval, pages 301–304

  37. Tonon A, Demartini G, Cudré-Mauroux P (2012) Combining inverted indices and structured search for ad-hoc object retrieval. In: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, pages 125–134

  38. Vechtomova O, Robertson SE (2012) A domain-independent approach to finding related entities. Inf Process Manag 48(4):654–670

    Article  Google Scholar 

  39. Vrandečić D (2012) Wikidata: A new platform for collaborative data collection. In: Proceedings of the 21st international conference on world wide web, pages 1063–1064

  40. Wannous R (2014) Computational inference of conceptual trajectory model: considering domain temporal and spatial dimensions. PhD thesis, Université de La Rochelle

  41. Xiong C, Power R, Callan J (2017) Explicit semantic ranking for academic search via knowledge graph embedding. In: Proceedings of the 26th international conference on world wide web, pages 1271–1279

  42. Xiong H, Wang S, Tang M, Wang L, Lin X (2021) Knowledge graph question answering with semantic oriented fusion model. Knowledge-Based Systems 221:106954

    Article  Google Scholar 

  43. Zhao C, Xiong C, Qian X, Boyd-Graber J (2020) Complex factoid question answering with a free-text knowledge graph. In: Proceedings of The Web Conference 2020, pages 1205–1216, 2020

  44. Zheng W, Yu JX, Zou L, Cheng H (2018) Question answering over knowledge graphs: question understanding via template decomposition. Proceedings of the VLDB Endowment 11(11):1373–1386

    Article  Google Scholar 

  45. Zhiltsov N, Kotov A, Nikolaev F (2015) Fielded sequential dependence model for ad-hoc entity retrieval in the web of data. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 253–262

Download references

Funding

The research leading to these results received funding from Natural Sciences and Engineering Research Council of Canada, under Grant No 140869.

Author information

Authors and Affiliations

Authors

Contributions

Parastoo Jafarzadeh: Conceptualization, Implementation, Investigation, and Writing. Zahra Amirmahani: Implementation. Faezeh Ensan: Conceptualization, Investigation, Writing, Supervision.

Corresponding author

Correspondence to Faezeh Ensan.

Ethics declarations

Conflict of interest/Competing interests

(Check journal-specific guidelines for which heading to use): The authors have no competing interests as defined by Springer, or other interests that might be perceived to influence the results and/or discussion reported in this paper.

Ethical and informed consent for data used

The data that has been used is from publicly available sources: Wikipedia and DBpedia and publicly available datasets: DBpedia-Entity V2

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jafarzadeh, P., Amirmahani, Z. & Ensan, F. Learning contextual representations for entity retrieval. Appl Intell 54, 8820–8840 (2024). https://doi.org/10.1007/s10489-024-05430-0

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-024-05430-0

Keywords

Navigation