Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2877799.2877801guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Exploring an ontology via text similarity: an experimental study

Published: 20 October 2014 Publication History

Abstract

In this paper we consider the problem of retrieving the concepts of an ontology that are most relevant to a given textual query. In our setting the concepts are associated with textual fragments, such as labels, descriptions, and links to other relevant concepts. The main task to be solved is the definition of a similarity measure between the single text of the query and the set of texts associated with an ontology concept. We experimentally study this problem on a particular scenario with a socio-pedagogic domain ontology and Italian language texts. We investigate how the basic cosine similarity measure on the bag-of-words text representations can be improved in three distinct ways by (i) taking into account the context of the ontology nodes, (ii) using the linear combination of various measures, and (iii) exploiting semantic resources. The experimental evaluation confirms the improvement of the presented methods upon the baseline. Beside discussing some issues to consider in applying these methods, we point out some directions for further improvement.

References

[1]
Legge-quadro n. 104 per l'assistenza, l'integrazione sociale e i diritti delle persone handicappate, 5 February 1992.
[2]
Daniel Bär, Torsten Zesch, and Iryna Gurevych. Dkpro similarity: An open source framework for text similarity. In ACL, 2013.
[3]
L. Bentivogli and E. Pianta. Exploiting parallel texts in the creation of multilingual semantically annotated resources: The multisemcor corpus. Nat. Lang. Eng., 11(3):247-261, September 2005.
[4]
Alexander Budanitsky and Graeme Hirst. Evaluating WordNet-based measures of lexical semantic relatedness. Comput. Linguist., 32(1):13-47, March 2006.
[5]
Evgeniy Gabrilovich and Shaul Markovitch. Computing semantic relatedness using Wikipedia-based explicit semantic analysis. IJCAI, 2007.
[6]
Sébastien Harispe, Sylvie Ranwez, Stefan Janaqi, and Jacky Montmain. Semantic measures for the comparison of units of language, concepts or entities from text and knowledge base analysis. CoRR, abs/1310.1285, 2013.
[7]
P. Jaccard. The distribution of the flora in the alpine zone. New Phytologist, 1912.
[8]
Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani. An introduction to statistical learning. Springer, 2013.
[9]
Youngjoong Ko. A study of term weighting schemes using class information for text classification. SIGIR. ACM, 2012.
[10]
Rada Mihalcea, Courtney Corley, and Carlo Strapparava. Corpus-based and knowledge-based measures of text semantic similarity. AAAI. AAAI Press, 2006.
[11]
Amit Singhal. Modern information retrieval: A brief overview. IEEE Data Eng. Bull., 24(4):35-43, 2001.
[12]
S. K. M. Wong, Wojciech Ziarko, and Patrick C. N. Wong. Generalized vector spaces model in information retrieval. SIGIR '85. ACM, 1985.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
IESD'14: Proceedings of the 3rd International Conference on Intelligent Exploration of Semantic Data - Volume 1279
October 2014
99 pages

Publisher

CEUR-WS.org

Aachen, Germany

Publication History

Published: 20 October 2014

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media