Abstract
In this paper, we present the effect of the semantic indexing using WordNet senses on the Information Retrieval (IR) and Text Categorization (TC) tasks. The documents have been sense-tagged using a Word Sense Disambiguation (WSD) system based on Specialized Hidden Markov Models (SHMMs). The preliminary results showed that a small improvement of the performance was obtained only in the TC task.
This work was supported by the Spanish Research Projects CICYT TIC2000-0664-C02 and TIC2003-07158-C04-03. We are grateful to E. Ferretti for sense-tagging the data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ferretti, E., Lafuente, J., Rosso, P.: Semantics Text Categorization using the K Nearest Neighbours Method. In: Proc. of the 1st Indian International Conference on Artificial Intelligence (2003)
Gonzalo, J., Verdejo, F., Chugur, I., Chigarrán, J.: Indexing with WordNet Synsets can improve Text Retrieval. In: Proc. of the Workshop on Usage of WordNet for NLP (1998)
Jiménez, D., Ferretti, E., Vidal, V., Rosso, P., Enguix, C.F.: The Influence of Semantics in IR using LSI and K-Means Clustering Techniques. In: Proc. of the Workshop on Conceptual Information Retrieval and Clustering of Documents, ACM Int. Conf. on Information and Communication Technologies (2003)
Jiménez, D., Vidal, V., Enguix, C.F.: A Comparison of Experiments with the Bisecting-Spherical K-Means Clustering and SVD Algorithms. In: Proc. of JOTRI (2002)
Molina, A., Pla, F., Segarra, E.: A Hidden Markov Model Approach to Word Sense Disambiguation. In: Proc. of VIII Conf. Iberoamericana de Inteligencia Artificial (IBERAMIA2), Sevilla, Spain (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rosso, P., Molina, A., Pla, F., Jiménez, D., Vidal, V. (2004). Information Retrieval and Text Categorization with Semantic Indexing. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2004. Lecture Notes in Computer Science, vol 2945. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24630-5_73
Download citation
DOI: https://doi.org/10.1007/978-3-540-24630-5_73
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21006-1
Online ISBN: 978-3-540-24630-5
eBook Packages: Springer Book Archive