Abstract
In this work we present a manually marked up corpus of Old Galician language (460 documents, 5,601,290 running words) and a diachronic dictionary extracted from it, as well as its potential applications, whose implementation is a topic of future work.
This work was partially funded by CICYT (TEL99-0335-C04-02)
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Brisaboa, N. R., Callón, C., López, Juan-Ramón, Places, A. S., Sanmartín, G. Stemming Galician Texts. Lecture Notes in Computer Science (LNCS 2476),. Springer-Verlag (SPIRE’2002), pp. 91–97. Lisboa, Portugal, 2002.
Gelbukh, A., Sidorov, G. Morphological Analysis of Inflective Languages Through Generation. J. Procesamiento de Lenguaje Natural, No 29, 2002.
Gelbukh, A., Sidorov, G. Approach to Construction of Automatic Morphological Analysis Systems for Inflective Languages with Little Effort. Computational Liguistics and Intelligent Text Processing. Lecture Notes in Computer Science N 2588, Springer-Verlag, 2003.
Honrado, A., Leon, R., O’Donnell, R. and Sinclair, D. A Word Stemming Algorithm for the Spanish Language. In Proceedings of the 7th International Symposium on String Processing and Information Retrieval (SPIRE’2000)-IEEE Comp. Society., pp.139–145, Espana, 2000.
Kraaij, W., Pohlmann, R. Porter’s stemming algorithm for Dutch. In L.G.M. Noordman and W.A.M. de Vroomen, editors, Informatiewetenschap 1994: Wetenschappelijke bijdragen aan de derde STINFON Conferentie, pp. 167–180, Tilburg, 1994.
López, J.R., Iglesias, E.L., Brisaboa, N.R., Paramá, J.R., Penabad, M.R. Base de datos documental para el estudio del español antiguo. In Proceedings of the X Simposio Internacional en Aplicaciones de Informática (INFONOR’97), pp. 2–8.Chile, 1997.
Moreira, V., Huyck, C. A Stemming Algorithm for the Portuguese Language. In Proceedings of the 8th International Symposium on String Processing and Information Retrieval (SPIRE’2001)-IEEE Computer Society, pp.186–193, Chile, 2001.
Wechsler, M., Sheridan, P., Schäuble, P. Multi-Language Text Indexing for Internet Retrieval. In the Proceedings of the 5th RIAO Conference Computer Assisted Information Searching on the Internet. Montreal, Canada, 1997.
A. Gelbukh, G. Sidorov, L. Chanona-Hernández. Compilation of a Spanish representative corpus. Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science, N 2276, Springer-Verlag, 2002, pp. 285–288.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Brisaboa, N.R., López, JR., Penabad, M.R., Places, Á.S. (2003). Diachronic Stemmed Corpus and Dictionary of Galician Language. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2003. Lecture Notes in Computer Science, vol 2588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36456-0_43
Download citation
DOI: https://doi.org/10.1007/3-540-36456-0_43
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00532-2
Online ISBN: 978-3-540-36456-6
eBook Packages: Springer Book Archive