Abstract
In this paper we present the conversion of two treebanks (Cat3LB for Catalan, and Cast3LB for Spanish) from its original constituent format into dependencies. The process has been done automatically but by manually writing the head and the function table. The process has also been used to improve the quality of the first annotation and to modifiy the annotation for further extensions of the treebanks. Treebanks in both formats are freely available for research purposes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Beil, F., Prescher, D., Schmid, H., Shulte im Walde, S.: Evaluation of the Gramotron parser for German. In: Beyond Parseval, a LREC 2002 Workshop (2002)
Brants, S., Dipper, S., Hansen, S., Lezius, W., Smith, G.: The TIGER Treebank. In: Proccedings of the Workshop on Treebanks and Linguistic Theories (2002)
Civit, M., Martí, M.A.: Building Cast3LB: a Spanish Treebank. Research on Language & Computation 2(4) (2005)
Civit, M., Bufí, N., Valverde, M.P.: CAT3LB: a Treebank for Catalan with Word Sense Annotation. In: 3rd Workshop on Treebanks and Linguistic Theories (TLT 2004), Tuebingen, Germany (2004)
Civit, M.: Guía para la anotación sintáctica de Cast3LB: un corpus del español con anotación sintáctica, semántica y pragmática (2003), Available at: http://clic.fil.ub.es/
Civit, M.: Guía para la anotación de las funciones sintácticas de Cast3LB: un corpus del español con anotación sintáctica, semántica y pragmática (2003), Available at: http://clic.fil.ub.es/
Civit, M., Bufí, N., Valverde, M.P.: Guia per a la anotació de les funcions sintàctiques de Cat3LB: un corpus del català amb anotació sintàctica, semàntica i pragmàtica (2004), Available at: http://clic.fil.ub.es/
Hajic, J.: Building a syntactically annotated corpus: the Prague Dependency Treebank. Issues in Valency and Meaning. Studies in honour of Jarmila Panevova (1999)
Kromann, M.: The Danish Dependency Treebank and the underlying linguistic theory. In: Proceedings of the Second Workshop on Treebanks and Linguistic Theories (2003)
Lin, D.: A dependency-based method for evaluating broad-coverage parsers. In: Proceedings of IJCAI 1995, pp. 1420–1425 (1995)
Lin, D.: A dependency-based method for evaluating broad-coverage parsers. Natural Language Engineering 4(2), 1420–1425 (1998)
Valverde, M.P., Civit, M., Bufí, N.: Guia per a la anotació sintàctica de Cat3LB: un corpus del català amb anotació sintàctica, semàntica i pragmàtica (2004), Available at: http://clic.fil.ub.es/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Civit, M., Martí, M.A., Bufí, N. (2006). Cat3LB and Cast3LB: From Constituents to Dependencies. In: Salakoski, T., Ginter, F., Pyysalo, S., Pahikkala, T. (eds) Advances in Natural Language Processing. FinTAL 2006. Lecture Notes in Computer Science(), vol 4139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11816508_16
Download citation
DOI: https://doi.org/10.1007/11816508_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37334-6
Online ISBN: 978-3-540-37336-0
eBook Packages: Computer ScienceComputer Science (R0)