Abstract
The paper presents the module for automatic morphological annotation within a text synthesizer for Hebrew, based on an efficient combination of two approaches. The first approach includes the selection of lexemes from appropriate lexica, while the other approach involves automatic morphological analysis of text input using a complex expert algorithm relying on a set of transformational rules and using 6 types of scoring procedures. The module operates on a set of 30 part-of-speech tags with more than 3000 corresponding morphological categories. The paper discusses the advantages of the proposed method in the context of an extremely morphologically complex language such as Hebrew, with particular emphasis given to the relative importance of individual scoring procedures. When all 6 scoring procedures are applied, the accuracy of 99.6% is achieved on a corpus of 3093 sentences (55046 words).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Manning, C., Schütze, H.: Foundations of statistical natural language processing. MIT Press, Cambridge (2000)
Aronoff, M., Rees-Miller, J.: Morphophonemics of modern Hebrew. Wiley-Blackwell, San Francisco (2003)
Fellman, J.: Concerning the “revival” of the Hebrew language. Anthropol. Linguist. 15(5), 250–257 (1973)
Lembersky, G., Shacham, D., Wintner, S.: Morphological disambiguation of Hebrew: A case study in classifier combination. Nat. Lang. Eng. Available on CJO 2012 (2012)
Wintner, S.: Hebrew computational linguistics: Past and Future. Artif. Intell. Rev. 21(2), 113–138 (2004)
Bar-Haim, R., Sima’an, K., Winter, Y.: Part-of-speech tagging of modern Hebrew text. Nat. Lang. Eng. 14(2), 223–251 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Popović, B., Sečujski, M., Delić, V., Janev, M., Stanković, I. (2013). Automatic Morphological Annotation in a Text-to-Speech System for Hebrew. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-01931-4_11
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01930-7
Online ISBN: 978-3-319-01931-4
eBook Packages: Computer ScienceComputer Science (R0)