Abstract
In this study, we present a hybrid named entity recognizer for Turkish, which is based on a previously proposed rule based recognizer. Since rule based systems for speci¯c domains require their knowledge sources to be manually revised when ported to other domains, we turn the rule based recognizer into a hybrid one so that it learns from annotated data and improves its knowledge sources accordingly. Both the hybrid recognizer and its predecessor are evaluated on the same corpora and the hybrid recognizer achieves comparably better results. The current study is signi¯cant since it presents the¯rst hybrid {manually engineered and learning{ named entity recognizer for Turkish texts
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvistica Investigationes 30, 1 (2007) 3–26
Cucerzan, S., Yarowsky, D.: Language independent named entity recognition com- bining morphological and contextual evidence. In: Proceedings of the Joint SIG- DAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. (1999)
Tür, G., Hakkani-Tür, D., Oflazer, K.: A statistical information extraction system for Turkish. Natural Language Engineering 9, 2 (2003) 181–210
Bayraktar, Ö., Taşkaya-Temizel, T.: Person name extraction from Turkish financial news text using local grammar based approach. In: Proceedings of the International Symposium on Computer and Information Sciences (ISCIS). (2008)
Küç ük, D., Yazici, A.: Identification of coreferential chains in video texts for se- mantic annotation of news videos. In: Proceedings of the International Symposium on Computer and Information Sciences (ISCIS). (2008)
Küç ük, D., Yazici, A.: Named entity recognition experiments on Turkish texts. In: Proceedings of the International Conference on Flexible Query Answering Systems (FQAS). (2009)
Adai, S., Sönmez, A.C., Göktörk, M.: An integrated architecture for process- ing business documents in Turkish. In: Proceedings of the Conference on Text Processing and Computational Linguistics (CICLing). (2009)
Freitag, D.: Machine Learning for Information Extraction in Informal Domains. PhD thesis, Computer Science Department, Carnegie Mellon University (1998)
Say, B., Zeyrek, D., Oflazer, K., Özge, U.: Development of a corpus and a tree- bank for present-day written Turkish. In: Proceedings of the 11th International Conference of Turkish Linguistics (ICTL). (2002)
Maynard, D., Tablan, V., Ursu, C., Cunningham, H., Wilks, Y.: Named entity recognition from diverse text types. In: Proceedings of the Conference on Recent Advances in Natural Language Processing. (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media B.V.
About this paper
Cite this paper
Küçük, D., Yazici, A. (2011). A Hybrid Named Entity Recognizer for Turkish with Applications to Different Text Genres. In: Gelenbe, E., Lent, R., Sakellari, G., Sacan, A., Toroslu, H., Yazici, A. (eds) Computer and Information Sciences. Lecture Notes in Electrical Engineering, vol 62. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9794-1_23
Download citation
DOI: https://doi.org/10.1007/978-90-481-9794-1_23
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-9793-4
Online ISBN: 978-90-481-9794-1
eBook Packages: EngineeringEngineering (R0)