Abstract
This paper proposes a new corpus-based approach for deriving syntactic structures and generating parse trees of natural language sentences. The parts of speech (word categories) of words in the sentences play the key role for this purpose. The grammar formalism used is more general than most of the grammar induction methods proposed in the literature. The approach was tested for Turkish language using a corpus of more than 5,000 sentences and successful results were obtained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Charniak, E.: Statistical Language Learning. MIT, Cambridge (1997)
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT, Cambridge (2002)
Bod, R.: Data Oriented Parsing. In: Computational Linguistics in the Netherlands, Amsterdam The Netherlands, pp. 26–39 (1991)
Bod, R.: Beyond Grammar: An Experience-Based Theory of Language. CSLI Publications, Stanford (1998)
Kaplan, R.: A Probabilistic Approach to Lexical-Functional Analysis. In: Conference and Workshop on Lexical Functional Grammar, CSLI Publications, Stanford (1996)
Carroll, G.: Learning Probabilistic Grammars for Language Modelling. Ph.D. Thesis. Brown University, Providence RI (1995)
Carroll, G., Charniak, E.: Learning Probabilistic Dependency Grammars from Labelled Text. AAAI Fall Symposium on Probabilistic Approaches to Natural Language. Cambridge, MA, pp. 25-32 (1992)
Pereira, F., Schabes, Y.: Inside-Outside Reestimation from Partially Bracketed Corpora. In: Annual Meeting of the Association for Computational Linguistics. Newark Deleware, pp. 128–135 (1992)
Briscoe, T., Waegner, N.: Robust Stochastic Parsing Using the Inside-Outside Algorithm. In: AAAI Workshop on Statistically-Based NLP Techniques, San Jose California, pp. 30–53 (1992)
Güngör, T.: Computer Processing of Turkish: Morphological and Lexical Investigation. Ph.D. Thesis. Boğaziçi University, İstanbul Turkey (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Güngör, T. (2004). Generation of Sentence Parse Trees Using Parts of Speech. In: Biundo, S., Frühwirth, T., Palm, G. (eds) KI 2004: Advances in Artificial Intelligence. KI 2004. Lecture Notes in Computer Science(), vol 3238. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30221-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-30221-6_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23166-0
Online ISBN: 978-3-540-30221-6
eBook Packages: Springer Book Archive