Abstract
So far, the Stanford Arabic statistic parser is considered as the best parsing tool in terms of performance compared to other parsers. This performance is not stable and may vary depending on the given corpus. A more detailed method to evaluate this parser may help the users to address the causes of a performance loss. We propose, for this reason, to evaluate the Stanford Parser using the verification of the satisfaction of the syntactic constraints (called, properties) based on the analysis results of the corpus. We may obtain these properties from a reference Arabic property grammar. By the way, we enriched the simple representation of the parsing result with syntactic properties. This allows to explicit several implicit information that are the relations between syntactic units. Therefore, we had both a detailed method for the evaluation of parsers and a more syntactically informative representation for the analysis. We obtained widely detailed and encouraging results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
http://nlp.cs.nyu.edu/evalb/ (of Sekine, S. and Collins, M. in 2006)
- 2.
http://www.informatics.susx.ac.uk/re-search/nlp/carroll/greval.html (of Carroll, J. in 2006).
- 3.
From the Arabic book “زهرة بابنج للعصفورة” (chamomile flower to the bird) of Talal Hassan: http://www.awu-dam.org/book/02/child02/105-t-h/105-t-h.zip.
References
AbuShquier, M., Al-Howiti, K.M.: Fully automated arabic to english machine translation system: transfer-based approach of AE-TBMT. Int. J. Inf. Commun. Technol. (2015)
Abuhaiba, I.S., Eltibi, M.F.: Author attribution of arabic texts using extended PCFG language model. J. Intell. Syst. Appl. 6, 27–39 (2016)
Arman, N., Jabbarin, J.: Generating use case models from arabic user requirements in a semi-automated approach using a NLP tool. J. Intell. Syst. (2014)
Bahloul, R.B., Elkarwi, M., Haddar, K., Blache, P.: Building an arabic linguistic resource from a Treebank: the case of property Grammar. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2014. LNCS (LNAI), vol. 8655, pp. 240–246. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10816-2_30
Bensalem, R.B., Haddar, K., Blache, P.: A formal modeling method to enrich the arabic Treebank ATB with syntactic properties. In: Proceedings of KEOD (2015)
Blache, P.: Les Grammaires de Propriétés: Des contraintes pour le traitement automatique des langues naturelles. Hermès science publications (2001). 228 pages
Blache, P., Rauzy, S.: Hybridization and Treebank enrichment with constraint-based representations. In: Workshop on Advanced Treebanking (2013)
Cheng, Y., Sun, C., Liu, B., Lin, L.: CRF tagging for head recognition based on Stanford parser. In: CIPS-SIGHAN Joint Conference on Chinese Language Processing (2010)
Duchier, D., Prost, J.-P., Dao, T.-B.-H.: A model-theoretic framework for grammaticality judgements. In: Conference on Formal Grammar, Bordeaux, France (2009)
Duchier, D., Dao, T., Parmentier, Y.: Analyse Syntaxique par Contraintes pour les Grammaires de Propriétés à Traits. Journées Francophones de Programmation par Contraintes (2012)
Maamouri, M., Bies, A., Buckwalter, T., Mekki, W.: The Penn Arabic Treebank: Building a Large-Scale Annotated Arabic Corpus (2004)
Oepen, S., Carroll, J.: Parser engineering and performance profiling. J. Nat. Lang. Eng. 6(1), 81–97 (2000)
Prost, J.-P.: Analyse relâchée à base de contraintes. In: TALN (Poster Session), Senlis (2009)
Seraji, M., Beata Megyesi, B., Nivre, J.: A basic language resource kit for persian. In: The international Conference on Language Resource Evaluation, pp. 2245–2252 (2012)
Green, S., Manning, C.D.: Better arabic parsing: baselines, evaluations, and analysis. In: International Conference on Computational Linguistics (COLING 2010) (2010)
Taylor, A., Marcus, M., Santorini, B.: The penn Treebank: an overview. In: Abeille, A. (ed.) Treebanks: the State of the Art in Syntactically Annotated Corpora. Kluwer (2003)
Vanrullen, T.: Analyse syntaxique à granularité variable. In: RECITAL (2004)
Waheeb, A., Babu, A.: Question analysis for arabic question answering systems. Int. J. Nat. Lang. Comput. (IJNLC) 5(6) (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Bahloul, R.B., Kadri, N., Haddar, K., Blache, P. (2018). Evaluation and Enrichment of Stanford Parser Using an Arabic Property Grammar. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2017. Lecture Notes in Computer Science(), vol 10761. Springer, Cham. https://doi.org/10.1007/978-3-319-77113-7_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-77113-7_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77112-0
Online ISBN: 978-3-319-77113-7
eBook Packages: Computer ScienceComputer Science (R0)