Abstract
In this paper, we provide for the linguists a method to facilitate the creation of a standard Arabic historical dictionary in order to save the lost period and to be up to date with other languages. In this method, we propose a platform of Automatic Natural Language Processing (ANLP) tools which permits the automatic indexing and research from a corpus of Arabic texts. The indexation is applied after some pretreatments: segmentation, normalization, and filtering, morphological analysis. The prototype that we’ve developed for the generation of standard Arabic historical dictionary permits to extract contexts from the entered corpus and to assign meaning from the user. The evaluation of our system shows that the results are reliable.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
McEnery, T., Wilson, A.: Corpus Linguistics: An Introduction, 2nd edn. Edinburgh University Press, Edinburgh (2001)
Wehr, H.: Arabic English dictionary. In: Cowan, M. (ed.) The Hans Wehr Dictionary of Modern Written Arabic, 4th edn. Spoken Language Services, Inc., Urbana (1994)
AlSaid. M.: A Corpus-based historical Arabic dictionary. Ph.D. thesis in linguistics, Cairo University (2010)
Kadri, Y., Nie, J.Y.: Effective stemming for Arabic information retrieval. In: Proceedings of the Challenge of Arabic for NLP/MT Conference, Londres, United Kingdom (2006)
Trench, R.: On some deficiencies in our English dictionaries: being the substance of two papers read before the philological society, London (1857)
Larkey, S.L., Ballesteros, L.: Light stemming for Arabic information retrieval. In: Arabic Computational Morphology, pp. 221–243 (2007)
Larkey, L., Ballesteros, S., Connell, E.M.: Improving stemming for Arabic information retrieval light stemming and co-occurrence analysis. In: Proceedings of the 25th Annual International Conference on Research and Development in Information Retrieval (SIGIR 2002), Tampere, Finland, August 11–15, pp. 275–282 (2005)
Larkey, S.L., Ballesteros, L., Connell, E.M.: Light stemming for Arabic information retrieval. In: Arabic Computational Morphology: Knowledge-Based and Empirical Methods (2005)
Souteh, Y., Bouzoubaa, K.: SAFAR platform and its morphological layer. In: Proceeding of the Eleventh Conference on Language Engineering ESOLEC’2011, 14–15 December, Cairo, Egypt (2011)
Habash, N.: Arabic natural language processing: words. In: Summer School on Human Language Technology Johns Hopkins University, Baltimore, 6 July 2005
Larkey, L., Ballesteros, S., Connell, E.M.: Light stemming for Arabic information retrieval, Arabic computational morphology: knowledge-based and empirical methods (2005)
Larkey, L., Ballesteros, S., Connell, E.M.: Improving stemming for Arabic information retrieval light stemming and co-occurrence analysis. In: Proceedings of the 25th Annual International Conference on Research and Development in Information Retrieval (SIGIR 2002), Tampere, Finland, pp. 275–282, 11–15 August 2002
Khalifa, I., Feki, Z., Farawila, A.: Arabic discourse segmentation based on rhetorical methods. Int. J. Electr. Comput. Sci. IJECS-IJENS 11, 10–15 (2011)
Semmar, N., Elkateb-Gara, F., Fluhr, C.: Using a stemmer in a natural language processing system to treat Arabic for cross-language information retrieval. In: International Conference on Machine Intelligence, Tozeur, Tunisia (2005)
Jacquemin, C., Daille, B., Royante, J., Polanco, X.: In vitro evaluation of a program for machine-aided indexing. Inf. Process. Manag. 38(6), 765–792 (2002)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Khalfallah, F., Msadak, H., Aloulou, C., Belguith, L.H. (2016). A Platform Based ANLP Tools for the Construction of an Arabic Historical Dictionary. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds) Natural Language Processing and Information Systems. NLDB 2016. Lecture Notes in Computer Science(), vol 9612. Springer, Cham. https://doi.org/10.1007/978-3-319-41754-7_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-41754-7_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41753-0
Online ISBN: 978-3-319-41754-7
eBook Packages: Computer ScienceComputer Science (R0)