Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Offline arabic handwritten text recognition: A Survey

Published: 12 March 2013 Publication History

Abstract

Research in offline Arabic handwriting recognition has increased considerably in the past few years. This is evident from the numerous research results published recently in major journals and conferences in the area of handwriting recognition. Features and classifications techniques utilized in recent research work have diversified noticeably compared to the past. Moreover, more efforts have been diverted, in last few years, to construct different databases for Arabic handwriting recognition. This article provides a comprehensive survey of recent developments in Arabic handwriting recognition. The article starts with a summary of the characteristics of Arabic text, followed by a general model for an Arabic text recognition system. Then the used databases for Arabic text recognition are discussed. Research works on preprocessing phase, like text representation, baseline detection, line, word, character, and subcharacter segmentation algorithms, are presented. Different feature extraction techniques used in Arabic handwriting recognition are identified and discussed. Different classification approaches, like HMM, ANN, SVM, k-NN, syntactical methods, etc., are discussed in the context of Arabic handwriting recognition. Works on Arabic lexicon construction and spell checking are presented in the postprocessing phase. Several summary tables of published research work are provided for used Arabic text databases and reported results on Arabic character, word, numerals, and text recognition. These tables summarize the features, classifiers, data, and reported recognition accuracy for each technique. Finally, we discuss some future research directions in Arabic handwriting recognition.

References

[1]
Abandah, G. A., Younis, K. S., and Khedher, M. Z. 2008. Handwritten arabic character recognition using multiple classifiers based on letter form. In Proceedings of the 5th IASTED International Conference on Signal Processing, Pattern Recognition, and Applications (SPPRA). 128--133.
[2]
Abandah, G. and Anssari, N. 2009. Novel moment features extraction for recognizing handwritten arabic letters. J. Comput. Sci. 5, 3, 226--232.
[3]
Abbes, R. and Hassoun, J. D. 2004. The architecture of a standard arabic lexical database, some figures, ratios and categories from the DIINAR.1 source program. In Proceedings of the Workshop on Computational Approaches to Arabic Script--Based Languages. 15--22.
[4]
Abd, M. A. and Paschos, G. 2007. Effective arabic character recognition using support vector machines. Innov. Adv. Tech. Comput. Inf. Sci. Engin. 7--11.
[5]
Abdelazeem, S. and El-Sherif, E. 2008. Arabic handwritten digit recognition. Int. J. Doc. Anal. Recog. 11, 3, 127--141.
[6]
Abdelazeem, S. 2009. Comparing arabic and latin handwritten digits recognition problems. World Acad. Sci. Engin. Technol. 54, 451--455.
[7]
Abdulkadr, A. 2006. Two-Tier approach for arabic offline handwriting recognition. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR). 161--166.
[8]
Abdullah, S., Al-Nassiri, A., and Abdul Salam, R. 2008. Off-Line arabic handwritten word segmentation using rotational invariant segments features. Int. Arab J. Inf. Technol. 5, 2, 200--208.
[9]
Abuhaiba, I. S. I., Mahmoud, S. A., and Green, R. J. 1994. Recognition of handwritten cursive arabic characters. IEEE Trans. Pattern Anal. Mach. Intell. 16, 9, 664--672.
[10]
Abuleil, S. and Evens, M. 2002. Extracting an arabic lexicon from arabic newspaper text. Comput. Humanit. 36, 2, 191--221.
[11]
Alaei, A., Pal, U., and Nagabhushan, P. 2009a. Using modified contour features and SVM based classifier for the recognition of persian/arabic handwritten numerals. In Proceedings of the 7th International Conference on Advances in Pattern Recognition (ICAPR). 391--394.
[12]
Alaei, A., Nagabhushan, P., and Pal, U. 2009b. Fine classification of unconstrained handwritten persian/arabic numerals by removing confusion amongst similar classes. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 601--605.
[13]
Alamri, H., Sadri, J., Suen, C., and Nobile, N. 2008. A novel comprehensive database for arabic off-line handwriting recognition. In Proceedings of the 11th International Conference on Frontiers in Handwriting Recognition (ICFHR). 664--669.
[14]
Alamri, H., He, C. L., and Suen, C. Y. 2009. A new approach for segmentation and recognition of arabic handwritten touching numeral pairs. In Proceedings of the International Conference Computer Analysis of Images and Patterns (CAIP). Lecture Notes in Computer Science, vol. 5702, Springer, 165--172.
[15]
Al-Badr, B. and Mahmoud, S. A. 1995. Survey and bibliography of arabic optical text recognition. Signal Process. 41, 1, 49--77.
[16]
Al-Hajj, R., Mokbel, C., and Likforman-Sulem, L. 2007. Combination of HMM-based classifiers for the recognition of arabic handwritten words. In Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR). 959--963.
[17]
Al-Hajj, R., Mokbel, C., and Likforman-Sulem, L. 2008. Recognition of arabic handwritten words using contextual character models. In Proceedings of the 20th IS&T/SPIE Annual Symposium on Electronic Imaging, Document Recognition and Retrieval XV. Vol. 6815. SPIE.
[18]
Al Hamad, H. A. and Abu Zitar, R. 2010. Development of an efficient neural--based segmentation technique for arabic handwriting recognition. Pattern Recogn. 43, 8, 2773--2798.
[19]
Ali, M. A. 2008. Arabic handwritten characters classification using learning vector quantization algorithm. In Image and Signal Processing, Lecture Notes in Computer Science, vol. 5099, Springer, 463--470.
[20]
Al-Jarrah, O., Al-Kiswany, S., Al-Gharaibeh, B., Fraiwan, M., and Khasawneh, H. A. 2006. New algorithm for arabic optical character recognition. In Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases. 211--224.
[21]
Alkhateeb, J. H., Jinchang Ren Ipson, S. S., and Jianmin J. 2008. Knowledge-Based baseline detection and optimal thresholding for words segmentation in efficient pre-processing of handwritten arabic text. InProceedings of the 5th International Conference on Information Technology: New Generations (ITNG'08). 1158--1159.
[22]
Akhateeb, J. H., Jiang, J., Ren, J., Khelifi, F., and Ipson, S. S. 2009. Multiclass classification of unconstrained handwritten arabic words using machine learning approaches. The Open Signal Process. J. 2, 21--28.
[23]
Alma'adeed, S., Elliman, D., and Higgins, C. A. A. 2002a. A data base for arabic handwritten text recognition research. In Proceedings of the 8th International Workshop on Frontiers in Handwriting Recognition (IWFHR'02). 485--489.
[24]
Alma'adeed, S., Higgens, C., and Elliman, D. 2002b. Recognition of off-line handwritten Arabic words using hidden markov model approach. In Proceedings of the 16th International Conference on Pattern Recognition (ICPR'02).
[25]
Alma'adeed, S., Higgens, C., and Elliman, D. 2004. Off-Line recognition of handwritten arabic words using multiple hidden markov models. Knowl. Based Syst. 17, 75--79.
[26]
Alma'adeed, S. 2006. Recognition of off-line handwritten arabic words using neural network. In Proceedings of the IEEE Conference on Geometric Modeling and Imaging-New Trends.
[27]
Almuallim, H. and Yamaguchi, S. 1987. A method of recognition of arabic cursive handwriting. IEEE Trans. Pattern Anal. Mach. Intell. 9, 5, 715--722.
[28]
Al-Muhtaseb, H. A., Mahmoud, S. A., and Qahwaji, R. S. 2008. Recognition of off-line printed arabic text using hidden markov models. Signal Processing 88, 12, 2902--2912.
[29]
Al-Ohali, Y., Cheriet, M., and Suen, C. 2003. Databases for recognition of handwritten arabic cheques. Pattern Recogn. 36, 1, 111--121.
[30]
Al-Omari, F. A. and Al-Jarrah, O. 2004. Handwritten indian numerals recognition system using probabilistic neural networks. Adv. Engin. Inf. 18, 9--16.
[31]
Al-Shatnawi, A. and Omar, K. 2008. Methods of arabic language baseline detection -- The state of art. Int. J. Comput. Sci. Netw. Secur. 8, 10, 137--143.
[32]
Al-Shatnawi, A. and Omar, K. 2009a. A comparative study between methods of arabic baseline detection. In Proceedings of the International Conference on Electrical Engineering and Informatics. 73--77.
[33]
Al-Shatnawi, A. and Omar, K. 2009b. Detecting arabic handwritten word baseline using voronoi diagram. In Proceedings of the International Conference on Electrical Engineering and Informatics. 18--22.
[34]
Al-Shatnawi, A. and Omar, K. 2009c. Skew detection and correction technique for arabic document images based on centre of gravity. J. Comput. Sci. 5, 5, 363--368.
[35]
Amin, A. 2003. Recognition of hand-printed characters based on structural description and inductive logic programming. Pattern Recogn. Lett. 24, 16, 3187--3196.
[36]
Applied Media Analysis. 2007. Arabic-Handwritten-1.0 database. http://appliedmediaanalysis.com/Datasets. htm#Arabic.
[37]
Awaidah, S. and Mahmoud, S. A. 2009. A multiple feature/resolution scheme to arabic (indian) numerals recognition using hidden markov models. Signal Process. 89, 6, 1176--1184.
[38]
Azizi, N., Farah, N., Khadir, M. T., and Sellami, M. 2009. Arabic handwritten word recognition using classifiers selection and features extraction/selection. In Recent Advances in Intelligent Information Systems. 735--742.
[39]
Azizi, N., Farah, N., Sellami, M., and Ennaji, A. 2010. Using diversity in classifier set selection for arabic handwritten recognition. In Multiple Classifier Systems, Lecture Notes in Computer Science, vol. 5997, Springer. 235--244.
[40]
Bahlmann, C. 2006. Directional features in online handwriting recognition. Pattern Recogn. 39, 1, 115--125.
[41]
Bazzi, I., Lapre, C., Makhoul, J., and Schwartz, R. 1997. Omnifont and unlimited vocabulary OCR for english and arabic. In Proceedings of the 5th International Conference on Document Analysis and Recognition (ICDAR). 842--846.
[42]
Bazzi, I., Schwartz, R., and Makhoul, J. 1999. An omnifont open-vocabulary OCR system for english and arabic. IEEE Trans. Pattern Anal. Mach. Intell. 21, 6, 495--504.
[43]
Ben Amor, N. and Ben Amara, N. E. 2006. A hybrid approach for multifont arabic characters recognition. In Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases. 194--198.
[44]
Ben Cheikh, I., Belaïd, A., and Kacem, A. 2008. A novel approach for the recognition of a wide arabic handwritten word lexicon. In Proceedings of the 19th International Conference on Pattern Recognition (ICPR).
[45]
Benjelil, M., Kanoun, S., Mullot, R., and Alimi, A. M. 2009. Arabic and latin script identification in printed and handwritten types based on steerable pyramid features. In Proceedings of the Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 591--595.
[46]
Ben Moussa, S., Frissard, Q., Zahour, A., Benabdelhafid, A., and Alimi, A. M. 2010. New features using fractal multi-dimensions for generalized arabic font recognition. Pattern Recogn. Lett. 31, 5, 361--371.
[47]
Benouareth, A., Ennaji, A., and Sellami, M. 2006a. HMMs with explicit state duration applied to handwritten arabic word recognition. In Proceedings of the 18th International Conference on Pattern Recognition (ICPR). 897--900.
[48]
Benouareth, A., Ennaji, A., and Sellami, M. 2006b. Semi-Continuous HMMs with explicit state duration applied to arabic handwritten word recognition. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR).
[49]
Benouareth, A., Ennaji, A., and Sellami, M. 2008a. Arabic handwritten word recognition using HMMs with explicit state duration. J. Advances Signal Process. 1.
[50]
Benouareth, A., Ennaji, A., and Sellami, M. 2008b. Semi-Continuous HMMs with explicit state duration for unconstrained arabic word modeling and recognition. Pattern Recogn. Lett. 29, 12, 1742--1752.
[51]
Benouareth, A., Ennaji, A., and Sellami, M. 2008c. Arabic handwritten word recognition using HMMs with explicit state duration. EURASIP J. Adv. Signal Process.
[52]
Biadsy, F., El-Sana, J., and Habash, N. 2006. Online arabic handwriting recognition using hidden markov models. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR).
[53]
Blumenstein, M., Liu, X. Y., and Verma, B. 2007. An investigation of the modified direction feature for cursive character recognition. Pattern Recogn. 40, 2, 376--388.
[54]
Boubaker, H., Kherallah, M., and Alimi, A. M. 2009. New algorithm of straight or curved baseline detection for short arabic handwritten writing. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 778--782.
[55]
Broumandnia, A., Shanbehzadeh, J., and Nourani, M. 2007. Handwritten farsi/arabic word recognition. In Proceedings of the IEEE/ACS International Conference on Computer Systems and Applications. 767--771.
[56]
Broumandnia, A., Shanbehzadeh, J., and Varnoosfaderani, M. R. 2008. Persian/Arabic handwritten word recognition using m-band packet wavelet transform. Image Vis. Comput. 26, 6, 829--842.
[57]
Bunke, H., Bengio, S., and Vinciarelli, A. 2004. Off-Line recognition of unconstrained handwritten texts using HMMs and statistical language models. IEEE Trans. Pattern Anal. Mach. Intell. 26, 6, 709--720.
[58]
Cheriet, M. 2008. Visual recognition of Arabic handwriting: challenges and new directions. In Arabic and Chinese Handwriting Recognition, Lecture Notes in Computer Science, vol. 4768, Springer, 1--21.
[59]
Chen, J., Cao, H., Prasad, R., Bhardwaj, A., and Natarajan, P. 2010. Gabor features for offline arabic handwriting recognition. In Proceedings of the 9th IAPR International Workshop on Document Analysis Systems (DAS). 53--58.
[60]
Clocksin, W. F. and Fernando, P. P. J. 2003. Towards automatic transcription of syriac handwriting. In Proceedings of the International Conference on Image Analysis and Processing. 664--669.
[61]
Dehghani, A., Shabini, F., and Nava, P. 2001. Off-line recognition of isolated persian handwritten characters using multiple hidden markov models. In Proceedings of the International Conference on Information Technology: Coding and Computing. 506--510.
[62]
Dichy, J. and Fargaly, A. 2003. Roots & patterns vs. stems plus grammar-lexis specifications: on what basis should a multilingual lexical database centred on arabic be built? In Proceedings of the IXth Machine Translation Summit in the Workshop on Machine Translation for Semitic Languages: Issues and Approaches. 1--8.
[63]
Ding, X. and Liu, H. 2008. Segmentation-Driven offline handwritten chinese and arabic script recognition. In Arabic and Chinese Handwriting Recognition. Lecture Notes in Computer Science, vol. 4768. Springer, 196--217.
[64]
Dreuw, P., Jonas, S., and Ney, H. 2008. White-Space models for offline arabic handwriting recognition. In 19th International Conference on Pattern Recognition (ICPR).
[65]
Dreuw, P., Rybach, D., Gollan, C., and Ney, H. 2009a. Writer adaptive training and writing variant model refinement for offline arabic handwriting recognition. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 21--25.
[66]
Dreuw, P., Heigold, G., and Ney, H. 2009b. Confidence-Based discriminative training for model adaptation in offline arabic handwriting recognition. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 596--600.
[67]
El Abed, H. and Märgner, V. 2007a. The IFN/ENIT-database-a tool to develop arabic handwriting recognition systems. In IEEE International Symposium on Signal Processing and its Applications (ISSPA).
[68]
El Abed, H. and Märgner, V. 2007b. Comparison of different preprocessing and feature extraction methods for offline recognition of handwritten arabic words. In Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR). 974--978.
[69]
El Abed, H. and Märgner, V. 2008a. Arabic text recognition systems - State of the art and future trends. In 5th International Conference on Innovations in Information Technology (Innovations'08).
[70]
El Abed, H. and Märgner, V. 2008b. Reject rules and combination methods to improve arabic handwritten word recognizers. In Proceedings of the 11th International Conference on Frontiers in Handwriting Recognition (ICFHR). 180--185.
[71]
El Abed, H. and Märgner, V. 2009a. How to improve a handwriting recognition system. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 1181--1185.
[72]
El Abed, H. and Märgner, V. 2009b. Improvement of arabic handwriting recognition systems: combination and/or reject? Proc. SPIE 7247, 1--10.
[73]
El Abed, H. and Margner, V. 2010. A framework for the combination of different arabic handwritten word recognition systems. In Proceedings of the 20th International Conference of Pattern Recognition (ICPR). 1904--1907.
[74]
Elarian, Y. and Mahmoud, S. A. 2008. An adaptive line segmentation algorithm (alsa) for arabic. In Proceedings of the International Conference on Image Processing, Computer Vision, and Pattern Recognition (IPCV). 735--739.
[75]
Elbaati, A., Boubaker, H., Kherallah, M., Alimi, A. M., Ennaji, A., and El Abed, H. 2009. Arabic handwriting recognition using restored stroke chronology. InProceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 411--415.
[76]
El-Hajj, R., Likforman-Sulem, L., and Mokbel, C. 2005. Arabic handwriting recognition using baseline dependent features and hidden markov modeling. In Proceedings of the International Conference on Document Analysis and Recognition. 893--897.
[77]
El-Hajj, R., Mokbel, C., and Likforman-Sulem, L. 2008. Recognition of arabic handwritten words using contextual character models. Proc. SPIE 6815.
[78]
El-Sherif, E. and Abdelazeem, S. 2007. A two-stage system for arabic handwritten digit recognition tested on a new large database. In Proceedings of the International Conference on Artificial Intelligence and Pattern Recognition (AIPR'07). 237--242.
[79]
Farghaly, A. and Senellart, J. 2003. Intuitive coding of the arabic lexicon. In Proceedings of the IXth Machine Translation Summit.
[80]
Farooq, F., Govindaraju, V., and Perrone, M. 2005. Pre-Processing methods for handwritten arabic documents. In Proceedings of the 8th International Conference on Document Analysis and Recognition. 267--271.
[81]
Farah, N., Souici, L., and Sellami, M. 2006. Classifiers combination and syntax analysis for arabic literal amount recognition. Engin. Appl. Artif. Intell. 19, 1, 29--39.
[82]
Freeman, H. 1961. On the encoding of arbitrary geometric configurations. IRE Trans. Electron. Comput. EC- 10, 260--268.
[83]
Fujisawa, H. 2008. Forty years of research in character and document recognition - an industrial perspective. Pattern Recogn. 41, 8, 2435--2446.
[84]
Graves, A. and Schmidhuber, J. 2009. Offline handwriting recognition with multidimensional recurrent neural networks. In Proceedings of the Conference on Advances in Neural Information Processing Systems (NIPS'09). 545--552.
[85]
Haboubi, S., Maddouri, S., Ellouze, N., and El-Abed, H. 2009. Invariant primitives for handwritten arabic script: A contrastive study of four feature sets. In Proceedings of the 10th International Conference on Document Analysis and Recognition. 691--697.
[86]
Haddad B. and Yaseen M. 2007. Detection and correction of non-words in arabic: A hybrid approach. Int. J. Comput. Process. Oriental Lang. 20, 4, 237--257.
[87]
Hamdani, M., El Abed, H., Kherallah, M., and Alimi, A. M. 2009. Combining multiple HMMs using on-line and off-line features for off-line arabic handwriting recognition. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 201--205.
[88]
Harifi, A. and Aghagolzade, A. 2004. A new pattern for handwritten persian/arabic digit recognition. Int. J. Inf. Technol. 1, 4, 293--296.
[89]
Hassin, A., Tang, X., Liu, J., and Zhao, W. 2004. Printed arabic character recognition using HMM. J. Comput. Sci. Technol. 19, 4, 538--543.
[90]
Hu, J., Lim, S., and Brown, M. 2000. Writer independent on-line handwriting recognition using an HMM approach. Pattern Recogn. 33, 1, 133--147.
[91]
Kanoun, S., Slimane, F., Guesmi, H., Ingold, R., Alimi, A. M., and Hennebert, J. 2009. Affixal approach versus analytical approach for off--line arabic decomposable vocabulary recognition. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 661--665.
[92]
Kessentini, Y., Paquet, T., and Ben Hamadou, A. 2009. A multi-lingual recognition system for arabic and latin handwriting. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 1196--1200.
[93]
Kessentini, Y., Paquet, T., and Ben Hamadou, A. 2010. Off-Line handwritten word recognition using multi-stream hidden markov models. Pattern Recogn. Lett. 31, 1, 60--70.
[94]
Khan, T. K., Azam, S. M., and Mohsin, S. 2007. An improvement over template matching using k-means algorithm for printed cursive script recognition. In Proceedings of the 4th IASTED International Conference on Signal Processing, Pattern Recognition, and Applications. 209--214.
[95]
Kharma, N., Ahmed, M., and Ward, R. 1999. A new comprehensive database of handwritten arabic words, numbers, and signatures used for OCR testing. In Proceedings of the Canadian Conference on Electrical and Computer Engineering. 766--768.
[96]
Khedher, M. and Abandah, G. 2002. Arabic character recognition using approximate stroke sequence. In Proceedings of the Arabic Language Resources and Evaluation - Status and Prospects Workshop, 3rd International Conference on Language Resources and Evaluation (LREC'02).
[97]
Kherallah, M., Elbaati, A., El Abed, H., and Alimi, A. M. 2008a. The on/off (LMCA) dual arabic handwriting database. In Proceedings of the 11th International Conference on Frontiers in Handwriting Recognition (ICFHR).
[98]
Kherallah, M., Haddad, L., Alimi, A. M., and Mitiche, A. 2008b. On-Line handwritten digit recognition based on trajectory and velocity modeling. Pattern Recogn. Lett. 29, 5, 580--594.
[99]
Kherallah, M., Bouri, F., and Alimi, A. M. 2009. On-line Arabic handwriting recognition system based on visual encoding and genetic algorithm. Engin. Appl. Artif. Intell. 22, 1, 153--170.
[100]
Khorsheed M. S. 2000. Automatic recognition of words in arabic manuscripts. PhD thesis, University of Cambridge.
[101]
Khorsheed M. S. 2003. Recognising handwritten arabic manuscripts using a single hidden markov model. Pattern Recog. Lett. 24, 14, 2235--2242.
[102]
Khorsheed, M. S. 2006. Mono-Font cursive arabic text recognition using speech recognition system. In Structural, Syntactic, and Statistical Pattern Recognition, Lecture Notes in Computer Science, vol. 4109, Springer, 755--763.
[103]
Khorsheed, M. S. 2007a. Offline recognition of omnifont arabic text using the HMM toolkit (HTK). Pattern Recogn. Lett. 28, 12, 1563--1571.
[104]
Khorsheed, M. S. 2007b. HMM-Based system for recognizing words in historical arabic manuscript. Int. J. Robot. Autom. 22, 4, 294--303.
[105]
Khosravi, H. and Kabir, E. 2007. Introducing a very large dataset of handwritten farsi digits and a study on the variety of handwriting styles. Pattern Recogn. Lett. 28, 10, 1133--1141.
[106]
Kukich, K. 1992. Techniques for automatically correcting words in text. ACM Comput. Surv. 24, 4, 377--440.
[107]
Kumar, J., Abd-Almageed, W., Kang, L., and Doermann, D. 2010. Handwritten arabic text line segmentation using affinity propagation. In Proceedings of the 9th IAPR International Workshop on Document Analysis Systems (DAS). 135--142.
[108]
Kundu, A., Hines, T., Phillips, J., Huyck, B., and Van Guilder, L. 2007. Arabic handwriting recognition using variable duration HMM. In Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR). 644--648.
[109]
Lawal, I. A., Abdel-Aal, R. E., and Mahmoud, S. A. 2010. Recognition of handwritten arabic (indian) numerals using freeman's chain codes and abductive network classifiers. In Proceedings of the 20th International Conference on Pattern Recognition (ICPR). 1884--1887.
[110]
Liu, C. L. and Suen, C. Y. 2009. A new benchmark on the recognition of handwritten bangla and farsi numeral characters. Pattern Recogn. 42, 12, 3287--3295.
[111]
Lorigo, L. and Govindaraju, V. 2005. Segmentation and pre-recognition of arabic handwriting. In Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR). 605--609.
[112]
Lorigo, L. and Govindaraju, V. 2006. Offline arabic handwriting recognition: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 28, 5, 712--724.
[113]
Mahmoud, S. A., Abuhaiba I., and Green, R. J. 1991. Skeletonization of arabic characters using clustering based skeletonization algorithm (CBSA). Pattern Recogn. 24, 5, 453--464.
[114]
Mahmoud, S. A. 2008a. Recognition of writer-independent off-line handwritten arabic (indian) numerals using hidden markov models. Signal Process. 88, 4, 844--857.
[115]
Mahmoud, S. A. 2008b. Arabic (indian) handwritten digits recognition using gabor-based features. In Proceedings of the Conference Innovations in Information Technology (Innovations'08).
[116]
Mahmoud, S. A. 2009. Recognition of arabic (indian) check digits using spatial gabor filters. In Proceedings of the 5th IEEE-GCC Conference on Computing and Information Technology.
[117]
Mahmoud, S. A. and Olatunji, S. O. 2009. Automatic recognition of off-line handwritten arabic (indian) numerals using support vector and extreme learning machines. Int. J. Imaging 2, A09.
[118]
Mahmoud, S. A. and Owaidah, S. 2009. Recognition of off-line handwritten arabic (indian) numerals using multi-scale features and support vector machines. Arab. J. Sci. Engin. 34, 2B, 429--444.
[119]
Mahmoud, S. A. and Abu-Amara, M. H. 2010. The use of radon transform in handwritten arabic (indian) numerals recognition. WSEAS Trans. Comput. 9, 3.
[120]
Mahmoud, S. A. and Al-Khatib, W. A. 2010. Recognition of arabic (indian) bank cheque digits using log-gabor filters. Appl. Intell. J.
[121]
Märgner, V., El Abed, H., and Pechwitz, M. 2006. Offline handwritten arabic word recognition using HMM - A character based approach without explicit segmentation. In Proceedings of the 9th Colloque International Francophone sur l'Ecrit et le Document (CIFED).
[122]
Märgner, V. and El Abed, H. 2008. Databases and competitions: strategies to improve arabic recognition systems. In Arabic and Chinese Handwriting Recognition, Lecture Notes in Computer Science, vol. 4768, Springer, 82--103.
[123]
Märgner, V. and El Abed, H. 2009. Arabic handwriting recognition competition. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR).
[124]
Mezghani, N. and Mitiche, A. 2008. A gibbsian kohonen network for online arabic character recognition. In Advances in Visual Computing, Lecture Notes in Computer Science, vol. 5359, Springer, 493--500.
[125]
Menasri, F., Vincent, N., Cheriet, M., and Augustin, E. 2007. Shape-Based alphabet for off-line arabic handwriting recognition. In Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR). 969--973.
[126]
Mohamad, R. A., Likforman-Sulem, L., and Mokbel, C. 2009. Combining slanted-frame classifiers for improved HMM-based arabic handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31, 7, 1165--1177.
[127]
Mostafa, K. and Darwish, A. M. 1999. Robust base-line independent algorithms for segmentation and reconstruction of arabic handwritten cursive script. In Proceedings of the IS&T/SPIE Conference on Document Recognition and Retrieval VI. Vol. 3651, 73--83.
[128]
Motawa, D., Amin, A., and Sabourin, R. 1997. Segmentation of arabic cursive script. In Proceedings of the International Conference on Document Analysis and Recognition. 625--628.
[129]
Mozaffari, S., Faez, K., and Ziaratban, M. 2005. Structural decomposition and statistical description of farsi/arabic handwritten numeric characters. In Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR). 237--241.
[130]
Mozaffari, S., Faez, K., Faradji, F., Ziaratban, M., and Golzan, S. M. 2006. A comprehensive isolated farsi/arabic character database for handwritten ocr research. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR). 385--389.
[131]
Mozaffari, S., Faez, K., Märgner, V., and El-Abed, H. 2007. Strategies for large handwritten farsi/arabic lexicon reduction. In Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR'07). 98--102.
[132]
Mozaffari, S., El Abed, H., Margner, V., Faez, K., and Amirshahi, A. 2008a. IfN/Farsi-database: A database of farsi handwritten city names. In Proceedings of the 11th International Conference on Frontiers in Handwriting Recognition (ICFHR).
[133]
Mozaffari, S., Faez, K., Märgner, V., and El-Abed, H. 2008b. Lexicon reduction using dots for off-line farsi/arabic handwritten word recognition. Pattern Recogn. Lett. 29, 6, 724--734.
[134]
Nasrudin, M. F., Omar, K., Liong, C-Y., and Zakaria, M. S. 2009. Invariant features from the trace transform for jawi character recognition. In Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living, Part II., Lecture Notes in Computer Science, vol. 5518, Springer, 256--263.
[135]
Natarajan, P., Saleem, S., Prasad, R., Macrostie, E., and Krishna, S. 2008. Multi-Lingual offline handwriting recognition using hidden markov models: A script-independent approach. In Arabic and Chinese Handwriting Recognition., Lecture Notes in Computer Science, vol. 4768, Springer, 231--250.
[136]
Natarajan, P., Subramanian, K., Bhardwaj, A., and Prasad, R. 2009. Stochastic segment modeling for offline handwriting recognition. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 971--975.
[137]
Nazif, A. 1975. A system for the recognition of the printed arabic characters. Master's thesis, Faculty of Engineering, Cairo University.
[138]
Olivier, G., Miled, H., Romeo, K., and Lecourtier, Y. 1996. Segmentation and coding of Arabic handwritten words. In Proceedings of the 13th International Conference on Pattern Recognition (ICPR). 264--268.
[139]
Parvez M. T. and Mahmoud, S. A. 2010. Arabic handwritten alphanumeric character recognition using fuzzy attributed turning functions. In Proceedings of the Workshop in Frontiers in Arabic Handwriting Recognition, 20th International Conference in Pattern Recognition (ICPR). 9--14.
[140]
Pechwitz, M., Snoussi Maddouri, S., Märgner, V., Ellouze, N., and Amiri, H. 2002. IFN/ENIT-Database of handwritten arabic words. In Proceedings of the 7th Colloque International Francophone sur l'Ecrit et le Document (CIFED'02). 127--136.
[141]
Pechwitz, M. and Märgner, V. 2002. Baseline estimation for arabic handwritten words. In Proceedings of the 8th International Workshop on Frontiers in Handwriting Recognition (IWFHR'02). 479--484.
[142]
Pechwitz, M. and Märgner, V. 2003. HMM based approach for handwritten arabic word recognition using the IFN/ENIT- database. In Proceedings of the 7th International Conference on Document Analysis and Recognition. 890--894.
[143]
Pechwitz, M., Märgner, V., and El Abed, H. 2006. Comparison of two different feature sets for offline recognition of handwritten arabic words. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR).
[144]
Prasad, R., Saleem, S., Kamali, M., Meermeier, R., and Natarajan, P. 2008. Improvements in hidden markov model based arabic ocr. In Proceedings of the 19th International Conference on Pattern Recognition (ICPR).
[145]
Riseman, E. M. and Hanson, A. R. 1974. A contextual post processing system for error correction using binary n-grams. IEEE Trans. Comput. C--23, 5, 480--493.
[146]
Romeo-Parker, K. R. K., Miled, H., and Lecourtier, Y. 1995. A new approach for latin/arabic character segmentation. In Proceedings of the 3rd International Conference on Document Analysis and Recognition (ICDAR). 874--877.
[147]
Saabni, R. and El-Sana, J. 2009. Hierarchical on-line arabic handwriting recognition. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 867--871.
[148]
Sadri, J., Suen, C., and Bui, T. 2003. Application of support vector machines for recognition of handwritten arabic/persian digits. In Proceedings of the 2nd Conference on Machine Vision and Image Processing & Applications (MVIP'03). 300--307.
[149]
Saeeda, K. and Albakoor, M. 2009. Region growing based segmentation algorithm for typewritten and handwritten text recognition. Appl. Soft Comput. 9, 2, 608--617.
[150]
Safabakhsh, R. and Adibi, P. 2005. Nastaaligh handwritten word recognition using a continuous--density variable duration HMM. Arab. J. Sci. Engin. 30, 1B, 95--118.
[151]
Said, F. N., Yacoub, R. A., and Suen, C. Y. 1999. Recognition of english and arabic numerals using a dynamic number of hidden neurons. In Proceedings of the 5th International Conference on Document Analysis and Recognition (ICDAR). 237--240.
[152]
Saleem, S., Cao, H., Subramanian, K., Kamali, M., Prasad, R., and Natarajan, P. 2009. Improvements in bbn's HMM-based offline arabic handwriting recognition system. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 773--777.
[153]
Sari, T., Souici, L., and Sellami, M. 2002. Off-Line handwritten arabic character segmentation algorithm: ACSA. In Proceedings of the 8th International Workshop on Frontiers in Handwriting Recognition (IWFHR). 452--457.
[154]
Sarfraz, M., Mahmoud, S. A., and Rasheed, Z. 2007. On skew estimation and correction of text. In Proceedings of the Conference on Computer Graphics Imaging and Visualization (CGIV'07). 308--313.
[155]
Schambach, M., Rottland, J., and Alary, T. 2008. How to convert a latin handwriting recognition system to arabic. In Proceedings of the 11th International Conference on Frontiers in Handwriting Recognition (ICFHR).
[156]
Shaalan., K., Allam, A., and Gohah, A. 2003. Towards automatic spell checking for arabic. In Proceedings of the Conference on Language Engineering (ELSE). 240--247.
[157]
Shirali-Shahreza, M., Faez, K., and Khotanzad, A. 1995. Recognition of hand-- written persian/arabic numerals by shadow coding and an edited probabilistic neural network. In Proceedings of the International Conference on Image Processing. 436--439.
[158]
Shirali-Shahreza, M. and Shirali-Shahreza, S. 2006. Persian/Arabic text font estimation using dots. In Proceedings of the 6th IEEE International Symposium on Signal Processing and Information Technology. 420--425.
[159]
Shi, Z., Setlur, S., and Govindaraju, V. 2009. A steerable directional local profile technique for extraction of handwritten arabic text lines. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 176--180.
[160]
Slimane, F., Ingold, R., Kanoun, S., Alimi, A. M., and Hennebert, J. 2008. Duration models for arabic text recognition using hidden markov models. In Proceedings of the International Conferences on Computational Intelligence for Modelling, Control and Automation, Intelligent Agents, Web Technologies and Internet Commerce and Innovation in Software Engineering (CIMCA). 838--843.
[161]
Slimane, F., Ingold, R., Kanoun, S., Alimi, A. M., and Hennebert, J. 2009. A new arabic printed text image database and evaluation protocols. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 946--950.
[162]
Srihari, R., Shetty, S., and Srihari, S. 2007. Use of language models in handwriting recognition. Tech. rep. TR-06-07, Center of Excellence for Document Analysis and Recognition (CEDAR).
[163]
Srihari, S. N., Ball, G. R., and Srinivasan, H. 2008. Versatile search of scanned arabic handwriting. In Arabic and Chinese Handwriting Recognition., Lecture Notes in Computer Science, vol. 4768, Springer, 57--69.
[164]
Sternby, J., Morwing, J., Andersson, J., and Friberg, C. 2009. On-Line arabic handwriting recognition with templates. Pattern Recogn., New Frontiers Handwrit. Recogn. 42, 12, 3278--3286.
[165]
Taghva, K. and Stofsky, E. 2001. OCRSpell: An interactive spelling correction system for ocr errors in text. Int. J. Doc. Anal. Recogn. 3, 3, 125--137.
[166]
Touj, S. M., Ben Amara, N. E., and Amiri, H. 2007. A hybrid approach for off-line arabic handwriting recognition based on a planar hidden markov modeling. In Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR'07). 964--968.
[167]
Wshah, S., Shi, Z., and Govindaraju, V. 2009. Segmentation of arabic handwriting based on both contour and skeleton segmentation. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 793--797.
[168]
Wshah, S., Govindaraju, V., Cheng, Y., and Li, H. 2010. A novel lexicon reduction method for arabic handwriting recognition. In Proceedings of the 20th International Conference on Pattern Recognition (ICPR). 2865--2868.
[169]
Xiu, P., Peng, L., Ding, X., and Wang, H. 2006a. Offline handwritten arabic character segmentation with probabilistic model. In Document Analysis Systems VII, Lecture Notes in Computer Science, vol. 3872, Springer, 402--412.
[170]
Xiu, P., Peng, L., and Ding, X. 2006b. Multi-Queue merging scheme and its application in arabic script segmentation. In Proceedings of the 2nd International Conference on Document Image Analysis for Libraries (DIAL). 24--29.
[171]
Zavorin, I., Borovikov, E., Davis, E., Borovikov, A., and Summers, K. 2008. Combining different classification approaches to improve off-line arabic handwritten word recognition. Proc. SPIE 6815.
[172]
Zeki, A. M., Zakaria, M. S., and Liong, C.-Y. 2007. Isolation of dots for arabic ocr using voronoi diagrams. In Proceedings of the International Conference on Electrical Engineering and Informatics. 199--202.
[173]
Ziaratban, M., Faez, K., and Faradji, F. 2007. Language--Based feature extraction using template-matching in farsi/arabic handwritten numeral recognition. In Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR). 297--301.
[174]
Ziaratban, M. and Faez, K. 2008. A novel two-stage algorithm for baseline estimation and correction in farsi and arabic handwritten text line. In Proceedings of the 19th International Conference on Pattern Recognition (ICPR). 1--5.
[175]
Ziaratban, M. and Faez, K. 2009. Non-Uniform slant estimation and correction for farsi/arabic handwritten words. Int. J. Doc. Anal. Recogn. 12, 4, 249--267.
[176]
Ziaratban, M., Faez, K., and Bagheri, F. 2009. FHT: An unconstraint farsi handwritten text database. In Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR). 281--285.
[177]
Zimmermann, M. and Bunke, H. 2004. N-Gram language models for offline handwritten text recognition. In Proceedings of the 9th International Workshop on Frontiers in Handwriting Recognition (IWFHR). 203--208.

Cited By

View all
  • (2024)End-to-End Deep Learning Framework for Arabic Handwritten Legal Amount Recognition and Digital Courtesy ConversionMathematics10.3390/math1214225612:14(2256)Online publication date: 19-Jul-2024
  • (2024)Advancements and Challenges in Handwritten Text Recognition: A Comprehensive SurveyJournal of Imaging10.3390/jimaging1001001810:1(18)Online publication date: 8-Jan-2024
  • (2024)Digitizing History: Transitioning Historical Paper Documents to Digital Content for Information Retrieval and Mining—A Comprehensive SurveyIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.337841911:5(6151-6180)Online publication date: Oct-2024
  • Show More Cited By

Index Terms

  1. Offline arabic handwritten text recognition: A Survey

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Computing Surveys
    ACM Computing Surveys  Volume 45, Issue 2
    February 2013
    417 pages
    ISSN:0360-0300
    EISSN:1557-7341
    DOI:10.1145/2431211
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 March 2013
    Accepted: 01 October 2011
    Revised: 01 October 2011
    Received: 01 July 2011
    Published in CSUR Volume 45, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Arabic text recognition
    2. Handwriting recognition
    3. classifiers
    4. features
    5. optical character recognition

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • KACST NSTIP project 08-INF99-4 “Automatic Recognition of Handwritten Arabic Text (ARHAT)”

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)41
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 01 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)End-to-End Deep Learning Framework for Arabic Handwritten Legal Amount Recognition and Digital Courtesy ConversionMathematics10.3390/math1214225612:14(2256)Online publication date: 19-Jul-2024
    • (2024)Advancements and Challenges in Handwritten Text Recognition: A Comprehensive SurveyJournal of Imaging10.3390/jimaging1001001810:1(18)Online publication date: 8-Jan-2024
    • (2024)Digitizing History: Transitioning Historical Paper Documents to Digital Content for Information Retrieval and Mining—A Comprehensive SurveyIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.337841911:5(6151-6180)Online publication date: Oct-2024
    • (2024)Hybrid Arabic handwritten character segmentation using CNN and graph theory algorithmJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2023.10187236:1Online publication date: 17-Apr-2024
    • (2024)DT2F-TLNet: A novel text-independent writer identification and verification model using a combination of deep type-2 fuzzy architecture and Transfer Learning networks based on handwriting dataExpert Systems with Applications10.1016/j.eswa.2023.122704242(122704)Online publication date: May-2024
    • (2024)Revitalizing Arabic Character Classification: Unleashing the Power of Deep Learning with Transfer Learning and Data Augmentation TechniquesArabian Journal for Science and Engineering10.1007/s13369-024-08818-949:9(12791-12815)Online publication date: 13-Mar-2024
    • (2023)Study on handwritten invoice recognition systemJournal of Advanced Marine Engineering and Technology10.5916/jamet.2023.47.6.41147:6(411-418)Online publication date: 31-Dec-2023
    • (2023)CALText: Contextual Attention Localization for Offline Handwritten TextNeural Processing Letters10.1007/s11063-023-11258-555:6(7227-7257)Online publication date: 15-Apr-2023
    • (2023)A novel multi-task learning technique for offline handwritten short answer spotting and recognitionMultimedia Tools and Applications10.1007/s11042-023-17606-w83:18(53441-53465)Online publication date: 20-Nov-2023
    • (2023)Printed Ottoman text recognition using synthetic data and data augmentationInternational Journal on Document Analysis and Recognition10.1007/s10032-023-00436-926:3(273-287)Online publication date: 24-May-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media