Abstract
Digitizing different formats of digits has multiple applications like door number detection, license plate detection, credit card number detection, etc. Specifically, handwritten digit recognition has gained so much popularity because of the vast applications such as recognizing ZIP codes in postal documents, amount entered in check leafs, etc. The handwritten digits are not always of the similar size, width, orientation, as they differ because of different writing styles of the persons, different writing instruments, etc. This makes the recognition of handwritten digits a tough and tricky task. The main problem occurs during the classification of the digits of similarity such as 1 and 7, 5 and 6, 3 and 8, etc. Recognizing digits from unconstrained natural images are also relatively difficult because of its large appearance variability. Printed digit recognition has been virtually solved by machine learning researchers. This work does not focus on printed digit recognition, but aims to learn the features from printed digits to recognize handwritten and natural image digits better. In this work, we are proposing DIGI-Net, a deep convolutional network, which has the ability to learn common features from three different formats (handwritten, natural images, printed font) of digits and to recognize them. The experimentation is done on MNIST, CVL single digit dataset, digits of Chars74K dataset and our proposed DIGI-Net achieved an accuracy of 99.11%, 93.29% and 97.60% respectively.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Arivazhagan S, Priyadharshini RA, Sangeetha L (2017) Automatic target recognition in SAR images using quaternion wavelet transform and principal component analysis. Int J Comput Vis Robot 7(3):314–334
Parveen Nusrat, Zaidi Sadaf, Danish Mohammad (2017) Support vector regression prediction and analysis of the copper (II) biosorption efficiency. Indian Chem Eng 59(4):295–311. https://doi.org/10.1080/00194506.2016.1270778
Parveen Nusrat, Zaidi Sadaf, Danish Mohammad (2017) Development of SVR-based model and comparative analysis with MLR and ANN models for predicting the sorption capacity of Cr(VI). Process Saf Environ Prot 107:428–437. https://doi.org/10.1016/j.psep.2017.03.007
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the 2014 IEEE conference on computer vision and pattern recognition, pp 580–587
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th international conference on neural information processing systems, NIPS’12, vol 1. pp 1097–1105
Ouyang W, Luo P, Zeng X, Qiu S, Tian Y, Li H, Yang S, Wang Z, Xiong Y, Qian C, Zhu Z, Wang R, Loy CC, Wang X, Tang X (2014) Deepid-net: multistage and deformable deep convolutional neural networks for object detection. arXiv preprint arXiv:1409.3505
Sun Y, Chen Y, Wang X, Tang X (2014) Deep learning face representation by joint identification-verification. In: Proceedings of the 27th international conference on neural information processing systems, vol 2. pp 1988–1996
Kim KI, Lee KM (2018) Context-aware information provisioning for vessel traffic service using rule-based and deep learning techniques. Int J Fuzzy Log Intell Syst 18(1):13–19. https://doi.org/10.5391/IJFIS.2018.18.1.13
Lee HW, Kim NR, Lee JH (2017) Deep neural network self-training based on unsupervised learning and dropout. Int J Fuzzy Log Intell Syst 17(1):1–9. https://doi.org/10.5391/IJFIS.2017.17.1.1
Ahila Priyadharshini R, Arivazhagan S, Arun M, Mirnalini A (2019) Maize leaf disease classification using deep convolutional neural networks. Neural Comput Appl. https://doi.org/10.1007/s00521-019-04228-3
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 779–788
Yang Y, Lijia X, Chen C (2011) English character recognition based on feature combination. Proc Eng 24:159–164
Yin F, Wang QF, Zhang XY, Liu CL (2013) ICDAR 2013 Chinese handwriting recognition competition. In: 2013 12th international conference on document analysis and recognition (ICDAR), pp 1464–1470
Zhu B, Zhou XD, Liu CL, Nakagawa M (2010) A robust model for on-line handwritten Japanese text recognition. IJDAR 13(2):121–131
Chaudhuri BB, Pal U (1998) A complete printed Bangla OCR system. Pattern Recogn 31:531–549
Fukushima K (1980) Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36(4):193–202
Le Cun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Bai J, Chen Z, Feng B, Xu B (2014) Image character recognition using deep convolutional neural network learned from different languages. In: IEEE international conference on image processing (ICIP), pp 2560–2564
Gatto BB, dos Santos EM, Fukui K (2017) Subspace-based convolutional network for handwritten character recognition. In: 2017 14th IAPR international conference on document analysis and recognition (ICDAR)
Fanany MI (2017) Handwriting recognition on form document using convolutional neural network and support vector machines. In: 2017 5th international conference on information and communication technology (ICoIC7)
Yang W, Jin L, Tao D, Xie Z, Feng Z (2016) DropSample: a new training method to enhance deep convolutional neural networks for large-scale unconstrained handwritten Chinese character recognition. Pattern Recogn 58:190–203. https://doi.org/10.1016/j.patcog.2016.04.007
Majumder HNK, Al Behadili (2016) Classification algorithms for determining handwritten digit. Iraq J Electr Electron Eng 12(1):96–102
Chen F, Chen N, Mao H, Hu H (2018) Assessing four neural networks on handwritten digit recognition dataset (MNIST). arXiv:1811.08278
Majumder S, von der Malsburg C, Richhariya A, Bhanot S (2018) Handwritten digit recognition by elastic matching. J Comput 13(9):1067–1074
https://www.cs.toronto.edu/~tijmen/csc321/slides/lecture_slides_lec6.pdf
MNIST dataset. http://yann.lecun.com/exdb/mnist/
CVL single digit dataset. http://caa.tuwien.ac.at/cvl/research/icdar2013-hdrc/
Chars74K dataset. http://www.ee.surrey.ac.uk/CVSSP/demos/chars74k/
Courbariaux M, Bengio Y, David JP (2015) BinaryConnect: training deep neural networks with binary weights during propagations. arXiv:1511.00363
Dundar A, Jin J, Culurciello E (2015) Convolutional clustering for unsupervised learning. arXiv preprint arXiv:1511.06241
Salakhutdinov R, Hinton G (2009) Deep Boltzmann machines. In: Proceedings of international conference on artificial intelligence and statistics, pp 448–455
Diem M, Fiel S, Garz S, Keglevic M, Kleber F, Sablatnig R (2013) ICDAR 2013 competition on handwritten digit recognition (HDRC 2013). In: 2013 12th international conference on document analysis and recognition
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Madakannu, A., Selvaraj, A. DIGI-Net: a deep convolutional neural network for multi-format digit recognition. Neural Comput & Applic 32, 11373–11383 (2020). https://doi.org/10.1007/s00521-019-04632-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-019-04632-9