Nothing Special   »   [go: up one dir, main page]

Skip to main content

Advertisement

Log in

DIGI-Net: a deep convolutional neural network for multi-format digit recognition

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Digitizing different formats of digits has multiple applications like door number detection, license plate detection, credit card number detection, etc. Specifically, handwritten digit recognition has gained so much popularity because of the vast applications such as recognizing ZIP codes in postal documents, amount entered in check leafs, etc. The handwritten digits are not always of the similar size, width, orientation, as they differ because of different writing styles of the persons, different writing instruments, etc. This makes the recognition of handwritten digits a tough and tricky task. The main problem occurs during the classification of the digits of similarity such as 1 and 7, 5 and 6, 3 and 8, etc. Recognizing digits from unconstrained natural images are also relatively difficult because of its large appearance variability. Printed digit recognition has been virtually solved by machine learning researchers. This work does not focus on printed digit recognition, but aims to learn the features from printed digits to recognize handwritten and natural image digits better. In this work, we are proposing DIGI-Net, a deep convolutional network, which has the ability to learn common features from three different formats (handwritten, natural images, printed font) of digits and to recognize them. The experimentation is done on MNIST, CVL single digit dataset, digits of Chars74K dataset and our proposed DIGI-Net achieved an accuracy of 99.11%, 93.29% and 97.60% respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Arivazhagan S, Priyadharshini RA, Sangeetha L (2017) Automatic target recognition in SAR images using quaternion wavelet transform and principal component analysis. Int J Comput Vis Robot 7(3):314–334

    Article  Google Scholar 

  2. Parveen Nusrat, Zaidi Sadaf, Danish Mohammad (2017) Support vector regression prediction and analysis of the copper (II) biosorption efficiency. Indian Chem Eng 59(4):295–311. https://doi.org/10.1080/00194506.2016.1270778

    Article  Google Scholar 

  3. Parveen Nusrat, Zaidi Sadaf, Danish Mohammad (2017) Development of SVR-based model and comparative analysis with MLR and ANN models for predicting the sorption capacity of Cr(VI). Process Saf Environ Prot 107:428–437. https://doi.org/10.1016/j.psep.2017.03.007

    Article  Google Scholar 

  4. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the 2014 IEEE conference on computer vision and pattern recognition, pp 580–587

  5. Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th international conference on neural information processing systems, NIPS’12, vol 1. pp 1097–1105

  6. Ouyang W, Luo P, Zeng X, Qiu S, Tian Y, Li H, Yang S, Wang Z, Xiong Y, Qian C, Zhu Z, Wang R, Loy CC, Wang X, Tang X (2014) Deepid-net: multistage and deformable deep convolutional neural networks for object detection. arXiv preprint arXiv:1409.3505

  7. Sun Y, Chen Y, Wang X, Tang X (2014) Deep learning face representation by joint identification-verification. In: Proceedings of the 27th international conference on neural information processing systems, vol 2. pp 1988–1996

  8. Kim KI, Lee KM (2018) Context-aware information provisioning for vessel traffic service using rule-based and deep learning techniques. Int J Fuzzy Log Intell Syst 18(1):13–19. https://doi.org/10.5391/IJFIS.2018.18.1.13

    Article  Google Scholar 

  9. Lee HW, Kim NR, Lee JH (2017) Deep neural network self-training based on unsupervised learning and dropout. Int J Fuzzy Log Intell Syst 17(1):1–9. https://doi.org/10.5391/IJFIS.2017.17.1.1

    Article  Google Scholar 

  10. Ahila Priyadharshini R, Arivazhagan S, Arun M, Mirnalini A (2019) Maize leaf disease classification using deep convolutional neural networks. Neural Comput Appl. https://doi.org/10.1007/s00521-019-04228-3

    Article  Google Scholar 

  11. Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 779–788

  12. Yang Y, Lijia X, Chen C (2011) English character recognition based on feature combination. Proc Eng 24:159–164

    Article  Google Scholar 

  13. Yin F, Wang QF, Zhang XY, Liu CL (2013) ICDAR 2013 Chinese handwriting recognition competition. In: 2013 12th international conference on document analysis and recognition (ICDAR), pp 1464–1470

  14. Zhu B, Zhou XD, Liu CL, Nakagawa M (2010) A robust model for on-line handwritten Japanese text recognition. IJDAR 13(2):121–131

    Article  Google Scholar 

  15. Chaudhuri BB, Pal U (1998) A complete printed Bangla OCR system. Pattern Recogn 31:531–549

    Article  Google Scholar 

  16. Fukushima K (1980) Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36(4):193–202

    Article  Google Scholar 

  17. Le Cun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324

    Article  Google Scholar 

  18. Bai J, Chen Z, Feng B, Xu B (2014) Image character recognition using deep convolutional neural network learned from different languages. In: IEEE international conference on image processing (ICIP), pp 2560–2564

  19. Gatto BB, dos Santos EM, Fukui K (2017) Subspace-based convolutional network for handwritten character recognition. In: 2017 14th IAPR international conference on document analysis and recognition (ICDAR)

  20. Fanany MI (2017) Handwriting recognition on form document using convolutional neural network and support vector machines. In: 2017 5th international conference on information and communication technology (ICoIC7)

  21. Yang W, Jin L, Tao D, Xie Z, Feng Z (2016) DropSample: a new training method to enhance deep convolutional neural networks for large-scale unconstrained handwritten Chinese character recognition. Pattern Recogn 58:190–203. https://doi.org/10.1016/j.patcog.2016.04.007

    Article  Google Scholar 

  22. Majumder HNK, Al Behadili (2016) Classification algorithms for determining handwritten digit. Iraq J Electr Electron Eng 12(1):96–102

    Article  Google Scholar 

  23. Chen F, Chen N, Mao H, Hu H (2018) Assessing four neural networks on handwritten digit recognition dataset (MNIST). arXiv:1811.08278

  24. Majumder S, von der Malsburg C, Richhariya A, Bhanot S (2018) Handwritten digit recognition by elastic matching. J Comput 13(9):1067–1074

    Article  Google Scholar 

  25. https://www.cs.toronto.edu/~tijmen/csc321/slides/lecture_slides_lec6.pdf

  26. MNIST dataset. http://yann.lecun.com/exdb/mnist/

  27. CVL single digit dataset. http://caa.tuwien.ac.at/cvl/research/icdar2013-hdrc/

  28. Chars74K dataset. http://www.ee.surrey.ac.uk/CVSSP/demos/chars74k/

  29. Courbariaux M, Bengio Y, David JP (2015) BinaryConnect: training deep neural networks with binary weights during propagations. arXiv:1511.00363

  30. Dundar A, Jin J, Culurciello E (2015) Convolutional clustering for unsupervised learning. arXiv preprint arXiv:1511.06241

  31. Salakhutdinov R, Hinton G (2009) Deep Boltzmann machines. In: Proceedings of international conference on artificial intelligence and statistics, pp 448–455

  32. Diem M, Fiel S, Garz S, Keglevic M, Kleber F, Sablatnig R (2013) ICDAR 2013 competition on handwritten digit recognition (HDRC 2013). In: 2013 12th international conference on document analysis and recognition

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arun Madakannu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Madakannu, A., Selvaraj, A. DIGI-Net: a deep convolutional neural network for multi-format digit recognition. Neural Comput & Applic 32, 11373–11383 (2020). https://doi.org/10.1007/s00521-019-04632-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-019-04632-9

Keywords

Navigation