Abstract
We present our current work on building a deep learning architecture for the offline handwritten character recognition problem. The proposed system is based on training a deep Convolutional Neural Network (CNN) to recognize handwritten characters, using a new synthetic character database derived from UNIPEN dataset. The presented approach is inspired in some successfully-used neural architectures for image classification, specially the VGG-CNN. Our system reads each word with the help of a sliding window in a similar way to how humans do. An innovative feature of our proposal is using a synthetic character database specifically built, in a optimized way, to identify the characters as component elements of the words. Experiments with this new training synthetic dataset produced recognition rates of 98.4% for uppercase and 96.3% for lowercase, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bengio, Y., LeCun, Y., Nohl, C., Burges, C.: A NN/HMM hybrid for on-line handwriting recognition. Neural Comput. 7(6), 1289–1303 (1995)
Ciresan, D.C., Meier, U., Gambardella, L.M., Schmidhuber, J.: Convolutional neural network committees for handwritten character classification. In: 11th International Conference on Document Analysis and Recognition (ICDAR), pp. 1250–1254 (2011)
Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Conference on Computer Vision and Pattern Recognition (CVPR) (2012)
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Deng, L.: The MNIST database of handwirtten digit images for machine learning research. IEEE Sig. Process. Mag. 29(6), 141–142 (2012)
Graves, A., Fernandez, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: 23rd International Conference on Machine Learning (ICML), pp. 369–376 (2006)
Graves, A., Schmidhuber, J.: Offline handwriting recognition with multidimensional recurrent neural networks. In: 21th Advances in Neural Information Processing Systems (NIPS), pp. 545–552 (2008)
Grother, P.J.: NIST Special Database 19 - Handprinted Forms and Characters Database, 2nd edn. National Institute of Standards and Technology, Gaithersburg (2016). User’s Guide
Guyon, I., Schomaker, L., Plamondon, R., Liberman, M., Janet, S.: Unipen project of on-line data exchange and recognizer benchmarks. In: 12th Internacional Conference on Pattern Recognition, vol. 2, pp. 29–33. IEEE (1994)
Kaltenmeier, A., Caesar, T., Gloger, J.M., Mandler, E.: Sophisticated topology of hidden Markov models for cursive script recognition. In: 2nd International Conference on Document Analysis and Recognition (ICDAR), pp. 139–142. IEEE (1993)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: 26th Advances in Neural Information Processing Systems (NIPS), pp. 1097–1105 (2012)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Nesterov, Y.: A method of solving a convex programming problem with convergence rate O(1/sqr(k)). Sov. Math. Dokl. 27, 372–376 (1983)
Sayre, K.M.: Machine recognition of handwritten words: a project report. Pattern Recogn. 5(3), 213–228 (1973)
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)
Symonian, K., Zisserman, A.: Very deep convolutional networks for large scale image recognition. In: 3rd International Conference on Learning Representations (ICLR) (2015)
Van der Maaten, L.J.P. : A new benchmark dataset for handwritten character recognition. Tilburg University Technical Report, TiCC TR 2009-002 (2009)
Kim, Y., Jernite, Y., Sontag, D., Rush, A.M.: Character-Aware Neural Language Models, CoRR (2015)
Yuan, A., Bai, G., Jiao, L., Liu, Y.: Offline handwritten English character recognition based on convolutional neural network. In: International Workshop Document Analysis Systems, pp. 125–129 (2012)
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: 29th Advances in Neural Information Processing Systems (NIPS), vol. 28 (2015)
Acknowledgements
This work was funded by the Spanish Ministry of Economy and Competitiveness project number TIN2014-57458-R and by the URJC-Banco de Santander Excellence Research Groups grant number 30VCPIGI09.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Sueiras, J., Ruiz, V., Sánchez, Á., Vélez, J.F. (2017). Using a Synthetic Character Database for Training Deep Learning Models Applied to Offline Handwritten Recognition. In: Madureira, A., Abraham, A., Gamboa, D., Novais, P. (eds) Intelligent Systems Design and Applications. ISDA 2016. Advances in Intelligent Systems and Computing, vol 557. Springer, Cham. https://doi.org/10.1007/978-3-319-53480-0_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-53480-0_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-53479-4
Online ISBN: 978-3-319-53480-0
eBook Packages: EngineeringEngineering (R0)