Handwriting Recognition with Novelty

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12824))

Included in the following conference series:

International Conference on Document Analysis and Recognition

3163 Accesses
2 Citations
4 Altmetric

Abstract

This paper introduces an agent-centric approach to handle novelty in the visual recognition domain of handwriting recognition (HWR). An ideal transcription agent would rival or surpass human perception, being able to recognize known and new characters in an image, and detect any stylistic changes that may occur within or across documents. A key confound is the presence of novelty, which has continued to stymie even the best machine learning-based algorithms for these tasks. In handwritten documents, novelty can be a change in writer, character attributes, writing attributes, or overall document appearance, among other things. Instead of looking at each aspect independently, we suggest that an integrated agent that can process known characters and novelties simultaneously is a better strategy. This paper formalizes the domain of handwriting recognition with novelty, describes a baseline agent, introduces an evaluation protocol with benchmark data, and provides experimentation to set the state-of-the-art. Results show feasibility for the agent-centric approach, but more work is needed to approach human-levels of reading ability, giving the HWR community a formal basis to build upon as they solve this challenging problem.

This research was sponsored by the Defense Advanced Research Projects Agency (DARPA) and the Army Research Office (ARO) under multiple contracts/agreements including HR001120C0055, W911NF-20-2-0005,W911NF-20-2- 0004,HQ0034-19-D-0001, W911NF2020009. The views contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the DARPA or ARO, or the U.S. Government.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Enhancing handwritten text recognition accuracy with gated mechanisms

Article Open access 22 July 2024

Spatio-Temporal Handwriting Imitation

Adaptive Text Recognition Through Visual Matching

Notes

1.
The code for this paper will be made publicly available after publication at https://github.com/prijatelj/handwriting_recognition_with_novelty.
2.
The Supplemental Material is publicly available at https://arxiv.org/abs/2105.06582.

References

Augustin, E., Carré, M., Grosicki, E., Brodin, J.M., Geoffrois, E., Prêteux, F.: RIMES evaluation campaign for handwritten mail processing. In: IWFHR (2006)
Google Scholar
Bendale, A., Boult, T.E.: Towards open set deep networks. In: IEEE CVPR (2016)
Google Scholar
Boult, T.E., et al.: A Unifying Framework for Formal Theories of Novelty:Framework, Examples and Discussion. arXiv:2012.04226 [cs], December 2020. http://arxiv.org/abs/2012.04226
Boult, T.E., et al.: A unifying framework for formal theories of novelty: framework, examples and discussion. In: AAAI (2021)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE CVPR (2005)
Google Scholar
DARPA: Teaching AI systems to adapt to dynamic environments (2019). https://www.darpa.mil/news-events/2019-02-14. Accessed 11 Jan 2021
Fei, G., Liu, B.: Breaking the closed world assumption in text classification. In: NAACL-HLT (2016)
Google Scholar
Fiel, S., Kleber, F., Diem, M., Christlein, V., Louloudis, G., Nikos, S., Gatos, B.: ICDAR2017 competition on historical document writer identification (Historical-WI). In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). vol. 01, pp. 1377–1382 (November 2017). https://doi.org/10.1109/ICDAR.2017.225. iSSN: 2379-2140
Graves, A., Fernández, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: ICML (2006)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE CVPR, pp. 770–778 (2016)
Google Scholar
Langley, P.: Open-world learning for radically autonomous agents. In: AAAI (2020)
Google Scholar
van Lit, L.: Paleography: between erudition and computation. In: Among Digitized Manuscripts. Philology, Codicology, Paleography in a Digital World, pp. 102–131. Brill (2019)
Google Scholar
Lorigo, L.M., Govindaraju, V.: Offline Arabic handwriting recognition: a survey. IEEE T-PAMI 28(5), 712–724 (2006)
Article Google Scholar
Marti, U.V., Bunke, H.: The IAM-database: an English sentence database for offline handwriting recognition. Int. J. Doc. Anal. Recognit. 5(1), 39–46 (2002)
Article Google Scholar
Ontario Ministry of Education: A Guide to Effective Literacy Instruction, Grades 4 to 6: A Multivolume Resource from the Ministry of Education. Volume one, Foundations of literacy instruction for the junior learner, p. 37. Ontario Ministry of Education (2006). https://books.google.com/books?id=Y8F4oAEACAAJ
Rayner, K., Pollatsek, A., Ashby, J., Clifton Jr, C.: Psychology of Reading. Psychology Press, London (2012)
Google Scholar
Rudd, E.M., Jain, L.P., Scheirer, W.J., Boult, T.E.: The extreme value machine. IEEE T-PAMI 40(3), 762–768 (2017)
Article Google Scholar
Ruff, L., et al.: A unifying review of deep and shallow anomaly detection. arXiv preprint arXiv:2009.11732 (2020)
Russakovsky, O.: Imagenet large scale visual recognition challenge. IJCV 115(3), 211–252 (2015)
Article MathSciNet Google Scholar
Sanchez, J.A., Romero, V., Toselli, A.H., Vidal, E.: ICFHR2016 competition on handwritten text recognition on the read dataset. In: ICFHR (2016)
Google Scholar
Scheirer, W.J., Jain, L.P., Boult, T.E.: Probability models for open set recognition. IEEE Trans. Pattern Anal. Mach. Intell. 36(11), 2317–2324 (2014)
Article Google Scholar
Scheirer, W.J., de Rezende Rocha, A., Sapkota, A., Boult, T.E.: Toward open set recognition. IEEE T-PAMI 35(7), 1757–1772 (2012)
Article Google Scholar
Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE T-PAMI 39(11), 2298–2304 (2017)
Google Scholar
Smith, D.A., Cordell, R.: A research agenda for historical and multilingual opticalcharacter recognition. Technical report, Northeastern University (2018)
Google Scholar
Studer, L., et al.: A comprehensive study of imagenet pre-training for historical document image analysis. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 720–725 (September 2019). https://doi.org/10.1109/ICDAR.2019.00120. iSSN: 2379-2140
Stutzmann, D.: Clustering of medieval scripts through computer image analysis: towards an evaluation protocol. Digit. Mediev. 10 (2016). https://doi.org/10.16995/dm.61. http://journal.digitalmedievalist.org//articles/10.16995/dm.61/. ISSN: 1715-0736
Wigington, C., Stewart, S., Davis, B., Barrett, B., Price, B., Cohen, S.: Data augmentation for recognition of handwritten words and lines using a CNN-LSTM network. In: ICDAR (2017)
Google Scholar
Yampolskiy, R.V., Govindaraju, V.: Behavioural biometrics: a survey and classification. Int. J. Biom. 1(1), 81–113 (2008)
Google Scholar
Zhang, H., Patel, V.M.: Sparse representation-based open set recognition. IEEE T-PAMI 39(8), 1690–1696 (2016)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Notre Dame, Notre Dame, IN, 46556, USA
Derek S. Prijatelj, Samuel Grieggs & Walter J. Scheirer
PAR Government, 421 Ridge St, Rome, NY, 13440, USA
Futoshi Yumoto & Eric Robertson

Authors

Derek S. Prijatelj
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Grieggs
View author publications
You can also search for this author in PubMed Google Scholar
Futoshi Yumoto
View author publications
You can also search for this author in PubMed Google Scholar
Eric Robertson
View author publications
You can also search for this author in PubMed Google Scholar
Walter J. Scheirer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Walter J. Scheirer .

Editor information

Editors and Affiliations

Universitat Autònoma de Barcelona, Barcelona, Spain
Josep Lladós
Lehigh University, Bethlehem, PA, USA
Daniel Lopresti
Kyushu University, Fukuoka-shi, Japan
Seiichi Uchida

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 556 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Prijatelj, D.S., Grieggs, S., Yumoto, F., Robertson, E., Scheirer, W.J. (2021). Handwriting Recognition with Novelty. In: Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), vol 12824. Springer, Cham. https://doi.org/10.1007/978-3-030-86337-1_33

Download citation

DOI: https://doi.org/10.1007/978-3-030-86337-1_33
Published: 02 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86336-4
Online ISBN: 978-3-030-86337-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Handwriting Recognition with Novelty

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others