Abstract.
We describe the design of on-line handwritten Japanese character pattern databases, software tools for pattern collection and verification, and analyses of collected patterns. Two databases containing over 3 million patterns were compiled: one with 120 people contributing 12,000 patterns each and another with 163 participants contributing 10,000 patterns each. Patterns were collected mostly in their sentential contexts and verified by machine and human inspection. Their analyses reveal greater variations in stroke count for characters having many strokes, with people generally using fewer strokes; they additionally reveal that stroke order variations are generally caused by common habits and added strokes.
Similar content being viewed by others
References
Guyon I, Schomaker L, Plamondon R, Liberman M, Janet S (1994) UNIPEN project of on-line data exchange and recognizer benchmarks. In: Proceedings of the 12th ICPR, 2:29-33
Hull JJ (1994) A database for handwritten text recognition research IEEE Trans PAMI 16(5):550-554
Jaeger S, Nakagawa M (2001) Two on-line Japanese character databases in Unipen format In: Proceedings of the 6th ICDAR, pp 566-570
Matsumoto K, Fukushima T, Nakagawa M (2001) Collection and analysis of on-line handwritten Japanese character patterns. In: Proceedings of the 6th ICDAR, pp 496-500
Nakagawa M (1990) Non-keyboard input of Japanese text - on-line recognition of handwritten characters as the most hopeful approach. J Inf Process 13(1):15-34
Nakagawa M, Akiyama K, Tu LV, Homma A, Higashiyama T (1996) Robust and highly customizable recognition of on-line handwritten Japanese characters. In: Proceedings of the 13th ICPR, 3:269-273
Nakagawa M, Tu LV (1996) Structural learning of character patterns for on-line recognition of handwritten Japanese characters. In: Perner P(eds) Advances in structural and syntactic pattern recognition. Lecture notes in computer science, vol 1121. Springer, Berlin Heidelberg New York, pp 180-188
Nakagawa M, Higashiyama T, Yamanaka Y, Sawada S, Higashigawa L, Akiyama K (1997) On-line handwritten character pattern database sampled in a sequence of sentences without any writing instructions. In: Proceedings of the 4th ICDAR, pp 376-381
Saito T, Yamada H, Yamamoto K (1985) On the data base ETL9 of handprinted characters in JIS Chinese characters and its analysis (in Japanese). Trans IECE Jpn J68-D(4):757-764
Smith SJ, Bourgoin MO, Sims K, Voorhees HL (1994) Handwritten character classification using nearest neighbor in large databases. IEEE Trans PAMI 16(9):915-919
Tanaka H, Nakajima K, Ishigaki K, Akiyama K, Nakagawa M (1999) Hybrid pen-input character recognition system based on integration of on-line-off-line recognition. In: Proceedings of the 5th ICDAR, pp 209-212
Tappert TC, Suen CY, Wakahara T (1990) The state of the art in on-line handwriting recognition IEEE Trans PAMI 12(8):787-808
Velek O, Jaeger S, Nakagawa M (2002) A new warping technique for normalizing likelihood of multiple classifiers and its effectiveness in combined on-line/off-line Japanese character recognition. In: Proceedings of the 8th IWFHR, pp 177-182
Viard-Gaudin C, Lallican PM, Knerr S, Binter P (1999) The ireste on-off (Ironoff) handwritten image database. In: Proceedings of the 5th ICDAR, pp 455-458
Yokota T, Kuzunuki S, Gunji K, Hamada N (2001) User adaptation in handwriting recognition by an automatic learning algorithm. In: Proceedings of HCI International 2001, 1:455-459
Author information
Authors and Affiliations
Additional information
Received: 14 December 2002, Accepted: 26 October 2003, Published online: 22 April 2004
Rights and permissions
About this article
Cite this article
Nakagawa, M., Matsumoto, K. Collection of on-line handwritten Japanese character pattern databases and their analyses. IJDAR 7, 69–81 (2004). https://doi.org/10.1007/s10032-004-0125-4
Issue Date:
DOI: https://doi.org/10.1007/s10032-004-0125-4