Abstract
The National Archives of Singapore keeps a large number of double-sided handwritten archival documents. Over long periods of storage, ink sipped through the pages of these documents, resulting in interfering images of handwriting coming from the back of the page. This paper addresses this problem of segmenting handwriting from both sides of a document by means of a wavelet approach. We first match both sides of a document page such that the interfering strokes are mapped with the corresponding strokes originating from the reverse side. This allows the identification of the foreground and interfering strokes. A wavelet reconstruction process then iteratively enhances the foreground strokes and smears the interfering strokes so as to strengthen the discriminating capability of an improved Canny edge detector against the interfering strokes. Experimental results confirm the validity of the wavelet approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Nagy, G.: Twenty Years of Document Image Analysis in PAMI. IEEE Trans. PAMI, Vol. 22, No. 1, Jan. 2000, 38–62
Casey, R.G., Lecolinet, E.: A Survey of Methods and Strategies in Character Segmentation. IEEE Trans. PAMI, Vol.20, No. 7, July 1996, 690–706
Negishi, H., Kato, J., Hase, H., Watanabe T.: Character Extraction from Noisy Background for an Automatic Reference System. In: Proc. 5th Int. Conf. Document Analysis and Recognition, Bangalore, India, Sept. 1999, 143–146
Otsu, N.: A Threshold Selection Method from Gray-Level Histograms. IEEE Trans. System, Man, and Cybernetics, Vol. 9, No. 1, 1979, 62–66
Liu, Y., Srihari, S.N.: Document Image Binarization Based on Texture Features. IEEE Trans. PAMI, Vol. 19, No. 5, May 1997, 540–544
Liang, S., Ahmadi, M.: A Morphological Approach to Text String Extraction from Regular Periodic Overlapping Text/Background Images. Graphical Models and Image Processing, CVGIP, Vol. 56, No. 5, Sept. 1994, 402–413
White, J.M., Rohrer, G.D.: Image Thresholding for Optical Character Recognition and Other Applications Requiring Character Image Extraction. IBM J. Res. Dev. 27(4), 1983, 400–410
Don, H-S.: A Noise Attribute Thresholding Method for Document Image Binarization. In: Proc. 3rd Int. Conf. Document Analysis and Recognition, 1995, 231–234
Lu, J., Healy, D.M., Weaver, J.B.: Contrast Enhancement of Medical Images Using Multi-scale Edge Representation. Optical Engineering, 33(7), 1994, 2151–216110.
Lu, J.: Image De-blocking via Multi-scale Edge Processing. In: Unser, M.A., Aldroubi, A., Laine, A.F. (eds.): Proc. of SPIE, Wavelet Applications in Signal and Image Processing IV, Vol. 2825, Part two, Denver, Colorado, Aug. 1996, 742–75.
Mallat, S., Zhong, S.: Characterization of Signals from Multi-scale Edges. IEEE Trans. PAMI, Vol. 14, No.7, July 1992, 710–732
Hwang, W.L., Chang, F.: Character Extraction from Documents Using Wavelet Maxima. In: Unser, M.A., Aldroubi, A., Laine, A.F. (eds.): Proc. of SPIE, Wavelet Applications in Signal and Image Processing IV, Vol. 2825, Part two, Denver, Colorado, Aug. 1996, 1003–1015
Etemad, K., Doerman, D., Chellappa, R.: Multi-scale Segmentation of Unstructured Document Pages Using Soft Decision Integration. IEEE Trans. PAMI, Vol. 19, No. 1, Jan. 1997, 92–96
Cao, R., Tan, C.L., Wang, Q., Shen, P.: Segmentation and Analysis of Double-Sided Handwritten Archival Documents. In: Proc. 4th IAPR Int. Workshop on Document Analysis Systems, Rio de Janeiro, Brazil, Dec. 2000, 147–158
Tan, C.L., Cao, R., Shen, P., Chee, J., Chang, J.: Removal of Interfering Strokes in Double-Sided Document Images. In: Proc. 5th IEEE Workshop on Applications of Computer Vision, Palm Springs, California, Dec. 2000, 16–21
Feng, L., Tang, Y.Y., Yang, L.H.: A Wavelet Approach to Extracting Contours of Document Images. In: Proc. 5th Int. Conf. Document Analysis and Recognition, Bangalore, India, Sept. 1999, 71–74
Niblack, W.: An Introduction to Digital Image Processing. Englewood Cliffs, N.J., Prentice Hall (1986) 115–116
Junker, M., Hoch R., Dengel, A.: On the Evaluation of Document Analysis Components by Recall, Precision, and Accuracy. In: Proc. 5th Int. Conf. Document Analysis and Recognition, Bangalore, India, Sept. 1999, 713–716
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tan, C.L., Cao, R., Shen, P. (2001). Wavelet Applications in Segmentation of Handwriting in Archival Documents. In: Tang, Y.Y., Yuen, P.C., Li, Ch., Wickerhauser, V. (eds) Wavelet Analysis and Its Applications. WAA 2001. Lecture Notes in Computer Science, vol 2251. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45333-4_23
Download citation
DOI: https://doi.org/10.1007/3-540-45333-4_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43034-6
Online ISBN: 978-3-540-45333-8
eBook Packages: Springer Book Archive