research-article

An approach for Bangla and Devanagari video text recognition

Authors:

Purnendu Banerjee,

B. B. ChaudhuriAuthors Info & Claims

MOCR '13: Proceedings of the 4th International Workshop on Multilingual OCR

Article No.: 8, Pages 1 - 5

https://doi.org/10.1145/2505377.2505389

Published: 24 August 2013 Publication History

Abstract

Extraction and recognition of Bangla text from video frame images is challenging due to fonts type and style variation, complex color background, low-resolution, low contrast etc. In this paper, we propose an algorithm for extraction and recognition of Bangla and Devanagari text form video frames with complex background. Here, a two-step approach has been proposed. After text localization, the text line is segmented into words using information based on line contours. First order gradient values of the text blocks are used to find the word gap. Next, an Adaptive SIS binarization technique is applied on each word. Next this binarized text block is sent to a state of the art OCR for recognition.

References

[1]

K. Jung, K. I. Kim, and A. K. Jain, "Text information extraction in images and video: a survey", Pattern Recognition, Vol. 37, No. 5, pp. 977--997, 2004.

[2]

D. Chen and J. M. Odobez, "Video text recognition using sequential Monte Carlo and error voting method", Pattern Recognition Letters, pp. 1386--1403, 2005.

Digital Library

[3]

A. K. Jain and B. Yu., "Automatic Text Location in Images and Video Frames", Pattern Recognition, vol. 31, pp. 2055--2076, 1998.

[4]

Q. Ye., Q. Huang, W. Gao and D. Zhao., "Fast and robust text detection in images and video frames", Image and Vision Computing, vol. 23, pp. 565--576, 2005.

Digital Library

[5]

K. Jung and J. H. Han, "Hybrid Approach to Efficient Text Extraction in Complex Color Images", Pattern Recognition Letters, vol. 25, pp. 679--699, 2004.

Digital Library

[6]

C. Liu, C. Wang and R. Dai., "Text Detection in Images Based on Unsupervised Classification of Edge-based Features", Proc. ICDAR, pp. 610--614, 2005.

Digital Library

[7]

P. Shivakumara, W. Huang and C. L. Tan., "An Efficient Edge based Technique for Text Detection in Video Frames", Proc. DAS, pp. 307--314, 2008.

Digital Library

[8]

M. Cai, J. Song and M. R. Lyu, "A New Approach for Video Text Detection", Proc. ICIP, pp. 117--120, 2002.

[9]

E. K. Wong and M. Chen., "A new robust algorithm for video text extraction", Pattern Recognition, vol. 36, pp. 1397--1406, 2003.

[10]

T. Q. Phan, P. Shivakumara and C. L Tan, "A Laplacian Method for Video Text Detection", Proc. ICDAR, pp. 66--70, 2009.

Digital Library

[11]

N. Sharma, P. Shivakumara, U. Pal, M. Blumenstein and C. L. Tan, "A New Method for Arbitrarily-Oriented Text Detection in Video", Proc. DAS, pp. 74--78, 2012.

Digital Library

[12]

J. Zang and R. Kasturi, "Extraction of Text Objects in Video Documents: Recent Progress", Proc. DAS, pp. 5--17, 2008.

Digital Library

[13]

H. Li, D. Doermann and O. Kia, "Automatic Text Detection and Tracking in Digital Video", IEEE Transactions on Image Processing, vol. 9, pp. 147--156, 2000.

Digital Library

[14]

P. Shivakumara, T. Q. Phanand C. L Tan, "A Robust Wavelet Transform Based Technique for Video Text Detection", Proc. ICDAR, pp. 1285--1289, 2009.

Digital Library

[15]

R. Lienhart and A. Wernicke, "Localizing and segmenting text in images and videos", IEEE Transactions on Circuits and Systems for Video Technology, Vol. 12, No. 4, pp. 256--268, 2002.

Digital Library

[16]

J. Lim, J. Park, and G. G. Medioni, "Text segmentation in color images using tensor voting", Image and Vision Computing, vol.25, pp. 671--685, 2007.

Digital Library

[17]

M. Su. Cho, Jae-Hyun Seok, S. Lee, and J. H. Kim, "Scene Text Extraction by Superpixel CRFs Combining Multiple Character Features", Proc. ICDAR, pp.1034--1038, 2011.

Digital Library

[18]

H. Zhang, C. Liu, C. Yang, X. Ding, and K. Wang, "An Improved Scene Text Extraction Method Using Conditional Random Field and Optical Character Recognition", Proc. ICDAR, pp.708--712, 2011.

Digital Library

[19]

P. Shivakumara, S. Bhowmick, B. Su, C. L. Tan, U. Pal, "A New Gradient based character segmentation Method for Video text Recognition", Proc. ICDAR, pp.-126--130, 2011.

Digital Library

[20]

T. Q. Phan, P. Shivakumara, B. Su, and C. L. Tan, "A Gradient Vector Flow-Based Method for Video Character Segmentation", Proc. ICDAR, pp.1024--1028, 2011.

Digital Library

[21]

D. Rajendran, P. Shivakumara, B. Su, S. Lu, and C. L. Tan, "A new Fourier-Moments based Video Word and Character Extraction Method for recognition", Proc. ICDAR, pp.1165--1169, 2011.

Digital Library

[22]

Z. Zhou, L. Li, C. L. Tan, "Edge based Binarization for video text images", Proc. ICPR, pp.133--136, 2010.

Digital Library

[23]

Toru Wakahara and Kohei Kita, "Binarization of Color Character Strings in Scene Images Using K-Means Clustering and Support Vector Machines", Proc. ICDAR, pp.274--278, 2011.

Digital Library

[24]

K. Ntirogiannis, B. Gatos, and I. Pratikakis "Binarization of Textual Content in Video Frames", Proc. ICDAR, pp.673--677, 2011.

Digital Library

[25]

J. Kittler, J. Illingworth, J. Föglein, "Threshold selection based on a simple image statistic", Computer Vision, Graphics, and Image Processing, Vol. 30, No. 2, pp. 125--147, 1985.

[26]

S. Ghosh, P. K. Bora, S. Das and B. B. Chaudhuri, "Development of an Assamese OCR using Bangla OCR", Proc. DAR (http://mile.ee.iisc.ernet.in/dar2012/), pp. 68--73, 2012.

Digital Library

[27]

P. Banerjee and B. B. Chaudhuri, "Video Text Localization using Wavelet and Shearlet Transforms", Available at: http://arxiv.org/abs/1307.4990.

Cited By

Hossain MRahman T(2023)A crowdsource based framework for Bengali scene text data collection and detectionComputers and Electrical Engineering10.1016/j.compeleceng.2023.109025112(109025)Online publication date: Dec-2023
https://doi.org/10.1016/j.compeleceng.2023.109025
Halder MKundu SHasan M(2023)An Improved Method to Recognize Bengali Handwritten Characters Using CNNProceedings of International Conference on Data Science and Applications10.1007/978-981-19-6634-7_43(611-624)Online publication date: 7-Feb-2023
https://doi.org/10.1007/978-981-19-6634-7_43
Akhter SRege P(2021)Multi-task learning for pre-processing of printed Devanagari document images with hyper-parameter optimization of the deep architecture using Taguchi methodSādhanā10.1007/s12046-021-01664-746:3Online publication date: 26-Jul-2021
https://doi.org/10.1007/s12046-021-01664-7
Show More Cited By

Recommendations

A hierarchical approach to recognition of handwritten Bangla characters

A novel hierarchical approach is presented here for optical character recognition (OCR) of handwritten Bangla words. Instead of dealing with isolated characters as found in selected works [T.K. Bhowmik, U. Bhattacharya, S.K. Parui, Recognition of Bangla ...
Multi-oriented Bangla and Devnagari text recognition

There are printed complex documents where text lines of a single page may have different orientations or the text lines may be curved in shape. As a result, it is difficult to detect the skew of such documents and hence character segmentation and ...
Offline recognition of handwritten Bangla characters: an efficient two-stage approach

The present work deals with recognition of handwritten characters of Bangla, a major script of the Indian sub-continent. The main contributions presented here are (a) generation of a database of handwritten basic characters of Bangla and (b) development ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

MOCR '13: Proceedings of the 4th International Workshop on Multilingual OCR

August 2013

99 pages

ISBN:9781450321143

DOI:10.1145/2505377

General Chairs:
Venu Govindaraju
University at Buffalo
,
Prem Natarajan
Information Sciences Institute
,
Santanu Chaudhury
IIT Delhi, India
,
Daniel Lopresti
Lehigh University
,
Program Chairs:
Srirangaraj Setlur
University at Buffalo
,
Huaigu Cao
Raytheon BBN Technologies

Copyright © 2013 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

BBN Technologies

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2013

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

DIT
Govt. of India
Society for Natural Language Technology Research, Kolkata

Conference

MOCR '13

Sponsor:

MOCR '13: 4th International Workshop on Multilingual OCR

August 24, 2013

D.C., Washington, USA

Acceptance Rates

MOCR '13 Paper Acceptance Rate 17 of 34 submissions, 50%;

Overall Acceptance Rate 17 of 34 submissions, 50%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

11
Total Citations
View Citations
114
Total Downloads

Downloads (Last 12 months)3
Downloads (Last 6 weeks)2

Reflects downloads up to 21 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Hossain MRahman T(2023)A crowdsource based framework for Bengali scene text data collection and detectionComputers and Electrical Engineering10.1016/j.compeleceng.2023.109025112(109025)Online publication date: Dec-2023
https://doi.org/10.1016/j.compeleceng.2023.109025
Halder MKundu SHasan M(2023)An Improved Method to Recognize Bengali Handwritten Characters Using CNNProceedings of International Conference on Data Science and Applications10.1007/978-981-19-6634-7_43(611-624)Online publication date: 7-Feb-2023
https://doi.org/10.1007/978-981-19-6634-7_43
Akhter SRege P(2021)Multi-task learning for pre-processing of printed Devanagari document images with hyper-parameter optimization of the deep architecture using Taguchi methodSādhanā10.1007/s12046-021-01664-746:3Online publication date: 26-Jul-2021
https://doi.org/10.1007/s12046-021-01664-7
Roy PBhunia ABhattacharyya APal U(2019)Word searching in scene image and video frame in multi-script scenario using dynamic shape codingMultimedia Tools and Applications10.1007/s11042-018-6484-578:6(7767-7801)Online publication date: 17-May-2019
https://dl.acm.org/doi/10.1007/s11042-018-6484-5
Banerjee PDas SSeraogi BMajumder HMukkamala SRoy RChaudhuri B(2018)A System for Automatic Elevation Datum Detection and Hyperlinking of AEC Drawing DocumentsGraphics Recognition. Current Trends and Evolutions10.1007/978-3-030-02284-6_3(30-42)Online publication date: 23-Nov-2018
https://doi.org/10.1007/978-3-030-02284-6_3
Maity SSeraogi BDas SBanerjee PMajumder HMukkamala SRoy RChaudhuri B(2018)An Approach for Detecting Circular Callouts in Architectural, Engineering and Constructional Drawing DocumentsGraphics Recognition. Current Trends and Evolutions10.1007/978-3-030-02284-6_2(17-29)Online publication date: 23-Nov-2018
https://doi.org/10.1007/978-3-030-02284-6_2
Maity SSeraogi BBanerjee PDas SMajumdar HMukkamala SRoy RChaudhuri B(2017)A Novel Approach for Detecting Circular Callouts in AEC Drawing Documents2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2017.273(51-52)Online publication date: Nov-2017
https://doi.org/10.1109/ICDAR.2017.273
Seraogi BDas SBanerjee PMajumdar HMukkamala SRoy RChaudhuri B(2017)Automatic Orientation Correction of AEC Drawing Documents2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2017.252(9-10)Online publication date: Nov-2017
https://doi.org/10.1109/ICDAR.2017.252
Banerjee PChoudhary SDas SMajumder HMukkamala SRoy RChaudhuri B(2017)A System for Creating Automatic Navigation among Architectural and Construction Documents2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2017.116(677-682)Online publication date: Nov-2017
https://doi.org/10.1109/ICDAR.2017.116
Banerjee PChoudhary SDas SMajumdar HRoy RChaudhuri B(2016)Automatic Hyperlinking of Engineering Drawing Documents2016 12th IAPR Workshop on Document Analysis Systems (DAS)10.1109/DAS.2016.76(102-107)Online publication date: Apr-2016
https://doi.org/10.1109/DAS.2016.76
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents