Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2505377.2505389acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmocrConference Proceedingsconference-collections
research-article

An approach for Bangla and Devanagari video text recognition

Published: 24 August 2013 Publication History

Abstract

Extraction and recognition of Bangla text from video frame images is challenging due to fonts type and style variation, complex color background, low-resolution, low contrast etc. In this paper, we propose an algorithm for extraction and recognition of Bangla and Devanagari text form video frames with complex background. Here, a two-step approach has been proposed. After text localization, the text line is segmented into words using information based on line contours. First order gradient values of the text blocks are used to find the word gap. Next, an Adaptive SIS binarization technique is applied on each word. Next this binarized text block is sent to a state of the art OCR for recognition.

References

[1]
K. Jung, K. I. Kim, and A. K. Jain, "Text information extraction in images and video: a survey", Pattern Recognition, Vol. 37, No. 5, pp. 977--997, 2004.
[2]
D. Chen and J. M. Odobez, "Video text recognition using sequential Monte Carlo and error voting method", Pattern Recognition Letters, pp. 1386--1403, 2005.
[3]
A. K. Jain and B. Yu., "Automatic Text Location in Images and Video Frames", Pattern Recognition, vol. 31, pp. 2055--2076, 1998.
[4]
Q. Ye., Q. Huang, W. Gao and D. Zhao., "Fast and robust text detection in images and video frames", Image and Vision Computing, vol. 23, pp. 565--576, 2005.
[5]
K. Jung and J. H. Han, "Hybrid Approach to Efficient Text Extraction in Complex Color Images", Pattern Recognition Letters, vol. 25, pp. 679--699, 2004.
[6]
C. Liu, C. Wang and R. Dai., "Text Detection in Images Based on Unsupervised Classification of Edge-based Features", Proc. ICDAR, pp. 610--614, 2005.
[7]
P. Shivakumara, W. Huang and C. L. Tan., "An Efficient Edge based Technique for Text Detection in Video Frames", Proc. DAS, pp. 307--314, 2008.
[8]
M. Cai, J. Song and M. R. Lyu, "A New Approach for Video Text Detection", Proc. ICIP, pp. 117--120, 2002.
[9]
E. K. Wong and M. Chen., "A new robust algorithm for video text extraction", Pattern Recognition, vol. 36, pp. 1397--1406, 2003.
[10]
T. Q. Phan, P. Shivakumara and C. L Tan, "A Laplacian Method for Video Text Detection", Proc. ICDAR, pp. 66--70, 2009.
[11]
N. Sharma, P. Shivakumara, U. Pal, M. Blumenstein and C. L. Tan, "A New Method for Arbitrarily-Oriented Text Detection in Video", Proc. DAS, pp. 74--78, 2012.
[12]
J. Zang and R. Kasturi, "Extraction of Text Objects in Video Documents: Recent Progress", Proc. DAS, pp. 5--17, 2008.
[13]
H. Li, D. Doermann and O. Kia, "Automatic Text Detection and Tracking in Digital Video", IEEE Transactions on Image Processing, vol. 9, pp. 147--156, 2000.
[14]
P. Shivakumara, T. Q. Phanand C. L Tan, "A Robust Wavelet Transform Based Technique for Video Text Detection", Proc. ICDAR, pp. 1285--1289, 2009.
[15]
R. Lienhart and A. Wernicke, "Localizing and segmenting text in images and videos", IEEE Transactions on Circuits and Systems for Video Technology, Vol. 12, No. 4, pp. 256--268, 2002.
[16]
J. Lim, J. Park, and G. G. Medioni, "Text segmentation in color images using tensor voting", Image and Vision Computing, vol.25, pp. 671--685, 2007.
[17]
M. Su. Cho, Jae-Hyun Seok, S. Lee, and J. H. Kim, "Scene Text Extraction by Superpixel CRFs Combining Multiple Character Features", Proc. ICDAR, pp.1034--1038, 2011.
[18]
H. Zhang, C. Liu, C. Yang, X. Ding, and K. Wang, "An Improved Scene Text Extraction Method Using Conditional Random Field and Optical Character Recognition", Proc. ICDAR, pp.708--712, 2011.
[19]
P. Shivakumara, S. Bhowmick, B. Su, C. L. Tan, U. Pal, "A New Gradient based character segmentation Method for Video text Recognition", Proc. ICDAR, pp.-126--130, 2011.
[20]
T. Q. Phan, P. Shivakumara, B. Su, and C. L. Tan, "A Gradient Vector Flow-Based Method for Video Character Segmentation", Proc. ICDAR, pp.1024--1028, 2011.
[21]
D. Rajendran, P. Shivakumara, B. Su, S. Lu, and C. L. Tan, "A new Fourier-Moments based Video Word and Character Extraction Method for recognition", Proc. ICDAR, pp.1165--1169, 2011.
[22]
Z. Zhou, L. Li, C. L. Tan, "Edge based Binarization for video text images", Proc. ICPR, pp.133--136, 2010.
[23]
Toru Wakahara and Kohei Kita, "Binarization of Color Character Strings in Scene Images Using K-Means Clustering and Support Vector Machines", Proc. ICDAR, pp.274--278, 2011.
[24]
K. Ntirogiannis, B. Gatos, and I. Pratikakis "Binarization of Textual Content in Video Frames", Proc. ICDAR, pp.673--677, 2011.
[25]
J. Kittler, J. Illingworth, J. Föglein, "Threshold selection based on a simple image statistic", Computer Vision, Graphics, and Image Processing, Vol. 30, No. 2, pp. 125--147, 1985.
[26]
S. Ghosh, P. K. Bora, S. Das and B. B. Chaudhuri, "Development of an Assamese OCR using Bangla OCR", Proc. DAR (http://mile.ee.iisc.ernet.in/dar2012/), pp. 68--73, 2012.
[27]
P. Banerjee and B. B. Chaudhuri, "Video Text Localization using Wavelet and Shearlet Transforms", Available at: http://arxiv.org/abs/1307.4990.

Cited By

View all
  • (2023)A crowdsource based framework for Bengali scene text data collection and detectionComputers and Electrical Engineering10.1016/j.compeleceng.2023.109025112(109025)Online publication date: Dec-2023
  • (2023)An Improved Method to Recognize Bengali Handwritten Characters Using CNNProceedings of International Conference on Data Science and Applications10.1007/978-981-19-6634-7_43(611-624)Online publication date: 7-Feb-2023
  • (2021)Multi-task learning for pre-processing of printed Devanagari document images with hyper-parameter optimization of the deep architecture using Taguchi methodSādhanā10.1007/s12046-021-01664-746:3Online publication date: 26-Jul-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
MOCR '13: Proceedings of the 4th International Workshop on Multilingual OCR
August 2013
99 pages
ISBN:9781450321143
DOI:10.1145/2505377
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • BBN Technologies

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Bangla and Devanagari video text processing
  2. video text OCR
  3. video text localization
  4. video word segmentation

Qualifiers

  • Research-article

Funding Sources

  • DIT
  • Govt. of India
  • Society for Natural Language Technology Research, Kolkata

Conference

MOCR '13
Sponsor:

Acceptance Rates

MOCR '13 Paper Acceptance Rate 17 of 34 submissions, 50%;
Overall Acceptance Rate 17 of 34 submissions, 50%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)2
Reflects downloads up to 21 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)A crowdsource based framework for Bengali scene text data collection and detectionComputers and Electrical Engineering10.1016/j.compeleceng.2023.109025112(109025)Online publication date: Dec-2023
  • (2023)An Improved Method to Recognize Bengali Handwritten Characters Using CNNProceedings of International Conference on Data Science and Applications10.1007/978-981-19-6634-7_43(611-624)Online publication date: 7-Feb-2023
  • (2021)Multi-task learning for pre-processing of printed Devanagari document images with hyper-parameter optimization of the deep architecture using Taguchi methodSādhanā10.1007/s12046-021-01664-746:3Online publication date: 26-Jul-2021
  • (2019)Word searching in scene image and video frame in multi-script scenario using dynamic shape codingMultimedia Tools and Applications10.1007/s11042-018-6484-578:6(7767-7801)Online publication date: 17-May-2019
  • (2018)A System for Automatic Elevation Datum Detection and Hyperlinking of AEC Drawing DocumentsGraphics Recognition. Current Trends and Evolutions10.1007/978-3-030-02284-6_3(30-42)Online publication date: 23-Nov-2018
  • (2018)An Approach for Detecting Circular Callouts in Architectural, Engineering and Constructional Drawing DocumentsGraphics Recognition. Current Trends and Evolutions10.1007/978-3-030-02284-6_2(17-29)Online publication date: 23-Nov-2018
  • (2017)A Novel Approach for Detecting Circular Callouts in AEC Drawing Documents2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2017.273(51-52)Online publication date: Nov-2017
  • (2017)Automatic Orientation Correction of AEC Drawing Documents2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2017.252(9-10)Online publication date: Nov-2017
  • (2017)A System for Creating Automatic Navigation among Architectural and Construction Documents2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2017.116(677-682)Online publication date: Nov-2017
  • (2016)Automatic Hyperlinking of Engineering Drawing Documents2016 12th IAPR Workshop on Document Analysis Systems (DAS)10.1109/DAS.2016.76(102-107)Online publication date: Apr-2016
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media