Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1815330.1815332acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

IBN SINA: a database for research on processing and understanding of Arabic manuscripts images

Published: 09 June 2010 Publication History

Abstract

This paper describes the steps that have been undertaken in order to develop the IBN SINA database, which is designed to apply learning techniques in the processing and understanding of document images. The description of the preparation process, including preprocessing, feature extraction and labeling, is provided. The database has been evaluated using classification techniques, such as the SVM classifiers. In order to make the database compatible with these classifiers, the labels of the shapes have been translated into a set of bi-class problems. Promising results with the SVM classifiers have been obtained.

References

[1]
M. M. Adankon and M. Cheriet. Encyclopedia of Biometrics, chapter Support Vector Machine, pages 1303--1308. Springer, 2009.
[2]
Y. Al-Ohali, M. Cheriet, and C. Suen. Databases for recognition of handwritten arabic cheques. Pattern Recognition, 36(1):111--121, Jan. 2003.
[3]
H. Alamri, J. Sadri, C. Suen, and N. Nobile. A novel comprehensive database for Arabic off-line handwriting recognition. In ICFHR'08, 2008.
[4]
G. Farin. Curves and surfaces for computer aided geometric design (5th ed.): a practical guide. Academic Press Professional, Inc., 2001.
[5]
R. Farrahi Moghaddam and M. Cheriet. A robust word spotting system for historical arabic manuscripts based on skeleton features. IEEE Trans. on Systems, Man, and Cybernetics, Part B, Submitted.
[6]
R. Farrahi Moghaddam and M. Cheriet. Application of multi-level classifiers and clustering for automatic word-spotting in historical document images. In ICDAR'09, pages 511--515, Barcelona, Spain, July 26--29 2009.
[7]
R. Farrahi Moghaddam and M. Cheriet. RSLDI: Restoration of single-sided low-quality document images. Pattern Recognition, 42:3355--3364, 2009.
[8]
A. Gacek. Arabic Manuscripts: A Vademecum for Readers. Handbook of Oriental Studies. Section 1 The Near and Middle East, 98. Leiden; Boston: Brill, 2009. ISBN-10: 90 04 17036 7.
[9]
J. Hull. A database for handwritten text recognition research. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(5):550--554, 1994.
[10]
S. Johansson, G. Leech, and H. Goodluck. Lancaster-oslo/bergen corpus, http://khnt.hit.uib.no/icame/manuals/lob/index.htm. Department of English, University of Oslo, Oslo, 1978.
[11]
L. Lorigo and V. Govindaraju. Offline arabic handwriting recognition: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(5):712--724, 2006.
[12]
U.-V. Marti and H. Bunke. A full english sentence database for off-line handwriting recognition. In ICDAR'99, pages 705--708, 1999.
[13]
U.-V. Marti and H. Bunke. The iam-database: an english sentence database for offline handwriting recognition. IJDAR, 5(1):39--46, Nov. 2002.
[14]
K. Morik, P. Brockhausen, and T. Joachims. Combining statistical learning with a knowledge-based approach -- a case study in intensive care monitoring. In ICML'99, 1999.
[15]
V. Vapnik. Statistical Learning Theory. John Wiley & Sons, New York, 1998.
[16]
R. Wisnovsky. Philosophy, Science and Exegesis in Greek, Arabic and Latin Commentaries, volume 2, chapter The nature and scope of Arabic philosophical commentary in post-classical (ca. 1100--1900 AD) Islamic intellectual history: Some preliminary observations, pages 149--191. Institute of Classical Studies, London, 2004.

Cited By

View all
  • (2024)Word Spotting in Historical Arabic Documents Using Deep Learning2024 6th International Conference on Computing and Informatics (ICCI)10.1109/ICCI61671.2024.10485040(499-505)Online publication date: 6-Mar-2024
  • (2024)A novel word recognition system in Persian/Arabic handwritten words using stacking ensemble classifier of deep learningMultimedia Tools and Applications10.1007/s11042-024-20467-6Online publication date: 25-Nov-2024
  • (2024)New Transformer Approach to the Recognition of Mediaeval Arabic Historical ManuscriptsArtificial Intelligence and Its Practical Applications in the Digital Economy10.1007/978-3-031-71429-0_20(271-283)Online publication date: 18-Dec-2024
  • Show More Cited By

Index Terms

  1. IBN SINA: a database for research on processing and understanding of Arabic manuscripts images

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
        June 2010
        490 pages
        ISBN:9781605587738
        DOI:10.1145/1815330
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 09 June 2010

        Permissions

        Request permissions for this article.

        Check for updates

        Qualifiers

        • Research-article

        Conference

        DAS '10

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)3
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 09 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Word Spotting in Historical Arabic Documents Using Deep Learning2024 6th International Conference on Computing and Informatics (ICCI)10.1109/ICCI61671.2024.10485040(499-505)Online publication date: 6-Mar-2024
        • (2024)A novel word recognition system in Persian/Arabic handwritten words using stacking ensemble classifier of deep learningMultimedia Tools and Applications10.1007/s11042-024-20467-6Online publication date: 25-Nov-2024
        • (2024)New Transformer Approach to the Recognition of Mediaeval Arabic Historical ManuscriptsArtificial Intelligence and Its Practical Applications in the Digital Economy10.1007/978-3-031-71429-0_20(271-283)Online publication date: 18-Dec-2024
        • (2023)Bagging: An Ensemble Approach for Recognition of Handwritten Place Names in Gurumukhi ScriptACM Transactions on Asian and Low-Resource Language Information Processing10.1145/359302422:7(1-25)Online publication date: 25-Jul-2023
        • (2023)Analysis of Cursive Text Recognition Systems: A Systematic Literature ReviewACM Transactions on Asian and Low-Resource Language Information Processing10.1145/359260022:7(1-30)Online publication date: 20-Jul-2023
        • (2022)MOJ-DBPattern Recognition Letters10.1016/j.patrec.2022.04.040159:C(54-60)Online publication date: 1-Jul-2022
        • (2021)Spatial Distribution of Ink at Keypoints (SDIK): A Novel Feature for Word Spotting in Arabic DocumentsInternational Journal of Image and Graphics10.1142/S021946782250035822:04Online publication date: 12-Jul-2021
        • (2021)Histogram of Marked Background (HMB) Feature Extraction Method for Arabic Handwriting RecognitionInternational Journal of Image and Graphics10.1142/S021946782250015222:02Online publication date: 24-Apr-2021
        • (2021)Practical Active Learning with Model Selection for Small Data2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)10.1109/ICMLA52953.2021.00263(1647-1653)Online publication date: Dec-2021
        • (2021)Holistic word descriptor for lexicon reduction in handwritten arabic documentsPattern Recognition10.1016/j.patcog.2021.108072119:COnline publication date: 1-Nov-2021
        • Show More Cited By

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media