Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1815330.1815376acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

Latent Dirichlet allocation based writer identification in offline handwriting

Published: 09 June 2010 Publication History

Abstract

In this paper, we describe a novel approach to Writer Identification in Offline handwriting using Latent Dirichlet Allocation. State-of-the-art methods for writer identification employ the traditional feature-classification paradigm which does not provide enough information about the handwriting attributes such as writing style which are key components in any forensic analysis of handwriting. This problem is also compounded due to lack of efficient rules for defining a particular writing style that can capture writer specific characteristics over a large dataset. We propose to address this issue by using a generative model in form of Latent Dirichlet Allocation(LDA) that automatically infers writing styles from handwritten document collection without any pre-defined set of rules. This information is then used to represent each writer as a distribution over multiple writing style for classifying any unknown writer sample. We describe our approach on two different feature sets consisting of contour angle features as well as structural and concavity features. Our experimental results show comparable performance with baseline systems and also demonstrate the efficacy of LDA for learning multiple handwriting styles.

References

[1]
Bresenham line drawing algorithm. http://en.wikipedia.org/wiki/bresenham's_line_algorithm.
[2]
Latent dirichlet allocation. http://www.cs.princeton.edu/~blei/lda-c/.
[3]
Morphological waveform coding for writer identification. Pattern Recognition, 33(3):385--398, 2000.
[4]
A. Bhardwaj, M. Malgireddy, S. Setlur, V. Govindaraju, and S. Ramachandrula. Writer identification in offline handwriting using topic models. In Proceedings of the NIPS 2009 Workshop on Applications of Topic Models: Text and Beyond, 2009.
[5]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, 2003.
[6]
M. Bulacu and L. Schomaker. Text-independent writer identification and verification using textural and allographic features. IEEE Trans. Pattern Anal. Mach. Intell., 29(4):701--717, 2007.
[7]
C.-C. Chang and C.-J. Lin. Libsvm: a library for support vector machines, 2001.
[8]
F. Farooq, L. Lorigo, and V. Govindaraju. On the accent in handwriting of individuals. In Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition, 10 2006.
[9]
J. T. Favata and G. Srikantan. A multiple feature/resolution approach to handprinted digit and character recognition. International Journal of Imaging Systems and Technology, 7(4):304--311, 1996.
[10]
U. Marti and H. Bunke. The iam-database: an english sentence database for offline handwriting recognition. International Journal on Document Analysis and Recognition, 5(1):39--46, 2002.
[11]
M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In UAI '04: Proceedings of the 20th conference on Uncertainty in artificial intelligence, pages 487--494, Arlington, Virginia, United States, 2004. AUAI Press.
[12]
H. E. S. Said, T. N. Tan, and K. D. Baker. Personal identification based on handwriting. Pattern Recognition, 33(1):149--160, 2000.
[13]
S. Srihari, S.-H. Cha, H. Arora, and S. Lee. Individuality of handwriting: a validation study. In Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on, pages 106--109, 2001.
[14]
S. N. Srihari, M. J. Beal, K. Bandi, and V. Shah. A statistical model for writer verification. In ICDAR '05: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pages 1105--1109, Washington, DC, USA, 2005. IEEE Computer Society.
[15]
S. P. Tan, H. E. S. Said, G. S. Peake, T. N. Tan, and K. D. Baker. Writer identification from non-uniformly skewed handwriting images. In In Proc. of the 9th British Machine Vision Conference, pages 478--487, 1998.

Cited By

View all
  • (2021)A clustering method for graphical handwriting components and statistical writership analysisStatistical Analysis and Data Mining10.1002/sam.1148814:1(41-60)Online publication date: 20-Jan-2021
  • (2014)Data Sufficiency for Online Writer IdentificationProceedings of the 2014 22nd International Conference on Pattern Recognition10.1109/ICPR.2014.538(3121-3125)Online publication date: 24-Aug-2014
  • (2014)A Hierarchical Framework for Accent Based Writer Identification2014 11th IAPR International Workshop on Document Analysis Systems10.1109/DAS.2014.69(21-25)Online publication date: Apr-2014
  • Show More Cited By

Index Terms

  1. Latent Dirichlet allocation based writer identification in offline handwriting

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
    June 2010
    490 pages
    ISBN:9781605587738
    DOI:10.1145/1815330
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 June 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. handwriting analysis
    2. latent Dirichlet allocation
    3. topic models
    4. writer identification

    Qualifiers

    • Research-article

    Conference

    DAS '10

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)A clustering method for graphical handwriting components and statistical writership analysisStatistical Analysis and Data Mining10.1002/sam.1148814:1(41-60)Online publication date: 20-Jan-2021
    • (2014)Data Sufficiency for Online Writer IdentificationProceedings of the 2014 22nd International Conference on Pattern Recognition10.1109/ICPR.2014.538(3121-3125)Online publication date: 24-Aug-2014
    • (2014)A Hierarchical Framework for Accent Based Writer Identification2014 11th IAPR International Workshop on Document Analysis Systems10.1109/DAS.2014.69(21-25)Online publication date: Apr-2014
    • (2013)A Bayesian Framework for Modeling Accents in HandwritingProceedings of the 2013 12th International Conference on Document Analysis and Recognition10.1109/ICDAR.2013.187(917-921)Online publication date: 25-Aug-2013
    • (2013)Semi‐supervised framework for writer identification using structural learningIET Biometrics10.1049/iet-bmt.2013.00182:4(208-215)Online publication date: Dec-2013
    • (2013)A hierarchical Bayesian approach to online writer identificationIET Biometrics10.1049/iet-bmt.2013.00172:4(191-198)Online publication date: 1-Dec-2013
    • (2012)Modeling Writing Styles for Online Writer IdentificationProceedings of the 2012 International Conference on Frontiers in Handwriting Recognition10.1109/ICFHR.2012.235(387-392)Online publication date: 18-Sep-2012
    • (2012)Accent Detection in Handwriting Based on Writing StylesProceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems10.1109/DAS.2012.13(312-316)Online publication date: 27-Mar-2012
    • (2010)Retrieving Handwriting StylesProceedings of the 2010 12th International Conference on Frontiers in Handwriting Recognition10.1109/ICFHR.2010.48(265-270)Online publication date: 16-Nov-2010
    • (2010)Writer recognition of Arabic text by generative local features2010 Fourth IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS)10.1109/BTAS.2010.5634495(1-7)Online publication date: Sep-2010

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media