Combined Document/Business Card Detector for Proactive Document-Based Services on the Smartphone

Yong-Joong Kim¹⁷,
Yonghyun Kim¹⁷,
Bong-Nam Kang¹⁸ &
…
Daijin Kim^17,18

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9492))

Included in the following conference series:

International Conference on Neural Information Processing

2352 Accesses

Abstract

In this paper, we present a novel combined detector of document and business card. To detect document or business card, our method firstly extracts a document object region from a given image, and then classifies it into positive or negative class. In the step of extracting the document object region, a block-based processing is exploited to efficiently find the line segment candidates of its boundary, and RANSAC-like method under three constraints is used to search its real boundary. In classification step, after performing image normalization on the extracted region, the Fisher vector is extracted to represent the document object, then it is classified by linear-SVM. For evaluating the proposed method, we carry out some experiments by using the collected images, and show that our method has achieved about 94 % accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Machine Learning Techniques for Identity Document Verification in Uncontrolled Environments: A Case Study

An Android Business Card Reader Based on Google Vision: Design and Evaluation

Online Document Identification and Verification Using Machine Learning Model

References

Abbyy mobile ocr engine. http://www.abbyy.com/mobile-ocr
camcard. http://www.camcard.com
Opencv. http://www.opencv.org
Samsung galaxy s4. http://www.samsung.com/global/microsite/galaxys4/
Vlfeat. http://www.vlfeat.org
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article MathSciNet MATH Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886–893 (2005)
Google Scholar
Duda, R.O., Hart, P.E.: Use of the hough transformation to detect lines and curves in pictures. Commun. ACM 15(1), 11–15 (1972)
Article MATH Google Scholar
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Article MathSciNet Google Scholar
Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Improving neural networks by preventing co-adaptation of feature detectors. CoRR abs/1207.0580 (2012)
Google Scholar
Jaakkola, T.S., Haussler, D.: Exploiting generative models in discriminative classifiers. In: Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems II, pp. 487–493 (1999)
Google Scholar
Kang, L., Kumar, J., Ye, P., Li, Y., Doermann, D.: Convolutional neural networks for document image classification. In: Proceeding of 22nd International Conference on Pattern Recognition, pp. 3168–3172 (2014)
Google Scholar
Kumar, J., Ye, P., Doermann, D.: Structural similarity for document image classification and retrieval. Pattern Recogn. Lett. 43(1), 119–126 (2014)
Article Google Scholar
Matas, J., Galambos, C., Kittler, J.: Robust detection of lines using the progressive probabilistic hough transform. Comput. Vis. Image Underst. 78(1), 119–137 (2000)
Article Google Scholar
Nair, V., Hinton, G.E.: Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning (ICML 2010), pp. 807–814 (2010)
Google Scholar
Perronnin, F., Dance, C.: Fisher kernels on visual vocabularies for image categorization. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2007)
Google Scholar
Rosten, E., Drummond, T.W.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part I. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)
Chapter Google Scholar
Sánchez, J., Perronnin, F., Mensink, T., Verbeek, J.: Image classification with the fisher vector: theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

The research was supported by the Implementation of Technologies for Identification, Behavior, and Location of Human based on Sensor Network Fusion Program through the Ministry of Trade, Industry and Energy (Grant Number: 10041629), and also by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No. 2015R1A2A2A01004282).

Author information

Authors and Affiliations

Department of Computer Science & Engineering, Pohang University of Science and Technology, 77 Cheongam-ro, Nam-gu, Pohang, Gyeongbuk, 37673, Republic of Korea
Yong-Joong Kim, Yonghyun Kim & Daijin Kim
Department of Creative IT Engineering, Pohang University of Science and Technology, 77 Cheongam-ro, Nam-gu, Pohang, Gyeongbuk, 37673, Republic of Korea
Bong-Nam Kang & Daijin Kim

Authors

Yong-Joong Kim
View author publications
You can also search for this author in PubMed Google Scholar
Yonghyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Bong-Nam Kang
View author publications
You can also search for this author in PubMed Google Scholar
Daijin Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daijin Kim .

Editor information

Editors and Affiliations

University of Istanbul, Istanbul, Turkey
Sabri Arik
University at Qatar, Doha, Qatar
Tingwen Huang
Tunku Abdul Rahman University College, Kuala Lumpur, Malaysia
Weng Kin Lai
University of Science Technology, Wuhan, China
Qingshan Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, YJ., Kim, Y., Kang, BN., Kim, D. (2015). Combined Document/Business Card Detector for Proactive Document-Based Services on the Smartphone. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9492. Springer, Cham. https://doi.org/10.1007/978-3-319-26561-2_47

Download citation

DOI: https://doi.org/10.1007/978-3-319-26561-2_47
Published: 18 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26560-5
Online ISBN: 978-3-319-26561-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Combined Document/Business Card Detector for Proactive Document-Based Services on the Smartphone

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Machine Learning Techniques for Identity Document Verification in Uncontrolled Environments: A Case Study

An Android Business Card Reader Based on Google Vision: Design and Evaluation

Online Document Identification and Verification Using Machine Learning Model

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Combined Document/Business Card Detector for Proactive Document-Based Services on the Smartphone

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Machine Learning Techniques for Identity Document Verification in Uncontrolled Environments: A Case Study

An Android Business Card Reader Based on Google Vision: Design and Evaluation

Online Document Identification and Verification Using Machine Learning Model

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation