Conservative Visual Learning for Object Detection with Minimal Hand Labeling Effort

Peter Roth¹⁹,
Helmut Grabner¹⁹,
Danijel Skočaj²⁰,
Horst Bischof¹⁹ &
…
Aleš Leonardis²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3663))

Included in the following conference series:

Joint Pattern Recognition Symposium

1911 Accesses
5 Citations

Abstract

We present a novel framework for unsupervised training of an object detection system. The basic idea is to (1) exploit a huge amount of unlabeled video data by being very conservative in selecting training examples; and (2) to start with a very simple object detection system and using generative and discriminative classifiers in an iterative co-training fashion to arrive at increasingly better object detectors. We demonstrate the framework on a surveillance task where we learn a person detector. We start with a simple moving object classifier and proceed with robust PCA (on shape and appearance) as a generative classifier which in turn generates a training set for a discriminative AdaBoost classifier. The results obtained by AdaBoost are again filtered by PCA which produces an even better training set. We demonstrate that by using this approach we avoid hand labeling training data and still achieve a state of the art detection rate.

This work has been supported by the Austrian Joint Research Project Cognitive Vision under projects S9103-N04 and S9104-N04, by the Federal Ministry for Education, Science and Culture of Austria under the CONEX program, by the SI-A project, by the Federal Ministry of Transport, Innovation and Technology under P-Nr. I2-2-26p Vitus2, by the Research program Computer Vision P2-0214 (RS), by EU FP6-004250-IP project CoSy and by EU FP6-511051-2 project MOBVIS.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Pedestrian Verification for Multi-Camera Detection

Learning a Family of Detectors via Multiplicative Kernels

Can Cosegmentation Improve the Object Detection Quality?

References

Agarwal, S., Awan, A., Roth, D.: Learning to detect objects in images via a sparse, part-based representation. IEEE Trans. PAMI 26(11), 1475–1490 (2004)
Google Scholar
Breu, H., Gil, J., Kirkpatrick, D., Werman, M.: Linear time euclidean distance transform algorithms. IEEE Trans. PAMI 17(5), 529–533 (1995)
Google Scholar
De la Torre, F., Black, M.J.: A framework for robust subspace learning. IJCV 54(1), 117–142 (2003)
Article MATH Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: Proc. CVPR 2003, pp. 264–271 (2003)
Google Scholar
Freund, Y., Shapire, R.: A decision-theoretic generalization of online learning and an application to boosting. J. of Computer and System Sciences 55, 119–139 (1997)
Article MATH Google Scholar
Leonardis, A., Bischof, H.: Robust recognition using eigenimages. Computer Vision and Image Understanding 78, 99–118 (2000)
Article Google Scholar
Levi, K., Weiss, Y.: Learning Object Detection from a Small Number of Examples: The Importance of Good Features. In: Proc. CVPR 2004 (2004)
Google Scholar
Levin, A., Viola, P., Freund, Y.: Unsupervised improvement of visual detectors using co-training. In: Proc. ICCV, pp. 626–633 (2003)
Google Scholar
Littlestone, N.: Learning quickly when irrelevant attributes abound. Machine Learning 2, 285–318 (1987)
Google Scholar
McFarlane, N.J.B., Schofield, C.P.: Segmentation and tracking of piglets. Machine Vision and Applications 8(3), 187–193 (1995)
Article Google Scholar
Nair, V., Clark, J.J.: An unsupervised, online learning framework for moving object detection. In: Proc. CVPR 2004, pp. 317–324 (2004)
Google Scholar
Opelt, A., Fussenegger, M., Pinz, A., Auer, P.: Weak hypotheses and boosting for generic object detection and recognition. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3022, pp. 71–84. Springer, Heidelberg (2004)
Chapter Google Scholar
Park, J.-H., Choi, Y.-K.: On-line learning for active pattern recognition. IEEE Signal Processing Letters 3(11), 301–303 (1996)
Article Google Scholar
Rowley, H., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Trans. PAMI 20(1), 23–38 (1998)
Google Scholar
Skočaj, D., Bischof, H., Leonardis, A.: A robust PCA algorithm for building representations from panoramic images. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 761–775. Springer, Heidelberg (2002)
Chapter Google Scholar
Skočaj, D., Leonardis, A.: Weighted and robust incremental method for subspace learning. In: Proc. ICCV 2003, vol. II, pp. 1494–1501 (2003)
Google Scholar
Sung, K., Poggio, T.: Example-based learning for view-based face detection. IEEE Trans. PAMI 20, 39–51 (1998)
Google Scholar
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
MATH Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proc. CVPR 2001, pp. 511–518 (2001)
Google Scholar
Viola, P., Jones, M.J., Snow, D.: Detecting pedestrians using patterns of motion and appearance. In: Proc. ICCV 2003, vol. 2, pp. 734–741 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Inst. for Computer Graphics and Vision, Graz University of Technology, Austria
Peter Roth, Helmut Grabner & Horst Bischof
Faculty of Computer and Information Science, University of Ljubljana, Slovenia
Danijel Skočaj & Aleš Leonardis

Authors

Peter Roth
View author publications
You can also search for this author in PubMed Google Scholar
Helmut Grabner
View author publications
You can also search for this author in PubMed Google Scholar
Danijel Skočaj
View author publications
You can also search for this author in PubMed Google Scholar
Horst Bischof
View author publications
You can also search for this author in PubMed Google Scholar
Aleš Leonardis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

PRIP, Vienna University of Technology, Austria
Walter G. Kropatsch
Vienna University of Technology, Vienna, Austria
Robert Sablatnig
Pattern Recognition and Image Processing Group, Institute of Computer-Aided Automation, Vienna University of Technology, Favoritenstraße 9/1832, A-1040, Vienna, Austria
Allan Hanbury

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roth, P., Grabner, H., Skočaj, D., Bischof, H., Leonardis, A. (2005). Conservative Visual Learning for Object Detection with Minimal Hand Labeling Effort. In: Kropatsch, W.G., Sablatnig, R., Hanbury, A. (eds) Pattern Recognition. DAGM 2005. Lecture Notes in Computer Science, vol 3663. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11550518_37

Download citation

DOI: https://doi.org/10.1007/11550518_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28703-2
Online ISBN: 978-3-540-31942-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Conservative Visual Learning for Object Detection with Minimal Hand Labeling Effort

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Pedestrian Verification for Multi-Camera Detection

Learning a Family of Detectors via Multiplicative Kernels

Can Cosegmentation Improve the Object Detection Quality?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Conservative Visual Learning for Object Detection with Minimal Hand Labeling Effort

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Pedestrian Verification for Multi-Camera Detection

Learning a Family of Detectors via Multiplicative Kernels

Can Cosegmentation Improve the Object Detection Quality?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation