Image representations for visual learning

Tomaso Poggio¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1206))

Included in the following conference series:

International Conference on Audio- and Video-Based Biometric Person Authentication

2425 Accesses

Abstract

The last few years have seen new successful approaches to object recognition and to computer graphics based directly on images without the use of intermediate 3D models. I will argue that most of these techniques depend on a representation of images that induces a linear vector space structure and in principle requires dense correspondence. This image representation allows the use of learning techniques for the analysis and for the synthesis of images, that is for both computer vision and computer graphics.

The key assumption hidden in most view-based approaches to object recognition and object detection is that the relevant images are vectors. This is not true unless they are set in correspondence. The correspondence step associates a shape vector and a texture vector to each image. I will focus on the domain of face images and review how this representation can be used to learn

to estimate parameters such as expression and pose from images
to interpolate in multiple dimensions new images and images sequences from examples
to extrapolate from single images and generate virtual examples

In particular, I will describe a new example-based correspondence technique based on a flexible template represented as the linear combination of prototypes [1]–[4]. The approach is hierarchical and may allow to exploit context in vision tasks. It has interesting biological implications and may applied to other domains beyond vision and graphics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

Thomas Vetter and Tomaso Poggio. Linear object classes and image synthesis from a single example image. A.I. Memo No. 1531, Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 1995.
Google Scholar
M. Jones and T. Poggio. Model-based matching of line drawings by linear combination of prototypes. Proceedings of the International Conference on Computer Vision, pages 531–536. IEEE, June 1995.
Google Scholar
David Beymer and Tomaso Poggio. Image Representations for Visual Learning. Science272 1905–1909 1996
Google Scholar
Mike Jones and Tomaso Poggio. Model-Based Matching by Linear Combinations of Prototypes. A.I. Memo No. 1583, Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Biological and Computational Learning, Massachusetts Institute of Technology, 02139, Cambridge, MA
Tomaso Poggio

Authors

Tomaso Poggio
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Josef Bigün Gérard Chollet Gunilla Borgefors

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Poggio, T. (1997). Image representations for visual learning. In: Bigün, J., Chollet, G., Borgefors, G. (eds) Audio- and Video-based Biometric Person Authentication. AVBPA 1997. Lecture Notes in Computer Science, vol 1206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0015989

Download citation

DOI: https://doi.org/10.1007/BFb0015989
Published: 10 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62660-2
Online ISBN: 978-3-540-68425-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics