Abstract
The last few years have seen new successful approaches to object recognition and to computer graphics based directly on images without the use of intermediate 3D models. I will argue that most of these techniques depend on a representation of images that induces a linear vector space structure and in principle requires dense correspondence. This image representation allows the use of learning techniques for the analysis and for the synthesis of images, that is for both computer vision and computer graphics.
The key assumption hidden in most view-based approaches to object recognition and object detection is that the relevant images are vectors. This is not true unless they are set in correspondence. The correspondence step associates a shape vector and a texture vector to each image. I will focus on the domain of face images and review how this representation can be used to learn
-
to estimate parameters such as expression and pose from images
-
to interpolate in multiple dimensions new images and images sequences from examples
-
to extrapolate from single images and generate virtual examples
In particular, I will describe a new example-based correspondence technique based on a flexible template represented as the linear combination of prototypes [1]–[4]. The approach is hierarchical and may allow to exploit context in vision tasks. It has interesting biological implications and may applied to other domains beyond vision and graphics.
References
Thomas Vetter and Tomaso Poggio. Linear object classes and image synthesis from a single example image. A.I. Memo No. 1531, Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 1995.
M. Jones and T. Poggio. Model-based matching of line drawings by linear combination of prototypes. Proceedings of the International Conference on Computer Vision, pages 531–536. IEEE, June 1995.
David Beymer and Tomaso Poggio. Image Representations for Visual Learning. Science272 1905–1909 1996
Mike Jones and Tomaso Poggio. Model-Based Matching by Linear Combinations of Prototypes. A.I. Memo No. 1583, Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 1996.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Poggio, T. (1997). Image representations for visual learning. In: Bigün, J., Chollet, G., Borgefors, G. (eds) Audio- and Video-based Biometric Person Authentication. AVBPA 1997. Lecture Notes in Computer Science, vol 1206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0015989
Download citation
DOI: https://doi.org/10.1007/BFb0015989
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62660-2
Online ISBN: 978-3-540-68425-1
eBook Packages: Springer Book Archive