Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1282280.1282288acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
Article

Video parsing based on head tracking and face recognition

Published: 09 July 2007 Publication History

Abstract

In this paper, we describe a fully automatic video retrieval prototype system that uses an image or a video sequence of an interested identity as probe. The system is based on face vision techniques including face detection and tracking, face alignment and recognition. Given a film or TV sitcom, first face trajectories are extracted in video by head tracking that decompose the video into segments corresponding to certain identity, then frames containing faces of higher quality are selected and normalized according to face alignment results, and finally different segments are associated by face recognition. Experiments are carried out on news video, feature length film video and TV sitcom to show its effectiveness. Potential usage of our system includes intelligent DVD/VCD browsing, video database retrieval, meeting record browsing, etc.

References

[1]
O Arandjelovic, G Shakhnarovich, et al. Face recognition with image sets using manifold density divergence, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 pp. 581--588
[2]
Ognjen Arandjelovic, Roberto Cipolla. Automatic Cast Listing in Feature-Length Films with Anisotropic Manifold Space, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2 (CVPR'06) pp. 1513--1520
[3]
Zhao, Chellappa, Rosenfeld & Phillips, Face Recognition: A Literature Survey, UMD CS-TR-4167, 2000
[4]
Mark Everingham and Andrew Zisserman, Identifying Individuals in Video by Combining Generative' and Discriminative Head Models, ICCV 2005.
[5]
M. Jones and P. Viola, Fast Multi-View Face Detection. MERL-TR2003-96, July 2003.
[6]
Tae-Kyun Kim, Ognjen Arandjelovic and Roberto Cipolla, Boosted Manifold Principal Angles for Image Set-Based Recognition, Pattern Recognition, accepted for publication conditioned on minor revision, 2006.
[7]
Pengxu Li, Haizhou Ai, Face Recognition Using Wavelet Features via MRC Boosting. Submitted to 2nd International Conference on Biometrics (ICB2007), Seoul, Korea, August 27--29, 2007
[8]
Y. Li, S. Gong, and H. Liddell. Constructing facial identity surfaces for recognition. International Journal of Computer Vision, 53(1):71--92, 2003.
[9]
Yuan Li, Haizhou Ai, et. al, Robust Head Tracking Based on a Multi-State Particle Filter, 7th IEEE International Conference, Automatic Face and Gesture Recognition, AFG2006, pp.335--340, Southampton, UK, April 10--12 2006.
[10]
Chang Huang, Haizhou Ai, et. al, Vector Boosting for Rotation Invariant Multi-View Face Detection, The IEEE International Conference on Computer Vision (ICCV-05), pp.446--453, Beijing, China, Oct 17--20, 2005.
[11]
B. Moghaddam, T. Jebara, and A. Pentland. Bayesian Face Recognition. Pattern Recognition, vol.33, no.11, November 2000
[12]
B. Moghaddam, Face Recognition by Humans and Machines, A Tutorial Survey, CVPR' 01 Short Course, CVPR2001, Dec. 2001.
[13]
S. Satoh, Y. Nakamura, and T. Kanade, Name-It: Naming and Detecting Faces in News Videos, IEEE MultiMedia, 1999, 6(1):22--35.
[14]
R. E. Schapire, Y. Singer. Improved boosting algorithms using confidence-rated predictions. Machine Learning, 1999, 37(3). 297--336.
[15]
T. Sim, S. Baker, and M. Bsat, The CMU Pose, Illumination, and Expression (PIE) Database. Technical Report CMU-RI-TR-01-02, 2001.
[16]
Josef Sivic, Mark Everingham, and Andrew Zisserman, Person spotting: video shot retrieval for face sets. International Conference on Image and Video Retrieval (CIVR 2005), Singapore 2005
[17]
Laurenz Wiskott, Jean-Marc Fellous, Norbert Krüger, and Christoph von der Malsburg. Face Recognition by Elastic Bunch Graph Matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, no.7, 1997.
[18]
Bo Wu, Haizhou Ai, Chang Huang, LUT-Based AdaBoost for Gender Classification, In LNCS, Vol.2688, pp.104--110, Springer-Verlag, 2003.
[19]
Xun Xu and Thomas S. Huang, Face Recognition with MRC-Boosting, ICCV 2005, Beijing, China.
[20]
Xun Xu Yong Rui Huang, T. S Recognizing Faces in Recorded Meetings via MRC-Boosting, Multimedia and Expo, 2006 IEEE International Conference on, Toronto, ON, Canada
[21]
Lei Zhang, Stan Z. Li, et. al. Boosting Local Feature Based Classifiers for Face Recognition. First IEEE Workshop on Face Processing in Video. 2004, Washington, USA.
[22]
Li Zhang, Haizhou Ai, et. al, Robust Face Alignment Based on Local Texture Classifiers, The IEEE International Conference on Image Processing (ICIP-05), Genoa, Italy, September 11--14, 2005.
[23]
http://www-nlpir.nist.gov/projects/trecvid

Cited By

View all
  • (2014)A DTCNN Approach on Video Analysis: Dynamic and Static Object SegmentationRecent Advances on Hybrid Approaches for Designing Intelligent Systems10.1007/978-3-319-05170-3_22(315-336)Online publication date: 27-Mar-2014
  • (2012)Video retrieval by mimicking posesProceedings of the 2nd ACM International Conference on Multimedia Retrieval10.1145/2324796.2324838(1-8)Online publication date: 5-Jun-2012
  • (2012)A comparison between a DTCNN and SOM like approach for dynamic object detection in videos2012 Annual Meeting of the North American Fuzzy Information Processing Society (NAFIPS)10.1109/NAFIPS.2012.6291048(1-6)Online publication date: Aug-2012
  • Show More Cited By

Index Terms

  1. Video parsing based on head tracking and face recognition

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval
    July 2007
    655 pages
    ISBN:9781595937339
    DOI:10.1145/1282280
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 July 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. face recognition
    2. face vision
    3. video content retrieval
    4. video parsing

    Qualifiers

    • Article

    Conference

    CIVR07
    Sponsor:

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 13 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2014)A DTCNN Approach on Video Analysis: Dynamic and Static Object SegmentationRecent Advances on Hybrid Approaches for Designing Intelligent Systems10.1007/978-3-319-05170-3_22(315-336)Online publication date: 27-Mar-2014
    • (2012)Video retrieval by mimicking posesProceedings of the 2nd ACM International Conference on Multimedia Retrieval10.1145/2324796.2324838(1-8)Online publication date: 5-Jun-2012
    • (2012)A comparison between a DTCNN and SOM like approach for dynamic object detection in videos2012 Annual Meeting of the North American Fuzzy Information Processing Society (NAFIPS)10.1109/NAFIPS.2012.6291048(1-6)Online publication date: Aug-2012
    • (2011)Person re-identification in TV series using robust face recognition and user feedbackMultimedia Tools and Applications10.1007/s11042-010-0603-255:1(83-104)Online publication date: 1-Oct-2011
    • (2010)Heat Kernel Based Local Binary Pattern for Face RepresentationIEEE Signal Processing Letters10.1109/LSP.2009.203665317:3(308-311)Online publication date: Mar-2010
    • (2010)Interactive person re-identification in TV series2010 International Workshop on Content Based Multimedia Indexing (CBMI)10.1109/CBMI.2010.5529898(1-6)Online publication date: Jun-2010
    • (2009)“Who are you?” - Learning person specific classifiers from video2009 IEEE Conference on Computer Vision and Pattern Recognition10.1109/CVPR.2009.5206513(1145-1152)Online publication date: Jun-2009
    • (2009)Pose search: Retrieving people using their pose2009 IEEE Conference on Computer Vision and Pattern Recognition10.1109/CVPR.2009.5206495(1-8)Online publication date: Jun-2009

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media