Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2354409.2354937guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Action bank: A high-level representation of activity in video

Published: 16 June 2012 Publication History

Abstract

Activity recognition in video is dominated by low- and mid-level features, and while demonstrably capable, by nature, these features carry little semantic meaning. Inspired by the recent object bank approach to image representation, we present Action Bank, a new high-level representation of video. Action bank is comprised of many individual action detectors sampled broadly in semantic space as well as viewpoint space. Our representation is constructed to be semantically rich and even when paired with simple linear SVM classifiers is capable of highly discriminative performance. We have tested action bank on four major activity recognition benchmarks. In all cases, our performance is better than the state of the art, namely 98.2% on KTH (better by 3.3%), 95.0% on UCF Sports (better by 3.7%), 57.9% on UCF50 (baseline is 47.9%), and 26.9% on HMDB51 (baseline is 23.2%). Furthermore, when we analyze the classifiers, we find strong transfer of semantics from the constituent action detectors to the bank classifier.

Cited By

View all
  • (2024)Multimodal Attentive Representation Learning for Micro-video Multi-label ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364388820:6(1-23)Online publication date: 8-Mar-2024
  • (2022)Action Detection System Based on Pose InformationProceedings of the 4th ACM International Conference on Multimedia in Asia10.1145/3551626.3564974(1-3)Online publication date: 13-Dec-2022
  • (2021)Dynamic normalization and relay for video action recognitionProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3541104(11026-11040)Online publication date: 6-Dec-2021
  • Show More Cited By

Index Terms

  1. Action bank: A high-level representation of activity in video

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    CVPR '12: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
    June 2012
    3800 pages
    ISBN:9781467312264

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 16 June 2012

    Author Tags

    1. Correlation
    2. Detectors
    3. Humans
    4. Semantics
    5. Spatiotemporal phenomena
    6. Support vector machines
    7. Vectors

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 23 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Multimodal Attentive Representation Learning for Micro-video Multi-label ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364388820:6(1-23)Online publication date: 8-Mar-2024
    • (2022)Action Detection System Based on Pose InformationProceedings of the 4th ACM International Conference on Multimedia in Asia10.1145/3551626.3564974(1-3)Online publication date: 13-Dec-2022
    • (2021)Dynamic normalization and relay for video action recognitionProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3541104(11026-11040)Online publication date: 6-Dec-2021
    • (2021)On-demand Action Detection System using Pose InformationProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3478567(2810-2812)Online publication date: 17-Oct-2021
    • (2019)Learning Match Kernels on Grassmann Manifolds for Action RecognitionIEEE Transactions on Image Processing10.1109/TIP.2018.286668828:1(205-215)Online publication date: 1-Jan-2019
    • (2019)Real time security framework for detecting abnormal events at ATM installationsJournal of Real-Time Image Processing10.1007/s11554-016-0573-316:2(535-545)Online publication date: 1-Apr-2019
    • (2019)Deep Packet FlowJournal of Signal Processing Systems10.1007/s11265-018-1363-x91:6(609-625)Online publication date: 1-Jun-2019
    • (2019)Second-order Temporal Pooling for Action RecognitionInternational Journal of Computer Vision10.1007/s11263-018-1111-5127:4(340-362)Online publication date: 1-Apr-2019
    • (2019)Video benchmarks of human action datasetsArtificial Intelligence Review10.1007/s10462-018-9651-152:2(1107-1154)Online publication date: 1-Aug-2019
    • (2019)Efficient health-related abnormal behavior detection with visual and inertial sensor integrationPattern Analysis & Applications10.1007/s10044-017-0660-522:2(601-614)Online publication date: 1-May-2019
    • Show More Cited By

    View Options

    View options

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media