Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/CVPR.2013.398guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Heterogeneous Visual Features Fusion via Sparse Multimodal Machine

Published: 23 June 2013 Publication History

Abstract

To better understand, search, and classify image and video information, many visual feature descriptors have been proposed to describe elementary visual characteristics, such as the shape, the color, the texture, etc. How to integrate these heterogeneous visual features and identify the important ones from them for specific vision tasks has become an increasingly critical problem. In this paper, We propose a novel Sparse Multimodal Learning (SMML) approach to integrate such heterogeneous features by using the joint structured sparsity regularizations to learn the feature importance of for the vision tasks from both group-wise and individual point of views. A new optimization algorithm is also introduced to solve the non-smooth objective with rigorously proved global convergence. We applied our SMML method to five broadly used object categorization and scene understanding image data sets for both single-label and multi-label image classification tasks. For each data set we integrate six different types of popularly used image features. Compared to existing scene and object categorization methods using either single modality or multi-modalities of features, our approach always achieves better performances measured.

Cited By

View all
  • (2023)Joint Augmented and Compressed Dictionaries for Robust Image ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/357291019:3s(1-24)Online publication date: 24-Feb-2023
  • (2019)Structured sparse multi-view feature selection based on weighted hinge lossMultimedia Tools and Applications10.1007/s11042-018-6937-x78:11(15455-15481)Online publication date: 1-Jun-2019
  • (2016)Human action recognition via skeletal and depth based feature fusionProceedings of the 9th International Conference on Motion in Games10.1145/2994258.2994268(123-132)Online publication date: 10-Oct-2016
  • Show More Cited By

Index Terms

  1. Heterogeneous Visual Features Fusion via Sparse Multimodal Machine

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    CVPR '13: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition
    June 2013
    3752 pages
    ISBN:9780769549897

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 23 June 2013

    Author Tags

    1. Data Integration
    2. Structured Sparsity
    3. Visual Features Fusion

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Joint Augmented and Compressed Dictionaries for Robust Image ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/357291019:3s(1-24)Online publication date: 24-Feb-2023
    • (2019)Structured sparse multi-view feature selection based on weighted hinge lossMultimedia Tools and Applications10.1007/s11042-018-6937-x78:11(15455-15481)Online publication date: 1-Jun-2019
    • (2016)Human action recognition via skeletal and depth based feature fusionProceedings of the 9th International Conference on Motion in Games10.1145/2994258.2994268(123-132)Online publication date: 10-Oct-2016
    • (2016)Multimodal Multipart Learning for Action Recognition in Depth VideosIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2015.250529538:10(2123-2129)Online publication date: 1-Oct-2016
    • (2016)UDSFSNeurocomputing10.1016/j.neucom.2015.10.130196:C(150-158)Online publication date: 5-Jul-2016
    • (2015)Multi-view matrix decompositionProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832581.2832728(3438-3444)Online publication date: 25-Jul-2015
    • (2015)Anatomical Annotations for Drosophila Gene Expression Patterns via Multi-Dimensional Visual Descriptors IntegrationProceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining10.1145/2783258.2783384(339-348)Online publication date: 10-Aug-2015
    • (2015)Cognitive Inspired WOR Framework to Reveal Image Semantics, for Efficient Content Based Image RetrievalProceedings of the 2nd International Conference on Perception and Machine Intelligence10.1145/2708463.2709034(201-210)Online publication date: 26-Feb-2015
    • (2014)Revealing What to Extract from Where, for Object-Centric Content Based Image Retrieval (CBIR)Proceedings of the 2014 Indian Conference on Computer Vision Graphics and Image Processing10.1145/2683483.2683540(1-8)Online publication date: 14-Dec-2014
    • (2014)Exploiting Correlation ConsensusProceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2654999(981-984)Online publication date: 3-Nov-2014
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media