Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Query-Document-Dependent Fusion: A Case Study of Multimodal Music Retrieval

Published: 01 December 2013 Publication History

Abstract

In recent years, multimodal fusion has emerged as a promising technology for effective multimedia retrieval. Developing the optimal fusion strategy for different modalities (e.g., content, metadata) has been the subject of intensive research. Given a query, existing methods derive a unified fusion strategy for all documents with the underlying assumption that the relative significance of a modality remains the same across all documents. However, this assumption is often invalid. We thus propose a general multimodal fusion framework, query-document-dependent fusion (QDDF), which derives the optimal fusion strategy for each query-document pair via intelligent content analysis of both queries and documents. By investigating multimodal fusion strategies adaptive to both queries and documents, we demonstrate that existing multimodal fusion approaches are special cases of QDDF and propose two QDDF approaches to derive fusion strategies. The dual-phase QDDF explicitly derives and fuses query- and document-dependent weights, and the regression-based QDDF determines the fusion weight for a query-document pair via a regression model derived from training data. To evaluate the proposed approaches, comprehensive experiments have been conducted using a multimedia data set with around 17 K full songs and over 236 K social queries. Results indicate that the regression-based QDDF is superior in handling single-dimension queries. In comparison, the dual-phase QDDF outperforms existing approaches for most query types. We found that document-dependent weights are instrumental in enhancing multimedia fusion performance. In addition, efficiency analysis demonstrates the scalability of QDDF over large data sets.

Cited By

View all
  • (2021)CONQUER: Contextual Query-aware Ranking for Video Corpus Moment RetrievalProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475281(3900-3908)Online publication date: 17-Oct-2021
  • (2014)Bridging the User Intention GapProceedings of the First International Workshop on Internet-Scale Multimedia Management10.1145/2661714.2661720(59-64)Online publication date: 7-Nov-2014
  1. Query-Document-Dependent Fusion: A Case Study of Multimodal Music Retrieval

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image IEEE Transactions on Multimedia
    IEEE Transactions on Multimedia  Volume 15, Issue 8
    December 2013
    495 pages

    Publisher

    IEEE Press

    Publication History

    Published: 01 December 2013

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)CONQUER: Contextual Query-aware Ranking for Video Corpus Moment RetrievalProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475281(3900-3908)Online publication date: 17-Oct-2021
    • (2014)Bridging the User Intention GapProceedings of the First International Workshop on Internet-Scale Multimedia Management10.1145/2661714.2661720(59-64)Online publication date: 7-Nov-2014

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media