Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3688867acmconferencesBook PagePublication PagesmmConference Proceedingsconference-collections
McGE '24: Proceedings of the 2nd International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice
ACM2024 Proceeding
  • Program Chairs:
  • Cheng Jin,
  • Liang He,
  • Mingli Song,
  • Rui Wang
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
MM '24: The 32nd ACM International Conference on Multimedia Melbourne VIC Australia 28 October 2024- 1 November 2024
ISBN:
979-8-4007-1194-7
Published:
28 October 2024
Sponsors:

Reflects downloads up to 23 Nov 2024Bibliometrics
Skip Abstract Section
Abstract

It is our great pleasure to welcome you to the 2nd International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice- McGE 2024

We believe that this workshop will provide a valuable platform for researchers and practitioners to discuss and exchange ideas on the latest advancements, challenges, and opportunities in the rapidly evolving field of multimedia content generation.

Skip Table Of Content Section
SESSION: Opening Session
short-paper
Free
McGE '24: The 2nd International Workshop on Multimedia Content Generation and Evaluation: New Methods & Practice

This workshop aims to explore key topics in the multimedia field, focusing on multimedia content generation, quality assessment, dataset creation, and construction. These topics are essential for the growth and advancement of the multimedia domain. ...

SESSION: Multimedia Content Evaluation: New Methods and Practice
research-article
Free
Jointly Text Region and Stroke Modeling for Scene Text Removal

Scene text removal has been widely applied in various applications due to its remarkable progress. Most of the previous methods apply text regions or strokes individually to provide text location information which exhibit distinct advantages and ...

research-article
Free
Text-guided Multi-Task Image Aesthetic Quality Assessment

In the realm of image aesthetic quality assessment, additional tagging information, such as scene classification, photographic style, and aesthetic attributes, embodies a wealth of aesthetic connotations. The textual descriptions and visual features ...

research-article
Free
Spatial and Channel Squeeze & Excitation in Adapting Vision Transformers for Temporal Action Localization

Transformer-based methods have achieved impressive performance on temporal action localization (TAL). Although this achievement is attributed to the multiheaded self-attention (MSA) mechanism, there is still a lack of systematic understanding. ...

research-article
Free
High Quality Fire Smoke Dataset: A Benchmark for Fire and Smoke Detection

In this paper, we present the High Quality Fire Smoke Dataset(HQFSD), a new comprehensive fire and smoke dataset tailored for training and evaluating fire detection algorithms. It currently comprises 12,166 meticulously selected images sourced from over ...

research-article
Free
SAFormer: An Efficient Hierarchical Transformer Network Specialized for Temporal Action Detection

Temporal action detection (TAD) is a critical task of multimedia video understanding, focusing on accurately predicting the starting and ending times of action instances along with their classification. Most previous works have relied on two-stage ...

research-article
Free
Attention Mixture Network for Crowd Counting via Binarization Transfer

Crowd counting endeavors to estimate the numerical count of individuals present within an image depicting a gathering of people. In recent years, there has been notable and gradual advancement in the realm of crowd counting, driven by the integration of ...

research-article
Open Access
RecipeSD: Injecting Recipe into Food Image Synthesis with Stable Diffusion

In this paper, we introduce RecipeSD, a novel approach for food image synthesis using Stable Diffusion, enhanced by integrating recipe text information. RecipeSD leverages a pretrained recipe encoder from a cross-modal retrieval task to extract ...

research-article
Free
Predicting Scores of Various Aesthetic Attribute Sets by Learning from Overall Score Labels

For aesthetic attribute evaluation of images (AAEI), the annotation of image aesthetic attribute scores plays an important role. It requires experienced artists and professional photographers, which hinders the collection of large-scale fully-annotated ...

research-article
Open Access
BrandDiffusion: Multimodal Personalized Marketing Visual Content Generation

Creating visual content such as product advertisements for marketing purposes has attracted research attention recently. Traditionally, such visuals showcase the product against a specific backdrop while adhering to a consistent corporate style to ...

Contributors
  • Zhejiang University

Index Terms

  1. Proceedings of the 2nd International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice
      Index terms have been assigned to the content through auto-classification.
      Please enable JavaScript to view thecomments powered by Disqus.

      Recommendations