TOMM: Vol 17, No 1

Volume 17, Issue 1February 2021

Volume 17, Issue 1

February 2021

Editor:

Alberto Del Bimbo
University of Firenze, Italy

Publisher:

Association for Computing Machinery
New York
NY
United States

ISSN:1551-6857

EISSN:1551-6865

Tags:

Subscribe to Journal Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Issue Downloads

PDFfront matter (TOC, masthead, submission information)

Select All

Export Citations Save to Binder

editorial

Free

Table of Contents: Online Supplement Volume 17, Number 1s

Yang Wang

Article No.: 21e, Pages 1–3https://doi.org/10.1145/3457372

research-article

Multi-task Learning-based All-in-one Collaboration Framework for Degraded Image Super-resolution

Article No.: 21, Pages 1–21https://doi.org/10.1145/3417333

In this article, we address the degraded image super-resolution problem in a multi-task learning (MTL) manner. To better share representations between multiple tasks, we propose an all-in-one collaboration framework (ACF) with a learnable “junction” unit ...

research-article

Cumulative Quality Modeling for HTTP Adaptive Streaming

Article No.: 22, Pages 1–24https://doi.org/10.1145/3423421

HTTP Adaptive Streaming has become the de facto choice for multimedia delivery. However, the quality of adaptive video streaming may fluctuate strongly during a session due to throughput fluctuations. So, it is important to evaluate the quality of a ...

research-article

Socializing the Videos: A Multimodal Approach for Social Relation Recognition

Article No.: 23, Pages 1–23https://doi.org/10.1145/3416493

As a crucial task for video analysis, social relation recognition for characters not only provides semantically rich description of video content but also supports intelligent applications, e.g., video retrieval and visual question answering. ...

research-article

Open Access

Robust Secret Image Sharing Resistant to Noise in Shares

Article No.: 24, Pages 1–22https://doi.org/10.1145/3419750

A secret image is split into \(\) shares in the generation phase of secret image sharing (SIS) for a \(\) threshold. In the recovery phase, the secret image is recovered when any \(\) or more shares are collected, and each collected share is generally assumed to ...

research-article

ART-UP: A Novel Method for Generating Scanning-Robust Aesthetic QR Codes

Article No.: 25, Pages 1–23https://doi.org/10.1145/3418214

Quick response (QR) codes are usually scanned in different environments, so they must be robust to variations in illumination, scale, coverage, and camera angles. Aesthetic QR codes improve the visual quality, but subtle changes in their appearance may ...

research-article

Compressed Imaging Reconstruction with Sparse Random Projection

Article No.: 26, Pages 1–25https://doi.org/10.1145/3447431

As the Internet of Things thrives, monitors and cameras produce tons of image data every day. To efficiently process these images, many compressed imaging frameworks are proposed. A compressed imaging framework comprises two parts, image signal ...

research-article

GreyReID: A Novel Two-stream Deep Framework with RGB-grey Information for Person Re-identification

Article No.: 27, Pages 1–22https://doi.org/10.1145/3419439

In this article, we observe that most false positive images (i.e., different identities with query images) in the top ranking list usually have the similar color information with the query image in person re-identification (Re-ID). Meanwhile, when we use ...

research-article

Bi-manual Haptic-based Periodontal Simulation with Finger Support and Vibrotactile Feedback

Article No.: 28, Pages 1–17https://doi.org/10.1145/3421765

The rise of virtual reality and haptic technologies has created exciting new applications in medical training and education. In a dental simulation, haptic technology can create the illusion of substances (teeth, gingiva, bone, etc.) by providing ...

research-article

Open Access

Multi-human Parsing with a Graph-based Generative Adversarial Model

Article No.: 29, Pages 1–21https://doi.org/10.1145/3418217

Human parsing is an important task in human-centric image understanding in computer vision and multimedia systems. However, most existing works on human parsing mainly tackle the single-person scenario, which deviates from real-world applications where ...

research-article

Open Access

Improved Jitter Buffer Management for WebRTC

Article No.: 30, Pages 1–20https://doi.org/10.1145/3410449

This work studies the jitter buffer management algorithm for Voice over IP in WebRTC. In particular, it details the core concepts of WebRTC’s jitter buffer management. Furthermore, it investigates how jitter buffer management algorithm behaves under ...

research-article

Automated Orchestration of Online Educational Collaboration in Cloud-based Environments

Article No.: 31, Pages 1–26https://doi.org/10.1145/3412381

Integrated collaboration environments (ICEs) are widely used by corporations to increase productivity by fostering groupwide and interpersonal collaboration. In this article, we discuss the enhancements of such environment needed to build an educational ...

research-article

Bottom-up and Layerwise Domain Adaptation for Pedestrian Detection in Thermal Images

Article No.: 32, Pages 1–19https://doi.org/10.1145/3418213

Pedestrian detection is a canonical problem for safety and security applications, and it remains a challenging problem due to the highly variable lighting conditions in which pedestrians must be detected. This article investigates several domain ...

research-article

Open Access

Market2Dish: Health-aware Food Recommendation

Article No.: 33, Pages 1–19https://doi.org/10.1145/3418211

With the rising incidence of some diseases, such as obesity and diabetes, the healthy diet is arousing increasing attention. However, most existing food-related research efforts focus on recipe retrieval, user-preference-based food recommendation, cooking ...

research-article

Affinity Derivation for Accurate Instance Segmentation

Article No.: 34, Pages 1–20https://doi.org/10.1145/3407090

Affinity, which represents whether two pixels belong to a same instance, is an equivalent representation to the instance segmentation labels. Conventional works do not make an explicit exploration on the affinity. In this article, we present two instance ...

research-article

Conditional LSTM-GAN for Melody Generation from Lyrics

Article No.: 35, Pages 1–20https://doi.org/10.1145/3424116

Melody generation from lyrics has been a challenging research issue in the field of artificial intelligence and music, which enables us to learn and discover latent relationships between interesting lyrics and accompanying melodies. Unfortunately, the ...

research-article

Attribute-wise Explainable Fashion Compatibility Modeling

Article No.: 36, Pages 1–21https://doi.org/10.1145/3425636

With the boom of the fashion market and people’s daily needs for beauty, clothing matching has gained increased research attention. In a sense, tackling this problem lies in modeling the human notions of the compatibility between fashion items, i.e., ...

research-article

A Semi-supervised Learning Approach Based on Adaptive Weighted Fusion for Automatic Image Annotation

Article No.: 37, Pages 1–23https://doi.org/10.1145/3426974

To learn a well-performed image annotation model, a large number of labeled samples are usually required. Although the unlabeled samples are readily available and abundant, it is a difficult task for humans to annotate large numbers of images manually. In ...

research-article

360-Degree VR Video Watermarking Based on Spherical Wavelet Transform

Article No.: 38, Pages 1–23https://doi.org/10.1145/3425605

Similar to conventional video, the increasingly popular 360 virtual reality (VR) video requires copyright protection mechanisms. The classic approach for copyright protection is the introduction of a digital watermark into the video sequence. Due to the ...

Subjects

Comments

Please enable JavaScript to view thecomments powered by Disqus.

ACM Transactions on Multimedia Computing, Communications, and Applications

Sections

Issue Downloads

Table of Contents: Online Supplement Volume 17, Number 1s

Multi-task Learning-based All-in-one Collaboration Framework for Degraded Image Super-resolution

Cumulative Quality Modeling for HTTP Adaptive Streaming

Socializing the Videos: A Multimodal Approach for Social Relation Recognition

Robust Secret Image Sharing Resistant to Noise in Shares

ART-UP: A Novel Method for Generating Scanning-Robust Aesthetic QR Codes

Compressed Imaging Reconstruction with Sparse Random Projection

GreyReID: A Novel Two-stream Deep Framework with RGB-grey Information for Person Re-identification

Bi-manual Haptic-based Periodontal Simulation with Finger Support and Vibrotactile Feedback

Multi-human Parsing with a Graph-based Generative Adversarial Model

Improved Jitter Buffer Management for WebRTC

Automated Orchestration of Online Educational Collaboration in Cloud-based Environments

Bottom-up and Layerwise Domain Adaptation for Pedestrian Detection in Thermal Images

Market2Dish: Health-aware Food Recommendation

Affinity Derivation for Accurate Instance Segmentation

Conditional LSTM-GAN for Melody Generation from Lyrics

Attribute-wise Explainable Fashion Compatibility Modeling

A Semi-supervised Learning Approach Based on Adaptive Weighted Fusion for Automatic Image Annotation

360-Degree VR Video Watermarking Based on Spherical Wavelet Transform