Issue Downloads
Multi-task Learning-based All-in-one Collaboration Framework for Degraded Image Super-resolution
In this article, we address the degraded image super-resolution problem in a multi-task learning (MTL) manner. To better share representations between multiple tasks, we propose an all-in-one collaboration framework (ACF) with a learnable “junction” unit ...
Cumulative Quality Modeling for HTTP Adaptive Streaming
HTTP Adaptive Streaming has become the de facto choice for multimedia delivery. However, the quality of adaptive video streaming may fluctuate strongly during a session due to throughput fluctuations. So, it is important to evaluate the quality of a ...
Socializing the Videos: A Multimodal Approach for Social Relation Recognition
As a crucial task for video analysis, social relation recognition for characters not only provides semantically rich description of video content but also supports intelligent applications, e.g., video retrieval and visual question answering. ...
ART-UP: A Novel Method for Generating Scanning-Robust Aesthetic QR Codes
Quick response (QR) codes are usually scanned in different environments, so they must be robust to variations in illumination, scale, coverage, and camera angles. Aesthetic QR codes improve the visual quality, but subtle changes in their appearance may ...
Compressed Imaging Reconstruction with Sparse Random Projection
As the Internet of Things thrives, monitors and cameras produce tons of image data every day. To efficiently process these images, many compressed imaging frameworks are proposed. A compressed imaging framework comprises two parts, image signal ...
GreyReID: A Novel Two-stream Deep Framework with RGB-grey Information for Person Re-identification
In this article, we observe that most false positive images (i.e., different identities with query images) in the top ranking list usually have the similar color information with the query image in person re-identification (Re-ID). Meanwhile, when we use ...
Bi-manual Haptic-based Periodontal Simulation with Finger Support and Vibrotactile Feedback
The rise of virtual reality and haptic technologies has created exciting new applications in medical training and education. In a dental simulation, haptic technology can create the illusion of substances (teeth, gingiva, bone, etc.) by providing ...
Multi-human Parsing with a Graph-based Generative Adversarial Model
- Jianshu Li,
- Jian Zhao,
- Congyan Lang,
- Yidong Li,
- Yunchao Wei,
- Guodong Guo,
- Terence Sim,
- Shuicheng Yan,
- Jiashi Feng
Human parsing is an important task in human-centric image understanding in computer vision and multimedia systems. However, most existing works on human parsing mainly tackle the single-person scenario, which deviates from real-world applications where ...
Improved Jitter Buffer Management for WebRTC
This work studies the jitter buffer management algorithm for Voice over IP in WebRTC. In particular, it details the core concepts of WebRTC’s jitter buffer management. Furthermore, it investigates how jitter buffer management algorithm behaves under ...
Automated Orchestration of Online Educational Collaboration in Cloud-based Environments
Integrated collaboration environments (ICEs) are widely used by corporations to increase productivity by fostering groupwide and interpersonal collaboration. In this article, we discuss the enhancements of such environment needed to build an educational ...
Bottom-up and Layerwise Domain Adaptation for Pedestrian Detection in Thermal Images
Pedestrian detection is a canonical problem for safety and security applications, and it remains a challenging problem due to the highly variable lighting conditions in which pedestrians must be detected. This article investigates several domain ...
Market2Dish: Health-aware Food Recommendation
With the rising incidence of some diseases, such as obesity and diabetes, the healthy diet is arousing increasing attention. However, most existing food-related research efforts focus on recipe retrieval, user-preference-based food recommendation, cooking ...
Affinity Derivation for Accurate Instance Segmentation
Affinity, which represents whether two pixels belong to a same instance, is an equivalent representation to the instance segmentation labels. Conventional works do not make an explicit exploration on the affinity. In this article, we present two instance ...
Conditional LSTM-GAN for Melody Generation from Lyrics
Melody generation from lyrics has been a challenging research issue in the field of artificial intelligence and music, which enables us to learn and discover latent relationships between interesting lyrics and accompanying melodies. Unfortunately, the ...
Attribute-wise Explainable Fashion Compatibility Modeling
With the boom of the fashion market and people’s daily needs for beauty, clothing matching has gained increased research attention. In a sense, tackling this problem lies in modeling the human notions of the compatibility between fashion items, i.e., ...
A Semi-supervised Learning Approach Based on Adaptive Weighted Fusion for Automatic Image Annotation
To learn a well-performed image annotation model, a large number of labeled samples are usually required. Although the unlabeled samples are readily available and abundant, it is a difficult task for humans to annotate large numbers of images manually. In ...