Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleOctober 2024
ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation
Medical Image Computing and Computer Assisted Intervention – MICCAI 2024Pages 731–741https://doi.org/10.1007/978-3-031-72390-2_68AbstractElectron microscopy (EM) imaging offers unparalleled resolution for analyzing neural tissues, crucial for uncovering the intricacies of synaptic connections and neural processes fundamental to understanding behavioral mechanisms. Recently, the ...
- ArticleNovember 2024
A Unified Image Compression Method for Human Perception and Multiple Vision Tasks
AbstractRecent advancements in end-to-end image compression demonstrate the potential to surpass traditional codecs regarding rate-distortion performance. However, current methods either prioritize human perceptual quality or solely optimize for one or a ...
- research-articleFebruary 2024
Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 7Pages 5174–5191https://doi.org/10.1109/TPAMI.2024.3367293As an emerging research practice leveraging recent advanced AI techniques, e.g. deep models based prediction and generation, <bold>V</bold>ideo <bold>C</bold>oding for <bold>M</bold>achines (<bold>VCM</bold>) is committed to bridging to an extent separate ...
- research-articleOctober 2023
Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 1431–1442https://doi.org/10.1145/3581783.3611851Traditional image codecs prioritize signal fidelity and human perception, often neglecting machine vision tasks. Deep learning approaches have shown promising coding performance by leveraging rich semantic embeddings that can be optimized for both human ...
- research-articleAugust 2023
Coarse-to-fine Disentangling Demoiréing Framework for Recaptured Screen Images
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 8Pages 9439–9453https://doi.org/10.1109/TPAMI.2023.3243310Removing the undesired moiré patterns from images capturing the contents displayed on screens is of increasing research interest, as the need for recording and sharing the instant information conveyed by the screens is growing. Previous demoir&#...
-
- research-articleJanuary 2023
Background Scene Recovery From an Image Looking Through Colored Glass
IEEE Transactions on Multimedia (TOM), Volume 25Pages 2876–2887https://doi.org/10.1109/TMM.2022.3152390Colored glass, which is commonly seen in modern city life, often degrades images taken through it with co-occurring reflection and color bias due to its optical property of simultaneous transmission, reflection, and wavelength-selective absorption. ...
- research-articleDecember 2022
Purifying Low-Light Images via Near-Infrared Enlightened Image
IEEE Transactions on Multimedia (TOM), Volume 25Pages 8006–8019https://doi.org/10.1109/TMM.2022.3232206Cameras usually produce low-quality images under low-light conditions. Though many methods have been proposed to enhance the visibility of low-light images, they are mainly designed for illumination correction and less capable of suppressing the ...
- research-articleNovember 2022
Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning
IEEE Transactions on Multimedia (TOM), Volume 25Pages 7287–7298https://doi.org/10.1109/TMM.2022.3219680Visual retrieval system faces frequent model update and deployment. It is a heavy workload to re-extract features of the whole database every time. Feature compatibility enables the learned new visual features to be directly compared with the old features ...
- ArticleOctober 2022
- research-articleOctober 2022
Disentangled Feature Learning Network and a Comprehensive Benchmark for Vehicle Re-Identification
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 44, Issue 10_Part_2Pages 6854–6871https://doi.org/10.1109/TPAMI.2021.3099253Vehicle Re-Identification (ReID) is of great significance for public security and intelligent transportation. Large and comprehensive datasets are crucial for the development of vehicle ReID in model training and evaluation. However, existing datasets in ...
- research-articleSeptember 2022
Intrinsic Performance Influence-based Participant Contribution Estimation for Horizontal Federated Learning
ACM Transactions on Intelligent Systems and Technology (TIST), Volume 13, Issue 6Article No.: 88, Pages 1–24https://doi.org/10.1145/3523059The rapid development of modern artificial intelligence technique is mainly attributed to sufficient and high-quality data. However, in the data collection, personal privacy is at risk of being leaked. This issue can be addressed by federated learning, ...
- research-articleFebruary 2022
Astute Video Transmission for Geographically Dispersed Devices in Visual IoT Systems
IEEE Transactions on Mobile Computing (ITMV), Volume 21, Issue 2Pages 448–464https://doi.org/10.1109/TMC.2020.3009745Visual IoT (VIoT) is a promising IoT paradigm that visualizes sensing data from massive numbers of dispersed devices. A key objective in VIoT is to efficiently manage the devices to perform complex task-related visual data processing. Prior multimedia IoT ...
- research-articleNovember 2021
Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding
- Wen Gao,
- Siwei Ma,
- Lingyu Duan,
- Yonghong Tian,
- Peiyin Xing,
- Yaowei Wang,
- Shanshe Wang,
- Huizhu Jia,
- Tiejun Huang
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 31, Issue 11Pages 4147–4161https://doi.org/10.1109/TCSVT.2021.3104305The ubiquitous camera networks in the city brain system grow at a rapid pace, creating massive amounts of images and videos at a range of spatial-temporal scales and thereby forming the “biggest” big data. However, the sensing system often ...
- research-articleApril 2021
Attribute-wise Explainable Fashion Compatibility Modeling
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 17, Issue 1Article No.: 36, Pages 1–21https://doi.org/10.1145/3425636With the boom of the fashion market and people’s daily needs for beauty, clothing matching has gained increased research attention. In a sense, tackling this problem lies in modeling the human notions of the compatibility between fashion items, i.e., ...
- research-articleApril 2021
Market2Dish: Health-aware Food Recommendation
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 17, Issue 1Article No.: 33, Pages 1–19https://doi.org/10.1145/3418211With the rising incidence of some diseases, such as obesity and diabetes, the healthy diet is arousing increasing attention. However, most existing food-related research efforts focus on recipe retrieval, user-preference-based food recommendation, cooking ...
- research-articleFebruary 2021
Face Image Reflection Removal
International Journal of Computer Vision (IJCV), Volume 129, Issue 2Pages 385–399https://doi.org/10.1007/s11263-020-01372-5AbstractFace images captured through glass are usually contaminated by reflections. The low-transmitted reflections make the reflection removal more challenging than for general scenes because important facial features would be completely occluded. In ...
- research-articleJanuary 2021
Disentangled feature learning network for vehicle re-identification
IJCAI'20: Proceedings of the Twenty-Ninth International Joint Conference on Artificial IntelligenceArticle No.: 66, Pages 474–480Vehicle Re-Identification (ReID) has attracted lots of research efforts due to its great significance to the public security. In vehicle ReID, we aim to learn features that are powerful in discriminating subtle differences between vehicles which are ...
- research-articleJanuary 2021
Hierarchical Connectivity-Centered Clustering for Unsupervised Domain Adaptation on Person Re-Identification
IEEE Transactions on Image Processing (TIP), Volume 30Pages 6715–6729https://doi.org/10.1109/TIP.2021.3094140Unsupervised domain adaptation (UDA) on person Re-Identification (ReID) aims to transfer the knowledge from a labeled source domain to an unlabeled target domain. Recent works mainly optimize the ReID models with pseudo labels generated by unsupervised ...
- research-articleJanuary 2021
Towards Large-Scale Object Instance Search: A Multi-Block N-Ary Trie
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 31, Issue 1Pages 372–386https://doi.org/10.1109/TCSVT.2020.2966541Object instance search is a challenging task with a wide range of applications, but the fast search with high accuracy has not been well solved yet. In this paper, we investigate the object instance search from a new perspective in terms of joint ...
- research-articleOctober 2020
Pose-native Network Architecture Search for Multi-person Human Pose Estimation
MM '20: Proceedings of the 28th ACM International Conference on MultimediaPages 592–600https://doi.org/10.1145/3394171.3413842Multi-person pose estimation has achieved great progress in recent years, even though, the precise prediction for occluded and invisible hard keypoints remains challenging. Most of the human pose estimation networks are equipped with an image ...