Author: Duan, Lingyu : Search

Article

ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation

Medical Image Computing and Computer Assisted Intervention – MICCAI 2024Pages 731–741https://doi.org/10.1007/978-3-031-72390-2_68

Abstract

Electron microscopy (EM) imaging offers unparalleled resolution for analyzing neural tissues, crucial for uncovering the intricacies of synaptic connections and neural processes fundamental to understanding behavioral mechanisms. Recently, the ...

Article

A Unified Image Compression Method for Human Perception and Multiple Vision Tasks

Computer Vision – ECCV 2024Pages 342–359https://doi.org/10.1007/978-3-031-73209-6_20

Abstract

Recent advancements in end-to-end image compression demonstrate the potential to surpass traditional codecs regarding rate-distortion performance. However, current methods either prioritize human perceptual quality or solely optimize for one or a ...

research-article

Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics

IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 46, Issue 7Pages 5174–5191https://doi.org/10.1109/TPAMI.2024.3367293

As an emerging research practice leveraging recent advanced AI techniques, e.g. deep models based prediction and generation, <bold>V</bold>ideo <bold>C</bold>oding for <bold>M</bold>achines (<bold>VCM</bold>) is committed to bridging to an extent separate ...

research-article

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach

MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 1431–1442https://doi.org/10.1145/3581783.3611851

Traditional image codecs prioritize signal fidelity and human perception, often neglecting machine vision tasks. Deep learning approaches have shown promising coding performance by leveraging rich semantic embeddings that can be optimized for both human ...

research-article

Coarse-to-fine Disentangling Demoiréing Framework for Recaptured Screen Images

IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 8Pages 9439–9453https://doi.org/10.1109/TPAMI.2023.3243310

Removing the undesired moiré patterns from images capturing the contents displayed on screens is of increasing research interest, as the need for recording and sharing the instant information conveyed by the screens is growing. Previous demoir&#...

research-article

Background Scene Recovery From an Image Looking Through Colored Glass

IEEE Transactions on Multimedia (TOM), Volume 25Pages 2876–2887https://doi.org/10.1109/TMM.2022.3152390

Colored glass, which is commonly seen in modern city life, often degrades images taken through it with co-occurring reflection and color bias due to its optical property of simultaneous transmission, reflection, and wavelength-selective absorption. ...

research-article

Open Access

Purifying Low-Light Images via Near-Infrared Enlightened Image

IEEE Transactions on Multimedia (TOM), Volume 25Pages 8006–8019https://doi.org/10.1109/TMM.2022.3232206

Cameras usually produce low-quality images under low-light conditions. Though many methods have been proposed to enhance the visibility of low-light images, they are mainly designed for illumination correction and less capable of suppressing the ...

research-article

Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning

IEEE Transactions on Multimedia (TOM), Volume 25Pages 7287–7298https://doi.org/10.1109/TMM.2022.3219680

Visual retrieval system faces frequent model update and deployment. It is a heavy workload to re-extract features of the whole database every time. Feature compatibility enables the learned new visual features to be directly compared with the old features ...

Article

mc-BEiT: Multi-choice Discretization for Image BERT Pre-training

Computer Vision – ECCV 2022Pages 231–246https://doi.org/10.1007/978-3-031-20056-4_14

Abstract

Image BERT pre-training with masked image modeling (MIM) becomes a popular practice to cope with self-supervised representation learning. A seminal work, BEiT, casts MIM as a classification task with a visual vocabulary, tokenizing the continuous ... $^{}$ $^{}$

research-article

Disentangled Feature Learning Network and a Comprehensive Benchmark for Vehicle Re-Identification

IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 44, Issue 10_Part_2Pages 6854–6871https://doi.org/10.1109/TPAMI.2021.3099253

Vehicle Re-Identification (ReID) is of great significance for public security and intelligent transportation. Large and comprehensive datasets are crucial for the development of vehicle ReID in model training and evaluation. However, existing datasets in ...

research-article

Intrinsic Performance Influence-based Participant Contribution Estimation for Horizontal Federated Learning

ACM Transactions on Intelligent Systems and Technology (TIST), Volume 13, Issue 6Article No.: 88, Pages 1–24https://doi.org/10.1145/3523059

The rapid development of modern artificial intelligence technique is mainly attributed to sufficient and high-quality data. However, in the data collection, personal privacy is at risk of being leaked. This issue can be addressed by federated learning, ...

research-article

Astute Video Transmission for Geographically Dispersed Devices in Visual IoT Systems

IEEE Transactions on Mobile Computing (ITMV), Volume 21, Issue 2Pages 448–464https://doi.org/10.1109/TMC.2020.3009745

Visual IoT (VIoT) is a promising IoT paradigm that visualizes sensing data from massive numbers of dispersed devices. A key objective in VIoT is to efficiently manage the devices to perform complex task-related visual data processing. Prior multimedia IoT ...

research-article

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding

IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 31, Issue 11Pages 4147–4161https://doi.org/10.1109/TCSVT.2021.3104305

The ubiquitous camera networks in the city brain system grow at a rapid pace, creating massive amounts of images and videos at a range of spatial-temporal scales and thereby forming the “biggest” big data. However, the sensing system often ...

research-article

Attribute-wise Explainable Fashion Compatibility Modeling

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 17, Issue 1Article No.: 36, Pages 1–21https://doi.org/10.1145/3425636

With the boom of the fashion market and people’s daily needs for beauty, clothing matching has gained increased research attention. In a sense, tackling this problem lies in modeling the human notions of the compatibility between fashion items, i.e., ...

research-article

Open Access

Market2Dish: Health-aware Food Recommendation

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 17, Issue 1Article No.: 33, Pages 1–19https://doi.org/10.1145/3418211

With the rising incidence of some diseases, such as obesity and diabetes, the healthy diet is arousing increasing attention. However, most existing food-related research efforts focus on recipe retrieval, user-preference-based food recommendation, cooking ...

research-article

Face Image Reflection Removal

International Journal of Computer Vision (IJCV), Volume 129, Issue 2Pages 385–399https://doi.org/10.1007/s11263-020-01372-5

Abstract

Face images captured through glass are usually contaminated by reflections. The low-transmitted reflections make the reflection removal more challenging than for general scenes because important facial features would be completely occluded. In ...

research-article

Free

Disentangled feature learning network for vehicle re-identification

IJCAI'20: Proceedings of the Twenty-Ninth International Joint Conference on Artificial IntelligenceArticle No.: 66, Pages 474–480

Vehicle Re-Identification (ReID) has attracted lots of research efforts due to its great significance to the public security. In vehicle ReID, we aim to learn features that are powerful in discriminating subtle differences between vehicles which are ...

research-article

Hierarchical Connectivity-Centered Clustering for Unsupervised Domain Adaptation on Person Re-Identification

IEEE Transactions on Image Processing (TIP), Volume 30Pages 6715–6729https://doi.org/10.1109/TIP.2021.3094140

Unsupervised domain adaptation (UDA) on person Re-Identification (ReID) aims to transfer the knowledge from a labeled source domain to an unlabeled target domain. Recent works mainly optimize the ReID models with pseudo labels generated by unsupervised ...

research-article

Towards Large-Scale Object Instance Search: A Multi-Block N-Ary Trie

IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 31, Issue 1Pages 372–386https://doi.org/10.1109/TCSVT.2020.2966541

Object instance search is a challenging task with a wide range of applications, but the fast search with high accuracy has not been well solved yet. In this paper, we investigate the object instance search from a new perspective in terms of joint ...

research-article

Pose-native Network Architecture Search for Multi-person Human Pose Estimation

MM '20: Proceedings of the 28th ACM International Conference on MultimediaPages 592–600https://doi.org/10.1145/3394171.3413842

Multi-person pose estimation has achieved great progress in recent years, even though, the precise prediction for occluded and invisible hard keypoints remains challenging. Most of the human pose estimation networks are equipped with an image ...

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Caption

ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation

A Unified Image Compression Method for Human Perception and Multiple Vision Tasks

Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach

Coarse-to-fine Disentangling Demoiréing Framework for Recaptured Screen Images

Background Scene Recovery From an Image Looking Through Colored Glass

Purifying Low-Light Images via Near-Infrared Enlightened Image

Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning

mc-BEiT: Multi-choice Discretization for Image BERT Pre-training

Disentangled Feature Learning Network and a Comprehensive Benchmark for Vehicle Re-Identification

Intrinsic Performance Influence-based Participant Contribution Estimation for Horizontal Federated Learning

Astute Video Transmission for Geographically Dispersed Devices in Visual IoT Systems

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding

Attribute-wise Explainable Fashion Compatibility Modeling

Market2Dish: Health-aware Food Recommendation

Face Image Reflection Removal

Disentangled feature learning network for vehicle re-identification

Hierarchical Connectivity-Centered Clustering for Unsupervised Domain Adaptation on Person Re-Identification

Towards Large-Scale Object Instance Search: A Multi-Block N-Ary Trie

Pose-native Network Architecture Search for Multi-person Human Pose Estimation

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder