default search action
MMAsia 2021: Gold Coast, Australia
- Changwen Chen, Helen Huang, Jun Zhou, Tatsuya Harada, Jianfei Cai, Wu Liu, Dong Xu:
MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1 - 3, 2021. ACM 2021, ISBN 978-1-4503-8607-4
Full Papers
- Haotian Sun, Jiwei Wei, Yang Yang, Xing Xu:
Semantic Enhanced Cross-modal GAN for Zero-shot Learning. 1:1-1:7 - Hehe Fan, Mohan S. Kankanhalli:
Motion = Video - Content: Towards Unsupervised Learning of Motion Representation from Videos. 2:1-2:7 - Zheng Zhang, Jianning Wang, Guangming Lu:
Towards Discriminative Visual Search via Semantically Cycle-consistent Hashing Networks. 3:1-3:7 - Dan Zhang, Mao Ye, Lin Xiong, Shuaifeng Li, Xue Li:
Source-Style Transferred Mean Teacher for Source-data Free Object Detection. 4:1-4:8 - Yihui Shi, Yun Liu, Fangxiang Feng, Ruifan Li, Zhanyu Ma, Xiaojie Wang:
S2TD: A Tree-Structured Decoder for Image Paragraph Captioning. 5:1-5:7 - Shiming Ge, Fanzhao Lin, Chenyu Li, Daichi Zhang, Jiyong Tan, Weiping Wang, Dan Zeng:
Latent Pattern Sensing: Deepfake Video Detection via Predictive Representation Learning. 6:1-6:7 - Nobukatsu Kajiura, Hong Liu, Shin'ichi Satoh:
Improving Camouflaged Object Detection with the Uncertainty of Pseudo-edge Labels. 7:1-7:7 - Jiapeng Tang, Yi Fang, Yu Dong, Rong Xie, Xiao Gu, Guangtao Zhai, Li Song:
Blindly Predict Image and Video Quality in the Wild. 8:1-8:7 - Peng-Fei Zhang, Pengfei Zhao, Xin Luo, Xin-Shun Xu:
BRUSH: Label Reconstructing and Similarity Preserving Hashing for Cross-modal Retrieval. 9:1-9:7 - Chen Wang, Yazhou Yao, Qiong Wang, Zhenmin Tang:
Local Self-Attention on Fine-grained Cross-media Retrieval. 10:1-10:7 - Yajie Zhang, Yuxuan Dai, Wei Tang, Lu Jin, Xinguang Xiang:
Self-Adaptive Hashing for Fine-Grained Image Retrieval. 11:1-11:7 - Hang Yu, Weixin Li, Jiankai Li, Ye Du:
Entity Relation Fusion for Real-Time One-Stage Referring Expression Comprehension. 12:1-12:8 - Qianxing Li, Shaofan Wang, Dehui Kong, Baocai Yin:
A Local-Global Commutative Preserving Functional Map for Shape Correspondence. 13:1-13:7 - Haolin Liu, Chenyu Li, Bochao Liu, Pengju Wang, Shiming Ge, Weiping Wang:
Differentially Private Learning with Grouped Gradient Clipping. 14:1-14:7 - Ziyang Ma, Xianjing Han, Xuemeng Song, Yiran Cui, Liqiang Nie:
Hierarchical Deep Residual Reasoning for Temporal Moment Localization. 15:1-15:7 - Yixin Zhang, Yoko Yamakata, Keishi Tajima:
MIRecipe: A Recipe Dataset for Stage-Aware Recognition of Changes in Appearance of Ingredients. 16:1-16:7 - Jiazhong Chen, Jie Chen, Yuan Dong, Dakai Ren, Shiqi Zhang, Zongyi Li:
Video Saliency Prediction via Deep Eye Movement Learning. 17:1-17:6 - Yu Liu, Xiaopeng Hong, Xiaoyu Tao, Songlin Dong, Jingang Shi, Yihong Gong:
Structural Knowledge Organization and Transfer for Class-Incremental Learning. 18:1-18:7 - Pengju Zhang, Chaofan Zhang, Zheng Rong, Yihong Wu:
Learning to Decompose and Restore Low-light Images with Wavelet Transform. 19:1-19:7 - Zhuoxiao Chen, Yadan Luo, Mahsa Baktashmotlagh:
Conditional Extreme Value Theory for Open Set Video Domain Adaptation. 20:1-20:8 - Yahui Xu, Yi Bin, Guoqing Wang, Yang Yang:
Hierarchical Composition Learning for Composed Query Image Retrieval. 21:1-21:7 - Yalu Cheng, Pengchong Qiao, Hongliang He, Guoli Song, Jie Chen:
Hard-Boundary Attention Network for Nuclei Instance Segmentation. 22:1-22:7 - Jinxing Pan, Xiaoshan Yang, Yi Huang, Changsheng Xu:
Few-shot Egocentric Multimodal Activity Recognition. 23:1-23:7 - Ruichao Fan, Hanli Wang, Jinjing Gu, Xianhui Liu:
Visual Storytelling with Hierarchical BERT Semantic Guidance. 24:1-24:7 - Lorenzo Seidenari, Leonardo Galteri, Pietro Bongini, Marco Bertini, Alberto Del Bimbo:
Language Based Image Quality Assessment. 25:1-25:7 - Ludan Ruan, Qin Jin:
Efficient Proposal Generation with U-shaped Network for Temporal Sentence Grounding. 26:1-26:7 - Dongliang Shao, Yunhui Shi, Jin Wang, Nam Ling, Baocai Yin:
A Model-Guided Unfolding Network for Single Image Reflection Removal. 27:1-27:7 - Haopeng Xie, Liang Xiao, Huicong Wu:
Intra- and Inter-frame Iterative Temporal Convolutional Networks for Video Stabilization. 28:1-28:7 - Ziqian Liu, Qing Ma, Junjun Jiang, Xianming Liu:
Improving Hyperspectral Super-Resolution via Heterogeneous Knowledge Distillation. 29:1-29:7 - Kang You, Pan Gao:
Patch-Based Deep Autoencoder for Point Cloud Geometry Compression. 30:1-30:7 - Masahiro Suzuki:
Score Transformer: Generating Musical Score from Note-level Representation. 31:1-31:7 - Shuang Li, Lichun Wang, Shaofan Wang, Dehui Kong, Baocai Yin:
Zero-shot Recognition with Image Attributes Generation using Hierarchical Coupled Dictionary Learning. 32:1-32:7 - Shivangi Singhal, Mudit Dhawan, Rajiv Ratn Shah, Ponnurangam Kumaraguru:
Inter-modality Discordance for Multimodal Fake News Detection. 33:1-33:7
Short Papers
- Zhichao Fu, Tianlong Ma, Liang Xue, Yingbin Zheng, Hao Ye, Liang He:
A Coarse-to-fine Approach for Fast Super-Resolution with Flexible Magnification. 34:1-34:5 - Zhanpeng Huang, Rui Han, Jianwen Huang, Hao Yin, Zipeng Qin, Zibin Wang:
Automatically Generate Rigged Character from Single Image. 35:1-35:5 - Jing Xu, Wei Zhang, Yalong Bai, Qibin Sun, Tao Mei:
Flat and Shallow: Understanding Fake Image Detection Models by Architecture Profiling. 36:1-36:5 - Jiading Ling, Xingcai Wu, Zhenguo Yang, Xudong Mao, Qing Li, Wenyin Liu:
Multi-branch Semantic Learning Network for Text-to-Image Synthesis. 37:1-37:5 - Wenjun Hui, Chuangchuang Tan, Guanghua Gu:
Attention-based Dual-Branches Localization Network for Weakly Supervised Object Localization. 38:1-38:5 - Donnaphat Trakulwaranont, Marc A. Kastner, Shin'ichi Satoh:
Pose-aware Outfit Transfer between Unpaired in-the-wild Fashion Images. 39:1-39:5 - Yang Wu, Shirui Feng, Guanbin Li, Liang Lin:
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation. 40:1-40:5 - Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura:
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages. 41:1-41:5 - Federico Becattini, Xuemeng Song, Claudio Baecchi, Shi-Ting Fang, Claudio Ferrari, Liqiang Nie, Alberto Del Bimbo:
PLM-IPE: A Pixel-Landmark Mutual Enhanced Framework for Implicit Preference Estimation. 42:1-42:5 - Mehmet N. Akcay, Burak Kara, Saba Ahsan, Ali C. Begen, Igor D. D. Curcio, Emre Aksu:
Head-Motion-Aware Viewport Margins for Improving User Experience in Immersive Video. 43:1-43:5 - Hao Zhang, Qi Zhang, Phuong Anh Nguyen, Victor C. S. Lee, Antoni Bert Chan:
Chinese White Dolphin Detection in the Wild. 44:1-44:5 - Md. Rafi Ur Rashid, Mahim Mahbub, Muhammad Abdullah Adnan:
BAND: A Benchmark Dataset forBangla News Audio Classification. 45:1-45:6 - Chang Kong, Qiuming Luo, Guoliang Chen:
A comparison study: the impact of age and gender distribution on age estimation. 46:1-46:5 - Huan Wang, Yunhui Shi, Jin Wang, Gang Wu, Nam Ling, Baocai Yin:
Spherical Image Compression Using Spherical Wavelet Transform. 47:1-47:5 - Ke-Xin Zhang, Gangyi Jiang, Mei Yu:
FQM-GC: Full-reference Quality Metric for Colored Point Cloud Based on Graph Signal Features and Color Features. 48:1-48:5 - Chenyu Guo, Jiyang Xie, Kongming Liang, Xian Sun, Zhanyu Ma:
Cross-layer Navigation Convolutional Neural Network for Fine-grained Visual Classification. 49:1-49:5 - Mohit Sharma, Raj Aaryaman Patra, Harshal Desai, Shruti Vyas, Yogesh S. Rawat, Rajiv Ratn Shah:
NoisyActions2M: A Multimedia Dataset for Video Understanding from Noisy Labels. 50:1-50:5 - Fengjie Xu, Chang-Hua Zhang, Zhongshu Chen, Zhekai Du, Lei Han, Lin Zuo:
CMRD-Net: An Improved Method for Underwater Image Enhancement. 51:1-51:5 - Letian Wang, Xiushan Nie, Quan Zhou, Yang Shi, Xingbo Liu:
Deep Multiple Length Hashing via Multi-task Learning. 52:1-52:5 - Xiaoyu Geng, Qiang Guo, Caiming Zhang:
Color Image Denoising via Tensor Robust PCA with Nonconvex and Nonlocal Regularization. 53:1-53:5 - Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo:
Conditioned Image Retrieval for Fashion using Contrastive Learning and CLIP-based Features. 54:1-54:5 - Tian Tian, Li Liu, Huaxiang Zhang, Dongmei Liu:
PBNet: Position-specific Text-to-image Generation by Boundary. 55:1-55:4 - Shuguang Zhao, Bingzhi Chen, Zheng Zhang, Guangming Lu:
An Embarrassingly Simple Approach to Discrete Supervised Hashing. 56:1-56:5 - Qiming Lu, Shikui Wei, Haoyu Chu, Yao Zhao:
Towards Transferable 3D Adversarial Attack. 57:1-57:5 - Ximing Wu, Lei Zhang, Yingfeng Wu, Haobin Zhou, Laizhong Cui:
Delay-sensitive and Priority-aware Transmission Control for Real-time Multimedia Communications. 58:1-58:5 - Nao Takeuchi, Tomoko Koda:
Impression of a Job Interview training agent that gives rationalized feedback: Should Virtual Agent Give Advice with Rationale? 59:1-59:5
Demo Papers
- Lingcan Meng, Xiushan Nie, Zhifang Tan:
An Efficient Bus Crowdedness Classification System. 60:1-60:2 - Arun Zachariah, Maha Alrasheed:
Private-Share: A Secure and Privacy-Preserving De-Centralized Framework for Large Scale Data Sharing. 61:1-61:3 - Zhuoxiao Chen, Yiyun Zhang, Yadan Luo, Zijian Wang, Jinjiang Zhong, Anthony Southon:
RoadAtlas: Intelligent Platform for Automated Road Defect Detection and Asset Management. 62:1-62:3
Applied Research Papers
- Jun Yao Francis Lee, Narayanan Rajeev, Anand Bhojan:
Goldeye: Enhanced Spatial Awareness for the Visually Impaired using Mixed Reality and Vibrotactile Feedback. 63:1-63:7 - Ailin Chen, Rui Jesus, Márcia Vilarigues:
Convolutional Neural Network-Based Pure Paint Pigment Identification Using Hyperspectral Images. 64:1-64:7 - Shengze Yu, Xin Wang, Wenwu Zhu:
CFCR: A Convolution and Fusion Model for Cross-platform Recommendation. 65:1-65:6
Brave New Ideas
- Chandan Misra:
SangeetXML: An XML Format for Score Retrieval for Indic Music. 66:1-66:5 - Shahram Ghandeharizadeh:
Holodeck: Immersive 3D Displays Using Swarms of Flying Light Specks [Extended Abstract]. 67:1-67:7 - Ming Cheung, Weiwei Sun, Jiantao Zhou:
Discovering Social Connections using Event Images. 68:1-68:5
Grand Challenge
- Beibei Zhang, Fan Yu, Yaqun Fang, Tongwei Ren, Gangshan Wu:
Hybrid Improvements in Multimodal Analysis for Deep Video Understanding. 69:1-69:5
W1: Visual Tasks and Challenges under Low-quality Multimedia Data
- Jun Zhang, Xian Zhong, Jingling Yuan, Shilei Zhao, Rongbo Zhang, Duxiu Feng, Luo Zhong:
Local-enhanced Multi-resolution Representation Learning for Vehicle Re-identification. 70:1-70:6 - Xiaolei Luo, Sen Xiang, Yingfeng Wang, Qiong Liu, You Yang, Kejun Wu:
Dedark+Detection: A Hybrid Scheme for Object Detection under Low-light Surveillance. 71:1-71:5 - Tomu Hirata, Yusuke Mukuta, Tatsuya Harada:
Making Video Recognition Models Robust to Common Corruptions With Supervised Contrastive Learning. 72:1-72:6 - Lingyi Lu, Xin Xu:
Visible-Infrared Cross-Modal Person Re-identification based on Positive Feedback. 73:1-73:6
W2: Multi-modal Embedding and Understanding
- Yangyang Li, Jun Li, Hao Jin, Liang Peng:
Focusing Attention across Multiple Images for Multimodal Event Detection. 74:1-74:6 - Zehui Hu, Zidong Su, Yangding Li, Junbo Ma:
Adaptive Cross-stitch Graph Convolutional Networks. 75:1-75:7 - Ayaka Ideno, Yusuke Mukuta, Tatsuya Harada:
Generation of Variable-Length Time Series from Text using Dynamic Time Warping-Based Method. 76:1-76:7 - Zidong Su, Zehui Hu, Yangding Li:
Hierarchical Graph Representation Learning with Local Capsule Pooling. 77:1-77:7 - Yang Shi, Xiushan Nie, Quan Zhou, Li Zou, Yilong Yin:
Deep Adaptive Attention Triple Hashing. 78:1-78:5
W3: Multi-model Computing of Marine Big Data
- Hao Liu, Qian Wang, Xiaotong Hu:
Deep Reinforcement Learning and Docking Simulations for autonomous molecule generation in de novo Drug Design. 79:1-79:6 - Xiaorui Han, Zhiqi Chen, Ruixue Wang, Pengfei Zhao:
Joint label refinement and contrastive learning with hybrid memory for Unsupervised Marine Object Re-Identification. 80:1-80:6 - Yangyang Li, Jie Liu, Hao Liu:
Prediction of Transcription Factor Binding Sites Using Deep Learning Combined with DNA Sequences and Shape Feature Data. 81:1-81:6 - Hao Liu, Jinmeng Yan, Yuandong Zhou:
A Reinforcement Learning-Based Reward Mechanism for Molecule Generation that Introduces Activity Information. 82:1-82:5 - Rui Wang, Chengyu Zheng, Yanru Jiang, Zhaoxin Wang, Min Ye, Chenglong Wang, Ning Song, Jie Nie:
A Fine-Grained River Ice Semantic Segmentation based on Attentive Features and Enhancing Feature Fusion. 83:1-83:8 - Yanru Jiang, Chengyu Zheng, Zhaoxin Wang, Rui Wang, Min Ye, Chenglong Wang, Ning Song, Jie Nie:
Multi-Scale Graph Convolutional Network and Dynamic Iterative Class Loss for Ship Segmentation in Remote Sensing Images. 84:1-84:9
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.