default search action
28th MMM 2022: Phu Quoc, Vietnam - Part I
- Björn Þór Jónsson, Cathal Gurrin, Minh-Triet Tran, Duc-Tien Dang-Nguyen, Anita Min-Chun Hu, Huynh Thi Thanh Binh, Benoit Huet:
MultiMedia Modeling - 28th International Conference, MMM 2022, Phu Quoc, Vietnam, June 6-10, 2022, Proceedings, Part I. Lecture Notes in Computer Science 13141, Springer 2022, ISBN 978-3-030-98357-4
Best Paper Session
- Yaxuan Hu, Yuehong Dai, Zhongxiang Wang:
Real-time Detection of Tiny Objects Based on a Weighted Bi-directional FPN. 3-14 - Boqun Li, Zhong Qian, Peifeng Li, Qiaoming Zhu:
Multi-modal Fusion Network for Rumor Detection with Texts and Images. 15-27 - Yuan Chang, Tao Peng, Ruhan He, Xinrong Hu, Junping Liu, Zili Zhang, Minghua Jiang:
PF-VTON: Toward High-Quality Parser-Free Virtual Try-On Network. 28-40 - Yuyan Yang, Xin Ni, Yanbin Hao, Chenyu Liu, Wenshan Wang, Yifeng Liu, Haiyong Xie:
MF-GAN: Multi-conditional Fusion Generative Adversarial Network for Text-to-Image Synthesis. 41-53
Applications 1
- Kezhen Xie, Lei Huang, Wenfeng Zhang, Qibing Qin, Zhiqiang Wei:
Learning to Classify Weather Conditions from Single Images Without Labels. 57-68 - Yongquan Wan, Cairong Yan, Bofeng Zhang, Guobing Zou:
Learning Image Representation via Attribute-Aware Attention Networks for Fashion Classification. 69-81 - Yuan Chang, Tao Peng, Ruhan He, Xinrong Hu, Junping Liu, Zili Zhang, Minghua Jiang:
Toward Detail-Oriented Image-Based Virtual Try-On with Arbitrary Poses. 82-94 - Ilias Gialampoukidis, Stelios Andreadis, Nick Pantelidis, Sameed Hayat, Li Zhong, Marios Bakratsas, Dennis Hoppe, Stefanos Vrochidis, Ioannis Kompatsiaris:
Parallel DBSCAN-Martingale Estimation of the Number of Concepts for Automatic Satellite Image Clustering. 95-106
Multimedia Applications - Perspectives, Tools and Applications (Special Session) and Brave New Ideas
- Werner Bailer, Georg Thallinger, Verena Krawarik, Katharina Schell, Victoria Ertelthalner:
AI for the Media Industry: Application Potential and Automation Levels. 109-118 - Ladislav Peska, Jakub Lokoc:
Rating-Aware Self-Organizing Maps. 119-130 - Yana van de Sande, Martha A. Larson:
Color the Word: Leveraging Web Images for Machine Translation of Untranslatable Words. 131-138
Activities and Events
- Jiankai Li, Yunhong Wang, Weixin Li:
MGMP: Multimodal Graph Message Propagation Network for Event Detection. 141-153 - Jiewen Wang, Shuang Liang:
Pose-Enhanced Relation Feature for Action Recognition in Still Images. 154-165 - Tao Peng, Caiyin Tang, Jing Wang:
Prostate Segmentation of Ultrasound Images Based on Interpretable-Guided Mathematical Model. 166-177 - Lin Wang, Yan Song, Rui Yan, Xiangbo Shu:
Spatiotemporal Perturbation Based Dynamic Consistency for Semi-supervised Temporal Action Detection. 178-190
Multimedia Datasets for Repeatable Experimentation (Special Session)
- Jakub Lokoc, Werner Bailer, Kai Uwe Barthel, Cathal Gurrin, Silvan Heller, Björn Þór Jónsson, Ladislav Peska, Luca Rossetto, Klaus Schoeffmann, Lucia Vadicamo, Stefanos Vrochidis, Jiaxin Wu:
A Task Category Space for User-Centric Comparative Multimedia Search Evaluations. 193-204 - Konstantin Schall, Kai Uwe Barthel, Nico Hezel, Klaus Jung:
GPR1200: A Benchmark for General-Purpose Content-Based Image Retrieval. 205-216 - Ly-Duyen Tran, Thanh Cong Ho, Lan Anh Pham, Binh T. Nguyen, Cathal Gurrin, Liting Zhou:
LLQA - Lifelog Question Answering Dataset. 217-228
Learning
- Yijie Zhong, Zhengxing Sun, Shoutong Luo, Yunhan Sun, Wei Zhang:
Category-Sensitive Incremental Learning for Image-Based 3D Shape Reconstruction. 231-244 - Zhaoliang He, Yuan Wang, Chen Tang, Zhi Wang, Wenwu Zhu, Chenyang Guo, Zhibo Chen:
AdaConfigure: Reinforcement Learning-Based Adaptive Configuration for Video Analytics Services. 245-257 - Gursimran Singh, Lingyang Chu, Lanjun Wang, Jian Pei, Qi Tian, Yong Zhang:
Mining Minority-Class Examples with Uncertainty Estimates. 258-271 - Siyuan Chen:
Conditional Context-Aware Feature Alignment for Domain Adaptive Detection Transformer. 272-283
Multimedia for Medical Applications (Special Session)
- Vasileios-Rafail Xefteris, Athina Tsanousa, Thanassis Mavropoulos, Georgios Meditskos, Stefanos Vrochidis, Ioannis Kompatsiaris:
Human Activity Recognition with IMU and Vital Signs Feature Fusion. 287-298 - Zhaohui Zhu, Marc A. Kastner, Shin'ichi Satoh:
On Assisting Diagnoses of Pareidolia by Emulating Patient Behavior. 299-310 - Pooja Prajod, Tobias Huber, Elisabeth André:
Using Explainable AI to Identify Differences Between Clinical and Experimental Pain Detection Models Based on Facial Expressions. 311-322
Applications 2
- Xuena Ren, Dongming Zhang, Xiuguo Bao, Lei Shi:
Double Granularity Relation Network with Self-criticism for Occluded Person Re-identification. 325-338 - Haoyuan Zheng, Weihang Wang, Fei Wen, Peilin Liu:
A Complementary Fusion Strategy for RGB-D Face Recognition. 339-351 - Zhibin Xiao, Pengwei Xie, Guijin Wang:
Multi-scale Cross-Modal Transformer Network for RGB-D Object Detection. 352-363 - Jian He, Xian Zhong, Jingling Yuan, Ming Tan, Shilei Zhao, Luo Zhong:
Joint Re-Detection and Re-Identification for Multi-Object Tracking. 364-376
Multimedia Analytics for Contextual Human Understanding (Special Session)
- Srijith Unni, Sushma Suryanarayana Gowda, Alan F. Smeaton:
An Investigation into Keystroke Dynamics and Heart Rate Variability as Indicators of Stress. 379-391 - Thao V. Ha, Hoang Nguyen, Son T. Huynh, Trung T. Nguyen, Binh T. Nguyen:
Fall Detection Using Multimodal Data. 392-403 - Tenzin Palbar, Manoj Kesavulu, Cathal Gurrin, Renaat Verbruggen:
Prediction of Blood Glucose Using Contextual LifeLog Data. 404-415 - Liting Zhou, Cathal Gurrin:
Multimodal Embedding for Lifelog Retrieval. 416-427
Applications 3
- Yi Li, Dehao Wu, Yuesheng Zhu:
A Multiple Positives Enhanced NCE Loss for Image-Text Retrieval. 431-442 - Xiang Shuai, Xiao Wang, Wei Wang, Xin Yuan, Xin Xu:
SAM: Self Attention Mechanism for Scene Text Recognition Based on Swin Transformer. 443-454 - Jian Yang, Chi Do-Kim Pham, Jinjia Zhou:
JVCSR: Video Compressive Sensing Reconstruction with Joint In-Loop Reference Enhancement and Out-Loop Super-Resolution. 455-466 - Yingrui Wang, Suyu Wang, Longhua Sun:
Point Cloud Upsampling via a Coarse-to-Fine Network. 467-478
Image Analytics
- Yuzhuo Wang, Yanlin Geng:
Arbitrary Style Transfer with Adaptive Channel Network. 481-492 - Shuang Zheng, Liang Wang:
Fast Single Image Dehazing Using Morphological Reconstruction and Saturation Compensation. 493-504 - Lulu Zhao, Ling Shen, Richang Hong:
One-Stage Image Inpainting with Hybrid Attention. 505-517 - Jiayao Xu, Chen Fu, Zhiqiang Zhang, Jinjia Zhou:
Real-Time FPGA Design for OMP Targeting 8K Image Reconstruction. 518-529
Speech and Music
- Ke Liu, Chen Wang, Jiayue Chen, Jun Feng:
Time-Frequency Attention for Speech Emotion Recognition with Squeeze-and-Excitation Blocks. 533-543 - Jing Xiao, Jiaqi Liu, Dengshi Li, Lanxin Zhao, Qianrui Wang:
Speech Intelligibility Enhancement By Non-Parallel Speech Style Conversion Using CWT and iMetricGAN Based CycleGAN. 544-556 - Or Goren, Eliya Nachmani, Lior Wolf:
A-Muze-Net: Music Generation by Composing the Harmony Based on the Generated Melody. 557-568 - Abhishek Srivastava, Wei Duan, Rajiv Ratn Shah, Jianming Wu, Suhua Tang, Wei Li, Yi Yu:
Melody Generation from Lyrics Using Three Branch Conditional LSTM-GAN. 569-581
Multimodal Analytics
- Pengfei Du, Yali Gao, Xiaoyong Li:
Bi-attention Modal Separation Network for Multimodal Video Fusion. 585-598 - Qi Zhong, Qian Wang, Ji Liu:
Combining Knowledge and Multi-modal Fusion for Meme Classification. 599-611 - Binqiang Wang, Gang Dong, Yaqian Zhao, Rengang Li, Qichun Cao, Yinyin Chao:
Non-Uniform Attention Network for Multi-modal Sentiment Analysis. 612-623 - Yanbei Sun, Yao Lu, Haowei Lu, Qingjie Zhao, Shunzhou Wang:
Multimodal Unsupervised Image-to-Image Translation Without Independent Style Encoder. 624-636
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.