default search action
ICMR 2020: Dublin, Ireland
- Cathal Gurrin, Björn Þór Jónsson, Noriko Kando, Klaus Schöffmann, Yi-Ping Phoebe Chen, Noel E. O'Connor:
Proceedings of the 2020 on International Conference on Multimedia Retrieval, ICMR 2020, Dublin, Ireland, June 8-11, 2020. ACM 2020, ISBN 978-1-4503-7087-5
Keynote Talks
- Ramesh C. Jain:
What Should I Do? 1 - Henning Müller:
Medical Image Retrieval: Applications and Resources. 2-3 - Marcel Worring:
Beyond Relevance Feedback for Searching and Exploring large Multimedia Collections. 4
Tutorials
- Martin Wistuba, Ambrish Rawat, Tejaswini Pedapati:
Automation of Deep Learning - Theory and Practice. 5-6 - Xavier Giró-i-Nieto:
One Perceptron to Rule Them All: Language, Vision, Audio and Speech. 7-8
Best Paper Session
- Yutian Guo, Jingjing Chen, Hao Zhang, Yu-Gang Jiang:
Visual Relations Augmented Cross-modal Retrieval. 9-15 - Eric Müller-Budack, Jonas Theiner, Sebastian Diering, Maximilian Idahl, Ralph Ewerth:
Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency. 16-25 - Xu Sun, Xinwen Hu, Tongwei Ren, Gangshan Wu:
Human Object Interaction Detection via Multi-level Conditioned Network. 26-34 - Sadaf Gulshad, Arnold W. M. Smeulders:
Explaining with Counter Visual Attributes and Examples. 35-43
Oral Session 1: Cross-Modal Analysis
- Dejie Yang, Dayan Wu, Wanqian Zhang, Haisu Zhang, Bo Li, Weiping Wang:
Deep Semantic-Alignment Hashing for Unsupervised Cross-Modal Retrieval. 44-52 - Po-Yao Huang, Xiaojun Chang, Alexander G. Hauptmann, Eduard H. Hovy:
Forward and Backward Multimodal NMT for Improved Monolingual and Multilingual Cross-Modal Retrieval. 53-62 - Petr Byvshev, Pascal Mettes, Yu Xiao:
Heterogeneous Non-Local Fusion for Multimodal Activity Recognition. 63-72 - Pim Dijt, Pascal Mettes:
Trajectory Prediction Network for Future Anticipation of Ships. 73-81
Oral Session 2: Applications
- Yunshan Ma, Yujuan Ding, Xun Yang, Lizi Liao, Wai Keung Wong, Tat-Seng Chua:
Knowledge Enhanced Neural Fashion Trend Forecasting. 82-90 - Guolong Wang, Zheng Qin, Junchi Yan, Liu Jiang:
Learning to Select Elements for Graphic Design. 91-99 - Zhengcong Fei:
Actor-Critic Sequence Generation for Relative Difference Captioning. 100-107 - Shuo Chen, Pascal Mettes, Tao Hu, Cees G. M. Snoek:
Interactivity Proposals for Surveillance Videos. 108-116
Oral Session 3: Retrieval
- Zichen Zan, Lin Li, Jianquan Liu, Dong Zhou:
Sentence-based and Noise-robust Cross-modal Retrieval on Cooking Recipes and Food Images. 117-125 - Arun Zachariah, Mohamed Gharibi, Praveen Rao:
QIK: A System for Large-Scale Image Retrieval on Everyday Scenes With Common Objects. 126-135 - Zhi Xiong, Dayan Wu, Wen Gu, Haisu Zhang, Bo Li, Weiping Wang:
Deep Discrete Attention Guided Hashing for Face Image Retrieval. 136-144 - Tianrui Niu, Fangxiang Feng, Lingxuan Li, Xiaojie Wang:
Image Synthesis from Locally Related Texts. 145-153
Oral Session 4: Semantic Enrichment
- Suzi Kim, Sunghee Choi:
Automatic Color Scheme Extraction from Movies. 154-163 - Hussam Lawen, Avi Ben-Cohen, Matan Protter, Itamar Friedman, Lihi Zelnik-Manor:
Compact Network Training for Person ReID. 164-171 - Xinzhe Zhou, Yadong Mu:
Google Helps YouTube: Learning Few-Shot Video Classification from Historic Tasks and Cross-Domain Sample Transfer. 172-179 - Yash Garg, K. Selçuk Candan:
iSparse: Output Informed Sparsification of Neural Network. 180-188
Session: Posters (Full Length)
- Yanjie Chen, Likun Cai, Wei Cheng, Hao Wang:
Super-Resolution Coding Defense Against Adversarial Examples. 189-197 - Fabio Carrara, Giuseppe Amato, Fabrizio Falchi, Claudio Gennaro:
Continuous ODE-defined Image Features for Adaptive Retrieval. 198-206 - Xavier Favory, Frederic Font, Xavier Serra:
Search Result Clustering in Collaborative Sound Collections. 207-214 - Pengcheng Gao, Ke Lu, Jian Xue:
EfficientFAN: Deep Knowledge Transfer for Face Alignment. 215-223 - Qi Sun, Hongyan Liu, Jun He, Zhaoxin Fan, Xiaoyong Du:
DAGC: Employing Dual Attention and Graph Convolution for Point Cloud based Place Recognition. 224-232 - Roshan Prakash Rane, Edit Szügyi, Vageesh Saxena, André Ofner, Sebastian Stober:
PredNet and Predictive Coding: A Critical Review. 233-241 - Jia-Hong Huang, Marcel Worring:
Query-controllable Video Summarization. 242-250
Session: Posters (Short)
- Xuxiao Bu, Bingfeng Li, Yaxiong Wang, Jihua Zhu, Xueming Qian, Marco Zhao:
Semantic Gated Network for Efficient News Representation. 251-255 - Liviu-Daniel Stefan, Mihai Gabriel Constantin, Bogdan Ionescu:
System Fusion with Deep Ensembles. 256-260 - Asra Aslam, Edward Curry:
Reducing Response Time for Multimedia Event Processing using Domain Adaptation. 261-265 - Mahnaz Amiri Parian, Luca Rossetto, Heiko Schuldt, Stéphane Dupont:
Are You Watching Closely? Content-based Retrieval of Hand Gestures. 266-270 - Takumi Ohkuma, Hideki Nakayama:
Efficient Base Class Selection Algorithms for Few-Shot Classification. 271-275 - Konstantinos Gkountakos, Konstantinos Ioannidis, Theodora Tsikrika, Stefanos Vrochidis, Ioannis Kompatsiaris:
A Crowd Analysis Framework for Detecting Violence Scenes. 276-280 - Ladislav Peska, Frantisek Mejzlík, Tomás Soucek, Jakub Lokoc:
Towards Evaluating and Simulating Keyword Queries for Development of Interactive Known-item Search Systems. 281-285 - Shengxin Chen, Bo-Hao Chen, Zhaojiong Chen, YunBing Wu:
Itinerary Planning via Deep Reinforcement Learning. 286-290 - Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard:
Confidence-based Weighted Loss for Multi-label Classification with Missing Labels. 291-295 - Dawei Zhang, Zhonglong Zheng, Xiaowei He, Liu Su, Liyuan Chen:
Learning Fine-Grained Similarity Matching Networks for Visual Tracking. 296-300 - Bo Dong, Cristian Lumezanu, Yuncong Chen, Dongjin Song, Takehiko Mizoguchi, Haifeng Chen, Latifur Khan:
At the Speed of Sound: Efficient Audio Scene Classification. 301-305 - Chihaya Matsuhira, Marc A. Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Keisuke Doman, Daisuke Deguchi, Hiroshi Murase:
Imageability Estimation using Visual and Language Features. 306-310 - Federico Vaccaro, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo:
Image Retrieval using Multi-scale CNN Features Pooling. 311-315 - Sabina Hult, Line Bay Kreiberg, Sami Sebastian Brandt, Björn Þór Jónsson:
Analysis of the Effect of Dataset Construction Methodology on Transferability of Music Emotion Recognition Models. 316-320 - Camilo Vargas, Qianni Zhang, Ebroul Izquierdo:
One Shot Logo Recognition Based on Siamese Neural Networks. 321-325 - Wei-Rou Lin, Hen-Hsen Huang, Hsin-Hsi Chen:
Visual Story Ordering with a Bidirectional Writer. 326-330 - Lili Wang, Ruibo Liu, Soroush Vosoughi:
Salienteye: Maximizing Engagement While Maintaining Artistic Style on Instagram Using Deep Neural Networks. 331-335 - Damianos Galanopoulos, Vasileios Mezaris:
Attention Mechanisms, Signal Encodings and Fusion Strategies for Improved Ad-hoc Video Search with Dual Encoding Networks. 336-340 - Imam Yogie Susanto, Tse-Yu Pan, Chien-Wen Chen, Min-Chun Hu, Wen-Huang Cheng:
Emotion Recognition from Galvanic Skin Response Signal Based on Deep Hybrid Neural Networks. 341-345
Session: Brave New Ideas
- Riku Togashi, Sumio Fujita, Tetsuya Sakai:
Automatic Evaluation of Iconic Image Retrieval based on Colour, Shape, and Texture. 346-354 - Keith Curtis, George Awad, Shahzad Rajput, Ian Soboroff:
HLVU: A New Challenge to Test Deep Understanding of Movies the Way Humans do. 355-361 - Tomás Skopal:
On Visualizations in the Role of Universal Data Representation. 362-367
Session: Doctoral Symposium
- Omar Shahbaz Khan:
An Interactive Learning System for Large-Scale Multimedia Analytics. 368-372 - Asra Aslam:
Object Detection for Unseen Domains while Reducing Response Time using Knowledge Transfer in Multimedia Event Processing. 373-377 - Negin Ghamsarian:
Enabling Relevance-Based Exploration of Cataract Videos. 378-382
Session: Demonstrations
- Mariona Caros, Maite Garolera, Petia Radeva, Xavier Giró-i-Nieto:
Automatic Reminiscence Therapy for Dementia. 383-387 - Markus Schedl, Michael Mayr, Peter Knees:
Music Tower Blocks: Multi-Faceted Exploration Interface for Web-Scale Music Access. 388-392 - Dinh V. Cuong, Dac H. Nguyen, Son Huynh, Phong Huynh, Cathal Gurrin, Minh-Son Dao, Duc-Tien Dang-Nguyen, Binh T. Nguyen:
A Framework for Paper Submission Recommendation System. 393-396 - Andreas Leibetseder, Klaus Schöffmann:
surgXplore: Interactive Video Exploration for Endoscopy. 397-401 - Thinhinane Yebda, Jenny Benois-Pineau, Marion Pech, Hélène Amièva, Cathal Gurrin:
Detection of Semantic Risk Situations in Lifelog Data for Improving Life of Frail People. 402-406 - Chenhao Lin, Pengwei Hu, Hui Su, Shaochun Li, Jing Mei, Jie Zhou, Henry Leung:
SenseMood: Depression Detection on Social Media. 407-411 - Quy H. Nguyen, Dac H. Nguyen, Minh-Son Dao, Duc-Tien Dang-Nguyen, Cathal Gurrin, Binh T. Nguyen:
An Active Learning Framework for Duplicate Detection in SaaS Platforms. 412-415 - Van-Luon Tran, Anh-Vu Mai-Nguyen, Trong-Dat Phan, Anh-Khoa Vo, Minh-Son Dao, Koji Zettsu:
An Interactive Multimodal Retrieval System for Memory Assistant and Life Organized Support. 416-420
Special Session 1: Human-Centric Cross-Modal Retrieval
- Xian Zhong, Tianyou Lu, Wenxin Huang, Jingling Yuan, Wenxuan Liu, Chia-Wen Lin:
Visible-infrared Person Re-identification via Colorization-based Siamese Generative Adversarial Network. 421-427 - Zhengxiong Jia, Xirong Li:
iCap: Interactive Image Captioning with Predictive Text. 428-435 - Taeyong Kim, Bowon Lee:
Multi-Attention Multimodal Sentiment Analysis. 436-441 - Yongbiao Chen, Sheng Zhang, Zhengwei Qi:
MAENet: Boosting Feature Representation for Cross-Modal Person Re-Identification with Pairwise Supervision. 442-449
Special Session 2: Activities of Daily Living
- Min-Huan Fu, An-Zi Yen, Hen-Hsen Huang, Hsin-Hsi Chen:
Incorporating Semantic Knowledge for Visual Lifelog Activity Recognition. 450-456 - Khac-Tuan Nguyen, Dat-Thanh Dinh, Minh N. Do, Minh-Triet Tran:
Anomaly Detection in Traffic Surveillance Videos with GAN-based Future Frame Prediction. 457-463 - Jiawei Li, Shu-Tao Xia, Qianggang Ding:
Multi-level Recognition on Falls from Activities of Daily Living. 464-471 - Jonathan Liono, Mohammad Saiedur Rahaman, Flora D. Salim, Yongli Ren, Damiano Spina, Falk Scholer, Johanne R. Trippas, Mark Sanderson, Paul N. Bennett, Ryen W. White:
Intelligent Task Recognition: Towards Enabling Productivity Assistance in Daily Life. 472-478 - Khanh-An C. Quan, Vinh-Tiep Nguyen, Tan-Cong Nguyen, Tam V. Nguyen, Minh-Triet Tran:
Flood Level Prediction via Human Pose Estimation from Social Media Images. 479-485 - Vaibhav Pandey, Nitish Nag, Ramesh C. Jain:
Continuous Health Interface Event Retrieval. 486-494
Special Session 3: Multimedia Information Retrieval for Urban Data
- Shahin Sharifi Noorian, Sihang Qiu, Achilleas Psyllidis, Alessandro Bozzon, Geert-Jan Houben:
Detecting, Classifying, and Mapping Retail Storefronts Using Street-level Imagery. 495-501 - Naoki Sugimoto, Toru Okubo, Kiyoharu Aizawa:
Urban Movie Map for Walkers: Route View Synthesis using 360° Videos. 502-508 - Maarten Sukel, Stevan Rudinac, Marcel Worring:
Urban Object Detection Kit: A System for Collection and Analysis of Street-Level Imagery. 509-516
Special Session 4: Knowledge-Driven Analysis and Retrieval on Multimedia
- Runchen Wei, Ning He, Ke Lu:
YOLO-mini-tiger: Amur Tiger Detection. 517-524 - Cong Bai, Chao Zeng, Qing Ma, Jinglin Zhang, Shengyong Chen:
Deep Adversarial Discrete Hashing for Cross-Modal Retrieval. 525-531 - Li Hao, Liping Hou, Yuantao Song, Ke Lu, Jian Xue:
A Lightweight Gated Global Module for Global Context Modeling in Neural Networks. 532-539 - Youze Wang, Shengsheng Qian, Jun Hu, Quan Fang, Changsheng Xu:
Fake News Detection via Knowledge-driven Multimodal Graph Convolutional Networks. 540-547 - Jiansheng Dong, Jingling Yuan, Lin Li, Xian Zhong, Weiru Liu:
Optimizing Queries over Video via Lightweight Keypoint-based Object Detection. 548-554 - Bo Jiang:
Multi-Graph Group Collaborative Filtering. 555-562 - Haiyan Fu, Ying Li, Hengheng Zhang, Jinfeng Liu, Tao Yao:
Rank-embedded Hashing for Large-scale Image Retrieval. 563-570 - Yifeng Han, Lin Li, Jianwei Zhang:
A Coordinated Representation Learning Enhanced Multimodal Machine Translation Approach with Multi-Attention. 571-577
Workshop Summaries
- Ichiro Ide, Yoko Yamakata, Atsushi Hashimoto:
CEA'20: The 12th Workshop on Multimedia for Cooking and Eating Activities. 578-579 - Minh-Son Dao, Morten Fjeld, Filip Biljecki, Uraz Yavanoglu, Mianxiong Dong:
ICDAR'20: Intelligent Cross-Data Analysis and Retrieval. 580-581 - Wei-Ta Chu, Ichiro Ide, Naoko Nitta, Norimichi Tsumura, Toshihiko Yamasaki:
MMArt-ACM'20: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2020. 582-583 - Cathal Gurrin, Tu-Khiem Le, Van-Tu Ninh, Duc-Tien Dang-Nguyen, Björn Þór Jónsson, Jakub Lokoc, Wolfgang Hürst, Minh-Triet Tran, Klaus Schöffmann:
Introduction to the Third Annual Lifelog Search Challenge (LSC'20). 584-585
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.