default search action
23rd MMM 2018: Bangkok, Thailand
- Klaus Schoeffmann, Thanarat H. Chalidabhongse, Chong-Wah Ngo, Supavadee Aramvith, Noel E. O'Connor, Yo-Sung Ho, Moncef Gabbouj, Ahmed Elgammal:
MultiMedia Modeling - 24th International Conference, MMM 2018, Bangkok, Thailand, February 5-7, 2018, Proceedings, Part I. Lecture Notes in Computer Science 10704, Springer 2018, ISBN 978-3-319-73602-0
Full Papers Accepted for Oral Presentation
- Shurong Sheng, Aparna Nurani Venkitasubramanian, Marie-Francine Moens:
A Markov Network Based Passage Retrieval Method for Multimodal Question Answering in the Cultural Heritage Domain. 3-15 - En Shi, Qian Li, Daquan Gu, Zhangming Zhao:
A Method of Weather Radar Echo Extrapolation Based on Convolutional Neural Networks. 16-28 - Konstantinos Apostolidis, Evlampios Apostolidis, Vasileios Mezaris:
A Motion-Driven Approach for Fine-Grained Temporal Segmentation of User-Generated Videos. 29-41 - Lianglei Wei, Yirui Wu, Wenhai Wang, Tong Lu:
A Novel 3D Human Action Recognition Framework for Video Content Analysis. 42-53 - Dorian Michaud, Thierry Urruty, François Lecellier, Philippe Carré:
Adaptive Image Representation Using Information Gain and Saliency: Application to Cultural Heritage Datasets. 54-66 - Peng Yao, Hua Zhang, Yanbing Xue, Shengyong Chen:
AGO: Accelerating Global Optimization for Accurate Stereo Matching. 67-80 - Wanzhao Yang, Weiping Tu, Jiaxi Zheng, Xiong Zhang, Yuhong Yang, Yucheng Song:
An RNN-Based Speech-Music Discrimination Used for Hybrid Audio Coder. 81-92 - Jiang Zhu, Wei Zhai, Yang Cao, Zheng-Jun Zha:
Co-occurrent Structural Edge Detection for Color-Guided Depth Map Super-Resolution. 93-105 - Kaiping Xu, Zheng Qin, Guolong Wang, Kai Huang, Shuxiong Ye, Huidi Zhang:
Collision-Free LSTM for Human Trajectory Prediction. 106-116 - Tae Kwan Lee, Wissam J. Baddar, Seong Tae Kim, Yong Man Ro:
Convolution with Logarithmic Filter Groups for Efficient Shallow CNN. 117-129 - Junjie Zhao, Yuxin Peng:
Cost-Sensitive Deep Metric Learning for Fine-Grained Image Classification. 130-141 - Meng Wei, Yu Kang, Weiguo Song, Yang Cao:
Crowd Distribution Estimation with Multi-scale Recursive Convolutional Neural Network. 142-153 - Yuhua Jia, Liang Bai, Peng Wang, Jinlin Guo, Yuxiang Xie:
Deep Convolutional Neural Network for Correlating Images and Sentences. 154-165 - Weijie Kong, Nannan Li, Thomas H. Li, Ge Li:
Deep Pedestrian Detection Using Contextual Information and Multi-level Features. 166-177 - Hua Yuan, Yuanyuan Zhou, Yun Sheng, Guixu Zhang:
Dual-Way Guided Depth Image Inpainting with RGBD Image Pairs. 178-189 - Ryosuke Furuta, Naoto Inoue, Toshihiko Yamasaki:
Efficient and Interactive Spatial-Semantic Image Retrieval. 190-202 - Sabrina Kletz, Andreas Leibetseder, Klaus Schoeffmann:
Evaluation of Visual Content Descriptors for Supporting Ad-Hoc Video Search Tasks at the Video Browser Showdown. 203-215 - Saumya Rawat, Siddhartha Gairola, Rajvi Shah, P. J. Narayanan:
Find Me a Sky: A Data-Driven Method for Color-Consistent Sky Search and Replacement. 216-228 - Yizhi Wang, Zhouhui Lian, Yingmin Tang, Jianguo Xiao:
Font Recognition in Natural Images via Transfer Learning. 229-240 - Manfred Jürgen Primus, Doris Putzgruber-Adamitsch, Mario Taschwer, Bernd Münzer, Yosuf El-Shabrawi, László Böszörményi, Klaus Schoeffmann:
Frame-Based Classification of Operation Phases in Cataract Surgery Videos. 241-253 - Jong-Hee Back, Sunho Kim, Yo-Sung Ho:
High-Precision 3D Coarse Registration Using RANSAC and Randomly-Picked Rejections. 254-266 - Huidi Fang, Chaoran Cui, Xiang Deng, Xiushan Nie, Muwei Jian, Yilong Yin:
Image Aesthetic Distribution Prediction with Fully Convolutional Network. 267-278 - Laura Pérez-Mayos, Federico M. Sukno, Leo Wanner:
Improving the Quality of Video-to-Language Models by Optimizing Annotation of the Training Material. 279-290 - Mofei Song, Zhengxing Sun, Bo Li, Jiagao Hu:
Iterative Active Classification of Large Image Collection. 291-304 - Amorntip Prayoonwong, Cheng-Hsien Wang, Chih-Yi Chiu:
Learning to Index in Large-Scale Datasets. 305-316 - Jianshe Zhou, Tuya Naren, Xianyu Chen, Yike Ma, Jie Liu, Feng Dai:
Light Field Foreground Matting Based on Defocus and Correspondence. 317-328 - Peng Cheng, Wu Liu, Yifan Zhang, Huadong Ma:
LOCO: Local Context Based Faster R-CNN for Small Traffic Sign Detection. 329-341 - Yongfei Zhang, Zhe Li:
Multi-hypothesis-Based Error Concealment for Whole Frame Loss in HEVC. 342-354 - Jinna Lv, Wu Liu, Lili Zhou, Bin Wu, Huadong Ma:
Multi-stream Fusion Model for Social Relation Recognition from Videos. 355-368 - Geert Lugtenberg, Wolfgang Hürst, Nina Rosa, Christian Sandor, Alexander Plopski, Takafumi Taketomi, Hirokazu Kato:
Multimodal Augmented Reality - Augmenting Auditory-Tactile Feedback to Change the Perception of Thickness. 369-380 - Jianjun Li, Lanlan Xu, Haojie Li, Chin-Chen Chang, Fuming Sun:
Parameter Selection for Denoising Algorithms Using NR-IQA with CNN. 381-392 - Itsara Wichakam, Teerapong Panboonyuen, Can Udomcharoenchaikit, Peerapon Vateekul:
Real-Time Polyps Segmentation for Colonoscopy Video Frames Using Compressed Fully Convolutional Network. 393-404 - Yuxin Yuan, Yuxin Peng:
Recursive Pyramid Network with Joint Attention for Cross-Media Retrieval. 405-416 - Qi Zheng, Jun Chen, Junjun Jiang, Ruimin Hu:
Reinforcing Pedestrian Parsing on Small Scale Dataset. 417-427 - Xiangyu Liu, Yunhong Wang, Qingjie Liu:
Remote Sensing Image Fusion Based on Two-Stream Fusion Network. 428-439 - Peng Wu, Di Huang, Yunhong Wang:
REVT: Robust and Efficient Visual Tracking by Region-Convolutional Regression Network. 440-452 - Dongmei Huang, Yan Wang, Wei Song, Jean Sequeira, Sébastien Mavromatis:
Shallow-Water Image Enhancement Using Relative Global Histogram Stretching Based on Adaptive Parameter Acquisition. 453-465 - Lintao Guo, Hunter Quant, Nikolas Lamb, Benjamin Lowit, Sean Banerjee, Natasha Kholgade Banerjee:
Spatiotemporal 3D Models of Aging Fruit from Multi-view Time-Lapse Videos. 466-478 - Kewei Yang, Zhengxing Sun, Shuang Wang, Bo Li:
Stitch-Based Image Stylization for Thread Art Using Sparse Modeling. 479-492 - Hong Joo Lee, Wissam J. Baddar, Hak Gu Kim, Seong Tae Kim, Yong Man Ro:
Teacher and Student Joint Learning for Compact Facial Landmark Detection Network. 493-504 - Zhengcai Qin, Bin Wu, Meng Li:
Text Image Deblurring via Intensity Extremums Prior. 505-517 - Dries Hulens, Bram Aerts, Punarjay Chakravarty, Ali Diba, Toon Goedemé, Tom Roussel, Jeroen Zegers, Tinne Tuytelaars, Luc Van Eycken, Luc Van Gool, Hugo Van hamme, Joost Vennekens:
The CAMETRON Lecture Recording System: High Quality Video Recording and Editing with Minimal Human Supervision. 518-530 - Magzhan Kairanbay, John See, Lai-Kuan Wong:
Towards Demographic-Based Photographic Aesthetics Prediction for Portraitures. 531-543 - Xiaoyu Qi, Deshun Yang, Xiaoou Chen:
Triplet Convolutional Network for Music Version Identification. 544-555 - Yujing Chen, Jing Xiao, Gen Zhan, Xu Wang, Zhongyuan Wang:
Two-Level Segment-Based Bitrate Control for Live ABR Streaming. 556-564 - Jianjun Chen, Hongtao Xie, Yue Hu, Chenggang Yan:
Uyghur Text Localization with Fast Component Detection. 565-577
SS: Multimedia Analytics: Perspectives, Techniques and Applications
- Rashmi Gupta, Cathal Gurrin:
Approaches for Event Segmentation of Visual Lifelog Data. 581-593 - Masoud Mazloom, Iliana Pappi, Marcel Worring:
Category Specific Post Popularity Prediction. 594-607 - Feiyan Hu, Alan F. Smeaton:
Image Aesthetics and Content in Selecting Memorable Keyframes from Lifelogs. 608-619 - Werner Bailer:
On the Traceability of Results from Deep Learning-Based Cloud Services. 620-631 - Stevan Rudinac, Tat-Seng Chua, Nicolás E. Díaz Ferreyra, Gerald Friedland, Tatjana Gornostaja, Benoit Huet, Rianne Kaptein, Krister Lindén, Marie-Francine Moens, Jaakko Peltonen, Miriam Redi, Markus Schedl, David A. Shamma, Alan F. Smeaton, Lexing Xie:
Rethinking Summarization and Storytelling for Modern Social Multimedia. 632-644
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.