default search action
22nd ISM 2020: Naples, Italy
- IEEE International Symposium on Multimedia, ISM 2020, Naples, Italy, December 2-4, 2020. IEEE 2020, ISBN 978-1-7281-8697-9
Streaming and Real Time Applications
- Antonio Nisticò, Dena Markudova, Martino Trevisan, Michela Meo, Giovanna Carofiglio:
A comparative study of RTC applications. 1-8 - Markus Hofbauer, Christopher B. Kuhn, Goran Petrovic, Eckehard G. Steinbach:
Adaptive Multi-View Live Video Streaming for Teledriving Using a Single Hardware Encoder. 9-16 - Jesús Aguilar Armijo, Babak Taraghi, Christian Timmerer, Hermann Hellwagner:
Dynamic Segment Repackaging at the Edge for HTTP Adaptive Streaming. 17-24
Video Quality
- Oda Olsen Nedrejord, Vajira Thambawita, Steven Alexander Hicks, Pål Halvorsen, Michael A. Riegler:
Vid2Pix - A Framework for Generating High-Quality Synthetic Videos. 25-26 - Joni Räsänen, Aaro Altonen, Alexandre Mercat, Jarno Vanne:
Live Demonstration: Interactive Quality of Experience Evaluation in Kvazzup Video Call. 27-28 - Antonio José G. Busson, Paulo Renato C. Mendes, Daniel de S. Moraes, Álvaro M. da Veiga, Álan L. V. Guedes, Sérgio Colcher:
Video Quality Enhancement Using Deep Learning-Based Prediction Models for Quantized DCT Coefficients in MPEG I-frames. 29-32
Best Paper Session
- Christopher B. Kuhn, Markus Hofbauer, Goran Petrovic, Eckehard G. Steinbach:
Better Look Twice - Improving Visual Scene Perception Using a Two-Stage Approach. 33-40 - Aysegül Özkaya Eren, Mustafa Sert:
Audio Captioning Based on Combined Audio and Semantic Embeddings. 41-48 - Petra Budíková, Jan Sedmidubský, Jan Horvath, Pavel Zezula:
Towards Scalable Retrieval of Human Motion Episodes. 49-56 - Jounsup Park, Mingyuan Wu, Kuan-Ying Lee, Bo Chen, Klara Nahrstedt, Michael Zink, Ramesh K. Sitaraman:
SEAWARE: Semantic Aware View Prediction System for 360-degree Video Streaming. 57-64
360-degree Video
- Stephan Fremerey, Frank Hofmeyer, Steve Göring, Dominik Keller, Alexander Raake:
Between the Frames - Evaluation of Various Motion Interpolation Algorithms to Improve 360° Video Quality. 65-73 - Bo Chen, Ahmed Ali-Eldin, Prashant J. Shenoy, Klara Nahrstedt:
Real-time Spatio-Temporal Action Localization in 360 Videos. 73-76 - Anahita Mahzari, Aliehsan Samiei, Ravi Prakash:
CooPEC: Cooperative Prefetching and Edge Caching for Adaptive 360° Video Streaming. 77-81 - Kuan-Ying Lee, Andrew Yoo, Jounsup Park, Klara Nahrstedt:
Redefine the A in ABR for 360-degree Videos: A Flexible ABR Framework. 82-84 - Maryam Homayouni, Alireza Aminlou, Miska M. Hannuksela:
On Subpicture-based Viewport-dependent 360-degree Video Streaming using VVC. 85-90
New Applications
- Markus Hofbauer, Christopher B. Kuhn, Lukas Püttner, Goran Petrovic, Eckehard G. Steinbach:
Measuring Driver Situation Awareness Using Region-of-Interest Prediction and Eye Tracking. 91-95 - Christian Roggia, Fabio Persia:
Extraction of Frame Sequences in the Manga Context. 96-99 - Omkar N. Kulkarni, Vikram Patil, Shivam B. Parikh, Shashank Arora, Pradeep K. Atrey:
Can You All Look Here? Towards Determining Gaze Uniformity In Group Images. 100-103 - Guilherme H. S. Nakahata, Ademir Aparecido Constantino, Yandre M. G. Costa:
Bonsai Style Classification: a new database and baseline results. 104-110
New Algorithms
- Chen Li, Xue Zhang, Tao Luo, Lihua Tian:
Audio Steganography Algorithm Based on Genetic Algorithm for MDCT Coefficient Adjustment for AAC. 111-112 - Tomasz Lyko, Matthew Broadbent, Nicholas J. P. Race, Mike Nilsson, Paul Farrow, Steve Appleby:
Llama - Low Latency Adaptive Media Algorithm. 113-121 - Nikolaos Gkalelis, Vasileios Mezaris:
Structured Pruning of LSTMs via Eigenanalysis and Geometric Median for Mobile Multimedia and Deep Learning Applications. 122-126 - Yurui Xie, Ling Guan:
Automatic Sparsity-Aware Recognition for Keypoint Detection. 127-134
Multimedia in Sport
- Olav A. Norgård Rongved, Steven Alexander Hicks, Vajira Thambawita, Håkon Kvale Stensland, Evi Zouganeli, Dag Johansen, Michael A. Riegler, Pål Halvorsen:
Real-Time Detection of Events in Soccer Videos using 3D Convolutional Neural Networks. 135-144 - Kotaro Yashiro, Yohei Nakada:
Computational Method for Optimal Attack Play Consisting of Run Plays and Hand-pass Plays for Seven-a-side Rugby. 145-148 - Vahid Khorasani Ghassab, Kamal Maanicshah, Nizar Bouguila, Paul Green:
REP-Model: A deep learning framework for replacing ad billboards in soccer videos. 149-153
Multimedia in Education
- Mohammad Rajiur Rahman, Shishir Shah, Jaspal Subhlok:
Visual Summarization of Lecture Video Segments for Enhanced Navigation. 154-157 - Paulo Renato C. Mendes, Eduardo S. Vieira, Álan L. V. Guedes, Antonio José G. Busson, Sérgio Colcher:
A Clustering-Based Method for Automatic Educational Video Recommendation Using Deep Face-Features of Lecturers. 158-161 - Raga Shalini Koka, Farah Naz Chowdhury, Mohammad Rajiur Rahman, Thamar Solorio, Jaspal Subhlok:
Automatic Identification of Keywords in Lecture Video Segments. 162-165 - Dorsaf Sebai, Emna Mani:
MPEG-DASH users quality of experience enhancement for MOOC videos. 166-167
Music
- Na He, Sam Ferguson:
Multi-view Neural Networks for Raw Audio-based Music Emotion Recognition. 168-172 - Leonardo Gabiato Catharin, Rafael P. Ribeiro, Carlos N. Silla, Yandre M. G. Costa, Valéria Delisandra Feltrim:
Multimodal Classification of Emotions in Latin Music. 173-180 - Li-Chia Yang, Alexander Lerch:
Remixing Music with Visual Conditioning. 181-188 - Yihao Chen, Alexander Lerch:
Melody-Conditioned Lyrics Generation with SeqGANs. 189-196
Video Summarization
- Saba Nazir, Taner Cagali, Mehrnoosh Sadrzadeh, Chris Newell:
Audiovisual, Genre, Neural and Topical Textual Embeddings for TV Programme Content Representation. 197-200 - Ran Xu, Haoliang Wang, Stefano Petrangeli, Viswanathan Swaminathan, Saurabh Bagchi:
Closing-the-Loop: A Data-Driven Framework for Effective Video Summarization. 201-205 - Thanh Hong-Phuoc, Ling Guan:
An Effective Rotational Invariant Key-point Detector for Image Matching. 206-209 - Hongxiang Gu, Stefano Petrangeli, Viswanathan Swaminathan:
SumBot: Summarize Videos Like a Human. 210-217 - Yeganeh Jalalpour, Li-Yun Wang, Wu-chi Feng, Feng Liu:
FID: Frame Interpolation and DCT-based Video Compression. 218-221
MTEL Workshop
- Christian Grévisse, Carina Martins Gomes, Steffen Rothkugel:
AR40ER: A Semantic Platform for Open Educational Augmented Reality Resources. 227-232 - Keisuke Ode, Sumiko Miyata:
Two types of flows admission control method for maximizing all user satisfaction considering seek-bar operation. 233-238 - Florian Schimanke, Robert Mertens:
Deriving Strategies for the Evaluation of Spaced Repetition Learning in Mobile Learning Applications from Learning Analytics. 239-244
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.