default search action
ACM Transactions on Multimedia Computing, Communications, and Applications, Volume 15
Volume 15, Number 1, February 2019
- Wei Zhang:
Table of Contents: Online Supplement Volume 15, Number 1s. 16:1-16:2 - Iheanyi Irondi, Qi Wang, Christos Grecos, José M. Alcaraz Calero, Pablo Casaseca-de-la-Higuera:
Efficient QoE-Aware Scheme for Video Quality Switching Operations in Dynamic Adaptive Streaming. 17:1-17:23 - Mariem Ben Yahia, Yannick Le Louédec, Gwendal Simon, Loutfi Nuaymi, Xavier Corbillon:
HTTP/2-based Frame Discarding for Low-Latency Adaptive Video Streaming. 18:1-18:23 - Xianguo Li, Yemei Sun, Yanli Yang, Changyun Miao:
Symmetrical Residual Connections for Single Image Super-Resolution. 19:1-19:10 - Yi Yu, Suhua Tang, Francisco Raposo, Lei Chen:
Deep Cross-Modal Correlation Learning for Audio and Lyrics in Music Retrieval. 20:1-20:16 - Jia Sun, Di Huang, Yunhong Wang, Liming Chen:
Expression Robust 3D Facial Landmarking via Progressive Coarse-to-Fine Tuning. 21:1-21:23 - Yuxin Peng, Jinwei Qi:
CM-GANs: Cross-modal Generative Adversarial Networks for Common Representation Learning. 22:1-22:24 - Pietro Pala, Stefano Berretti:
Reconstructing 3D Face Models by Incremental Aggregation and Refinement of Depth Frames. 23:1-23:24 - Han Hu, Yichao Jin, Yonggang Wen, Cédric Westphal:
Orchestrating Caching, Transcoding and Request Routing for Adaptive Video Streaming Over ICN. 24:1-24:23 - Bo Yuan, Xinbo Gao, Zhenxing Niu, Qi Tian:
Discovering Latent Topics by Gaussian Latent Dirichlet Allocation and Spectral Clustering. 25:1-25:18 - Chen He, Haifeng Hu:
Image Captioning With Visual-Semantic Double Attention. 26:1-26:16 - Ruoyu Liu, Yao Zhao, Shikui Wei, Liang Zheng, Yi Yang:
Modality-Invariant Image-Text Embedding for Image-Sentence Matching. 27:1-27:19 - Ruijun Ma, Haifeng Hu, Weixuan Wang, Jia Xu, Zhengming Li:
Photorealistic Face Completion with Semantic Parsing and Face Identity-Preserving Features. 28:1-28:18 - Jakub Lokoc, Gregor Kovalcík, Bernd Münzer, Klaus Schöffmann, Werner Bailer, Ralph Gasser, Stefanos Vrochidis, Phuong Anh Nguyen, Sitapa Rujikietgumjorn, Kai Uwe Barthel:
Interactive Search or Sequential Browsing? A Detailed Analysis of the Video Browser Showdown 2018. 29:1-29:18
Volume 15, Number 1s, February 2019
- Wei Zhang, Ting Yao, Shiai Zhu, Abdulmotaleb El-Saddik:
Editorial to Special Issue on Deep Learning for Intelligent Multimedia Analytics. 1:1-1:2 - Wei Zhang, Ting Yao, Shiai Zhu, Abdulmotaleb El-Saddik:
Deep Learning-Based Multimedia Analytics: A Review. 2:1-2:26 - Hongtao Xie, Shancheng Fang, Zheng-Jun Zha, Yating Yang, Yan Li, Yongdong Zhang:
Convolutional Attention Networks for Scene Text Recognition. 3:1-3:17 - Zhineng Chen, Shanshan Ai, Caiyan Jia:
Structure-Aware Deep Learning for Product Image Classification. 4:1-4:20 - Shuqiang Jiang, Gongwei Chen, Xinhang Song, Linhu Liu:
Deep Patch Representations with Shared Codebook for Scene Classification. 5:1-5:17 - Rui-Wei Zhao, Qi Zhang, Zuxuan Wu, Jianguo Li, Yu-Gang Jiang:
Visual Content Recognition by Exploiting Semantic Feature Map with Attention and Multi-task Learning. 6:1-6:22 - Xueliang Liu, Meng Wang, Zheng-Jun Zha, Richang Hong:
Cross-Modality Feature Learning via Convolutional Autoencoder. 7:1-7:20 - Jiawei Liu, Zheng-Jun Zha, Xuejin Chen, Zilei Wang, Yongdong Zhang:
Dense 3D-Convolutional Neural Network for Person Re-Identification in Videos. 8:1-8:19 - Liang Zhao, Zhikui Chen, Laurence T. Yang, M. Jamal Deen, Z. Jane Wang:
Deep Semantic Mapping for Heterogeneous Multimedia Transfer Learning Using Co-Occurrence Data. 9:1-9:21 - M. Shamim Hossain, Syed Umar Amin, Mansour Alsulaiman, Ghulam Muhammad:
Applying Deep Learning for Epilepsy Seizure Detection and Brain Mapping Visualization. 10:1-10:17
- Xavier Alameda-Pineda, Miriam Redi, Mohammad Soleymani, Nicu Sebe, Shih-Fu Chang, Samuel D. Gosling:
Special Section on Multimodal Understanding of Social, Affective, and Subjective Attributes. 11:1-11:3 - Chuan-Shen Hu, Yi-Tsung Hsieh, Hsiao-Wei Lin, Mei-Chen Yeh:
Virtual Portraitist: An Intelligent Tool for Taking Well-Posed Selfies. 12:1-12:17 - Shogo Okada, Laurent Son Nguyen, Oya Aran, Daniel Gatica-Perez:
Modeling Dyadic and Group Impressions with Intermodal and Interperson Features. 13:1-13:30 - Sicheng Zhao, Amir Gholaminejad, Guiguang Ding, Yue Gao, Jungong Han, Kurt Keutzer:
Personalized Emotion Recognition by Personality-Aware High-Order Learning of Physiological Signals. 14:1-14:18 - Rim Trabelsi, Jagannadan Varadarajan, Le Zhang, Issam Jabri, Yong Pei, Fethi Smach, Ammar Bouallegue, Pierre Moulin:
Understanding the Dynamics of Social Interactions: A Multi-Modal Multi-View Approach. 15:1-15:16
Volume 15, Number 2, June 2019
- Tian Gan, Junnan Li, Yongkang Wong, Mohan S. Kankanhalli:
A Multi-sensor Framework for Personal Presentation Analytics. 30:1-30:21 - Pengjie Tang, Hanli Wang, Qinyu Li:
Rich Visual and Language Representation with Complementary Semantics for Video Captioning. 31:1-31:23 - Chen Shen, Zhongming Jin, Wenqing Chu, Rongxin Jiang, Yaowu Chen, Guo-Jun Qi, Xian-Sheng Hua:
Multi-level Similarity Perception Network for Person Re-identification. 32:1-32:19 - Yu Miao, Haiwei Dong, Jihad Mohamad Al'Jaam, Abdulmotaleb El-Saddik:
A Deep Learning System for Recognizing Facial Expression in Real-Time. 33:1-33:20 - Gebremariam Mesfin, Nadia Hussain, Alexandra Covaci, Gheorghita Ghinea:
Using Eye Tracking and Heart-Rate Activity to Examine Crossmodal Correspondences QoE in Mulsemedia. 34:1-34:22 - Ming Cheung, James She, Weiwei Sun, Jiantao Zhou:
Detecting Online Counterfeit-goods Seller using Connection Discovery. 35:1-35:16 - Hema Kumar Yarnagula, Parikshit Juluri, Sheyda Kiani Mehr, Venkatesh Tamarapalli, Deep Medhi:
QoE for Mobile Clients with Segment-aware Rate Adaptation Algorithm (SARA) for DASH Video Streaming. 36:1-36:23 - Pradeep K. Atrey, Bakul Trehan, Mukesh Kumar Saini:
Watch Me from Distance (WMD): A Privacy-Preserving Long-Distance Video Surveillance System. 37:1-37:18 - Chih-Fan Hsu, Yu-Shuen Wang, Chin-Laung Lei, Kuan-Ta Chen:
Look at Me! Correcting Eye Gaze in Live Video Communication. 38:1-38:21 - Kashif Ahmad, Nicola Conci:
How Deep Features Have Improved Event Recognition in Multimedia: A Survey. 39:1-39:27 - Yadang Chen, Chuanyan Hao, Alex X. Liu, Enhua Wu:
Appearance-consistent Video Object Segmentation Based on a Multinomial Event Model. 40:1-40:15 - Roberto Pierdicca, Emanuele Frontoni, Primo Zingaretti, Adriano Mancini, Jelena Loncarski, Marina Paolanti:
Design, Large-Scale Usage Testing, and Important Metrics for Augmented Reality Gaming Applications. 41:1-41:18 - Aliaksandr Siarohin, Gloria Zen, Cveta Majtanovic, Xavier Alameda-Pineda, Elisa Ricci, Nicu Sebe:
Increasing Image Memorability with Neural Style Transfer. 42:1-42:22 - Thanh-Toan Do, Tuan Hoang, Dang-Khoa Le Tan, Huu Le, Tam V. Nguyen, Ngai-Man Cheung:
From Selective Deep Convolutional Features to Compact Binary Representations for Image Retrieval. 43:1-43:22 - Liquan Shen, Ping An, Guorui Feng:
Low-Complexity Scalable Extension of the High-Efficiency Video Coding (SHVC) Encoding System. 44:1-44:23 - Jun Hu, Shengsheng Qian, Quan Fang, Xueliang Liu, Changsheng Xu:
A2CMHNE: Attention-Aware Collaborative Multimodal Heterogeneous Network Embedding. 45:1-45:17 - Khalid M. Hosny, Mohamed M. Darwish:
Resilient Color Image Watermarking Using Accurate Quaternion Radial Substituted Chebyshev Moments. 46:1-46:25 - Wenxuan Mou, Hatice Gunes, Ioannis Patras:
Alone versus In-a-group: A Multi-modal Framework for Automatic Affect Recognition. 47:1-47:23
Volume 15, Number 2s, August 2019
- Richang Hong, Yahong Han, Tat-Seng Chua:
Introduction to the Special Issue on the Cross-Media Analysis for Visual Question Answering. 48:1-48:3 - Qun Li, Fu Xiao, Le An, Xianzhong Long, Xiaochuan Sun:
Semantic Concept Network and Deep Walk-based Visual Question Answering. 49:1-49:19 - Zhiwei Fang, Jing Liu, Xueliang Liu, Qu Tang, Yong Li, Hanqing Lu:
BTDP: Toward Sparse Fusion with Block Term Decomposition Pooling for Visual Question Answering. 50:1-50:21 - Dongfei Yu, Jianlong Fu, Xinmei Tian, Tao Mei:
Multi-source Multi-level Attention Networks for Visual Question Answering. 51:1-51:20 - Weike Jin, Zhou Zhao, Yimeng Li, Jie Li, Jun Xiao, Yueting Zhuang:
Video Question Answering via Knowledge-based Progressive Spatial-Temporal Attention Network. 52:1-52:22 - Zheng-Jun Zha, Jiawei Liu, Tianhao Yang, Yongdong Zhang:
Spatiotemporal-Textual Co-Attention Network for Video Question Answering. 53:1-53:18 - Jinhui Tang, Jing Wang, Zechao Li, Jianlong Fu, Tao Mei:
Show, Reward, and Tell: Adversarial Visual Story Generation. 54:1-54:20 - Xiaoshan Yang, Changsheng Xu:
Image Captioning by Asking Questions. 55:1-55:19 - Shuo Wang, Dan Guo, Xin Xu, Li Zhuo, Meng Wang:
Cross-Modality Retrieval by Joint Correlation Learning. 56:1-56:16
- James She:
Introduction to the Special Issue on Big Data, Machine Learning, and AI Technologies for Art and Design. 57:1-57:3 - Joo-Wha Hong, Nathaniel Ming Curran:
Artificial Intelligence, Artists, and Art: Attitudes Toward Artwork Produced by Humans vs. Artificial Intelligence. 58:1-58:16 - Eugene Ch'ng:
Art by Computing Machinery: Is Machine Art Acceptable in the Artworld? 59:1-59:17 - Hui Mao, James She, Ming Cheung:
Visual Arts Search on Mobile Devices. 60:1-60:23 - Zunlei Feng, Zhenyun Yu, Yongcheng Jing, Sai Wu, Mingli Song, Yezhou Yang, Junxiao Jiang:
Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit Composition. 61:1-61:20 - Karen Panetta, Long Bao, Sos S. Agaian, Victor Oludare:
Color Theme-based Aesthetic Enhancement Algorithm to Emulate the Human Perception of Beauty in Photos. 62:1-62:17 - Magzhan Kairanbay, John See, Lai-Kuan Wong:
Beauty Is in the Eye of the Beholder: Demographically Oriented Analysis of Aesthetics in Photographs. 63:1-63:21
- Pablo César, Michael Zink, Niall Murray:
Introduction to the Best Papers of the ACM Multimedia Systems (MMSys) Conference 2018 and the ACM Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV) 2018 and the International Workshop on Mixed and Virtual Environment Systems (MMVE) 2018. 64:1-64:2 - Abdelhak Bentaleb, Ali C. Begen, Saad Harous, Roger Zimmermann:
Game of Streaming Players: Is Consensus Viable or an Illusion? 65:1-65:30 - De-Yu Chen, Magda El Zarki:
A Framework for Adaptive Residual Streaming for Single-Player Cloud Gaming. 66:1-66:23 - Kevin Spiteri, Ramesh K. Sitaraman, Daniel Sparacio:
From Theory to Practice: Improving Bitrate Adaptation in the DASH Reference Player. 67:1-67:29 - Alireza Zare, Maryam Homayouni, Alireza Aminlou, Miska M. Hannuksela, Moncef Gabbouj:
6K and 8K Effective Resolution with 4K HEVC Decoding Capability for 360 Video Streaming. 68:1-68:22
Volume 15, Number 3, September 2019
- Richang Hong:
Table of Contents: Online Supplement Volume 15, Number 2s. - Yongyi Gong, Shangru Li, Kanoksak Wattanachote, Xiaonan Luo:
Advanced Stereo Seam Carving by Considering Occlusions on Both Sides. 69:1-69:21 - Yun Zhang, Na Li, Sam Kwong, Gangyi Jiang, Huanqiang Zeng:
Statistical Early Termination and Early Skip Models for Fast Mode Decision in HEVC INTRA Coding. 70:1-70:23 - Abhinav Gupta, Divya Singhal:
A Simplistic Global Median Filtering Forensics Based on Frequency Domain Analysis of Image Residuals. 71:1-71:23 - Kan Wu, Guanbin Li, Haofeng Li, Jian-Jun Zhang, Yizhou Yu:
Harvesting Visual Objects from Internet Images via Deep-Learning-Based Objectness Assessment. 72:1-72:23 - Yuan Yuan, Jie Fang, Xiaoqiang Lu, Yachuang Feng:
Spatial Structure Preserving Feature Pyramid Network for Semantic Image Segmentation. 73:1-73:19 - Junxuan Zhang, Haifeng Hu, Xinlong Lu:
Moving Foreground-Aware Visual Attention and Key Volume Mining for Human Action Recognition. 74:1-74:16 - Amit More, Subhasis Chaudhuri:
A Pseudo-likelihood Approach for Geo-localization of Events from Crowd-sourced Sensor-Metadata. 75:1-75:26 - Mohsin Shah, Weiming Zhang, Honggang Hu, Nenghai Yu:
Paillier Cryptosystem based Mean Value Computation for Encrypted Domain Image Processing Operations. 76:1-76:21 - Guanghui Yue, Chunping Hou, Tianwei Zhou:
Subtitle Region Selection of S3D Images in Consideration of Visual Discomfort and Viewing Habit. 77:1-77:16 - Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Yong Rui, Tao Mei:
Learning Click-Based Deep Structure-Preserving Embeddings with Visual Attention. 78:1-78:19 - Tengfei Cao, Changqiao Xu, Mu Wang, Zhongbai Jiang, Xingyan Chen, Lujie Zhong, Luigi Alfredo Grieco:
Stochastic Optimization for Green Multimedia Services in Dense 5G Networks. 79:1-79:22 - Jie Wu, Haifeng Hu, Liang Yang:
Pseudo-3D Attention Transfer Network with Content-aware Strategy for Image Captioning. 80:1-80:19 - Min Wang, Wengang Zhou, Qi Tian, Houqiang Li:
Deep Scalable Supervised Quantization by Self-Organizing Map. 81:1-81:18 - Ihsan Mert Ozcelik, Cem Ersoy:
Chunk Duration-Aware SDN-Assisted DASH. 82:1-82:22 - Naifan Zhuang, Guo-Jun Qi, The Duc Kieu, Kien A. Hua:
Rethinking the Combined and Individual Orders of Derivative of States for Differential Recurrent Neural Networks: Deep Differential Recurrent Neural Networks. 83:1-83:21 - Zhangcheng Wang, Ya Li, Richang Hong, Xinmei Tian:
Eigenvector-Based Distance Metric Learning for Image Classification and Retrieval. 84:1-84:19
Volume 15, Number 4, January 2020
- Weizhi Nie, Weijie Wang, Anan Liu, Jie Nie, Yuting Su:
HGAN: Holistic Generative Adversarial Networks for Two-dimensional Image-based Three-dimensional Object Retrieval. 101:1-101:24 - Mading Li, Jiaying Liu, Xiaoyan Sun, Zhiwei Xiong:
Image/Video Restoration via Multiplanar Autoregressive Model and Low-Rank Optimization. 102:1-102:23 - Sheng-Hua Zhong, Yuantian Wang, Tongwei Ren, Mingjie Zheng, Yan Liu, Gangshan Wu:
Steganographer Detection via Multi-Scale Embedding Probability Estimation. 103:1-103:23 - Marcos A. de Almeida, Carolina Coimbra Vieira, Pedro Olmo Stancioli Vaz de Melo, Renato Martins Assunção:
Random Playlists Smoothly Commuting Between Styles. 104:1-104:20 - Zhaoda Ye, Yuxin Peng:
Sequential Cross-Modal Hashing Learning via Multi-scale Correlation Mining. 105:1-105:20 - Shiguang Liu, Ziqing Huang:
Efficient Image Hashing with Geometric Invariant Vector Distance for Copy Detection. 106:1-106:22 - Zhandong Liu, Wengang Zhou, Houqiang Li:
AB-LSTM: Attention-based Bidirectional LSTM Model for Scene Text Detection. 107:1-107:23 - Deepayan Bhowmik, Charith Abhayaratne:
Embedding Distortion Analysis in Wavelet-domain Watermarking. 108:1-108:24 - Ling Shen, Richang Hong, Haoran Zhang, Xinmei Tian, Meng Wang:
Video Retrieval with Similarity-Preserving Deep Temporal Hashing. 109:1-109:16 - Jeroen van der Hooft, Maria Torres Vega, Stefano Petrangeli, Tim Wauters, Filip De Turck:
Tile-based Adaptive Streaming for Virtual Reality Video. 110:1-110:24 - Roberto Irajá Tavares da Costa Filho, Marcelo Caggiani Luizelli, Stefano Petrangeli, Maria Torres Vega, Jeroen van der Hooft, Tim Wauters, Filip De Turck, Luciano Paschoal Gaspary:
Dissecting the Performance of VR Video Streaming through the VR-EXP Experimentation Platform. 111:1-111:23 - Yunpeng Zheng, Xuelong Li, Xiaoqiang Lu:
Unsupervised Learning of Human Action Categories in Still Images with Deep Representations. 112:1-112:20 - Meng Xing, Zhiyong Feng, Yong Su, Jianhai Zhang:
An Image Cues Coding Approach for 3D Human Pose Estimation. 113:1-113:20 - Jinhuan Liu, Xuemeng Song, Liqiang Nie, Tian Gan, Jun Ma:
An End-to-End Attention-Based Neural Model for Complementary Clothing Matching. 114:1-114:16 - Jonathan Kua, Grenville Armitage, Philip Branch, Jason But:
Adaptive Chunklets and AQM for Higher-Performance Content Streaming. 115:1-115:24
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.