default search action
International Journal of Multimedia Information Retrieval, Volume 13
Volume 13, Number 1, March 2024
- Shan Liu, Shihao Shan, Guoqiang Xiao, Xinbo Gao, Song Wu:
Image enhancement with bi-directional normalization and color attention-guided generative adversarial networks. 1 - Rangwan Kasantikul, Worapan Kusakunniran:
Augmented inputs for surveillance re-identification. 2 - Xiang Yuan, Shihao Shan, Yuwen Huo, Junkai Jiang, Song Wu:
Text-assisted attention-based cross-modal hashing. 3 - Stefanos-Iordanis Papadopoulos, Christos Koutlis, Symeon Papadopoulos, Panagiotis C. Petrantonakis:
VERITE: a Robust benchmark for multimodal misinformation detection accounting for unimodal bias. 4 - Yuxin Wei, Ligang Zheng, Guoping Qiu, Guocan Cai:
Cross-modal retrieval based on shared proxies. 5 - Younghoon Lee:
Opinion convergence-based sentiment prediction of image advertisement. 6 - Ashima Yadav, Anika Gupta:
An emotion-driven, transformer-based network for multimodal fake news detection. 7 - Yiqiao Tan, Haizhong Liu:
How does a kernel based on gradients of infinite-width neural networks come to be widely used: a review of the neural tangent kernel. 8 - Ahmed Mazari, Hichem Sahbi:
Deep multiple aggregation networks for action recognition. 9 - Kaiyang Liao, Jie Lin, Yuanlin Zheng, Keer Wang, Wen Feng:
Incremental image retrieval method based on feature perception and deep hashing. 10 - Gürkan Dogan, Burhan Ergen:
A new CNN-based semantic object segmentation for autonomous vehicles in urban traffic scenes. 11 - Qingsong Tang, Yingli Chen, Minghui Zhao, Shitong Min, Wuming Jiang:
DAABNet: depth-wise asymmetric attention bottleneck for real-time semantic segmentation. 12 - Sandeep Chand Kumain, Maheep Singh, Lalit Kumar Awasthi:
A voting-based novel spatio-temporal fusion framework for video saliency using transfer learning mechanism. 13 - Huaying Zhang, Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama:
Parameter-efficient tuning of cross-modal retrieval for a specific database via trainable textual and visual prompts. 14
Volume 13, Number 2, June 2024
- Konstantin Schall, Werner Bailer, Kai Uwe Barthel, Fabio Carrara, Jakub Lokoc, Ladislav Peska, Klaus Schoeffmann, Lucia Vadicamo, Claudio Vairo:
Interactive multimodal video search: an extended post-evaluation for the VBS 2022 competition. 15 - Lina Sun, Yumin Dong:
Unsupervised graph reasoning distillation hashing for multimodal hamming space search with vision-language model. 16 - Shuren Zhou, Zhixiong Li, Jie Liu, Jiarui Zhou, Jianming Zhang:
Progressive spatial-temporal transfer model for unsupervised person re-identification. 17 - Divine Njengwie Achinek, Ibrahim Shehi Shehu, Athuman Mohamed Athuman, Xianping Fu:
DAF-Net: dense attention feature pyramid network for multiscale object detection. 18 - Shihao Shan, Peixin Sun, Guoqiang Xiao, Song Wu:
Multi-knowledge-driven enhanced module for visible-infrared cross-modal person Re-identification. 19 - Himanshu Sharma, Devanand Padha:
Domain-specific image captioning: a comprehensive review. 20 - Zhong Ji, Xiangyu Kong, Xuan Wang, Xiyao Liu:
Relevance equilibrium network for cross-domain few-shot learning. 21 - Shilpa Singhal, Kunwar Pal:
State of art and emerging trends on group recommender system: a comprehensive review. 22 - Zhongyi Zhai, Jie Liang, Bo Cheng, Lingzhong Zhao, Junyan Qian:
Strengthening attention: knowledge distillation via cross-layer feature fusion for image classification. 23 - Yishan Li, Yanming Guo, Yulun Wu, Yuxiang Xie, Mingrui Lao, Tianyuan Yu, Yirun Ruan:
RDAT: an efficient regularized decoupled adversarial training mechanism. 24 - Gaurav Sharma, Maheep Singh:
A spatiotemporal bidirectional network for video salient object detection using multiscale transfer learning. 25
Volume 13, Number 3, September 2024
- Pranjal Kumar:
Adversarial attacks and defenses for large language models (LLMs): methods, frameworks & challenges. 26 - Peng Zhao, Qiangchang Wang, Yilong Yin:
DSPformer: discovering semantic parts with token growth and clustering for zero-shot learning. 27 - Abdessamad Elboushaki, Rachida Hannane, Karim Afdel:
Similarity-based face image retrieval using sparsely embedded deep features and binary code learning. 28 - Davar Giveki:
Human action recognition using an optical flow-gated recurrent neural network. 29 - Zhiwen Wang, Donglin Zhang, Zhikai Hu:
LSECA: local semantic enhancement and cross aggregation for video-text retrieval. 30 - Yang Deng, Yonghong Li, Sidong Xian, Laquan Li, Haiyang Qiu:
Mual: enhancing multimodal sentiment analysis with cross-modal attention and difference loss. 31 - Neelu Verma, Anik De, Anand Mishra:
Bridging language to visuals: towards natural language query-to-chart image retrieval. 32 - Jianying Huang, Hoon Kang:
3D skeleton-based human motion prediction using spatial-temporal graph convolutional network. 33 - Thiago Kobashigawa Amorim, Helton Hideraldo Bíscaro:
A order-based content-based information retrieval system proposal applied in 3D meshes. 34 - M. M. Mahabubur Rahman, Jelena Tesic:
Stratified Graph Indexing for efficient search in deep descriptor databases. 35 - Nouara Boudouh, Bilal Mokhtari, Sebti Foufou:
Enhancing deep learning image classification using data augmentation and genetic algorithm-based optimization. 36 - Anna-Maria Christodoulou, Olivier Lartillot, Alexander Refsum Jensenius:
Multimodal music datasets? Challenges and future goals in music processing. 37
Volume 13, Number 4, December 2024
- Sandeep Chand Kumain, Maheep Singh, Lalit Kumar Awasthi:
DBTSF-VSOD: a decision-based two-stage framework for video salient object detection. 38 - Rui Wang, Jiawei Zhu, Shoujin Wang, Tao Wang, Jingze Huang, Xianxun Zhu:
Multi-modal emotion recognition using tensor decomposition fusion and self-supervised multi-tasking. 39 - Archana Singh, Dhiraj:
Advancements in machine learning techniques for threat item detection in X-ray images: a comprehensive survey. 40 - Chintoo Kumar, C. Ravindranath Chowdary, Ashok Kumar Meena:
Recent trends in recommender systems: a survey. 41
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.