Li H, Wang W, Wang X, Yuan X and Xu X. (2024). Blind 3D Video Stabilization with Spatio-Temporally Varying Motion Blur. ACM Transactions on Multimedia Computing, Communications, and Applications. 20:11. (1-23). Online publication date: 30-Nov-2024.

Zhang P, Liu M, Song X, Cao D, Gao Z and Nie L. (2024). Universal Relocalizer for Weakly Supervised Referring Expression Grounding. ACM Transactions on Multimedia Computing, Communications, and Applications. 20:7. (1-23). Online publication date: 31-Jul-2024.

Antil A and Dhiman C. (2024). MF2ShrT: Multimodal Feature Fusion Using Shared Layered Transformer for Face Anti-spoofing. ACM Transactions on Multimedia Computing, Communications, and Applications. 20:6. (1-21). Online publication date: 30-Jun-2024.

https://doi.org/10.1145/3640817

Ben H, Wang S, Wang M and Hong R. Pseudo Content Hallucination for Unpaired Image Captioning. Proceedings of the 2024 International Conference on Multimedia Retrieval. (320-329).

https://doi.org/10.1145/3652583.3658080

Zhang D, Zhu W, Liao X, Qi F, Yang G and Ding X. (2024). Spatiotemporal Inconsistency Learning and Interactive Fusion for Deepfake Video Detection. ACM Transactions on Multimedia Computing, Communications, and Applications. 0:0.

https://doi.org/10.1145/3664654

Li M, Zhou T, Huang Z, Yang J, Yang J and Gong C. (2024). Dynamic Weighted Adversarial Learning for Semi-Supervised Classification under Intersectional Class Mismatch. ACM Transactions on Multimedia Computing, Communications, and Applications. 20:4. (1-24). Online publication date: 30-Apr-2024.

https://doi.org/10.1145/3635310

Shi P, Hu M, Shi X and Ren F. (2024). Deep Modular Co-Attention Shifting Network for Multimodal Sentiment Analysis. ACM Transactions on Multimedia Computing, Communications, and Applications. 20:4. (1-23). Online publication date: 30-Apr-2024.

https://doi.org/10.1145/3634706

Feng Z, Xu J, Ma L and Zhang S. (2024). Efficient Video Transformers via Spatial-temporal Token Merging for Action Recognition. ACM Transactions on Multimedia Computing, Communications, and Applications. 20:4. (1-21). Online publication date: 30-Apr-2024.

https://doi.org/10.1145/3633781

Zhang Y, Lin X, Yang H, He J, Qing L, He X, Li Y, Chen H and Khosravi M. (2024). A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-Resolution. International Journal of Intelligent Systems. 2024. Online publication date: 1-Jan-2024.

https://doi.org/10.1155/2024/3255233

Nai K and Chen S. (2024). Learning a Novel Ensemble Tracker for Robust Visual Tracking. IEEE Transactions on Multimedia. 26. (3194-3206). Online publication date: 1-Jan-2024.

https://doi.org/10.1109/TMM.2023.3307939

Anand S, Devulapally N, Bhattacharjee S and Yuan J. Multi-label Emotion Analysis in Conversation via Multimodal Knowledge Distillation. Proceedings of the 31st ACM International Conference on Multimedia. (6090-6100).

https://doi.org/10.1145/3581783.3612517

Lu J, Wang S, Zhang X, Hao Y and He X. Semantic-based Selection, Synthesis, and Supervision for Few-shot Learning. Proceedings of the 31st ACM International Conference on Multimedia. (3569-3578).

https://doi.org/10.1145/3581783.3611784

Peng Y, He L, Hu D, Liu Y, Yang L and Shang S. Decoupling Deep Learning for Enhanced Image Recognition Interpretability. ACM Transactions on Multimedia Computing, Communications, and Applications. 0:0.

https://doi.org/10.1145/3674837