Cited By
View all- Ruan JWu YWan XZhu Y(2024)Describe Images in a Boring Way: Towards Cross-Modal Sarcasm Generation2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00560(5689-5698)Online publication date: 3-Jan-2024
- Li GYe HQi YWang SQing LHuang QYang M(2024)Learning Hierarchical Modular Networks for Video CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.332767746:2(1049-1064)Online publication date: Mar-2024
- Wu SFu XWu FZha Z(2024)Vision-and-Language Navigation via Latent Semantic Alignment LearningIEEE Transactions on Multimedia10.1109/TMM.2024.335811226(8406-8418)Online publication date: 2024
- Show More Cited By