Cited By
View all- Tang PTan YXia J(2023)Deep sequential collaborative cognition of vision and language based model for video descriptionMultimedia Tools and Applications10.1007/s11042-023-14887-z82:23(36207-36230)Online publication date: 17-Mar-2023
- Wang SGao LLyu XGuo YZeng PSong JMagalhães Jdel Bimbo ASatoh SSebe NAlameda-Pineda XJin QOria VToni L(2022)Dynamic Scene Graph Generation via Temporal Prior InferenceProceedings of the 30th ACM International Conference on Multimedia10.1145/3503161.3548324(5793-5801)Online publication date: 10-Oct-2022
- Wang YLi KChen GZhang YGuo DWang M(2022)Spatiotemporal contrastive modeling for video moment retrievalWorld Wide Web10.1007/s11280-022-01105-326:4(1525-1544)Online publication date: 26-Sep-2022
- Show More Cited By