Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleNovember 2024
JPA: A Joint-Part Attention for Mitigating Overfocusing on 3D Human Pose Estimation
AbstractRecently, transformer-based solutions have exhibited remarkable success in 3D human pose estimation (3D-HPE) by computing pairwise relations between joints. However, we observed that the conventional self-attention mechanism in 3D-HPE tends to ...
- research-articleJune 2024
Iterative Semantic Transformer by Greedy Distillation for Community Question Answering
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), Volume 32Pages 3576–3588https://doi.org/10.1109/TASLP.2024.3414329The semantic matching problem consists of recognizing if the candidate text is relevant to a particular input text. Semantic similarities can be determined from human-curated knowledge, but such knowledge may not be available in every language. Instead, ...
- research-articleJune 2023
Question-aware dynamic scene graph of local semantic representation learning for visual question answering
Pattern Recognition Letters (PTRL), Volume 170, Issue CPages 93–99https://doi.org/10.1016/j.patrec.2023.04.014Highlights- Dynamic scene graph is adaptive to different questions.
- Word-level co-attention mechanism to refine node features and edge features.
- Dynamic scene graph is beneficial to the logistic reasoning.
- The proposed method ...
In visual question answering task, it is vital to learn the semantic interactions between the question and target objects in the input image. Existing scene graph-based methods generally extract global features from the image and then perform ...
- research-articleMay 2023
Deep Progressive Asymmetric Quantization Based on Causal Intervention for Fine-Grained Image Retrieval
IEEE Transactions on Multimedia (TOM), Volume 26Pages 1306–1318https://doi.org/10.1109/TMM.2023.3279990In the field of computer vision, fine-grained image retrieval is an extremely challenging task due to the inherently subtle intra-class object variations. In addition, the high-dimensional real-valued features extracted from large-scale fine-grained image ...
- ArticleMarch 2023
Unsupervised Encoder-Decoder Model for Anomaly Prediction Task
AbstractFor the anomaly detection task of video sequences, CNN-based methods have been able to learn to describe the normal situation without abnormal samples at training time by reconstructing the input frame or predicting the future frame, and then use ...
- research-articleJanuary 2023
Car Emotion Labeling Based on Color-SSL Semi-Supervised Learning Algorithm by Color Augmentation
International Journal of Intelligent Systems (IJIS), Volume 2023https://doi.org/10.1155/2023/4331838In the era of emotional consumption, it has become a hot topic that commodities meet consumers’ emotional needs. As a necessity of life, the car also needs to meet the needs of consumers. To achieve that consumers can purchase cars according to their ...
- research-articleOctober 2022
Scribble-attention hierarchical network for weakly supervised salient object detection in optical remote sensing images
Applied Intelligence (KLU-APIN), Volume 53, Issue 10Pages 12999–13017https://doi.org/10.1007/s10489-022-04014-0AbstractArising from cluttered background interference and various scaled objects, salient object detection (SOD) in optical remote sensing images (RSIs) is a challenging task. In the latest research, supervision-based methods have made significant ...