Computer Science and Information Systems 2024 Volume 21, Issue 4, Pages: 1483-1498
https://doi.org/10.2298/CSIS230829044L
Full text ( 954 KB)
Learning discriminative representations through an attention mechanism for image-based person re-identification
Liu Jing (School of Computer Science, Weinan Normal University, Weinan, Shaanxi, China), liujing8318@mail.nwpu.edu.cn
Zhou Guoqing (School of Computer Science, Northwestern Polytechnical University, Xi'an, Shaanxi, China), Zhouguoqing@nwpu.edu.cn
Over the past years, person re-identification has been obtaining various attentions in computer vision tasks. However, existing methods mainly focus on building massive number of deep architecture layers, which is unsuitable for extracting the robust features for person re-ID. In this paper, we present a novel hybrid framework PGAN, through which the discriminative representations can be learned for person re-ID. Specifically, a novel self-attention method named channel-wise attention mechanism is adopted to learn the informative representations from the patch-network and global network, respectively. In addition, CSwin Transformer is exploited to re-extract the discriminative features from the residual blocks. We obtain a mAP of 81.8% and 80.3% of the labeled and detected dataset on the CUHK0- NP dataset. And we obtain a mAP of 83.4% and 91.3% on the DukeMTMC and Market-1501 datasets respectively. Comprehensive experiments are performed on the three datasets, (Market-1501, DukeMTMC-reID and CUHK03-NP), demonstrating the efficiency of the introduced approach.
Keywords: person re-identification, channel-wise attention, deep learning
Show references
Bai, Y., Jiao, J., Ce, W., Liu, J., Lou, Y., Feng, X., Duan, L.Y.: Person30k: A dual-meta generalization network for person re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 2123-2132 (2021)
Bai, Z., Wang, Z., Wang, J., Hu, D., Ding, E.: Unsupervised multi-source domain adaptation for person re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 12914-12923 (2021)
Chang, X., Hospedales, T.M., Xiang, T.: Multi-level factorisation net for person reidentification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 2109-2118 (2018)
Chen, D., Xu, D., Li, H., Sebe, N., Wang, X.: Group consistent similarity learning via deep crf for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 8649-8658 (2018)
Cho, Y., Kim, W.J., Hong, S., Yoon, S.E.: Part-based pseudo label refinement for unsupervised person re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 7308-7318 (2022)
Dong, X., Bao, J., Chen, D., Zhang,W., Yu, N., Yuan, L., Chen, D., Guo, B.: Cswin transformer: A general vision transformer backbone with cross-shaped windows. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 12124-12134 (2022)
Eom, C., Lee, W., Lee, G., Ham, B.: Is-gan: Learning disentangled representation for robust person re-identification. IEEE transactions on pattern analysis and machine intelligence (2021)
Fukushima, K., Miyake, S.: Neocognitron: A self-organizing neural network model for a mechanism of visual pattern recognition. In: Competition and cooperation in neural nets, pp. 267- 285. Springer (1982)
Geng, Y., Hu, H.M., Zeng, G., Zheng, J.: A person re-identification algorithm by exploiting region-based feature salience. Journal of Visual Communication and Image Representation 29, 89-102 (2015)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 770-778 (2016)
Hou, R., Chang, H., Ma, B., Huang, R., Shan, S.: Bicnet-tks: Learning efficient spatial-temporal representation for video person re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 2014-2023 (2021)
Hou, R., Ma, B., Chang, H., Gu, X., Shan, S., Chen, X.: Feature completion for occluded person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021)
Hu, H.M., Fang, W., Zeng, G., Hu, Z., Li, B.: A person re-identification algorithm based on pyramid color topology feature. Multimedia Tools and Applications 76(24), 26633-26646 (2017)
Isobe, T., Li, D., Tian, L., Chen, W., Shan, Y.,Wang, S.: Towards discriminative representation learning for unsupervised person re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 8526-8536 (2021)
Kalayeh, M.M., Basaran, E., Gökmen, M., Kamasak, M.E., Shah, M.: Human semantic parsing for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 1062-1071 (2018)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25, 1097-1105 (2012)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proceedings of the IEEE 86(11), 2278-2324 (1998)
Li, S., Bak, S., Carr, P., Wang, X.: Diversity regularized spatiotemporal attention for videobased person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 369-378 (2018)
Li, W., Zhao, R., Xiao, T., Wang, X.: Deepreid: Deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 152-159 (2014)
Li, W., Zhu, X., Gong, S.: Harmonious attention network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 2285-2294 (2018)
Liao, S., Hu, Y., Zhu, X., Li, S.Z.: Person re-identification by local maximal occurrence representation and metric learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 2197-2206 (2015)
Lin, M., Chen, Q., Yan, S.: Network in network. arXiv preprint arXiv:1312.4400 (2013)
Liu, X., Zhao, H., Tian, M., Sheng, L., Shao, J., Yi, S., Yan, J.,Wang, X.: Hydraplus-net: Attentive deep features for pedestrian analysis. In: Proceedings of the IEEE international conference on computer vision. pp. 350-359 (2017)
Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., Guo, B.: Swin transformer: Hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 10012-10022 (2021)
Ma, X., Lv, W., Zhao, M.: A double stream person re-identification method based on attention mechanism and multi-scale feature fusion. IEEE Access 11, 14612-14620 (2023)
Matsukawa, T., Okabe, T., Suzuki, E., Sato, Y.: Hierarchical gaussian descriptor for person reidentification. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 1363-1372 (2016)
Meng, J., Zheng, W.S., Lai, J.H., Wang, L.: Deep graph metric learning for weakly supervised person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021)
Rao, H., Wang, S., Hu, X., Tan, M., Guo, Y., Cheng, J., Liu, X., Hu, B.: A self-supervised gait encoding approach with locality-awareness for 3d skeleton based person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021)
Ristani, E., Solera, F., Zou, R., Cucchiara, R., Tomasi, C.: Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision. pp. 17-35. Springer (2016)
Shen, Y., Xiao, T., Li, H., Yi, S., Wang, X.: End-to-end deep kronecker-product matching for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 6886-6895 (2018)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. Computer Science (2014)
Suh, Y.,Wang, J., Tang, S., Mei, T., Lee, K.M.: Part-aligned bilinear representations for person re-identification. In: Proceedings of the European Conference on Computer Vision (ECCV). pp. 402-419 (2018)
Sun, Y., Zheng, L., Deng, W., Wang, S.: Svdnet for pedestrian retrieval. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 3800-3808 (2017)
Sun, Y., Zheng, L., Yang, Y., Tian, Q., Wang, S.: Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In: Proceedings of the European conference on computer vision (ECCV). pp. 480-496 (2018)
Varior, R.R., Haloi, M.,Wang, G.: Gated siamese convolutional neural network architecture for human re-identification. In: European conference on computer vision. pp. 791-808. Springer (2016)
Wang, H., Shen, J., Liu, Y., Gao, Y., Gavves, E.: Nformer: Robust person re-identification with neighbor transformer. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 7297-7307 (2022)
Wang, Y., Wang, L., You, Y., Zou, X., Chen, V., Li, S., Huang, G., Hariharan, B., Weinberger, K.Q.: Resource aware person re-identification across multiple resolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 8042-8051 (2018)
Wang, Y., Zhang, P., Gao, S., Geng, X., Lu, H., Wang, D.: Pyramid spatial-temporal aggregation for video-based person re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). pp. 12026-12035 (October 2021)
Wei, L., Zhang, S., Gao, W., Tian, Q.: Person transfer gan to bridge domain gap for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 79-88 (2018)
Wu, J., Yang, Y., Lei, Z., Wang, J., Li, S.Z., Tiwari, P., Pandey, H.M.: An end-to-end exemplar association for unsupervised person re-identification. Neural Networks 129, 43-54 (2020)
Xu, W., Liu, H., Shi, W., Miao, Z., Lu, Z., Chen, F.: Adversarial feature disentanglement for long-term person re-identification. In: Proceedings of the International Joint Conference on Artificial Intelligence (2021)
Yan, C., Pang, G., Bai, X., Liu, C., Xin, N., Gu, L., Zhou, J.: Beyond triplet loss: person re-identification with fine-grained difference-aware pairwise loss. IEEE Transactions on Multimedia (2021)
Yan, Y., Qin, J., Ni, B., Chen, J., Liu, L., Zhu, F., Zheng, W.S., Yang, X., Shao, L.: Learning multi-attention context graph for group-based re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)
Yang, Y., Tan, Z., Tiwari, P., Pandey, H.M., Wan, J., Lei, Z., Guo, G., Li, S.Z.: Cascaded splitand- aggregate learning with feature recombination for pedestrian attribute recognition. International Journal of Computer Vision pp. 1-14 (2021)
Yang, Y., Tiwari, P., Pandey, H.M., Lei, Z., et al.: Pixel and feature transfer fusion for unsupervised cross-dataset person reidentification. IEEE Transactions on Neural Networks and Learning Systems (2021)
Yang, Y., Zhang, T., Cheng, J., Hou, Z., Tiwari, P., Pandey, H.M., et al.: Cross-modality pairedimages generation and augmentation for rgb-infrared person re-identification. Neural Networks 128, 294-304 (2020)
Yuan, L., Tian, Z.: Person re-identification based on color and texture feature fusion. In: International conference on intelligent computing. pp. 341-352. Springer (2016)
Zhang, Z., Lan, C., Zeng, W., Jin, X., Chen, Z.: Relation-aware global attention for person reidentification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2020)
Zhao, H., Tian, M., Sun, S., Shao, J., Yan, J., Yi, S., Wang, X., Tang, X.: Spindle net: Person re-identification with human body region guided feature decomposition and fusion. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 1077-1085 (2017)
Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., Tian, Q.: Scalable person re-identification: A benchmark. In: Proceedings of the IEEE international conference on computer vision. pp. 1116-1124 (2015)
Zheng, L., Yang, Y., Hauptmann, A.G.: Person re-identification: Past, present and future. arXiv preprint arXiv:1610.02984 (2016)
Zheng, M., Karanam, S., Wu, Z., Radke, R.J.: Re-identification with consistent attentive siamese networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 5735-5744 (2019)
Zheng,W.S., Gong, S., Xiang, T.: Reidentification by relative distance comparison. IEEE transactions on pattern analysis and machine intelligence 35(3), 653-668 (2012)
Zheng, Z., Zheng, L., Yang, Y.: Unlabeled samples generated by gan improve the person reidentification baseline in vitro. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 3754-3762 (2017)
Zhong, Z., Zheng, L., Cao, D., Li, S.: Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 1318-1327 (2017)
Zhou, K., Yang, Y., Cavallaro, A., Xiang, T.: Learning generalisable omni-scale representations for person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021)