Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3372278.3390696acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Visible-infrared Person Re-identification via Colorization-based Siamese Generative Adversarial Network

Published: 08 June 2020 Publication History

Abstract

With explosive surveillance data during day and night, visible-infrared person re-identification (VI-ReID) is an emerging challenge due to the apparent cross-modality discrepancy between visible and infrared images. Existing VI-ReID work mainly focuses on learning a robust feature to represent a person in both modalities despite the modality gap cannot be effectively eliminated. Recent research works have proposed various generative adversarial network (GAN) models to transfer the visible modality to another unified modality, aiming to bridge the cross-modality gap. However, they neglect the information loss caused by transferring the domain of visible images which is significant for identification. To effectively address the problems, we observe that key information such as textures and semantics in an infrared image can help to color the image itself and the colored infrared image maintains rich information from infrared image while reducing the discrepancy with the visible image. We therefore propose a colorization-based Siamese generative adversarial network (CoSiGAN) for VI-ReID to bridge the cross-modality gap, by retaining the identity of the colored infrared image. Furthermore, we also propose a feature-level fusion model to supplement the transfer loss of colorization. The experiments conducted on two cross-modality person re-identification datasets demonstrate the superiority of the proposed method compared with the state-of-the-arts.

References

[1]
Pingyang Dai, Rongrong Ji, Haibin Wang, Qiong Wu, and Yuyu Huang. 2018. Cross-Modality Person Re-Identification with Generative Adversarial Training. In IJCAI. 677--683.
[2]
Navneet Dalal and Bill Triggs. 2005. Histograms of oriented gradients for human detection.
[3]
Fartash Faghri, David J Fleet, Jamie Ryan Kiros, and Sanja Fidler. 2017. Vse+: Improving visual-semantic embeddings with hard negatives. arXiv preprint arXiv:1707.05612 (2017).
[4]
Jiuxiang Gu, Jianfei Cai, Shafiq R Joty, Li Niu, and Gang Wang. 2018. Look, imagine and match: Improving textual-visual cross-modal retrieval with generative models. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7181--7189.
[5]
Yi Hao, Nannan Wang, Xinbo Gao, Jie Li, and Xiaoyu Wang. 2019. Dual-alignment Feature Embedding for Cross-modality Person Re-identification. In Proceedings of the 27th ACM International Conference on Multimedia. 57--65.
[6]
Chih-Chung Hsu, Chia-Wen Lin, Weng-Tai Su, and Gene Cheung. 2019. Sigan: Siamese generative adversarial network for identity-preserving face hallucination. IEEE Transactions on Image Processing, Vol. 28, 12 (2019), 6225--6236.
[7]
Wenxin Huang, Ruimin Hu, Chao Liang, Yi Yu, Zheng Wang, Xian Zhong, and Chunjie Zhang. 2016. Camera Network Based Person Re-identification by Leveraging Spatial-Temporal Constraint and Multiple Cameras Relations. In International Conference on Multimedia Modeling. 174--186.
[8]
Ziling Huang, Zheng Wang, Wei Hu, Chia-Wen Lin, and Shin'ichi Satoh. 2019. DoT-GNN: Domain-Transferred Graph Neural Network for Group Re-identification. In ACM MM.
[9]
Kajal Kansal, AV Subramanyam, Zheng Wang, and Shin'ichi Satoh. 2020. SDL: Spectrum-Disentangled Representation Learning for Visible-Infrared Person Re-identification. IEEE Transactions on Circuits and Systems for Video Technology (2020).
[10]
Ryan Kiros, Ruslan Salakhutdinov, and Richard S Zemel. 2014. Unifying visual-semantic embeddings with multimodal neural language models. arXiv preprint arXiv:1411.2539 (2014).
[11]
Kuang-Huei Lee, Xi Chen, Gang Hua, Houdong Hu, and Xiaodong He. 2018. Stacked cross attention for image-text matching. Proceedings of the European Conference on Computer Vision (ECCV). 201--216.
[12]
Anat Levin, Dani Lischinski, and Yair Weiss. 2004. Colorization using optimization. In ACM transactions on graphics (tog), Vol. 23. ACM, 689--694.
[13]
Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, and Xiaogang Wang. 2017a. Identity-aware textual-visual matching with latent co-attention. In Proceedings of the IEEE International Conference on Computer Vision. 1890--1899.
[14]
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, and Xiaogang Wang. 2017b. Person search with natural language description. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1970--1979.
[15]
Shengcai Liao, Yang Hu, Xiangyu Zhu, and Stan Z Li. 2015. Person re-identification by local maximal occurrence representation and metric learning. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2197--2206.
[16]
Shengcai Liao and Stan Z Li. 2015. Efficient psd constrained asymmetric metric learning for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision. 3685--3693.
[17]
Liang Lin, Guangrun Wang, Wangmeng Zuo, Xiangchu Feng, and Lei Zhang. 2016. Cross-domain visual matching via generalized similarity measure and feature learning. IEEE transactions on pattern analysis and machine intelligence, Vol. 39, 6 (2016), 1089--1102.
[18]
Chuanchen Luo, Yuntao Chen, Naiyan Wang, and Zhaoxiang Zhang. 2018. Spectral feature transformation for person re-identification. arXiv preprint arXiv:1811.11405 (2018).
[19]
Kamyar Nazeri, Eric Ng, and Mehran Ebrahimi. 2018. Image Colorization with Generative Adversarial Networks. arXiv preprint arXiv:1803.05400 (2018).
[20]
Dat Nguyen, Hyung Hong, Ki Kim, and Kang Park. 2017. Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors, Vol. 17, 3 (2017), 605.
[21]
Zhenxing Niu, Mo Zhou, Le Wang, Xinbo Gao, and Gang Hua. 2017. Hierarchical multimodal lstm for dense visual-semantic embedding. In Proceedings of the IEEE International Conference on Computer Vision. 1881--1889.
[22]
Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015).
[23]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention. Springer, 234--241.
[24]
Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. 2018. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European Conference on Computer Vision (ECCV). 480--496.
[25]
Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Yung-Yu Chuang, and Shin'ichi Satoh. 2019. Learning to reduce dual-level discrepancy for infrared-visible person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 618--626.
[26]
Tomihisa Welsh, Michael Ashikhmin, and Klaus Mueller. 2002. Transferring color to greyscale images. In ACM transactions on graphics (TOG), Vol. 21. ACM, 277--280.
[27]
Ancong Wu, Wei-Shi Zheng, Hong-Xing Yu, Shaogang Gong, and Jianhuang Lai. 2017. Rgb-infrared cross-modality person re-identification. In Proceedings of the IEEE International Conference on Computer Vision. 5380--5389.
[28]
Xiang Wu, Huaibo Huang, Vishal M Patel, Ran He, and Zhenan Sun. 2019. Disentangled variational representation for heterogeneous face recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 9005--9012.
[29]
Fan Yang, Zheng Wang, Jing Xiao, and Shin'ichi Satoh. 2020. Mining on Heterogeneous Manifolds for Zero-shot Cross-modal Image Retrieval. In AAAI.
[30]
Mang Ye, Yi Cheng, Xiangyuan Lan, and Hongyuan Zhu. 2019 a. Improving Night-Time Pedestrian Retrieval with Distribution Alignment and Contextual Distance. IEEE Transactions on Industrial Informatics (2019).
[31]
Mang Ye, Xiangyuan Lan, and Qingming Leng. 2019 b. Modality-aware Collaborative Learning for Visible Thermal Person Re-Identification. In Proceedings of the 27th ACM International Conference on Multimedia. ACM, 347--355.
[32]
Mang Ye, Xiangyuan Lan, Jiawei Li, and Pong C Yuen. 2018a. Hierarchical discriminative learning for visible thermal person re-identification. In Thirty-Second AAAI Conference on Artificial Intelligence.
[33]
Mang Ye, Xiangyuan Lan, Zheng Wang, and Pong C Yuen. 2019 c. Bi-directional Center-Constrained Top-Ranking for Visible Thermal Person Re-Identification. IEEE Transactions on Information Forensics and Security (2019).
[34]
Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, and Steven C. H. Hoi. 2020. Deep Learning for Person Re-identification: A Survey and Outlook. arXiv preprint arXiv:2001.04193 (2020).
[35]
Mang Ye, Zheng Wang, Xiangyuan Lan, and Pong C Yuen. 2018b. Visible Thermal Person Re-Identification via Dual-Constrained Top-Ranking. In IJCAI. 1092--1099.
[36]
Kaiyang Zhou, Xiatian Zhu, Yongxin Yang, Andrea Cavallaro, and Tao Xiang. 2019. Learning Generalisable Omni-Scale Representations for Person Re-Identification. arXiv preprint arXiv:1910.06827 (2019).

Cited By

View all
  • (2024)Relation-Aware Weight Sharing in Decoupling Feature Learning Network for UAV RGB-Infrared Vehicle Re-IdentificationIEEE Transactions on Multimedia10.1109/TMM.2024.340067526(9839-9853)Online publication date: 2024
  • (2024)Modality Bias Calibration Network via Information Disentanglement for Visible–Infrared Person ReidentificationIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.339869611:5(6925-6938)Online publication date: Oct-2024
  • (2024)Margin-enhanced average precision optimization for visible-infrared person re-identificationComputers and Electrical Engineering10.1016/j.compeleceng.2024.109751120(109751)Online publication date: Dec-2024
  • Show More Cited By

Index Terms

  1. Visible-infrared Person Re-identification via Colorization-based Siamese Generative Adversarial Network

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICMR '20: Proceedings of the 2020 International Conference on Multimedia Retrieval
    June 2020
    605 pages
    ISBN:9781450370875
    DOI:10.1145/3372278
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 June 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. colorization
    2. cross-modality
    3. person re-identification
    4. siamese generative adversarial network
    5. visible-infrared

    Qualifiers

    • Research-article

    Funding Sources

    • Hubei Key Laboratory of Transportation Internet of Things
    • Fundamental Research Funds for the Central Universities
    • National Natural Science Foundation of China

    Conference

    ICMR '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)30
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Relation-Aware Weight Sharing in Decoupling Feature Learning Network for UAV RGB-Infrared Vehicle Re-IdentificationIEEE Transactions on Multimedia10.1109/TMM.2024.340067526(9839-9853)Online publication date: 2024
    • (2024)Modality Bias Calibration Network via Information Disentanglement for Visible–Infrared Person ReidentificationIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.339869611:5(6925-6938)Online publication date: Oct-2024
    • (2024)Margin-enhanced average precision optimization for visible-infrared person re-identificationComputers and Electrical Engineering10.1016/j.compeleceng.2024.109751120(109751)Online publication date: Dec-2024
    • (2024)Cross-modality person re-identification via modality-synergy alignment learningMachine Vision and Applications10.1007/s00138-024-01612-535:6Online publication date: 26-Sep-2024
    • (2024)Learning a Robust Synthetic Modality with Dual-Level Alignment for Visible-Infrared Person Re-identificationPattern Recognition and Computer Vision10.1007/978-981-97-8620-6_20(289-303)Online publication date: 20-Oct-2024
    • (2023)Person Re-Identification with RGB–D and RGB–IR Sensors: A Comprehensive SurveySensors10.3390/s2303150423:3(1504)Online publication date: 29-Jan-2023
    • (2023)Margin-Based Modal Adaptive Learning for Visible-Infrared Person Re-IdentificationSensors10.3390/s2303142623:3(1426)Online publication date: 27-Jan-2023
    • (2023)SFANet: A Spectrum-Aware Feature Augmentation Network for Visible-Infrared Person ReidentificationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.310570234:4(1958-1971)Online publication date: Apr-2023
    • (2023)Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-IdentificationIEEE Journal of Selected Topics in Signal Processing10.1109/JSTSP.2022.323371617:3(545-559)Online publication date: May-2023
    • (2023)Visible–Thermal Person Reidentification in Visual Internet of Things With Random Gray Data Augmentation and a New Pooling MechanismIEEE Internet of Things Journal10.1109/JIOT.2022.323318310:10(9022-9037)Online publication date: 15-May-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media