Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.13934 (cs)

[Submitted on 23 Nov 2023]

Title:Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning

Authors:Seonghak Kim, Gyeongdo Ham, Yucheol Cho, Daeshik Kim

View PDF

Abstract:The improvement in the performance of efficient and lightweight models (i.e., the student model) is achieved through knowledge distillation (KD), which involves transferring knowledge from more complex models (i.e., the teacher model). However, most existing KD techniques rely on Kullback-Leibler (KL) divergence, which has certain limitations. First, if the teacher distribution has high entropy, the KL divergence's mode-averaging nature hinders the transfer of sufficient target information. Second, when the teacher distribution has low entropy, the KL divergence tends to excessively focus on specific modes, which fails to convey an abundant amount of valuable knowledge to the student. Consequently, when dealing with datasets that contain numerous confounding or challenging samples, student models may struggle to acquire sufficient knowledge, resulting in subpar performance. Furthermore, in previous KD approaches, we observed that data augmentation, a technique aimed at enhancing a model's generalization, can have an adverse impact. Therefore, we propose a Robustness-Reinforced Knowledge Distillation (R2KD) that leverages correlation distance and network pruning. This approach enables KD to effectively incorporate data augmentation for performance improvement. Extensive experiments on various datasets, including CIFAR-100, FGVR, TinyImagenet, and ImageNet, demonstrate our method's superiority over current state-of-the-art methods.

Comments:	11 pages, 7 figures. This work has been submitted to the IEEE for possible publication
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.13934 [cs.CV]
	(or arXiv:2311.13934v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.13934

Submission history

From: Seonghak Kim [view email]
[v1] Thu, 23 Nov 2023 11:34:48 UTC (263 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators