Block dictionary learning-driven convolutional neural networks for fewshot face recognition

358 Accesses
7 Citations
Explore all metrics

Abstract

Fewshot face recognition (FFR) in less constrained environment is an important but challenging task due to the lack of sufficient sample information and the impact of occlusion. In this paper, a novel approach called block dictionary learning (BDL) is proposed, which combines sparse representation with convolutional neural networks to address the FFR problem. Based on the key-point locations of face images, the images are divided into four block regions for local feature extraction. Then, highly compact and discriminative features of both holistic and segmented parts are generated by CNN, which further compensates for the shortage of samples. Moreover, the sparse loss is introduced to optimize the performance of CNN by increasing the inter-class variations of features; thus, it develops a global-to-local dictionary learning algorithm to improve the robustness of BDL against complex variations. Finally, extensive experiments on AR and Extended Yale B datasets significantly demonstrate the effectiveness of BDL in comparison with other FFR methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep Learning Based Face Recognition with Sparse Representation Classification

Single Sample Face Recognition by Sparse Recovery of Deep-Learned LDA Features

Sparse Representation Based Face Recognition Using VGGFace

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Jafri, R., Arabnia, H.R.: A survey of face recognition techniques. J. Inf. Process. Syst. 5(2), 41–68 (2009)
Article Google Scholar
Jolliffe, I.: Principal Component Analysis, pp. 1094–1096. Springer, Berlin (2011)
Google Scholar
Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2008)
Article Google Scholar
Zhang, L., Yang, M., Feng, X.: Sparse representation or collaborative representation: Which helps face recognition?. In: 2011 International Conference on Computer Vision, pp. 471–478. IEEE (2011)
Kan, M., Shan, S., Su, Y., Chen, X., Gao, W.: Adaptive discriminant analysis for face recognition from single sample per person. In: Face and Gesture 2011, pp. 193–199. IEEE (2011)
Deng, W., Hu, J., Guo, J.: Extended SRC: undersampled face recognition via intraclass variant dictionary. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1864–1870 (2012)
Article Google Scholar
Ding, R.X., Du, D.K., Huang, Z.H., Li, Z.M., Shang, K.: Variational feature representation-based classification for face recognition with single sample per person. J. Vis. Commun. Image Represent. 30, 35–45 (2015)
Article Google Scholar
Yang, M., Van Gool, L., Zhang, L.: Sparse variation dictionary learning for face recognition with a single training sample per person. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 689–696 (2013)
Gao, Q.X., Zhang, L., Zhang, D.: Face recognition using FLDA with single training image per person. Appl. Math. Comput. 205(2), 726–734 (2008)
MATH Google Scholar
Liu, F., Tang, J., Song, Y., Bi, Y., Yang, S.: Local structure based multi-phase collaborative representation for face recognition with single sample per person. Inf. Sci. 346, 198–215 (2016)
Google Scholar
Hu, C., Ye, M., Ji, S., Zeng, W., Lu, X.: A new face recognition method based on image decomposition for single sample per person problem. Neurocomputing 160, 287–299 (2015)
Article Google Scholar
He, L., Li, H., Zhang, Q., Sun, Z.: Dynamic feature matching for partial face recognition. IEEE Trans. Image Process. 28(2), 791–802 (2018)
Article MathSciNet Google Scholar
Wright, J., Ma, Y.: Dense error correction via l1 minimization. IEEE Trans. Inf. Theory 56(7), 3540–3560 (2010)
Article Google Scholar
Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE PAMI 31(2), 210–227 (2009)
Article Google Scholar
Yang, M., Hang, L.: Gabor feature based sparse representation for face recognition with Gabor occlusion dictionary. In: ECCV (2010)
Gao, S.H., Tsang, I.W-H., Chia, L.-T.: Kernel sparse representation for image classification and face recognition. In: ECCV (2010)
Yang, M., Zhang, L.: Gabor Feature based sparse representation for face recognition with Gabor occlusion dictionary. In: ECCV (2010)
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR (2009)
Sun, Y., Chen, Y., Wang, X., Tang, X.: Deep learning face representation by joint identification-verification. In: NIPS (2014)
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: DeepFace: closing the gap to human-level performance in face verification. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch. CoRR, vol. abs/1411.7923 (2014)
Schroff, F., Kalenichenko, D., Philbin, J.F.: A unified embedding for face recognition and clustering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 815–823 (2015)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Chu, Y., Zhao, L., Ahma, T.: Multiple feature subspaces analysis for single sample per person face recognition. Vis. Comput. 35(2), 239–256 (2019)
Article Google Scholar
Deng, W., Hu, J., Zhou, X., Guo, J.: Equidistant prototypes embedding for single sample based face recognition with generic learning and incremental learning. Pattern Recognit. 47(12), 3738–3749 (2014)
Article Google Scholar
Lu, J., Tan, Y.P., Wang, G.: Discriminative multimanifold analysis for face recognition from a single training sample per person. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 39–51 (2012)
Article Google Scholar
Zhu, P., Yang, M., Zhang, L., Lee, I.Y.: Local generic representation for face recognition with single sample per person. In: Asian Conference on Computer Vision, pp. 34–50. Springer, Cham (2014)
Abdellatef, E., Ismail, N.A., Abd Elrahman, S.E.S.E., et al.: Cancelable multi-biometric recognition system based on deep learning. Vis. Comput. (2019). https://doi.org/10.1007/s00371-019-01715-5
Agrawal, A., Mittal, N.: Using CNN for facial expression recognition: a study of the effects of kernel size and number of filters on accuracy. Vis. Comput. (2019). https://doi.org/10.1007/s00371-019-01630-9
Gao, G., Yang, J., Jing, X.-Y., et al.: Learning robust and discriminative low-rank representations for face recognition with occlusion. Pattern Recognit. 66, 129–143 (2017)
Article Google Scholar
Gao, G., Yu, Y., Meng, Y., et al.: Cross-resolution face recognition with pose variations via multilayer locality-constrained structural orthogonal procrustes regression. Inf. Sci. 506, 19–36 (2020)
Article MathSciNet Google Scholar
Zhang, K., Zhang, Z., Li, Z., Qiao, Y.: Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process. Lett. 23(10), 1499–1503 (2016)
Article Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2014). arXiv preprint arXiv:1412.6980
Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch (2014). arXiv preprint arXiv:1411.7923
Martinez, A.M.: The AR face database. CVC Technical Report 24 (1998)
Lee, K.C., Ho, J., Kriegman, D.J.: Acquiring linear subspaces for face recognition under variable lighting. IEEE Trans. Pattern Anal. Mach. Intell. 5, 684–698 (2005)
Google Scholar
Gao, S., Jia, K., Zhuang, L., Ma, Y.: Neither global nor local: regularized patch-based representation for single sample per person face recognition. Int. J. Comput. Vis. 111(3), 365–383 (2015)
Article MathSciNet Google Scholar
Ji, H.K., Sun, Q.S., Ji, Z.X., Yuan, Y.H., Zhang, G.Q.: Collaborative probabilistic labels for face recognition from single sample per person. Pattern Recognit. 62(C), 125–134 (2017)
Article Google Scholar
Shang, K., Huang, Z.-H., Liu, W., Li, Z.M.: A single gallery-based face recognition using extended joint sparse representation. Appl. Math. Comput. 320, 99–115 (2018)
MATH Google Scholar
Pei, T., Zhang, L., Wang, B., Li, F., Zhang, Z.: Decision pyramid classifier for face recognition under complex variations using single sample per person. Pattern Recognit. 64(C), 305–313 (2016)
Google Scholar

Download references

Acknowledgements

Part of this research was carried out at Key Laboratory of Measurement and Control of Complex Systems of Engineering, Nanjing, China. Acknowledgements. This work was supported by the Natural National Science Foundation of China (Grant Nos. 51475092, 61462072) and Natural Science Foundation of Jiangsu Province of China (Grant Nos. BK20181269, BK20160693). This project was funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions, the Fundamental Research Funds for the Central Universities.

Author information

Authors and Affiliations

School of Automation, Southeast University, Nanjing, China
Qiao Du & Feipeng Da

Authors

Qiao Du
View author publications
You can also search for this author in PubMed Google Scholar
Feipeng Da
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiao Du.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Du, Q., Da, F. Block dictionary learning-driven convolutional neural networks for fewshot face recognition. Vis Comput 37, 663–672 (2021). https://doi.org/10.1007/s00371-020-01802-y

Download citation

Published: 22 January 2020
Issue Date: April 2021
DOI: https://doi.org/10.1007/s00371-020-01802-y

Block dictionary learning-driven convolutional neural networks for fewshot face recognition

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Deep Learning Based Face Recognition with Sparse Representation Classification

Single Sample Face Recognition by Sparse Recovery of Deep-Learned LDA Features

Sparse Representation Based Face Recognition Using VGGFace

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Block dictionary learning-driven convolutional neural networks for fewshot face recognition

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Deep Learning Based Face Recognition with Sparse Representation Classification

Single Sample Face Recognition by Sparse Recovery of Deep-Learned LDA Features

Sparse Representation Based Face Recognition Using VGGFace

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation