Abstract
Fewshot face recognition (FFR) in less constrained environment is an important but challenging task due to the lack of sufficient sample information and the impact of occlusion. In this paper, a novel approach called block dictionary learning (BDL) is proposed, which combines sparse representation with convolutional neural networks to address the FFR problem. Based on the key-point locations of face images, the images are divided into four block regions for local feature extraction. Then, highly compact and discriminative features of both holistic and segmented parts are generated by CNN, which further compensates for the shortage of samples. Moreover, the sparse loss is introduced to optimize the performance of CNN by increasing the inter-class variations of features; thus, it develops a global-to-local dictionary learning algorithm to improve the robustness of BDL against complex variations. Finally, extensive experiments on AR and Extended Yale B datasets significantly demonstrate the effectiveness of BDL in comparison with other FFR methods.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Jafri, R., Arabnia, H.R.: A survey of face recognition techniques. J. Inf. Process. Syst. 5(2), 41–68 (2009)
Jolliffe, I.: Principal Component Analysis, pp. 1094–1096. Springer, Berlin (2011)
Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2008)
Zhang, L., Yang, M., Feng, X.: Sparse representation or collaborative representation: Which helps face recognition?. In: 2011 International Conference on Computer Vision, pp. 471–478. IEEE (2011)
Kan, M., Shan, S., Su, Y., Chen, X., Gao, W.: Adaptive discriminant analysis for face recognition from single sample per person. In: Face and Gesture 2011, pp. 193–199. IEEE (2011)
Deng, W., Hu, J., Guo, J.: Extended SRC: undersampled face recognition via intraclass variant dictionary. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1864–1870 (2012)
Ding, R.X., Du, D.K., Huang, Z.H., Li, Z.M., Shang, K.: Variational feature representation-based classification for face recognition with single sample per person. J. Vis. Commun. Image Represent. 30, 35–45 (2015)
Yang, M., Van Gool, L., Zhang, L.: Sparse variation dictionary learning for face recognition with a single training sample per person. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 689–696 (2013)
Gao, Q.X., Zhang, L., Zhang, D.: Face recognition using FLDA with single training image per person. Appl. Math. Comput. 205(2), 726–734 (2008)
Liu, F., Tang, J., Song, Y., Bi, Y., Yang, S.: Local structure based multi-phase collaborative representation for face recognition with single sample per person. Inf. Sci. 346, 198–215 (2016)
Hu, C., Ye, M., Ji, S., Zeng, W., Lu, X.: A new face recognition method based on image decomposition for single sample per person problem. Neurocomputing 160, 287–299 (2015)
He, L., Li, H., Zhang, Q., Sun, Z.: Dynamic feature matching for partial face recognition. IEEE Trans. Image Process. 28(2), 791–802 (2018)
Wright, J., Ma, Y.: Dense error correction via l1 minimization. IEEE Trans. Inf. Theory 56(7), 3540–3560 (2010)
Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE PAMI 31(2), 210–227 (2009)
Yang, M., Hang, L.: Gabor feature based sparse representation for face recognition with Gabor occlusion dictionary. In: ECCV (2010)
Gao, S.H., Tsang, I.W-H., Chia, L.-T.: Kernel sparse representation for image classification and face recognition. In: ECCV (2010)
Yang, M., Zhang, L.: Gabor Feature based sparse representation for face recognition with Gabor occlusion dictionary. In: ECCV (2010)
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR (2009)
Sun, Y., Chen, Y., Wang, X., Tang, X.: Deep learning face representation by joint identification-verification. In: NIPS (2014)
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: DeepFace: closing the gap to human-level performance in face verification. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch. CoRR, vol. abs/1411.7923 (2014)
Schroff, F., Kalenichenko, D., Philbin, J.F.: A unified embedding for face recognition and clustering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 815–823 (2015)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Chu, Y., Zhao, L., Ahma, T.: Multiple feature subspaces analysis for single sample per person face recognition. Vis. Comput. 35(2), 239–256 (2019)
Deng, W., Hu, J., Zhou, X., Guo, J.: Equidistant prototypes embedding for single sample based face recognition with generic learning and incremental learning. Pattern Recognit. 47(12), 3738–3749 (2014)
Lu, J., Tan, Y.P., Wang, G.: Discriminative multimanifold analysis for face recognition from a single training sample per person. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 39–51 (2012)
Zhu, P., Yang, M., Zhang, L., Lee, I.Y.: Local generic representation for face recognition with single sample per person. In: Asian Conference on Computer Vision, pp. 34–50. Springer, Cham (2014)
Abdellatef, E., Ismail, N.A., Abd Elrahman, S.E.S.E., et al.: Cancelable multi-biometric recognition system based on deep learning. Vis. Comput. (2019). https://doi.org/10.1007/s00371-019-01715-5
Agrawal, A., Mittal, N.: Using CNN for facial expression recognition: a study of the effects of kernel size and number of filters on accuracy. Vis. Comput. (2019). https://doi.org/10.1007/s00371-019-01630-9
Gao, G., Yang, J., Jing, X.-Y., et al.: Learning robust and discriminative low-rank representations for face recognition with occlusion. Pattern Recognit. 66, 129–143 (2017)
Gao, G., Yu, Y., Meng, Y., et al.: Cross-resolution face recognition with pose variations via multilayer locality-constrained structural orthogonal procrustes regression. Inf. Sci. 506, 19–36 (2020)
Zhang, K., Zhang, Z., Li, Z., Qiao, Y.: Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process. Lett. 23(10), 1499–1503 (2016)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2014). arXiv preprint arXiv:1412.6980
Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch (2014). arXiv preprint arXiv:1411.7923
Martinez, A.M.: The AR face database. CVC Technical Report 24 (1998)
Lee, K.C., Ho, J., Kriegman, D.J.: Acquiring linear subspaces for face recognition under variable lighting. IEEE Trans. Pattern Anal. Mach. Intell. 5, 684–698 (2005)
Gao, S., Jia, K., Zhuang, L., Ma, Y.: Neither global nor local: regularized patch-based representation for single sample per person face recognition. Int. J. Comput. Vis. 111(3), 365–383 (2015)
Ji, H.K., Sun, Q.S., Ji, Z.X., Yuan, Y.H., Zhang, G.Q.: Collaborative probabilistic labels for face recognition from single sample per person. Pattern Recognit. 62(C), 125–134 (2017)
Shang, K., Huang, Z.-H., Liu, W., Li, Z.M.: A single gallery-based face recognition using extended joint sparse representation. Appl. Math. Comput. 320, 99–115 (2018)
Pei, T., Zhang, L., Wang, B., Li, F., Zhang, Z.: Decision pyramid classifier for face recognition under complex variations using single sample per person. Pattern Recognit. 64(C), 305–313 (2016)
Acknowledgements
Part of this research was carried out at Key Laboratory of Measurement and Control of Complex Systems of Engineering, Nanjing, China. Acknowledgements. This work was supported by the Natural National Science Foundation of China (Grant Nos. 51475092, 61462072) and Natural Science Foundation of Jiangsu Province of China (Grant Nos. BK20181269, BK20160693). This project was funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions, the Fundamental Research Funds for the Central Universities.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Du, Q., Da, F. Block dictionary learning-driven convolutional neural networks for fewshot face recognition. Vis Comput 37, 663–672 (2021). https://doi.org/10.1007/s00371-020-01802-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-020-01802-y