Nothing Special   »   [go: up one dir, main page]

Skip to main content
Log in

Block dictionary learning-driven convolutional neural networks for fewshot face recognition

  • Original Article
  • Published:
The Visual Computer Aims and scope Submit manuscript

Abstract

Fewshot face recognition (FFR) in less constrained environment is an important but challenging task due to the lack of sufficient sample information and the impact of occlusion. In this paper, a novel approach called block dictionary learning (BDL) is proposed, which combines sparse representation with convolutional neural networks to address the FFR problem. Based on the key-point locations of face images, the images are divided into four block regions for local feature extraction. Then, highly compact and discriminative features of both holistic and segmented parts are generated by CNN, which further compensates for the shortage of samples. Moreover, the sparse loss is introduced to optimize the performance of CNN by increasing the inter-class variations of features; thus, it develops a global-to-local dictionary learning algorithm to improve the robustness of BDL against complex variations. Finally, extensive experiments on AR and Extended Yale B datasets significantly demonstrate the effectiveness of BDL in comparison with other FFR methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Jafri, R., Arabnia, H.R.: A survey of face recognition techniques. J. Inf. Process. Syst. 5(2), 41–68 (2009)

    Article  Google Scholar 

  2. Jolliffe, I.: Principal Component Analysis, pp. 1094–1096. Springer, Berlin (2011)

    Google Scholar 

  3. Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2008)

    Article  Google Scholar 

  4. Zhang, L., Yang, M., Feng, X.: Sparse representation or collaborative representation: Which helps face recognition?. In: 2011 International Conference on Computer Vision, pp. 471–478. IEEE (2011)

  5. Kan, M., Shan, S., Su, Y., Chen, X., Gao, W.: Adaptive discriminant analysis for face recognition from single sample per person. In: Face and Gesture 2011, pp. 193–199. IEEE (2011)

  6. Deng, W., Hu, J., Guo, J.: Extended SRC: undersampled face recognition via intraclass variant dictionary. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1864–1870 (2012)

    Article  Google Scholar 

  7. Ding, R.X., Du, D.K., Huang, Z.H., Li, Z.M., Shang, K.: Variational feature representation-based classification for face recognition with single sample per person. J. Vis. Commun. Image Represent. 30, 35–45 (2015)

    Article  Google Scholar 

  8. Yang, M., Van Gool, L., Zhang, L.: Sparse variation dictionary learning for face recognition with a single training sample per person. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 689–696 (2013)

  9. Gao, Q.X., Zhang, L., Zhang, D.: Face recognition using FLDA with single training image per person. Appl. Math. Comput. 205(2), 726–734 (2008)

    MATH  Google Scholar 

  10. Liu, F., Tang, J., Song, Y., Bi, Y., Yang, S.: Local structure based multi-phase collaborative representation for face recognition with single sample per person. Inf. Sci. 346, 198–215 (2016)

    Google Scholar 

  11. Hu, C., Ye, M., Ji, S., Zeng, W., Lu, X.: A new face recognition method based on image decomposition for single sample per person problem. Neurocomputing 160, 287–299 (2015)

    Article  Google Scholar 

  12. He, L., Li, H., Zhang, Q., Sun, Z.: Dynamic feature matching for partial face recognition. IEEE Trans. Image Process. 28(2), 791–802 (2018)

    Article  MathSciNet  Google Scholar 

  13. Wright, J., Ma, Y.: Dense error correction via l1 minimization. IEEE Trans. Inf. Theory 56(7), 3540–3560 (2010)

    Article  Google Scholar 

  14. Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE PAMI 31(2), 210–227 (2009)

    Article  Google Scholar 

  15. Yang, M., Hang, L.: Gabor feature based sparse representation for face recognition with Gabor occlusion dictionary. In: ECCV (2010)

  16. Gao, S.H., Tsang, I.W-H., Chia, L.-T.: Kernel sparse representation for image classification and face recognition. In: ECCV (2010)

  17. Yang, M., Zhang, L.: Gabor Feature based sparse representation for face recognition with Gabor occlusion dictionary. In: ECCV (2010)

  18. Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR (2009)

  19. Sun, Y., Chen, Y., Wang, X., Tang, X.: Deep learning face representation by joint identification-verification. In: NIPS (2014)

  20. Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: DeepFace: closing the gap to human-level performance in face verification. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)

  21. Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch. CoRR, vol. abs/1411.7923 (2014)

  22. Schroff, F., Kalenichenko, D., Philbin, J.F.: A unified embedding for face recognition and clustering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 815–823 (2015)

  23. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

  24. Chu, Y., Zhao, L., Ahma, T.: Multiple feature subspaces analysis for single sample per person face recognition. Vis. Comput. 35(2), 239–256 (2019)

    Article  Google Scholar 

  25. Deng, W., Hu, J., Zhou, X., Guo, J.: Equidistant prototypes embedding for single sample based face recognition with generic learning and incremental learning. Pattern Recognit. 47(12), 3738–3749 (2014)

    Article  Google Scholar 

  26. Lu, J., Tan, Y.P., Wang, G.: Discriminative multimanifold analysis for face recognition from a single training sample per person. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 39–51 (2012)

    Article  Google Scholar 

  27. Zhu, P., Yang, M., Zhang, L., Lee, I.Y.: Local generic representation for face recognition with single sample per person. In: Asian Conference on Computer Vision, pp. 34–50. Springer, Cham (2014)

  28. Abdellatef, E., Ismail, N.A., Abd Elrahman, S.E.S.E., et al.: Cancelable multi-biometric recognition system based on deep learning. Vis. Comput. (2019). https://doi.org/10.1007/s00371-019-01715-5

  29. Agrawal, A., Mittal, N.: Using CNN for facial expression recognition: a study of the effects of kernel size and number of filters on accuracy. Vis. Comput. (2019). https://doi.org/10.1007/s00371-019-01630-9

  30. Gao, G., Yang, J., Jing, X.-Y., et al.: Learning robust and discriminative low-rank representations for face recognition with occlusion. Pattern Recognit. 66, 129–143 (2017)

    Article  Google Scholar 

  31. Gao, G., Yu, Y., Meng, Y., et al.: Cross-resolution face recognition with pose variations via multilayer locality-constrained structural orthogonal procrustes regression. Inf. Sci. 506, 19–36 (2020)

    Article  MathSciNet  Google Scholar 

  32. Zhang, K., Zhang, Z., Li, Z., Qiao, Y.: Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process. Lett. 23(10), 1499–1503 (2016)

    Article  Google Scholar 

  33. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2014). arXiv preprint arXiv:1412.6980

  34. Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch (2014). arXiv preprint arXiv:1411.7923

  35. Martinez, A.M.: The AR face database. CVC Technical Report 24 (1998)

  36. Lee, K.C., Ho, J., Kriegman, D.J.: Acquiring linear subspaces for face recognition under variable lighting. IEEE Trans. Pattern Anal. Mach. Intell. 5, 684–698 (2005)

    Google Scholar 

  37. Gao, S., Jia, K., Zhuang, L., Ma, Y.: Neither global nor local: regularized patch-based representation for single sample per person face recognition. Int. J. Comput. Vis. 111(3), 365–383 (2015)

    Article  MathSciNet  Google Scholar 

  38. Ji, H.K., Sun, Q.S., Ji, Z.X., Yuan, Y.H., Zhang, G.Q.: Collaborative probabilistic labels for face recognition from single sample per person. Pattern Recognit. 62(C), 125–134 (2017)

    Article  Google Scholar 

  39. Shang, K., Huang, Z.-H., Liu, W., Li, Z.M.: A single gallery-based face recognition using extended joint sparse representation. Appl. Math. Comput. 320, 99–115 (2018)

    MATH  Google Scholar 

  40. Pei, T., Zhang, L., Wang, B., Li, F., Zhang, Z.: Decision pyramid classifier for face recognition under complex variations using single sample per person. Pattern Recognit. 64(C), 305–313 (2016)

    Google Scholar 

Download references

Acknowledgements

Part of this research was carried out at Key Laboratory of Measurement and Control of Complex Systems of Engineering, Nanjing, China. Acknowledgements. This work was supported by the Natural National Science Foundation of China (Grant Nos. 51475092, 61462072) and Natural Science Foundation of Jiangsu Province of China (Grant Nos. BK20181269, BK20160693). This project was funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions, the Fundamental Research Funds for the Central Universities.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qiao Du.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Du, Q., Da, F. Block dictionary learning-driven convolutional neural networks for fewshot face recognition. Vis Comput 37, 663–672 (2021). https://doi.org/10.1007/s00371-020-01802-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00371-020-01802-y

Keywords

Navigation