Abstract
Training data are critical in face recognition systems. Labeling a large scale dataset for a particular domain needs lots of manpower. Without dataset related to current face recognition domain, we can’t get a strong face recognition model with existing public datasets. In this paper, we propose a semi-supervised method to automatically construct strong dataset which can be trained to achieve better performance on the target domain from massive weakly labeled data. In the case of Asian face recognition, a well trained VRCN model by CASIA, which achieves 98.63% on LFW and 91.76% on YTF, only achieves 88.53% recognition rate on our test dataset of Asian faces. We collect 530,560 weakly labeled Asian face images of 7962 identities, and get a cleaned dataset of size 285,933. Model trained by the cleaned dataset with VRCN network and same strategy achieves 95.33% recognition rate on the Asian face test dataset (6.8% improved).
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
Our network used in this paper is inspired by VGG-Net, ResNet and center loss.
References
Zhao W, Chellappa R, Phillips PJ, Rosenfeld A (2003) Face recognition: a literature survey. ACM Comput Surv (CSUR) 25(4):399–458
Taigman Y, Yang M, Ranzato M, Wolf L (2014) DeepFace: closing the gap to human-level performance in face verification. In: IEEE conference on computer vision and pattern recognition, pp 1701–1708
Sun Y, Wang X, Tang X (2014) Deep learning face representation from predicting 10,000 classes, In: IEEE conference on computer vision and pattern recognition, pp 1891–1898
Sun Y, Chen Y, Wang X, Yang X (2014) Deep learning face representation by joint identification-verification. In: Advances in neural information processing systems 27, pp 1988–1996
Sun Y, Liang D, Wang X, Tang X (2015) Deepid3: Face recognition with very deep neural networks. arXiv preprint arXiv: 1502.00873
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 815–813
Yi D, Lei Z, Liao S, Li SZ (2014) Learning face representation from scratch. arXiv preprint arXiv: 1441.7923
Huang G. B, Ramesh M, Berg T, Learned-Miller E (2007) Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical report Technical Report 07-49 University of Massachusetts Amherst
Wolf L, Hassner T, Maoz I (2011) Face recognition in unconstrained videos with matched background similarity. In: IEEE conference on computer vision and pattern recognition, vol 42, pp 529–534
Sun Y, Wang X, Tang X (2013) Hybrid deep learning for face verification. In Proceedings of the IEEE international conference on computer vision, pp 1489–1496
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778
He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, pp 1026–1034
Sim T, Baker S, Bsat M (2002) The CMU pose illumination and expression (PIE) database. In: Automatic face and gesture recognition proceedings. Fifth IEEE international conference, pp 53–58
Phillips PJ, Moon H, Rizvi SA, Rauss PJ (2000) The FERET evaluation methodology for face-recognition algorithms. IEEE Trans Pattern Anal Mach Intell 22(10):1090–1104
Chen BC, Chen CS, Hsu WH (2014) Cross-age reference coding for age-invariant face recognition and retrieval. In: European conference on computer vision, pp 768–783
Chen D, Cao X, Wang L, Wen F, Sun J (2012) Bayesian face revisited: a joint formulation. In: European conference on computer vision, vol 7574, pp 566–579
Guo Y, Zhang L, Hu Y, He X, Gao J (2016) Ms-celeb-1m: a dataset and benchmark for large-scale face recognition. arXiv:1607.08221
Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3(1):71
Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720
Liu C, Wechsler H (2002) Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition. IEEE Trans Image Process A Publ IEEE Signal Process Soc 11(4):467–476
Li SZ, Chu RF, Liao SC, Zhang L (2007) Illumination invariant face recognition using near-infrared images. IEEE Trans Pattern Anal Mach Intell 29(4):627–639
Hadsell R, Chopra S, Lecun Y (2006) Dimensionality reduction by learning an invariant mapping. In: IEEE computer society conference on computer vision and pattern recognition, vol 2, pp 1735–1742
Ding S, Lin L, Chao H (2015) Deep feature learning with relative distance comparison for person re-identification. Pattern Recognit 48(10):2993–3003
Wen Y, Zhang K, Li Z, Qiao Y (2016) A discriminative feature learning approach for deep face recognition. In: European conference on computer vision, vol 47(9), pp 11–26
Jia Y, Shelhamer E, Donahue J, Karayev S, Long J, Girshick R, Darrell T (2014) Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM international conference on multimedia, pp 675–678
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work is supported by NSF of China under Grants 61672548, U1611461, and the Guangzhou Science and Technology Program, China, under Grant 201510010165.
Rights and permissions
About this article
Cite this article
Xu, W., Wu, J., Ding, S. et al. Enhancing Face Recognition from Massive Weakly Labeled Data of New Domains. Neural Process Lett 49, 939–950 (2019). https://doi.org/10.1007/s11063-018-9839-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-018-9839-z