Abstract
Automated lip reading from videos requires lip segmentation. Threshold-based segmentation is straightforward, but it is rarely used. This study proposes a histogram threshold based on the feedback of shape information. Both good and bad lip segmentation examples were used to train an \(\epsilon \)-support vector regression model to infer the segmentation accuracy from the region shape. The histogram threshold was optimised to minimise the segmentation error. The proposed method was tested on 895 images from 112 subjects using the AR Face Database. The proposed method, implemented in simple segmentation algorithms, reduced segmentation errors by 23.1%.
Similar content being viewed by others
References
Zhang, X., Cheng, F., Shilin, W.: Spatio-temporal fusion based convolutional sequence learning for lip reading. In: Proceedings IEEE/CVF International Conference on Computer Vision (ICCV), pp. 713–722 (2019)
Guan, C., Wang, S., Liu, G., Liew, A.W.: Lip image segmentation in mobile devices based on alternative knowledge distillation. In: Proceedings 2019 IEEE International Conference on Image Processing (ICIP), pp. 1540–1544 (2019)
Guan, C., Wang, S., Liew, A.W.: Lip image segmentation based on a fuzzy convolutional neural network. IEEE Trans. Fuzzy Syst. 28(7), 1242–1251 (2020)
Nascimento, J.C., Carneiro, G.: One shot segmentation: unifying rigid detection and non-rigid segmentation using elastic regularization. IEEE Trans. Pattern Anal. Mach. Intell. 42, 3054–3070 (2020)
Spyridonos, P., Saint, A.F., Likas, A., Gaitanis, G., Bassukas, I.: Multi-threshold lip contour detection. In: Proceedings 25th IEEE International Conference on Image Processing (ICIP), pp. 1912–1916 (2018)
Cheung, Y.M., Li, M., Peng, Q., Chen, C.P.: A cooperative learning-based clustering approach to lip segmentation without knowing segment number. IEEE Trans. Neural Netw. Learn. Syst. 28(1), 80–93 (2017)
Zheng, Z., Jiong, J., Chunjiang, D., Liu, X., Yang, J.: Facial feature localization based on an improved active shape model. Inf. Sci. 178(9), 2215–2223 (2008)
Wang, S.L., Liew, A.W.C., Lau, W.H., Leung, S.H.: An automatic lipreading system for spoken digits with limited training data. IEEE Trans. Circuits Syst. Video Technol. 18(12), 1760–1765 (2008)
Saeed, U., Dugelay, J.L.: Combining edge detection and region segmentation for lip contour extraction. In: International Conference on Articulated Motion and Deformable Objects, pp. 11–20 (2010)
Gritzman, A.D., Aharonson, V., Rubin, D.M., Pantanowitz, A.: Automatic computation of histogram threshold for lip segmentation using feedback of shape information. SIViP 10(5), 869–876 (2016)
Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Stat. Comput. 14(3), 199–222 (2004)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 1–27 (2011)
Hsu, C.W., Chang, C.C., Lin, C.J.: A practical guide to support vector classification. Technical report, Department of Computer Science, National Taiwan University (2003)
Lewis, R.M., Torczon, V., Trosset, M.W.: Direct search methods: then and now. J. Comput. Appl. Math. 124(1–2), 191–207 (2000)
Otsu, N.: A threshold selection method from gray-level histograms. Automatica 11(285–296), 23–27 (1975)
Martinez, A.M.: The AR face database. CVC Technical Report, vol. 24 (1998)
Ding, L., Martinez, A.M.: Features versus context: an approach for precise and detailed detection and delineation of faces and facial features. IEEE Trans. Pattern Anal. Mach. Intell. 32(11), 2022–2038 (2010)
Gritzman, A.D.: Adaptive threshold optimisation for colour-based lip segmentation in automatic lip-reading systems. University of the Witwatersrand, Johannesburg, Ph.D. thesis (2016)
Acknowledgements
This study has been supported by the National Research Foundation of South Africa Grant Numbers 97742 and 127102. It has been based on a Doctoral project [18].
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Gritzman, A.D., Postema, M., Rubin, D.M. et al. Threshold-based outer lip segmentation using support vector regression. SIViP 15, 1197–1202 (2021). https://doi.org/10.1007/s11760-020-01849-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-020-01849-3