Abstract
Natural language classification and determination based on a particular content and dataset is carried out using Spoken Language Identification (LID) which typically involves the extraction of valuable elements in a mature data processing procedure whereby the regular LID features had been developed using the Mel Frequency Cepstral Coefficient (MFCC), Shifted Delta Coefficient (SDC), Gaussian Mixture Model (GMM) and an i-vector framework. However, there remains a need for optimization in terms of the learning process so as to allow for all the knowledge embedded in the extracted features to be captured completely. A powerful machine learning algorithm known as Extreme Learning Machine (ELM) is used for conducting regression and classification and can train single hidden layer neural networks effectively. Yet, ELM’s learning process remains under-optimized owing to the entrenched random weights selection in the input hidden layer. Based on the standard feature extraction, this current study chooses ELM as the learning model for LID. An optimized method known as the Enhanced Self-Adjusting-ELM (ESA-ELM) has been chosen as a benchmark with enhancements via the adoption of an alternate optimization approach i.e., Grey Wolf Optimisation (GWO) rather than Enhanced Ameliorated Teaching Learning-Based Optimization (EATLBO) to ensure higher performance. Ultimately, this enhanced version of the ESA-ELM is referred to as a Grey Wolf Optimisation-Extreme Learning Machine (GWO-ELM). The results generation is carried out based on LID using the exact benchmark dataset that was derived from eight separate languages. The results indicated that the GWO-ELM LID has a much superior performance than the ESA-ELM LID with respective accuracies of 100.00% for the former and merely 96.25% for the latter.
Similar content being viewed by others
References
Albadr MAA, Tiun S (2020) Spoken language identification based on particle swarm optimisation–extreme learning machine approach. Circuits Syst Signal Process 39(9):4596–4622
Albadr MAA, Tiun S, al-Dhief FT, Sammour MAM (2018) Spoken language identification based on the enhanced self-adjusting extreme learning machine approach. PLoS One 13(4):e0194770
Albadr MAA, Tiun S, Ayob M, al-Dhief FT (2019) Spoken language identification based on optimised genetic algorithm–extreme learning machine approach. Int J Speech Technol 22(3):711–727
Albadr MAA, Tiun S, Ayob M, al-Dhief FT, Omar K, Hamzah FA (2020) Optimised genetic algorithm-extreme learning machine approach for automatic COVID-19 detection. PLoS One 15(12):e0242899
Albadr MA, Tiun S, Ayob M, al-Dhief F (2020) Genetic algorithm based on natural selection theory for optimization problems. Symmetry 12(11):1758
Albadr MAA et al (2021) Extreme learning machine for automatic language identification utilizing emotion speech data. In: 2021 international conference on electrical, communication, and computer engineering (ICECCE). IEEE
Albadr MAA, Tiun S, Ayob M, Mohammed M, al-Dhief FT (2021) Mel-frequency cepstral coefficient features based on standard deviation and principal component analysis for language identification systems. Cogn Comput 13(5):1136–1153
Albadr MA et al (2022. In Press) Speech emotion recognition using optimized genetic algorithm-extreme learning machine. Multimed Tools Appl 81:23963–23989
Albadra MAA, Tiuna S (2017) Extreme learning machine: a review. Int J Appl Eng Res 12(14):4610–4623
AL-Dhief FT et al (2020) Voice pathology detection using machine learning technique. In: 2020 IEEE 5th international symposium on telecommunication technologies (ISTT). IEEE
Al-Dhief FT et al (2020) A survey of voice pathology surveillance systems based on internet of things and machine learning algorithms. IEEE Access 8:64514–64533
Al-Dhief FT et al (2021) Voice pathology detection and classification by adopting online sequential extreme learning machine. IEEE Access 9:77293–77306
AL-Dhief FT et al (2021) Voice pathology detection using support vector machine based on different number of voice signals. In: 2021 26th IEEE Asia-Pacific conference on communications (APCC). IEEE, pp 1–6
Alexander V, Annamalai P (2016) An Elitist Genetic Algorithm Based Extreme Learning Machine. In: Computational Intelligence, Cyber Security and Computational Models. Springer, pp 301–309
Ambikairajah E, Li H, Wang L, Yin B, Sethu V (2011) Language identification: a tutorial. IEEE Circuits Syst Mag 11(2):82–108
Ben-Reuven E, Goldberger J (2016) A semisupervised approach for language identification based on ladder networks. arXiv preprint arXiv:1604.00317
Deng C, Huang GB, Xu J, Tang JX (2015) Extreme learning machines: new trends and applications. Sci China Inf Sci 58(2):1–16
Faris H, Aljarah I, al-Betar MA, Mirjalili S (2018) Grey wolf optimizer: a review of recent variants and applications. Neural Comput & Applic 30(2):413–435
Faris H, Mirjalili S, Aljarah I (2019) Automatic selection of hidden neurons and weights in neural networks using grey wolf optimizer based on a hybrid encoding scheme. Int J Mach Learn Cybern 10(10):2901–2920
Ganapathy S et al (2014) Robust language identification using convolutional neural network features. In: INTERSPEECH
Gao R et al (2019) Extreme learning machine ensemble for CSI based device-free indoor localization. In: 2019 28th wireless and optical communications conference (WOCC). IEEE
Garg A, Gupta V, Jindal M (2014) A survey of language identification techniques and applications. J Emerg Technol Web Intell 6(4):388–400
Gazeau V, Varol C (2018) Automatic spoken language recognition with neural networks. Int J Inf Technol Comput Sci (IJITCS) 10(8):11–17
Hafen RP, Henry MJ (2012) Speech information retrieval: a review. Multimed Syst 18(6):499–518
Han K, Yu D, Tashev I (2014) Speech emotion recognition using deep neural network and extreme learning machine. In: Fifteenth Annual Conference of the International Speech Communication Association
Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1–3):489–501
Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501
Huang G-B, Chen L, Siew CK (2006) Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans Neural Netw 17(4):879–892
Huang G-B et al (2011) Extreme learning machine for regression and multiclass classification. IEEE Trans Syst Man Cybern Part B Cybern 42(2):513–529
Jiang B, Song Y, Wei S, Liu JH, McLoughlin IV, Dai LR (2014) Deep bottleneck features for spoken language identification. PLoS One 9(7):e100795
Kaya H, Karpov AA (2018) Efficient and effective strategies for cross-corpus acoustic emotion recognition. Neurocomputing 275:1028–1034
Lan Y, Hu Z, Soh YC, Huang GB (2013) An extreme learning machine approach for speaker recognition. Neural Comput & Applic 22(3–4):417–425
Lee KA et al (2016) The 2015 NIST language recognition evaluation: the shared view of I2R, Fantastic4 and SingaMS
Li J et al (2015) LSTM time and frequency recurrence for automatic speech recognition. In: Automatic speech recognition and understanding (ASRU), 2015 IEEE workshop on. IEEE
Liang N-Y et al (2006) A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans Neural Netw 17(6):1411–1423
Lopez-Moreno I, Gonzalez-Dominguez J, Martinez D, Plchot O, Gonzalez-Rodriguez J, Moreno PJ (2016) On the use of deep feedforward neural networks for automatic language identification. Comput Speech Lang 40:46–59
Malik H, Roy N (2019) Extreme Learning Machine-Based Image Classification Model Using Handwritten Digit Database. In: Applications of Artificial Intelligence Techniques in Engineering. Springer, pp 607–618
Minhas R, Baradarani A, Seifzadeh S, Jonathan Wu QM (2010) Human action recognition using extreme learning machine based on visual vocabularies. Neurocomputing 73(10):1906–1917
Mirjalili S, Mirjalili SM, Lewis A (2014) Grey wolf optimizer. Adv Eng Softw 69:46–61
Mohammed AA, Minhas R, Jonathan Wu QM, Sid-Ahmed MA (2011) Human face recognition based on multidimensional PCA and extreme learning machine. Pattern Recogn 44(10):2588–2597
Muthusamy H, Polat K, Yaacob S (2015) Improved emotion recognition using Gaussian mixture model and extreme learning machine in speech and glottal signals. Math Probl Eng 2015:1–13
Nayak P et al (2016) Comparison of modified teaching–learning-based optimization and extreme learning machine for classification of multiple power signal disturbances. Neural Comput & Applic 27(7):2107–2122
Naz A, Javaid N, Javaid S (2018) Enhanced recurrent extreme learning machine using gray wolf optimization for load forecasting. In: 2018 IEEE 21st international multi-topic conference (INMIC). IEEE
Niu P, Ma Y, Li M, Yan S, Li G (2016) A kind of parameters self-adjusting extreme learning machine. Neural Process Lett 44(3):813–830
Pal M, Maxwell AE, Warner TA (2013) Kernel-based extreme learning machine for remote-sensing image classification. Remote Sens Lett 4(9):853–862
Peng Y, Wang S, Long X, Lu BL (2015) Discriminative graph regularized extreme learning machine and its application to face recognition. Neurocomputing 149:340–353
Singh G, Sharma S, Kumar V, Kaur M, Baz M, Masud M (2021) Spoken language identification using deep learning. Comput Intell Neurosci 2021:1–12
Sokolova M, Japkowicz N, Szpakowicz S (2006) Beyond accuracy, F-score and ROC: a family of discriminant measures for performance evaluation. In: Australasian joint conference on artificial intelligence. Springer
van Heeswijk M (2015) Advances in extreme learning machines
Wang Y, Cao F, Yuan Y (2011) A study on effectiveness of extreme learning machine. Neurocomputing 74(16):2483–2490
Wang M, Chen H, Li H, Cai Z, Zhao X, Tong C, Li J, Xu X (2017) Grey wolf optimization evolving kernel extreme learning machine: application to bankruptcy prediction. Eng Appl Artif Intell 63:54–68
Wang Z et al (2019) Breast Cancer detection using extreme learning machine based on feature fusion with CNN deep features. IEEE Access
Wang W, Song W, Chen C, Zhang Z, Xin Y (2019) I-vector features and deep neural network modeling for language recognition. Procedia Comput Sci 147:36–43
Xu J et al (2015) Regularized minimum class variance extreme learning machine for language recognition. EURASIP J Audio Speech Music Process 2015(1):22
Yang Z, Zhang T, Zhang D (2016) A novel algorithm with differential evolution and coral reef optimization for extreme learning machine training. Cogn Neurodyn 10(1):73–83
Zazo R, Lozano-Diez A, Gonzalez-Dominguez J, T. Toledano D, Gonzalez-Rodriguez J (2016) Language identification in short utterances using long short-term memory (LSTM) recurrent neural networks. PLoS One 11(1):e0146917
Zhou Z, Wang C, Zhu Z, Wang Y, Yang D (2019) Sliding mode control based on a hybrid grey-wolf-optimized extreme learning machine for robot manipulators. Optik 185:364–380
Acknowledgments
This project was funded by the Universiti Kebangsaan Malaysia under Dana Impak Perdana grant (Research code: GUP-2020-063).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Albadr, M.A.A., Tiun, S., Ayob, M. et al. Grey wolf optimization-extreme learning machine for automatic spoken language identification. Multimed Tools Appl 82, 27165–27191 (2023). https://doi.org/10.1007/s11042-023-14473-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14473-3