default search action
International Journal of Speech Technology, Volume 20
Volume 20, Number 1, March 2017
- M. K. Prasanna Kumar, R. Kumaraswamy:
An unsupervised approach for co-channel speech separation using Hilbert-Huang transform and Fuzzy C-Means clustering. 1-13 - Arijul Haque, Krothapalli Sreenivasa Rao:
Modification of energy spectra, epoch parameters and prosody for emotion conversion in speech. 15-25 - Na Yang, Jianbo Yuan, Yun Zhou, Ilker Demirkol, Zhiyao Duan, Wendi B. Heinzelman, Melissa Sturge-Apple:
Enhanced multiclass SVM with thresholding fusion for speech-based emotion classification. 27-41 - S. Jothilakshmi, J. Sangeetha, R. Brindha:
Speech based automatic personality perception using spectral features. 43-50 - Salwa M. Serag Eldin:
Voice recognition package for ERTU's cloud. 51-67 - Jan Holub, Hakob Avetisyan, Scott Isabelle:
Subjective speech quality measurement repeatability: comparison of laboratory test results. 69-74 - Talbi Mourad:
Speech enhancement based on stationary bionic wavelet transform and maximum a posterior estimator of magnitude-squared spectrum. 75-88 - Nasir Saleem:
Single channel noise reduction system in low SNR. 89-98 - Mohammad Soleymanpour, Hossein Marvi:
Text-independent speaker identification based on selection of the most similar feature vectors. 99-108 - M. K. Prasanna Kumar, R. Kumaraswamy:
Single-channel speech separation using empirical mode decomposition and multi pitch information with estimation of number of speakers. 109-125 - K. Ramesh, S. R. M. Prasanna:
Glottal opening instants detection using zero frequency resonator. 127-141 - Abdelilah Jilbab, Achraf Benba, Ahmed Hammouch:
Quantification system of Parkinson's disease. 143-150 - Prasad Nizampatnam, Kishore Kumar Tappeta:
Bandwidth extension of telephone speech using magnitude spectrum data hiding. 151-162 - Emad Mossa:
Security enhancement for AES encrypted speech in communications. 163-169 - Yanhua Long, Yijie Li, Hone Ye, Hongwei Mao:
Domain adaptation of lattice-free MMI based TDNN models for speech recognition. 171-178 - Elmehdi Benmalek, Jamal Elmhamdi, Abdelilah Jilbab:
Multiclass classification of Parkinson's disease using different classifiers and LLBFS feature selection algorithm. 179-184 - Rajeev Rajan, Manaswi Misra, Hema A. Murthy:
Melody extraction from music using modified group delay functions. 185-204
Volume 20, Number 2, June 2017
- Azzedine Touazi, Mohamed Debyeche:
An experimental framework for Arabic digits speech recognition in noisy environments. 205-224 - Jihen Zeremdini, Mohamed Anouar Ben Messaoud, Aïcha Bouzid:
Multi-pitch estimation based on multi-scale product analysis, improved comb filter and dynamic programming. 225-237 - Fatemeh Noroozi, Tomasz Sapinski, Dorota Kaminska, Gholamreza Anbarjafari:
Vocal-based emotion recognition using random forests and decision tree. 239-246 - Ahilan Kanagasundaram, David Dean, Sridha Sridharan, Houman Ghaemmaghami, Clinton Fookes:
A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems. 247-259 - Mohammad A. M. Abushariah:
TAMEEM V1.0: speakers and text independent Arabic automatic continuous speech recognizer. 261-280 - Achraf Benba, Abdelilah Jilbab, Ahmed Hammouch:
Detecting multiple system atrophy, Parkinson and other neurological disorders using voice analysis. 281-288 - Ali Benabdallah, Mohammed Alaeddine Abderrahim, Mohammed El Amine Abderrahim:
Extraction of terms and semantic relationships from Arabic texts for automatic construction of an ontology. 289-296 - Yogesh Kumar, Navdeep Singh:
An automatic speech recognition system for spontaneous Punjabi speech corpus. 297-303 - Shambhu Nath Saha, Shyamal Kr. Das Mandal:
Discourse prosody planning in native (L1) and nonnative (L2) (L1-Bengali) English: a comparative study. 305-326 - D. Pravena, D. Govind:
Development of simulated emotion speech database for excitation source analysis. 327-338 - Wided Bakari, Patrice Bellot, Mahmoud Neji:
A logical representation of Arabic questions toward automatic passage extraction from the Web. 339-353 - Michael Osigbemeh, Cletus Ohaneme, Hyacinth Inyiama:
An algorithm for characterizing pre-fuzzified linguistic nuance using artificial neural network. 355-362 - Hamza Meguehout, Tahar Bouhadada, Mohamed Tayeb Laskri:
Semantic role labeling for Arabic language using case-based reasoning approach. 363-372 - Ravikumar Kandagatla, Venkata Subbaiah Potluri:
Speech enhancement using MMSE estimation under phase uncertainty. 373-385 - Sourjya Sarkar, K. Sreenivasa Rao:
Supervector-based approaches in a discriminative framework for speaker verification in noisy environments. 387-416 - Nassim Asbai, Abderrahmane Amrouche:
A novel scores fusion approach applied on speaker verification under noisy environments. 417-429
Volume 20, Number 3, September 2017
- Salwa M. Serag Eldin:
Encrypted gray image transmission over OFDM channel for TV cloud computing. 431-442 - V. Sunnydayal, N. Siva Prasad, S. Ravishankar, Sudeep Surendran, N. K. Ragesh:
Sparse NMF based speech enhancement with bases update. 443-454 - Yeh Huann Goh, Yann-Ling Goh, Yoon-Ket Lee, Ying-Hao Ko:
Robust speech recognition system using multi-parameter bidirectional Kalman filter. 455-463 - B. Bharathi:
Speaker-specific-text based speaker verification system using spectral and phase based features. 465-474 - Flavio J. Reyes Díaz, Gabriel Hernández Sierra, José Ramón Calvo de Lara:
Two-space variability compensation technique for speaker verification in short length and reverberant environments. 475-485 - Vo Ngoc Phu, Vo Thi Ngoc Chau, Vo Thi Ngoc Tran:
SVM for English semantic classification in parallel environment. 487-508 - Vo Ngoc Phu, Vo Thi Ngoc Chau, Vo Thi Ngoc Tran:
Shifting semantic values of English phrases for classification. 509-533 - Roshahliza M. Ramli, Ali O. Abid Noor, Salina Abdul Samad:
Noise cancellation using selectable adaptive algorithm for speech in variable noise environment. 535-542 - Sumanlata Gautam, Latika Singh:
Development of spectro-temporal features of speech in children. 543-551 - Zeinab Farhoudi, Saeed Setayeshi, Azam Rabiee:
Using learning automata in brain emotional learning for speech emotion recognition. 553-562 - Hamza Frihia, Halima Bahi:
HMM/SVM segmentation and labelling of Arabic speech for speech recognition applications. 563-573 - Dhekra Najar, Slim Mesfar:
Opinion mining and sentiment analysis for Arabic on-line texts: application on the political domain. 575-585 - Abolghasem Sayadian, Fatemeh Mozaffari:
A novel method for voice conversion based on non-parallel corpus. 587-592 - Vo Ngoc Phu, Vo Thi Ngoc Tran, Vo Thi Ngoc Chau, Nguyen Duy Dat, Khanh Ly Doan Duy:
A decision tree using ID3 algorithm for English semantic analysis. 593-613 - Nikunj Tahilramani, Ninad Bhatt:
Proposed modifications in ITU-T G.729 8 kbps CS-ACELP speech codec and its overall performance analysis. 615-628 - Asmaa Etman, A. A. (Louis) Beex:
The effect of pitch tracking on automatic dialect identification. 629-634 - Thimmaraja Yadava G., Haradagere Siddaramaiah Jayanna:
A spoken query system for the agricultural commodity prices and weather information access in Kannada language. 635-644 - Zied Sakka, Elhem Techini, Mohamed Salim Bouhlel:
Using geometric spectral subtraction approach for feature extraction for DSR front-end Arabic system. 645-650 - Linhui Sun, Min Su, Zhenzhen Yang:
An adaptive speech endpoint detection method in low SNR environments. 651-658 - Randhir Singh, Ajay Kumar, Parveen Lehana:
Effect of bandwidth modifications on the quality of speech imitated by Alexandrine and Indian Ringneck parrots. 659-672 - Achraf Benba, Abdelilah Jilbab, Ahmed Hammouch:
Voice assessments for detecting patients with neurological diseases using PCA and NPCA. 673-683 - Aïssa Belmeguenaï, Zahir Ahmida, Salim Ouchtati, Rafik Djemili:
A novel approach based on stream cipher for selective speech encryption. 685-698 - Barbara Schuppler:
Rethinking classification results based on read speech, or: why improvements do not always transfer to other speaking styles. 699-713 - Fawaz S. Al-Anzi, Dia AbuZeina:
The impact of phonological rules on Arabic speech recognition. 715-723 - Fatma-Zohra Chelali, Amar Djeradi:
Text dependant speaker recognition using MFCC, LPC and DWT. 725-740 - Jafar Ramadhan Mohammed:
Development of two-input adaptive noise canceller for wideband and narrowband noise signals. 741-751
Volume 20, Number 4, December 2017
- Yan Zhang, Yanhua Long, Xiangrong Shen, Haoran Wei, Min Yang, Hong Ye, Hongwei Mao:
Articulatory movement features for short-duration text-dependent speaker verification. 753-759 - Virender Kadyan, Archana Mantri, R. K. Aggarwal:
A heterogeneous speech feature vectors generation approach with hybrid hmm classifiers. 761-769 - Hassan Satori, Ouissam Zealouk, Khalid Satori, Fatima El Haoussi:
Voice comparison between smokers and non-smokers using HMM speech recognition system. 771-777 - Buadat Karibayeva, Salima S. Kunanbayeva:
Power distance and verbal index in Kazakh business discourse. 779-785 - D. Pravena, D. Govind:
Significance of incorporating excitation source parameters for improved emotion recognition from speech and electroglottographic signals. 787-797 - Seçkin Uluskan, Abhijeet Sangwan, John H. L. Hansen:
Phoneme class based feature adaptation for mismatch acoustic modeling and recognition of distant noisy speech. 799-811 - Anirban Bhowmick, Mahesh Chandra, Astik Biswas:
Speech enhancement using Teager energy operated ERB-like perceptual wavelet packet decomposition. 813-827 - Ardalan Ghasemzadeh, Elham Esmaeili:
A novel method in audio message encryption based on a mixture of chaos function. 829-837 - Banriskhem K. Khonglah, Ramesh K. Bhukya, S. R. Mahadeva Prasanna:
Processing degraded speech for text dependent speaker verification. 839-850 - Hala Shawky, Mohammed Abd-Elnaby, Mohamed Rihan, Mohamed Abd-Elsalam Nassar, Adel S. El-Fishawy, Fathi E. Abd El-Samie:
Efficient compression and reconstruction of speech signals using compressed sensing. 851-857 - Shakti P. Rath:
Factored front-end CMLLR for joint speaker and environment normalization under DNN-HMM. 859-867 - Karim Tahiry, Badia Mounir, Ilham Mounir, Laila Elmazouzi, Abdelmajid Farchi:
Arabic stop consonants characterisation and classification using the normalized energy in frequency bands. 869-880 - Isah Abdullahi Lawal:
Spoken character classification using abductive network. 881-890 - Mohamed Hesham Farouk:
On the application of quantum clustering on speech data. 891-896 - Agnes Jacob:
Modelling speech emotion recognition using logistic regression and decision trees. 897-905 - Sopon Wiriyarattanakul, Nawapak Eua-anant:
Pitch segmentation of speech signals based on short-time energy waveform. 907-917 - Gábor Kiss, Klára Vicsi:
Mono- and multi-lingual depression prediction based on speech processing. 919-935 - Mohamed O. M. Khelifa, Yahya O. M. ElHadj, Abdellah Yousfi, Mostafa Belkasmi:
Constructing accurate and robust HMM/GMM models for an Arabic speech recognition system. 937-949 - A. R. Elshazly, Mohammad Nasr Esfahani, M. M. Fouad, F. S. Abdel-Samie:
High payload multi-channel dual audio watermarking algorithm based on discrete wavelet transform and singular value decomposition. 951-958 - Soumya Priyadarsini Panda, Ajit Kumar Nayak:
A waveform concatenation technique for text-to-speech synthesis. 959-976 - Naglaa F. Soliman, Zhraa Mostfa, Fathi E. Abd El-Samie, Mahmoud I. Abdalla:
Performance enhancement of speaker identification systems using speech encryption and cancelable features. 977-1004 - Shashidhar G. Koolagudi, Akash Bharadwaj, Vishnu Srinivasa Murthy Yarlagadda, Nishaanth H. Reddy, Priya Rao:
Dravidian language classification from speech signal using spectral and prosodic features. 1005-1016 - Yu Zhang:
Research on English machine translation system based on the internet. 1017-1022 - Banriskhem K. Khonglah, S. R. Mahadeva Prasanna:
Clean speech/speech with background music classification using HNGD spectrum. 1023-1036 - M. K. Prasanna Kumar, R. Kumaraswamy:
Single-channel speech separation using combined EMD and speech-specific information. 1037-1047 - Shima Tabibian:
A voice command detection system for aerospace applications. 1049-1061 - Shabnam Ghaffarzadegan, Hynek Boril, John H. L. Hansen:
Deep neural network training for whispered speech recognition using small databases and generative model sampling. 1063-1075
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.