Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2808719.2812222acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Voice signal features analysis and classification: looking for new diseases related parameters

Published: 09 September 2015 Publication History

Abstract

Dysphonia is an often-underestimated vocal tract problem caused by irregular vocal cords vibration. It has been proved that dysphonia can be considered as a symptom of vocal tract diseases; also, voice anomalies represent neurological or neurodegenerative diseases triggers. Indeed, dysarthric voice patterns are studied as early detection for Parkinson's syndrome. Voice acoustic features can thus be extracted from simple vocalism (or short reading) and considered as diseases indicators. The interest is also improved thank to the possibility of using mobile and/or portable technologies to analyze voice parameters in a background manner. Nevertheless, voice can be considered as a personal signature, thus the identification of a general purpose features identification is still an open issue.
In this work we focus on identifying voice signal features by using a comparative study of different classifiers. The target is to identify frequencies values and noise indexes (or combination of them) to be used as general purpose indicators, aiming to discriminate between healthy and pathological voices. The work is part of a larger project on voice analysis aiming to define a mobile and reliable based system allowing the acquisition and features extraction of vocal signals to be used as healthy monitor.

References

[1]
F. Amato, M. Cannataro, C. Cosentino, A. Garozzo, N. Lombardo, C. Manfredi, F. Montefusco, G. Tradigo, and P. Veltri. Early detection of voice diseases via a web-based system. Biomedical Signal Processing and Control, 4:206--211, 2009.
[2]
R. J. Baken and R. F. Orlikoff. Clinical Measurement of Speech and Voice. Cengage Learning, 2000.
[3]
M. M. Baki, G. Wood, M. Alston, P. Ratcliffe, G. Sandhu, J. S. Rubin, and M. Birchall. Reliability of operavox against multidimensional voice program (MDVP). Clinical Otolaryngology, 40(1):22--28, 2015.
[4]
P. Boersma. Accurate short-term analysis of the fundamental frequency and the harmonics to noise ratio of a sampled sound. Technical report, Institute of Phonetic Sciences, University of Amsterdam, 1993.
[5]
B. Boyanov and S. Hadjitodorov. Acoustic analysis of pathological voices. IEEE Engineering in Medicine and Biology, 16(4):74--82, 1997.
[6]
M. Brockmann, M. J. Drinnan, C. Storck, and P. N. Carding. Reliable jitter and shimmer measurements in voice clinics: The relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task. Journal of Voice, 25(1):44--53, 2011.
[7]
A. Carullo, A. Vallan, and A. Astolfi. A low-cost platform for voice monitoring. In Instrumentation and Measurement Technology Conference (I2MTC), 2013 IEEE International, pages 67--72, 2013.
[8]
P. Cinaglia, M. Macrí, B. Calabrese, P. Vizza, G. Tradigo, and P. Veltri. A system for ubiquitous distributed acquisition of voice alteration samples through a mobile application. In Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, pages 813--818, 2014.
[9]
C. Cortes and V. Vapnik. Support-vector networks. Machine Learning, 20(3):273--297, 1995.
[10]
G. Cybenko. Approximation by superpositions of a sigmoidal function. Mathematics of control, signals and systems, 2(4):303--314, 1989.
[11]
E. S. Fonseca, R. C. Guido, P. R. Scalassara, C. D. Maciel, and J. C. Pereira. Wavelet time-frequency analysis and least squares support vector machines for the identification of voice disorders. Computers in Biology and Medicine, 37(4):571--578, 2007.
[12]
M. J. Handel. mhealth (Mobile Health) - Using apps for health and wellness. EXPLORE: The Journal of Science and Healing, 7(4):256--261, 2011.
[13]
K. Hornik, M. Stinchcombe, and H. White. Multilayer feedforward networks are universal approximators. Neural Networks, 78(7):44--46, 2009.
[14]
B. H. Jacobson, A. Johnson, C. Grywalski, A. Silbergleit, G. Jacobson, M. S. Benninger, and C. W. Newman. The voice handicap index (VHI) development and validation. American Journal of Speech-Language Pathology, 6(3):66--70, 1997.
[15]
J. Jiang, Y. Zhang, J. MacCallum, A. Sprecher, and L. Zhou. Objective acoustic analysis of pathological voices from patients with vocal nodules and polyps. Folia Phoniatr Logop, 61:342--349, 2009.
[16]
Z. N. Karam, E. M. Provost, S. Singh, J. Montgomery, C. Archer, G. Harrington, and Melvi. Ecologically valid long-term mood monitoring of individuals with bipolar disorder using speech. In ICASSP 2014--2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014.
[17]
H. Kasuya, S. Ogawa, K. Mashima, and S. Ebihara. Normalized noise energy as an acoustic measure to evaluate pathologic voice. The Journal of the Acoustical Society of America, 80(5):1329--1334, 1986.
[18]
R. Linder, A. Albers, M. Hess, S. Poppl, and R. Schonweiler. Artificial neural network-based classification to screen for dysphonia using psychoacoustic scaling of acoustic voice features. Journal of Voice, 22(2):155--163, 2008.
[19]
M. A. Little, P. E. McSharry, E. J. Hunter, J. Spielman, and L. O. Ramig. Suitability of Dysphonia Measurements for Telemonitoring of Parkinson's Disease. IEEE Transactions on Biomedical Engineering, 56(4):1015--1022, Apr. 2009.
[20]
C. Manfredi. Adaptive noise energy estimation in pathological speech signals. Biomedical Engineering, IEEE Transactions on, 47(11):1538--1543, 2000.
[21]
C. Manfredi, L. Bocchi, and G. Cantarella. A multipurpose user-friendly tool for voice analysis: application to pathological adult voices. Biomedical Signal Processing and Control, 4(3):212--220, 2009.
[22]
C. Manfredi, T. Bruschi, A. Dallai, A. Ferri, and P. Tortoli. Voice quality monitoring: a portable device prototype. In Proc. of the 30th Annual International IEEE EMBS Conference, pages 997--1000, 2008.
[23]
D. D. Mehta, M. Zanartu, S. W. Feng, H. A. Cheyne, and R. E. Hillman. Mobile voice health monitoring using a wearable accelerometer sensor and a smartphone platform. Biomedical Engineering, 59(11):3090--3096, 2012.
[24]
A. Millar, I. J. Deary, J. A. Wilson, and K. MacKenzie. Is an organic/functional distinction psychologically meaningful in patients with dysphonia? Journal of psychosomatic research, 46(6):497--505, 1999.
[25]
A. Palumbo, P. Veltri, B. Calabrese, P. Vizza, M. Cannataro, A. Garozzo, N. Lombardo, and F. Amato. Experiences of using a dsp-based device for vocal signal analysis. In Proc. of Models and Analysis of Vocal Emissions for Biomedical Applications: 7th International Workshop, pages 187--189, August.
[26]
V. Parsa and D. G. Jamieson. Acoustic discrimination of pathological voice sustained vowels versus continuous speech. Journal of Speech, Language, and Hearing Research, 44(2):327--339, 2001.
[27]
J. Pepper, A. Zhang, R. Li, and X. H. Wang. Usage results of a mobile app for managing urinary incontinence. The Journal of Urology, 193(4):1292--1297, 2015.
[28]
J. C. Platt. Sequential minimal optimization: A fast algorithm for training support vector machines. Advances in Kernel Methods - Support Vector Learning, 1998.
[29]
Y. Qi and R. E. Hillman. Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals. The Journal of the Acoustical Society of America, 102(1):537--543, 1997.
[30]
J. R. Quinlan. Programs for machine learning. Machine Learning, 16(3):235--240, 1994.
[31]
I. Rish. An empirical study of the naive bayes classifier. In IJCAI 2001 workshop on empirical methods in artificial intelligence, 2001.
[32]
R. Ritchings, M. McGillion, and C. Moore. Pathological voice quality assessment using artificial neural networks. Medical Engineering & Physics, 2002.
[33]
C. Rosen, A. Lee, J. Osborne, T. Zullo, and T. Murry. Development and validation of the voice handicap index-10. Laryngoscope, 2004.
[34]
F. Rosenblatt. Principles of neurodynamics. perceptrons and the theory of brain mechanisms. Cornell Aeronautical Lab. Inc., 1961.
[35]
B. Schneider-Stickler, C. Knell, B. Aichstill, and W. Jocher. Biofeedback on voice use in call center agents in order to prevent occupational voice disorders. Journal of Voice, 26(1):51--62, 2012.
[36]
V. Sellam and J. Jagadeesan. Classification of normal and pathological voice using svm and rbfnn. Journal of Signal and Information Processing, (5):1--7, 2014.
[37]
K. Shama, A. Krishna, and N. U. Cholayya. Study of harmonics-to-noise ratio and critical-band energy spectrum of speech as acoustic indicators of laryngeal and voice pathology. EURASIP Journal on Applied Signal Processing, 2007(1):50, 2007.
[38]
S. Skodda. Acoustic analysis of speech as a promising instrument for monitoring and differential diagnosis of parkinson disease. In Proc. of Models and Analysis of Vocal Emissions for Biomedical Applications: 7th International Workshop, pages 177--180, August.
[39]
J. C. Stemple, N. Roy, and B. K. Klaben. Clinical Voice Pathology Theory and Management. Plural Publishing Inc., 2014.
[40]
M. K. Sunwoo, J. Y. Hong, J. E. Lee, H. S. Lee, P. H. Lee, and Y. H. Sohn. Depression and voice handicap in Parkinson disease. Journal of the Neurological Sciences, 346:112--115, 2014.
[41]
J. P. Teixeira and P. O. Fernandesa. Jitter, shimmer and hnr classification within gender, tones and vowels in healthy voices. Procedia Technology, 16:1228--1237, 2014.
[42]
A. Tsanas, M. A. Little, P. E. McSharry, J. Spielman, and L. O. Ramig. Novel speech signal processing algorithms for high-accuracy classification of Parkinson's disease. IEEE Transactions on Biomedical Engineering, 59(5):1264--1271, 2012.
[43]
K. Umapathy, S. Krishnan, V. Parsa, and D. G. Jamieson. Discrimination of pathological voices using a time-frequency approach. IEEE Transactions on Biomedical Engineering, 52(3):421--430, 2005.
[44]
J. Wang and C. Jo. Vocal folds disorder detection using pattern recognition methods. In Proc. of 29th Annual International Conference of the IEEE EMBS, 2007.
[45]
C. R. Watts and M. Vanryckeghem. Laryngeal dysfunction in amyotrophic lateral sclerosis: a review and case report. JBMC Ear, Nose and Throat Disorders, 1(1), 2001.
[46]
W. L. Zuo, Z. Y. Wang, T. Liu, and H. L. Chen. Effective detection of Parkinson's disease using an adaptive fuzzy k-nearest neighbor approach. Biomedical Signal Processing and Control, 8(4):364--373, 2013.

Cited By

View all
  • (2024)Noise signature identification using mobile phones for indoor localizationMultimedia Tools and Applications10.1007/s11042-023-17885-3Online publication date: 16-Jan-2024
  • (2023)Voice signal-based disease diagnosis using IoT and learning algorithms for healthcareImplementation of Smart Healthcare Systems using AI, IoT, and Blockchain10.1016/B978-0-323-91916-6.00005-9(59-81)Online publication date: 2023
  • (2021)Speech outcome in tongue cancer surgery: objective evaluation by acoustic analysis softwareRomanian Journal of Rhinology10.2478/rjr-2021-002511:44(143-152)Online publication date: 3-Nov-2021
  • Show More Cited By

Index Terms

  1. Voice signal features analysis and classification: looking for new diseases related parameters

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        BCB '15: Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics
        September 2015
        683 pages
        ISBN:9781450338530
        DOI:10.1145/2808719
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 09 September 2015

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. acoustic features extraction
        2. classification
        3. dysphonia
        4. machine learning
        5. voice monitoring

        Qualifiers

        • Research-article

        Conference

        BCB '15
        Sponsor:

        Acceptance Rates

        BCB '15 Paper Acceptance Rate 48 of 141 submissions, 34%;
        Overall Acceptance Rate 254 of 885 submissions, 29%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)6
        • Downloads (Last 6 weeks)1
        Reflects downloads up to 13 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Noise signature identification using mobile phones for indoor localizationMultimedia Tools and Applications10.1007/s11042-023-17885-3Online publication date: 16-Jan-2024
        • (2023)Voice signal-based disease diagnosis using IoT and learning algorithms for healthcareImplementation of Smart Healthcare Systems using AI, IoT, and Blockchain10.1016/B978-0-323-91916-6.00005-9(59-81)Online publication date: 2023
        • (2021)Speech outcome in tongue cancer surgery: objective evaluation by acoustic analysis softwareRomanian Journal of Rhinology10.2478/rjr-2021-002511:44(143-152)Online publication date: 3-Nov-2021
        • (2018)A Voice-Aware System for Vocal WellnessProceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/3233547.3233729(693-698)Online publication date: 15-Aug-2018
        • (2018)An intelligent healthcare system for detection and classification to discriminate vocal fold disordersFuture Generation Computer Systems10.1016/j.future.2018.02.02185(19-28)Online publication date: Aug-2018
        • (undefined)Sdi: A Tool for Speech Differentiation in User AuthenticationSSRN Electronic Journal10.2139/ssrn.4047897

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media