Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3316615.3316664acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

Meta-Analysis of Acoustic Feature Extraction for Machine Listening Systems

Published: 19 February 2019 Publication History

Abstract

Generally, the concentration of the study and research in the understanding of sounds revolves around the speech and music area, on the contrary, there are few in environmental and non-speech recognition. This paper carries out a meta-analysis of the acoustic transformation and feature set extraction of the environmental sound raw signal form into a parametric type representation in handling analysis, perception, and labeling for audio analysis of sound identification systems. We evaluated and analyzed the various contemporary methods and feature algorithms surveyed for the acoustic identification and perception of surrounding sounds, the Gammatone spectral coefficients (GSTC) and Mel Filterbank (FBEs) then the acoustic signal classification the Convolutional Neural Network (ConvNet) was applied. The outcome demonstrates that GSTC accomplished better as a feature in contrast to FBEs, but FBEs tend to improve performance when merge or incorporated with other feature. The analysis demonstrates that merging or incorporating with other features set is encouraging in achieving a much better accuracy in contrast to a single feature in classifying environmental sounds that is useful in the advancement of the intelligent machine listening frameworks.

References

[1]
Piczak K. J. ESC: Dataset for Environmental Sound Classification. In Proceedings of the 23rd ACM international conference on Multimedia, pp. 1015--1018, ACM, 2015.
[2]
Stowell A D., D. Giannoulis, E. Benetos, M. Lagrange, M.D. Plumbley, "Detection and classification of acoustic scenes and events", IEEE Transactions on Multimedia, vol. 17, no. 10, pp. 1733--1746, Oct 2015.
[3]
Imoto K., "Introduction to Acoustic Event and Scene Analysis," Acoustical Science and Technology, Vol. 39, No. 3, pp. 182--188, 2018.
[4]
Lyon, R. "Machine hearing: an emerging field," IEEE Signal Processing Magazine, vol. 42, pp. 1414-- 1416, 201.
[5]
Tak R.N., Agrawal D.M., Patil H.A. (2017) Novel Phase Encoded Mel Filterbank Energies for Environmental Sound Classification. In: Shankar B., Ghosh K., Mandal D., Ray S., Zhang D., Pal S. (eds) Pattern Recognition and Machine Intelligence. PReMI 2017. Lecture Notes in Computer Science, vol 10597. Springer, Cham
[6]
Valero X. and F. Alias, "Gammatone cepstral coefficients: Biologically inspired features for non-speech audio classification," IEEE Trans. on Multimedia, vol. 14, no. 6, pp. 1684--1689, 2012.
[7]
Carney L. H. and T. Yin, "Temporal coding of resonances by low-frequency auditory nerve fibers: single-fiber responses and a population model," Journal of Neurophysiology, vol. 60, no. 5, pp. 1653--1677, 1988.
[8]
Agrawal D. M., H. B. Sailor, M. H. Soni, and H. A. Patil, "Novel teo based gammatone features for environmental sound classification," in European Signal Processing Conference, 2017, pp. 1809--1813
[9]
Sailor H B, Dharmesh M Agrawal, and Hemant A Patil. Unsupervised filterbank learning using convolutional restricted Boltzmann machine for environmental sound classification. Proc. Interspeech 2017, pages 3107--3111, 2017.
[10]
Piczak K. J., "Environmental sound classification with convolutional neural networks," in 25th Int. Workshop on Machine Learning for Signal Processing (MLSP), Boston, MA, USA, 2015, pp. 1--6.

Cited By

View all
  • (2021)Computer Discriminative Acoustic Tool for Reading Enhancement and Diagnostic: Development and Pilot Test2021 Second International Conference on Innovative Technology Convergence (CITC)10.1109/CITC54365.2021.00019(63-68)Online publication date: Dec-2021

Index Terms

  1. Meta-Analysis of Acoustic Feature Extraction for Machine Listening Systems

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICSCA '19: Proceedings of the 2019 8th International Conference on Software and Computer Applications
    February 2019
    611 pages
    ISBN:9781450365734
    DOI:10.1145/3316615
    © 2019 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

    In-Cooperation

    • University of New Brunswick: University of New Brunswick

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 February 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. ConvNets
    2. Feature extraction
    3. acoustic transformation
    4. machine listening system

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICSCA '19

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 19 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Computer Discriminative Acoustic Tool for Reading Enhancement and Diagnostic: Development and Pilot Test2021 Second International Conference on Innovative Technology Convergence (CITC)10.1109/CITC54365.2021.00019(63-68)Online publication date: Dec-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media