Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3321408.3326655acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesacm-turcConference Proceedingsconference-collections
research-article

Acoustic event detection with two-stage judgement in the noisy environment

Published: 17 May 2019 Publication History

Abstract

Aiming at the problem of inaccurate event location in noisy environment by existing acoustic even detection technology, this paper presents an acoustic event detection algorithm based on two-stage judgement. Firstly, the acoustic events existing in the audio signal are located by the two-stage judgement detection method, both of the distance of the Mel Frequency Cepstral Coefficients (MFCC) and the short-time energy between each audio signal frame and the noise average are calculated, respectively. The MFCC distance in the frequency domain which can produce fine but incomplete results is the first judgement; the energy distance in the time domain is the second judgement, which is used to supplement the first judgment. Studies have shown that the Gammatone filter bank is biologically closer to the human ear structure than the Mel filter bank. The Gammatone Frequency Cepstral Coefficients (GFCC) of the detected acoustic events were then extracted. The detected acoustic events are classified by the Gaussian Mixture Model (GMM). By analyzing the experimental results, the algorithm can solve the problem that the sound feature information is insufficient and the noise segment boundary is not clear. This system is more suitable for the situation where a variety of acoustic events should be analyzed.

References

[1]
Andery Temko, Robert Malkin and Christian Zieger. 2012: 311--322. CLEAR Evaluation of Acoustic Event Detection and Classification Systems. International Evaluation Conference on Classification of Events, Activities and Relationships. Springer, Berlin.
[2]
Woohyun Choi and Jinsang Rho. 2012: 118--123. Selective Background Adaptation Based Abnormal Acoustic Event Recognition for Audio Surveillance. IEEE Ninth International Conference on Advanced Video and Signal-based Surveillance. IEEE, Beijing.
[3]
Pasquale Foggia. 2015. Audio Surveillance of Roads: A System for Detecting Anomalous Sounds. IEEE Transactions on Intelligent Transportation Systems, 17(1):279--288
[4]
Seongkyu Mun. 2017: 796--800. Deep Neural Network based learning and transferring mid-level audio features for acoustic scene classification. IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, New Orleans.
[5]
Jayanthi kumari T.R. 2015:1--6. Comparison of LPCC and MFCC features and GMM and GMM-UBM modeling for limited data speaker verification. IEEE International Conference on Computational Intelligence & Computing Research. IEEE, Madurai.
[6]
Jimmy Ludena-Choez and Ascension Gallardo-Antolin. 2016. Acoustic Event Classification using spectral band selection and Non-Negative Matrix Factorization-based features. Expert Systems with Applications, 46:77--86.
[7]
Rene Grzeszick, Axel Plinge and Gernot A. Fink. 2017. Bag-of-Features Methods for Acoustic Event Detection and Classification. IEEE/ACM Transactions on Audio Speech & Language Processing, 25(6):1242--1252.
[8]
Eric Nowak, Frederic Jurie and Bill Triggs. 2006: 490--503. Sampling Strategies for Bag-of-Features Image Classification. European Conference on Computer Vision. Springer. Springer, Berlin.
[9]
Zhao Zhao, Sai-hua Zhang, and Zhi-yong Xu. 2017. Automated bird acoustic event detection and robust species classification. Ecological Informatics, 39:99--108.
[10]
Annamaria Mesaros, Toni Heittola and Tuomas Virtanen. 2016: 1128--1132. TUT database for acoustic scene classification and sound event detection. Signal Processing Conference. IEEE, Budapest.
[11]
Johannesma P I M. 1972: 58--69. The pre-response stimulus ensemble of neurons in the cochlear nucleus. Proc of the Symposium on Hearing Theory. IPO, Eindhoven, Netherlands.
[12]
Xiaoyun Lv. 2010. Research on abnormal audio recognition algorithm based on MFCC and GMM. Southwest Jiaotong University, Chengdu, China.
[13]
Lukas Burget, Pavel Matejka, Petr Schwaz, Ondrej Glembek and Jan "Honza" Cernocky. 2007. Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System. IEEE Transactions on Audio Speech & Language Processing, 15(7):1979--1986.
[14]
Annamaria Mesaros, Toni Heittola and Tuomas Virtanen. 2016. Metrics for polyphonic sound event detection. Applied Sciences, 6(6):162.
  1. Acoustic event detection with two-stage judgement in the noisy environment

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ACM TURC '19: Proceedings of the ACM Turing Celebration Conference - China
    May 2019
    963 pages
    ISBN:9781450371582
    DOI:10.1145/3321408
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 May 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. GFCC
    2. acoustic event classification
    3. acoustic event detection
    4. two-stage judgement

    Qualifiers

    • Research-article

    Conference

    ACM TURC 2019

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 39
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 24 Nov 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media