Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3242969.3243682acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesicmi-mlmiConference Proceedingsconference-collections
research-article

Functional-Based Acoustic Group Feature Selection for Automatic Recognition of Eating Condition

Published: 02 October 2018 Publication History

Abstract

This paper presents the novel Functional-based acoustic Group Feature Selection (FGFS) method for automatic eating condition recognition addressed in the ICMI 2018 Eating Analysis and Tracking Challenge's Food-type Sub-Challenge. The Food-type Sub-Challenge employs the audiovisual iHEARu-EAT database and attempts to classify which of six food types, or none, is being consumed by subjects while speaking. The approach proposed by the FGFS method uses the audio mode and considers the acoustic feature space in groups rather than individually. Each group is comprised of acoustic features generated by the application of a statistical functional to a specified set of the low-level descriptors of the audio data. The FGFS method provides information about the degree of relevance of the statistical functionals to the task. In addition, the partitioning of features into groups allows for more rapid processing of the official Sub-Challenge's large acoustic feature set. The FGFS-based system achieves 2.8% relative Unweighted Average Recall performance improvement over the official Food-type Sub-Challenge baseline on iHEARu-EAT test data.

References

[1]
Leo Breiman . 2001. Random Forests. Machine Learning Vol. 45, 1 (01 Oct . 2001), 5--32.
[2]
Corinna Cortes and Vladimir Vapnik . 1995. Support-Vector Networks. Machine Learning Vol. 20, 3 (1995), 273--297.
[3]
Geoff Dougherty . 2013. Feature Extraction and Selection. In Pattern Recognition and Classification: An Introduction. Springer, 123--141.
[4]
Florian Eyben . 2016. Real-time Speech and Music Classification by Large Audio Feature Space Extraction. Springer.
[5]
Florian Eyben, Felix Weninger, Florian Groß, and Björn Schuller . 2013. Recent developments in openSMILE, the munich open-source multimedia feature extractor. In Proceedings of the 21st ACM international conference on Multimedia. ACM, 835--838.
[6]
Martin Gutlein, Eibe Frank, Mark Hall, and Andreas Karwath . 2009. Large-scale attribute selection using wrappers. In Computational Intelligence and Data Mining, 2009. CIDM'09. IEEE Symposium on. IEEE, 332--339.
[7]
Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H Witten. 2009. The WEKA Data Mining Software: An Update. ACM SIGKDD explorations newsletter Vol. 11, 1 (2009), 10--18.
[8]
Simone Hantke, Maximilian Schmitt, Panagiotis Tzirakis, and Björn Schuller. 2018. EAT -- The ICMI 2018 Eating Analysis and Tracking Challenge Proceedings of 20th ACM International Conference on Multimodal Interaction (ICMI'18). ACM, Boulder, Colorado, USA.
[9]
Simone Hantke, Felix Weninger, Richard Kurle, Fabien Ringeval, Anton Batliner, Amr El-Desoky Mousa, and Björn Schuller . 2016. I Hear You Eat and Speak: Automatic Recognition of Eating Condition and Food Type, Use-Cases, and Impact on ASR Performance. PloS ONE Vol. 11, 5 (2016), e0154486.
[10]
Ron Kohavi and George H John. 1997. Wrappers for feature subset selection. Artificial Intelligence Vol. 97, 1 (1997), 273--324.
[11]
Dara Pir and Theodore Brown. 2015. Acoustic Group Feature Selection Using Wrapper Method for Automatic Eating Condition Recognition. In INTERSPEECH 2015 -- 16textsuperscriptth Annual Conference of the International Speech Communication Association, September 6--10, 2015, Dresden, Germany, Proceedings. 894--898.
[12]
Dara Pir, Theodore Brown, and Jarek Krajewski . 2017. Automatic Driver Sleepiness Detection Using Wrapper-Based Acoustic Between-Groups, Within-Groups, and Individual Feature Selection Proceedings of the 3rd International Conference on Vehicle Technology and Intelligent Transport Systems - Volume 1: VEHITS,. INSTICC, SciTePress, 196--202.
[13]
John C Platt . 1999. Fast Training of Support Vector Machines Using Sequential Minimal Optimization. In Advances in Kernel Methods: Support Vector Learning. MIT Press.
[14]
Björn Schuller and Anton Batliner . 2014. Computational Paralinguistics: Emotion, Affect and Personality in Speech and Language Processing. John Wiley & Sons.
[15]
Björn Schuller, Anton Batliner, Dino Seppi, Stefan Steidl, Thurid Vogt, Johannes Wagner, Laurence Devillers, Laurence Vidrascu, Noam Amir, Loic Kessous, and Vered Aharonson. 2007. The Relevance of Feature Type for the Automatic Classification of Emotional User States: Low Level Descriptors and Functionals Interspeech 2007 -- 8th Annual Conference of the International Speech Communication Association, August 27--31, Antwerp, Belgium, Proceedings. 2253--2256.
[16]
Björn Schuller, Stefan Steidl, and Anton Batliner. 2009. The INTERSPEECH 2009 Emotion Challenge. In Interspeech 2009 -- 10th Annual Conference of the International Speech Communication Association, September 6--10, 2009, Brighton, UK, Proceedings. 312--315.
[17]
Björn Schuller, Stefan Steidl, Anton Batliner, Simone Hantke, Florian Hönig, Juan Rafael Orozco-Arroyave, Elmar Nöth, Yue Zhang, and Felix Weninger. 2015. The INTERSPEECH 2015 Computational Paralinguistics Challenge: Nativeness, Parkinson's & Eating Condition. In Interspeech 2015 -- 16th Annual Conference of the International Speech Communication Association, September 6--10, Dresden, Germany, Proceedings. 478--482.
[18]
Björn Schuller, Stefan Steidl, Anton Batliner, Alessandro Vinciarelli, Klaus Scherer, Fabien Ringeval, Mohamed Chetouani, Felix Weninger, Florian Eyben, Erik Marchi, Marcello Mortillaro, Hugues Salamin, Anna Polychroniou, Fabio Valente, and Samuel Kim. 2013. The INTERSPEECH 2013 Computational Paralinguistics Challenge: Social Signals, Conflict, Emotion, Autism. In Interspeech 2013 -- 14th Annual Conference of the International Speech Communication Association, August 25--29, Lyon, France, Proceedings. 148--152.
[19]
Marc Sumner, Eibe Frank, and Mark Hall. 2005. Speeding Up Logistic Model Tree Induction. Springer Berlin Heidelberg, Berlin, Heidelberg, 675--683.
[20]
Felix Weninger, Florian Eyben, Björn Schuller, Marcello Mortillaro, and Klaus Scherer. 2013. On the acoustics of emotion in audio: what speech, music, and sound have in common. Frontiers in Psychology Vol. 4 (2013), 292.

Cited By

View all
  • (2020)SMOPredT4SE: An Effective Prediction of Bacterial Type IV Secreted Effectors Using SVM Training With SMOIEEE Access10.1109/ACCESS.2020.29710918(25570-25578)Online publication date: 2020
  • (2019)The Challenge of Automatic Eating Behaviour Analysis and TrackingRecent Advances in Intelligent Assistive Technologies: Paradigms and Applications10.1007/978-3-030-30817-9_8(187-204)Online publication date: 8-Nov-2019

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICMI '18: Proceedings of the 20th ACM International Conference on Multimodal Interaction
October 2018
687 pages
ISBN:9781450356923
DOI:10.1145/3242969
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • SIGCHI: Specialist Interest Group in Computer-Human Interaction of the ACM

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 October 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. computational paralinguistics
  2. eating condition recognition
  3. feature selection
  4. group feature
  5. large acoustic feature set
  6. low-level descriptor
  7. statistical functional

Qualifiers

  • Research-article

Conference

ICMI '18
Sponsor:
  • SIGCHI

Acceptance Rates

ICMI '18 Paper Acceptance Rate 63 of 149 submissions, 42%;
Overall Acceptance Rate 453 of 1,080 submissions, 42%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2020)SMOPredT4SE: An Effective Prediction of Bacterial Type IV Secreted Effectors Using SVM Training With SMOIEEE Access10.1109/ACCESS.2020.29710918(25570-25578)Online publication date: 2020
  • (2019)The Challenge of Automatic Eating Behaviour Analysis and TrackingRecent Advances in Intelligent Assistive Technologies: Paradigms and Applications10.1007/978-3-030-30817-9_8(187-204)Online publication date: 8-Nov-2019

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media