Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3689062.3689081acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Exploring Multimodal Approaches and Fusion Methods for CEO Social Attribute Prediction in 2024 MuSe-Perception

Published: 28 October 2024 Publication History

Abstract

In this paper, we discuss the way we addressed the 2024 MuSe-Perception challenge, which aims to automatically recognize and quantify social attributes of CEOs using multimodal data. We investigated five different approaches: (1) optimizing a basic RNN and Transformer encoder model, (2) adopting the xLSTM architecture for improved long-term dependency modeling, (3) extracting text features using pre-trained language models, (4) grouping similar social attributes for joint learning, and (5) incorporating novel fusion methods. The experimental results on the development set showed that our different approaches excelled for different attributes, with multimodal methods generally outperforming unimodal methods, and text feature extraction notably improving the prediction of a subset of the agentive attributes. For the test set, our most effective strategy achieved a mean Pearson correlation coefficient of 0.35 by combining the highest values from four different approaches. This outcome points to the existence of a substantial gap between development and test set effectiveness, indicating limitations in model generalization and calling for further research to improve the accuracy and reliability of methods for social attribute prediction.

References

[1]
Shahin Amiriparian, Lukas Christ, Alexander Kathan, Maurice Gerczuk, Niklas Mueller, Steffen Klug, Lukas Stappen, Recoro Munich, Andreas König, Erik Cambria, Björn Schuller, and Simone Eulitz. 2024. The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition. https://doi.org/10.13140/RG.2.2.31167.32166
[2]
Maximilian Beck, Korbinian Pöppel, Markus Spanring, Andreas Auer, Oleksandra Prudnikova, Michael Kopp, Günter Klambauer, Johannes Brandstetter, and Sepp Hochreiter. 2024. xLSTM: Extended Long Short-Term Memory. arXiv preprint arXiv:2405.04517 (2024).
[3]
Lukas Biewald. 2020. Experiment Tracking with Weights and Biases. https://www.wandb.com/ Software available from wandb.com.
[4]
Aayushi Chaudhari, Chintan Bhatt, Achyut Krishna, and Pier Luigi Mazzeo. 2022. ViTFER: facial emotion recognition with vision transformers. Applied System Innovation, Vol. 5, 4 (2022), 80. https://doi.org/10.3390/asi5040080
[5]
Jonathan H. Clark, Dan Garrette, Iulia Turc, and John Wieting. 2021. CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation. CoRR, Vol. abs/2103.06874 (2021). arxiv: 2103.06874 https://arxiv.org/abs/2103.06874
[6]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR, Vol. abs/1810.04805 (2018). arxiv: 1810.04805 http://arxiv.org/abs/1810.04805
[7]
Hugging Face. 2024. Hugging Face - The AI community building the future. https://huggingface.co/. Accessed: 2024-07--22.
[8]
Rensis Likert. 1932. A technique for the measurement of attitudes. Archives of psychology (1932).
[9]
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. RoBERTa: A Robustly Optimized BERT Pretraining Approach. CoRR, Vol. abs/1907.11692 (2019). arxiv: 1907.11692 http://arxiv.org/abs/1907.11692
[10]
Ho-Min Park, Ganghyun Kim, Arnout Van Messem, and Wesley De Neve. 2023. MuSe-Personalization 2023: Feature Engineering, Hyperparameter Optimization, and Transformer-Encoder Re-discovery. In Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, Humour and Personalisation (Ottawa ONCanada) (MuSe '23). Association for Computing Machinery, New York, NY, USA, 89--97. https://doi.org/10.1145/3606039.3613104
[11]
Ho-min Park, Ilho Yun, Ajit Kumar, Ankit Kumar Singh, Bong Jun Choi, Dhananjay Singh, and Wesley De Neve. 2022. Towards Multimodal Prediction of Time-Continuous Emotion Using Pose Feature Engineering and a Transformer Encoder. In Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge (Lisboa, Portugal) (MuSe' 22). Association for Computing Machinery, New York, NY, USA, 47--54. https://doi.org/10.1145/3551876.3554807
[12]
Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. 2019. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. ArXiv, Vol. abs/1910.01108 (2019).
[13]
Sefik Serengil and Alper Ozpinar. 2024. A Benchmark of Facial Recognition Pipelines and Co-Usability Performances of Modules. Bilisim Teknolojileri Dergisi, Vol. 17, 2 (2024), 95--107. https://doi.org/10.17671/gazibtd.1399077
[14]
Suramya Tomar. 2006. Converting video formats with FFmpeg. Linux Journal, Vol. 2006, 146 (2006), 10.
[15]
Johannes Wagner, Andreas Triantafyllopoulos, Hendrik Wierstorf, Maximilian Schmitt, Felix Burkhardt, Florian Eyben, and Björn W. Schuller. 2023. Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap. IEEE Transactions on Pattern Analysis & Machine Intelligence, Vol. 01 (2023), 1--13. https://doi.org/10.1109/TPAMI.2023
[16]
Tal Yarkoni. 2010. Personality in 100,000 Words: A large-scale analysis of personality and word use among bloggers. Journal of Research in Personality, Vol. 44, 3 (2010), 363--373. https://doi.org/10.1016/j.jrp.2010.04.001

Cited By

View all
  • (2024)MuSe '24: The 5th Multimodal Sentiment Analysis Challenge and Workshop: Social Perception & HumorProceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor10.1145/3689062.3695939(10-11)Online publication date: 28-Oct-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MuSe'24: Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor
October 2024
76 pages
ISBN:9798400711992
DOI:10.1145/3689062
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. attribute grouping
  2. fusion methods
  3. machine learning
  4. multimodal analysis
  5. social attribute prediction
  6. text features

Qualifiers

  • Research-article

Funding Sources

  • AI Excellence Global Innovative Leader Education Program, Korea
  • Information Technology Research Center
  • Ministry of Culture, Sports and Tourism, Korea
  • Ministry of Science and ICT, South Korea
  • Ghent University Global Campus, Korea

Conference

MM '24
Sponsor:
MM '24: The 32nd ACM International Conference on Multimedia
October 28 - November 1, 2024
Melbourne VIC, Australia

Acceptance Rates

Overall Acceptance Rate 14 of 17 submissions, 82%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)38
  • Downloads (Last 6 weeks)38
Reflects downloads up to 29 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)MuSe '24: The 5th Multimodal Sentiment Analysis Challenge and Workshop: Social Perception & HumorProceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor10.1145/3689062.3695939(10-11)Online publication date: 28-Oct-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media