Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.05916 (cs)

[Submitted on 9 Mar 2024 (v1), last revised 10 Apr 2024 (this version, v2)]

Title:GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing

Authors:Hao Lu, Xuesong Niu, Jiyao Wang, Yin Wang, Qingyong Hu, Jiaqi Tang, Yuting Zhang, Kaishen Yuan, Bin Huang, Zitong Yu, Dengbo He, Shuiguang Deng, Hao Chen, Yingcong Chen, Shiguang Shan

View PDF HTML (experimental)

Abstract:Multimodal large language models (MLLMs) are designed to process and integrate information from multiple sources, such as text, speech, images, and videos. Despite its success in language understanding, it is critical to evaluate the performance of downstream tasks for better human-centric applications. This paper assesses the application of MLLMs with 5 crucial abilities for affective computing, spanning from visual affective tasks and reasoning tasks. The results show that \gpt has high accuracy in facial action unit recognition and micro-expression detection while its general facial expression recognition performance is not accurate. We also highlight the challenges of achieving fine-grained micro-expression recognition and the potential for further study and demonstrate the versatility and potential of \gpt for handling advanced tasks in emotion recognition and related fields by integrating with task-related agents for more complex tasks, such as heart rate estimation through signal processing. In conclusion, this paper provides valuable insights into the potential applications and challenges of MLLMs in human-centric computing. Our interesting examples are at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.05916 [cs.CV]
	(or arXiv:2403.05916v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.05916

Submission history

From: Hao Lu [view email]
[v1] Sat, 9 Mar 2024 13:56:25 UTC (4,940 KB)
[v2] Wed, 10 Apr 2024 07:58:44 UTC (4,941 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators