research-article

Object-Based Visual Sentiment Concept Analysis and Application

Authors:

Shih-Fu ChangAuthors Info & Claims

MM '14: Proceedings of the 22nd ACM international conference on Multimedia

Pages 367 - 376

https://doi.org/10.1145/2647868.2654935

Published: 03 November 2014 Publication History

Abstract

This paper studies the problem of modeling object-based visual concepts such as "crazy car" and "shy dog" with a goal to extract emotion related information from social multimedia content. We focus on detecting such adjective-noun pairs because of their strong co-occurrence relation with image tags about emotions. This problem is very challenging due to the highly subjective nature of the adjectives like "crazy" and "shy" and the ambiguity associated with the annotations. However, associating adjectives with concrete physical nouns makes the combined visual concepts more detectable and tractable. We propose a hierarchical system to handle the concept classification in an object specific manner and decompose the hard problem into object localization and sentiment related concept modeling. In order to resolve the ambiguity of concepts we propose a novel classification approach by modeling the concept similarity, leveraging on online commonsense knowledgebase. The proposed framework also allows us to interpret the classifiers by discovering discriminative features. The comparisons between our method and several baselines show great improvement in classification performance. We further demonstrate the power of the proposed system with a few novel applications such as sentiment-aware music slide shows of personal albums.

Supplementary Material

suppl.mov (fp276.avi)

Supplemental video

Download
25.43 MB

References

[1]

Supplemental Webpage. http://www.ee.columbia.edu/dvmm/vso/supplemental.html.

[2]

Subhabrata Bhattacharya, Behnaz Nojavanasghari, Tao Chen, Dong Liu, Shih-Fu Chang, and Mubarak Shah. Towards a comprehensive computational model for aesthetic assessment of videos. In Proceedings of the 21st ACM International Conference on Multimedia. ACM, 2013.

Digital Library

[3]

Damian Borth, Rongrong Ji, Tao Chen, Thomas Breuel, and Shih-Fu Chang. Large-scale visual sentiment ontology and detectors using adjective noun pairs. In Proceedings of the 21st ACM International Conference on Multimedia. ACM, 2013.

Digital Library

[4]

Yan-Ying Chen, Tao Chen, Winston H. Hsu, Hong-Yuan Mark Liao, and Shih--Fu Chang. Predicting viewer affective comments based on image content in social media. In Proceedings of the International Conference on Multimedia Retrieval. ACM, 2014.

Digital Library

[5]

Ritendra Datta, Dhiraj Joshi, Jia Li, and James Z Wang. Studying aesthetics in photographic images using a computational approach. In European Conference on Computer Vision. Springer, 2006.

Digital Library

[6]

J. Deng, W. Dong, R. Socher, L.J. Li, K. Li, and L. Fei--Fei. ImageNet: A Large-Scale Hierarchical Image Database. In Computer Vision and Pattern Recognition, 2009.

[7]

Jia Deng, Jonathan Krause, and Li Fei--Fei. Fine--grained crowdsourcing for fine-grained recognition. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pages 580--587. IEEE, 2013.

Digital Library

[8]

Kun Duan, Devi Parikh, David Crandall, and Kristen Grauman. Discovering localized attributes for fine-grained recognition. In Computer Vision and Pattern Recognition. IEEE, 2012.

Digital Library

[9]

Andrea Esuli and Fabrizio Sebastiani. SentiWordNet: A publicly available lexical resource for opinion mining. In Proceedings of the Conference on Language Resources and Evaluation, volume 6, 2006.

[10]

Rong--En Fan, Kai--Wei Chang, Cho--Jui Hsieh, Xiang--Rui Wang, and Chih--Jen Lin. Liblinear: A library for large linear classification. The Journal of Machine Learning Research, 9:1871--1874, 2008.

Digital Library

[11]

P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9):1627--1645, 2010.

Digital Library

[12]

V. Ferrari and A. Zisserman. Learning visual attributes. In Neural Information Processing Systems, 2007.

[13]

Yanwei Fu, Timothy M Hospedales, Tao Xiang, and Shaogang Gong. Attribute learning for understanding unstructured social activity. In European Conference on Computer Vision. Springer, 2012.

Digital Library

[14]

R. B. Girshick, P. F. Felzenszwalb, and D. McAllester. Discriminatively trained deformable part models, release 5. http://people.cs.uchicago.edu/~rbg/latent-release5/.

[15]

Chang Huang, Haizhou Ai, Yuan Li, and Shihong Lao. High--performance rotation invariant multiview face detection. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(4):671--686, 2007.

Digital Library

[16]

P. Isola, J. Xiao, A. Torralba, and A. Oliva. What makes an image memorable? In Computer Vision and Pattern Recognition, 2011.

Digital Library

[17]

Jia Jia, Sen Wu, Xiaohui Wang, Peiyun Hu, Lianhong Cai, and Jie Tang. Can we understand van Gogh's mood?: Learning to infer affects from images in social networks. In Proceedings of the 20th ACM international conference on Multimedia, pages 857--860. ACM, 2012.

Digital Library

[18]

Dhiraj Joshi, Ritendra Datta, Elena Fedorovskaya, Quang-Tuan Luong, James Z Wang, Jia Li, and Jiebo Luo. Aesthetics and emotions in images. Signal Processing Magazine, IEEE, 28(5):94--115, 2011.

[19]

Cyril Laurier and Perfecto Herrera. Mood Cloud: A real-time music mood visualization tool. In Proceedings of the 2008 Computers in Music Modeling and Retrieval Conference, 2008.

[20]

Hugo Liu and Push Singh. ConceptNet -- a practical commonsense reasoning tool-kit. BT technology journal, 22(4):211--226, 2004.

Digital Library

[21]

J. Machajdik and A. Hanbury. Affective image classification using features inspired by psychology and art theory. In Proceedings of ACM Multimedia, pages 83--92, 2010.

Digital Library

[22]

L. Marchesotti, F. Perronnin, D. Larlus, and G. Csurka. Assessing the aesthetic quality of photographs using generic image descriptors. In Proceedings of the International Conference on Computer Vision, 2011.

Digital Library

[23]

M. Naphade, J. Smith, J. Tesic, S.-F. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and Curtis J. Large-scale concept ontology for multimedia. In IEEE Multimedia, 2006.

Digital Library

[24]

Bo Pang and Lillian Lee. Opinion mining and sentiment analysis. Information Retrieval, 2(1--2):1--135, 2008.

Digital Library

[25]

Genevieve Patterson and James Hays. Sun attribute database: Discovering, annotating, and recognizing scene attributes. In Computer Vision and Pattern Recognition. IEEE, 2012.

Digital Library

[26]

Mohammad Amin Sadeghi and Ali Farhadi. Recognition using visual phrases. In Computer Vision and Pattern Recognition. IEEE, 2011.

[27]

Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, and Yann LeCun. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229, 2013.

[28]

J.R. Smith, M. Naphade, and A. Natsev. Multimedia semantic indexing using model vectors. In International Conference on Multimedia and Expo, 2003.

Digital Library

[29]

Mike Thelwall, Kevan Buckley, Georgios Paltoglou, Di Cai, and Arvid Kappas. Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technology, 61(12):2544--2558, 2010.

[30]

Andranik Tumasjan, Timm O Sprenger, Philipp G Sandner, and Isabell M Welpe. Predicting elections with Twitter: What 140 characters reveal about political sentiment. In Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, 2010.

[31]

Weining Wang and Qianhua He. A survey on emotional semantic image retrieval. In Image Processing, 2008. ICIP 2008. 15th IEEE International Conference on, pages 117--120. IEEE, 2008.

[32]

Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, 2005.

Digital Library

[33]

V. Yanulevskaya, J. van Gemert, K. Roth, A. Herbold, N. Sebe, and J.M. Geusebroek. Emotional valence categorization using holistic image features. In Proceedings of the IEEE International Conference on Image Processing, pages 101--104, 2008.

[34]

Guangnan Ye, Dong Liu, I--Hong Jhuo, and Shih--Fu Chang. Robust late fusion with rank minimization. In Computer Vision and Pattern Recognition. IEEE, 2012.

[35]

Felix X. Yu, Dong Liu, Sanjiv Kumar, Tony Jebara, and Shih-Fu Chang. SVM for learning with label proportions. In International Conference on Machine Learning, 2013.

Cited By

Zhang TZhou GLu JLi ZWu HLiu S(2024)Text-image semantic relevance identification for aspect-based multimodal sentiment analysisPeerJ Computer Science10.7717/peerj-cs.190410(e1904)Online publication date: 12-Apr-2024
https://doi.org/10.7717/peerj-cs.1904
Han YGoh KKim SPhan T(2024)The Effect of Ad Image's Sentiment Scores and Mobile Device Attributes on Mobile Ad Response BehaviorIEEE Transactions on Engineering Management10.1109/TEM.2022.315712571(1314-1329)Online publication date: 2024
https://doi.org/10.1109/TEM.2022.3157125
Al-Tameemi IFeizi-Derakhshi MPashazadeh SAsadpour M(2024)A comprehensive review of visual–textual sentiment analysis from social media networksJournal of Computational Social Science10.1007/s42001-024-00326-y7:3(2767-2838)Online publication date: 8-Sep-2024
https://doi.org/10.1007/s42001-024-00326-y
Show More Cited By

Index Terms

Object-Based Visual Sentiment Concept Analysis and Application
1. Information systems
  1. Information retrieval

Recommendations

Large-scale visual sentiment ontology and detectors using adjective noun pairs
MM '13: Proceedings of the 21st ACM international conference on Multimedia

We address the challenge of sentiment analysis from visual content. In contrast to existing methods which infer sentiment or emotion directly from visual low-level features, we propose a novel approach based on understanding of the visual concepts that ...
Multilingual Visual Sentiment Concept Matching
ICMR '16: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval

The impact of culture in visual emotion perception has recently captured the attention of multimedia research. In this study, we provide powerful computational linguistics tools to explore, retrieve and browse a dataset of 16K multilingual affective ...
Salient object based visual sentiment analysis by combining deep features and handcrafted features
Abstract
With the rapid growth of social networks, the visual sentiment analysis has quickly emerged for opinion mining. Recent study reveals that the sentiments conveyed by some images are related to salient objects in them, we propose a scheme for visual ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '14: Proceedings of the 22nd ACM international conference on Multimedia

November 2014

1310 pages

ISBN:9781450330633

DOI:10.1145/2647868

General Chairs:
Kien A. Hua
University of Central Florida, USA
,
Yong Rui
Microsoft Research, China
,
Ralf Steinmetz
Technische Universitt Darmstadt, Germany
,
Program Chairs:
Alan Hanjalic
Delft University of Technology, Netherlands
,
Apostol (Paul) Natsev
Google, USA
,
Wenwu Zhu
Tsinghua University, China

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Defense Advanced Research Projects Agency

Conference

MM '14

Sponsor:

SIGMM

MM '14: 2014 ACM Multimedia Conference

November 3 - 7, 2014

Florida, Orlando, USA

Acceptance Rates

MM '14 Paper Acceptance Rate 55 of 286 submissions, 19%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

92
Total Citations
View Citations
744
Total Downloads

Downloads (Last 12 months)22
Downloads (Last 6 weeks)1

Reflects downloads up to 10 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Zhang TZhou GLu JLi ZWu HLiu S(2024)Text-image semantic relevance identification for aspect-based multimodal sentiment analysisPeerJ Computer Science10.7717/peerj-cs.190410(e1904)Online publication date: 12-Apr-2024
https://doi.org/10.7717/peerj-cs.1904
Han YGoh KKim SPhan T(2024)The Effect of Ad Image's Sentiment Scores and Mobile Device Attributes on Mobile Ad Response BehaviorIEEE Transactions on Engineering Management10.1109/TEM.2022.315712571(1314-1329)Online publication date: 2024
https://doi.org/10.1109/TEM.2022.3157125
Al-Tameemi IFeizi-Derakhshi MPashazadeh SAsadpour M(2024)A comprehensive review of visual–textual sentiment analysis from social media networksJournal of Computational Social Science10.1007/s42001-024-00326-y7:3(2767-2838)Online publication date: 8-Sep-2024
https://doi.org/10.1007/s42001-024-00326-y
Sun JZhang QYuan KJiang YChen X(2024)A supervised contrastive learning-based model for image emotion classificationWorld Wide Web10.1007/s11280-024-01260-927:3Online publication date: 24-Apr-2024
https://doi.org/10.1007/s11280-024-01260-9
Das RSingh T(2023)Multimodal Sentiment Analysis: A Survey of Methods, Trends, and ChallengesACM Computing Surveys10.1145/358607555:13s(1-38)Online publication date: 13-Jul-2023
https://dl.acm.org/doi/10.1145/3586075
Zhang MLuo GMa YLi SQian ZZhang XEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)VCMaster: Generating Diverse and Fluent Live Video Comments Based on Multimodal ContextsProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612078(4688-4696)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3612078
Su YZhao WJing PNie L(2023)Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for Visual Sentiment DistributionsIEEE Transactions on Multimedia10.1109/TMM.2022.314089225(1243-1255)Online publication date: 2023
https://doi.org/10.1109/TMM.2022.3140892
Yu JChen KXia R(2023)Hierarchical Interactive Multimodal Transformer for Aspect-Based Multimodal Sentiment AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2022.317109114:3(1966-1978)Online publication date: 1-Jul-2023
https://doi.org/10.1109/TAFFC.2022.3171091
Xu ZWang S(2023)Emotional Attention Detection and Correlation Exploration for Image Emotion Distribution LearningIEEE Transactions on Affective Computing10.1109/TAFFC.2021.307113114:1(357-369)Online publication date: 1-Jan-2023
https://doi.org/10.1109/TAFFC.2021.3071131
Zhang QSun JYuan KJiang Y(2023)An Image Emotion Classification Method Based on Supervised Contrastive Learning2023 8th International Conference on Data Science in Cyberspace (DSC)10.1109/DSC59305.2023.00052(313-320)Online publication date: 18-Aug-2023
https://doi.org/10.1109/DSC59305.2023.00052
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents