Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2647868.2654935acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Object-Based Visual Sentiment Concept Analysis and Application

Published: 03 November 2014 Publication History

Abstract

This paper studies the problem of modeling object-based visual concepts such as "crazy car" and "shy dog" with a goal to extract emotion related information from social multimedia content. We focus on detecting such adjective-noun pairs because of their strong co-occurrence relation with image tags about emotions. This problem is very challenging due to the highly subjective nature of the adjectives like "crazy" and "shy" and the ambiguity associated with the annotations. However, associating adjectives with concrete physical nouns makes the combined visual concepts more detectable and tractable. We propose a hierarchical system to handle the concept classification in an object specific manner and decompose the hard problem into object localization and sentiment related concept modeling. In order to resolve the ambiguity of concepts we propose a novel classification approach by modeling the concept similarity, leveraging on online commonsense knowledgebase. The proposed framework also allows us to interpret the classifiers by discovering discriminative features. The comparisons between our method and several baselines show great improvement in classification performance. We further demonstrate the power of the proposed system with a few novel applications such as sentiment-aware music slide shows of personal albums.

Supplementary Material

suppl.mov (fp276.avi)
Supplemental video

References

[1]
Supplemental Webpage. http://www.ee.columbia.edu/dvmm/vso/supplemental.html.
[2]
Subhabrata Bhattacharya, Behnaz Nojavanasghari, Tao Chen, Dong Liu, Shih-Fu Chang, and Mubarak Shah. Towards a comprehensive computational model for aesthetic assessment of videos. In Proceedings of the 21st ACM International Conference on Multimedia. ACM, 2013.
[3]
Damian Borth, Rongrong Ji, Tao Chen, Thomas Breuel, and Shih-Fu Chang. Large-scale visual sentiment ontology and detectors using adjective noun pairs. In Proceedings of the 21st ACM International Conference on Multimedia. ACM, 2013.
[4]
Yan-Ying Chen, Tao Chen, Winston H. Hsu, Hong-Yuan Mark Liao, and Shih--Fu Chang. Predicting viewer affective comments based on image content in social media. In Proceedings of the International Conference on Multimedia Retrieval. ACM, 2014.
[5]
Ritendra Datta, Dhiraj Joshi, Jia Li, and James Z Wang. Studying aesthetics in photographic images using a computational approach. In European Conference on Computer Vision. Springer, 2006.
[6]
J. Deng, W. Dong, R. Socher, L.J. Li, K. Li, and L. Fei--Fei. ImageNet: A Large-Scale Hierarchical Image Database. In Computer Vision and Pattern Recognition, 2009.
[7]
Jia Deng, Jonathan Krause, and Li Fei--Fei. Fine--grained crowdsourcing for fine-grained recognition. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pages 580--587. IEEE, 2013.
[8]
Kun Duan, Devi Parikh, David Crandall, and Kristen Grauman. Discovering localized attributes for fine-grained recognition. In Computer Vision and Pattern Recognition. IEEE, 2012.
[9]
Andrea Esuli and Fabrizio Sebastiani. SentiWordNet: A publicly available lexical resource for opinion mining. In Proceedings of the Conference on Language Resources and Evaluation, volume 6, 2006.
[10]
Rong--En Fan, Kai--Wei Chang, Cho--Jui Hsieh, Xiang--Rui Wang, and Chih--Jen Lin. Liblinear: A library for large linear classification. The Journal of Machine Learning Research, 9:1871--1874, 2008.
[11]
P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9):1627--1645, 2010.
[12]
V. Ferrari and A. Zisserman. Learning visual attributes. In Neural Information Processing Systems, 2007.
[13]
Yanwei Fu, Timothy M Hospedales, Tao Xiang, and Shaogang Gong. Attribute learning for understanding unstructured social activity. In European Conference on Computer Vision. Springer, 2012.
[14]
R. B. Girshick, P. F. Felzenszwalb, and D. McAllester. Discriminatively trained deformable part models, release 5. http://people.cs.uchicago.edu/~rbg/latent-release5/.
[15]
Chang Huang, Haizhou Ai, Yuan Li, and Shihong Lao. High--performance rotation invariant multiview face detection. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(4):671--686, 2007.
[16]
P. Isola, J. Xiao, A. Torralba, and A. Oliva. What makes an image memorable? In Computer Vision and Pattern Recognition, 2011.
[17]
Jia Jia, Sen Wu, Xiaohui Wang, Peiyun Hu, Lianhong Cai, and Jie Tang. Can we understand van Gogh's mood?: Learning to infer affects from images in social networks. In Proceedings of the 20th ACM international conference on Multimedia, pages 857--860. ACM, 2012.
[18]
Dhiraj Joshi, Ritendra Datta, Elena Fedorovskaya, Quang-Tuan Luong, James Z Wang, Jia Li, and Jiebo Luo. Aesthetics and emotions in images. Signal Processing Magazine, IEEE, 28(5):94--115, 2011.
[19]
Cyril Laurier and Perfecto Herrera. Mood Cloud: A real-time music mood visualization tool. In Proceedings of the 2008 Computers in Music Modeling and Retrieval Conference, 2008.
[20]
Hugo Liu and Push Singh. ConceptNet -- a practical commonsense reasoning tool-kit. BT technology journal, 22(4):211--226, 2004.
[21]
J. Machajdik and A. Hanbury. Affective image classification using features inspired by psychology and art theory. In Proceedings of ACM Multimedia, pages 83--92, 2010.
[22]
L. Marchesotti, F. Perronnin, D. Larlus, and G. Csurka. Assessing the aesthetic quality of photographs using generic image descriptors. In Proceedings of the International Conference on Computer Vision, 2011.
[23]
M. Naphade, J. Smith, J. Tesic, S.-F. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and Curtis J. Large-scale concept ontology for multimedia. In IEEE Multimedia, 2006.
[24]
Bo Pang and Lillian Lee. Opinion mining and sentiment analysis. Information Retrieval, 2(1--2):1--135, 2008.
[25]
Genevieve Patterson and James Hays. Sun attribute database: Discovering, annotating, and recognizing scene attributes. In Computer Vision and Pattern Recognition. IEEE, 2012.
[26]
Mohammad Amin Sadeghi and Ali Farhadi. Recognition using visual phrases. In Computer Vision and Pattern Recognition. IEEE, 2011.
[27]
Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, and Yann LeCun. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229, 2013.
[28]
J.R. Smith, M. Naphade, and A. Natsev. Multimedia semantic indexing using model vectors. In International Conference on Multimedia and Expo, 2003.
[29]
Mike Thelwall, Kevan Buckley, Georgios Paltoglou, Di Cai, and Arvid Kappas. Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technology, 61(12):2544--2558, 2010.
[30]
Andranik Tumasjan, Timm O Sprenger, Philipp G Sandner, and Isabell M Welpe. Predicting elections with Twitter: What 140 characters reveal about political sentiment. In Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, 2010.
[31]
Weining Wang and Qianhua He. A survey on emotional semantic image retrieval. In Image Processing, 2008. ICIP 2008. 15th IEEE International Conference on, pages 117--120. IEEE, 2008.
[32]
Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, 2005.
[33]
V. Yanulevskaya, J. van Gemert, K. Roth, A. Herbold, N. Sebe, and J.M. Geusebroek. Emotional valence categorization using holistic image features. In Proceedings of the IEEE International Conference on Image Processing, pages 101--104, 2008.
[34]
Guangnan Ye, Dong Liu, I--Hong Jhuo, and Shih--Fu Chang. Robust late fusion with rank minimization. In Computer Vision and Pattern Recognition. IEEE, 2012.
[35]
Felix X. Yu, Dong Liu, Sanjiv Kumar, Tony Jebara, and Shih-Fu Chang. SVM for learning with label proportions. In International Conference on Machine Learning, 2013.

Cited By

View all
  • (2024)Text-image semantic relevance identification for aspect-based multimodal sentiment analysisPeerJ Computer Science10.7717/peerj-cs.190410(e1904)Online publication date: 12-Apr-2024
  • (2024)The Effect of Ad Image's Sentiment Scores and Mobile Device Attributes on Mobile Ad Response BehaviorIEEE Transactions on Engineering Management10.1109/TEM.2022.315712571(1314-1329)Online publication date: 2024
  • (2024)A comprehensive review of visual–textual sentiment analysis from social media networksJournal of Computational Social Science10.1007/s42001-024-00326-y7:3(2767-2838)Online publication date: 8-Sep-2024
  • Show More Cited By

Index Terms

  1. Object-Based Visual Sentiment Concept Analysis and Application

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '14: Proceedings of the 22nd ACM international conference on Multimedia
    November 2014
    1310 pages
    ISBN:9781450330633
    DOI:10.1145/2647868
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 November 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. affective computing
    2. social multimedia
    3. visual sentiment

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    MM '14
    Sponsor:
    MM '14: 2014 ACM Multimedia Conference
    November 3 - 7, 2014
    Florida, Orlando, USA

    Acceptance Rates

    MM '14 Paper Acceptance Rate 55 of 286 submissions, 19%;
    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)22
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 10 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Text-image semantic relevance identification for aspect-based multimodal sentiment analysisPeerJ Computer Science10.7717/peerj-cs.190410(e1904)Online publication date: 12-Apr-2024
    • (2024)The Effect of Ad Image's Sentiment Scores and Mobile Device Attributes on Mobile Ad Response BehaviorIEEE Transactions on Engineering Management10.1109/TEM.2022.315712571(1314-1329)Online publication date: 2024
    • (2024)A comprehensive review of visual–textual sentiment analysis from social media networksJournal of Computational Social Science10.1007/s42001-024-00326-y7:3(2767-2838)Online publication date: 8-Sep-2024
    • (2024)A supervised contrastive learning-based model for image emotion classificationWorld Wide Web10.1007/s11280-024-01260-927:3Online publication date: 24-Apr-2024
    • (2023)Multimodal Sentiment Analysis: A Survey of Methods, Trends, and ChallengesACM Computing Surveys10.1145/358607555:13s(1-38)Online publication date: 13-Jul-2023
    • (2023)VCMaster: Generating Diverse and Fluent Live Video Comments Based on Multimodal ContextsProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612078(4688-4696)Online publication date: 26-Oct-2023
    • (2023)Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for Visual Sentiment DistributionsIEEE Transactions on Multimedia10.1109/TMM.2022.314089225(1243-1255)Online publication date: 2023
    • (2023)Hierarchical Interactive Multimodal Transformer for Aspect-Based Multimodal Sentiment AnalysisIEEE Transactions on Affective Computing10.1109/TAFFC.2022.317109114:3(1966-1978)Online publication date: 1-Jul-2023
    • (2023)Emotional Attention Detection and Correlation Exploration for Image Emotion Distribution LearningIEEE Transactions on Affective Computing10.1109/TAFFC.2021.307113114:1(357-369)Online publication date: 1-Jan-2023
    • (2023)An Image Emotion Classification Method Based on Supervised Contrastive Learning2023 8th International Conference on Data Science in Cyberspace (DSC)10.1109/DSC59305.2023.00052(313-320)Online publication date: 18-Aug-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media