research-article

Public Access

Cats and Captions vs. Creators and the Clock: Comparing Multimodal Content to Context in Predicting Relative Popularity

Authors:

David MimnoAuthors Info & Claims

WWW '17: Proceedings of the 26th International Conference on World Wide Web

Pages 927 - 936

https://doi.org/10.1145/3038912.3052684

Published: 03 April 2017 Publication History

Abstract

The content of today's social media is becoming more and more rich, increasingly mixing text, images, videos, and audio. It is an intriguing research question to model the interplay between these different modes in attracting user attention and engagement. But in order to pursue this study of multimodal content, we must also account for context: timing effects, community preferences, and social factors (e.g., which authors are already popular) also affect the amount of feedback and reaction that social-media posts receive. In this work, we separate out the influence of these non-content factors in several ways. First, we focus on ranking pairs of submissions posted to the same community in quick succession, e.g., within 30 seconds; this framing encourages models to focus on time-agnostic and community-specific content features. Within that setting, we determine the relative performance of author vs. content features. We find that victory usually belongs to "cats and captions," as visual and textual features together tend to outperform identity-based features. Moreover, our experiments show that when considered in isolation, simple unigram text features and deep neural network visual features yield the highest accuracy individually, and that the combination of the two modalities generally leads to the best accuracies overall.

References

[1]

K. Almgren, J. Lee, and M. Kim. Predicting the future popularity of images on social networks. In MISNC, 2016.

Digital Library

[2]

T. Althoff, C. Danescu-Niculescu-Mizil, and D. Jurafsky. How to ask for a favor: A case study on the success of altruistic requests. In ICWSM, 2014.

[3]

Y. Artzi, P. Pantel, and M. Gamon. Predicting responses to microblog posts. In NAACL, 2012.

Digital Library

[4]

S. Bakhshi and E. Gilbert. Red, purple and pink: The colors of diffusion on Pinterest. PloS One, 2015.

[5]

S. Bakhshi, D. A. Shamma, and E. Gilbert. Faces engage us: Photos with faces attract more likes and comments on instagram. In CHI, 2014.

Digital Library

[6]

E. Bakshy, J. M. Hofman, W. A. Mason, and D. J. Watts. Everyone's an influencer: Quantifying influence on twitter. In WSDM, 2011.

Digital Library

[7]

R. Bandari, S. Asur, and B. A. Huberman. The pulse of news in social media: Forecasting popularity. In ICWSM, 2012.

[8]

Y. Borghol, S. Ardon, N. Carlsson, D. Eager, and A. Mahanti. The untold story of the clones: Content-agnostic factors that impact YouTube video popularity. In KDD, 2012.

Digital Library

[9]

G. Bradski. OpenCV library. Dr. Dobb's Journal of Software Tools, 2000.

[10]

C. Buciluǎ, R. Caruana, and A. Niculescu-Mizil. Model compression. In KDD, pages 535--541. ACM, 2006.

Digital Library

[11]

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. In ICML, 2005.

Digital Library

[12]

J. Chen. Multi-modal learning: Study on a large-scale micro-video data collection. In ACM Multimedia, 2016.

Digital Library

[13]

F. Chollet. Keras. https://github.com/fchollet/keras, 2015.

[14]

N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, 2005.

Digital Library

[15]

C. Danescu-Niculescu-Mizil, J. Cheng, J. Kleinberg, and L. Lee. You had me at hello: How phrasing affects memorability. In ACL, 2012.

Digital Library

[16]

C. Danescu-Niculescu-Mizil, R. West, D. Jurafsky, J. Leskovec, and C. Potts. No country for old members: User lifecycle and linguistic change in online communities. In WWW, 2013.

Digital Library

[17]

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.

[18]

A. Deza and D. Parikh. Understanding image virality. In CVPR, 2015.

[19]

G. Dror, D. Pelleg, O. Rokhlenko, and I. Szpektor. Churn prediction in new users of yahoo! answers. In WWW, 2012.

Digital Library

[20]

F. Figueiredo. On the prediction of popularity of trends and hits for user generated videos. In WSDM, 2013.

Digital Library

[21]

F. Figueiredo, J. M. Almeida, F. Benevenuto, and K. P. Gummadi. Does content determine information popularity in social media?: A case study of YouTube videos' content and their popularity. In CHI, pages 979--982. ACM, 2014.

Digital Library

[22]

F. Gelli, T. Uricchio, M. Bertini, A. Del Bimbo, and S.-F. Chang. Image popularity prediction in social media using sentiment and context features. In ACM Multimedia, 2015.

Digital Library

[23]

E. Gilbert. Widespread underprovision on Reddit. In CSCW, 2013.

Digital Library

[24]

A. Graves and J. Schmidhuber. Framewise phoneme classification with bidirectional LS™ and other neural network architectures. Neural Networks, 2005.

Digital Library

[25]

M. Guerini, C. Strapparava, and G. Özbal. Exploring text virality in social networks. In ICWSM, 2011.

[26]

K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In CVPR, 2016.

[27]

R. Herbrich, T. Graepel, and K. Obermayer. Large margin rank boundaries for ordinal regression. In NIPS, 1999.

[28]

J. Hessel, C. Tan, and L. Lee. Science, AskScience, and BadScience: On the coexistence of highly related communities. In ICWSM, 2016.

[29]

F. Hill, K. Cho, and A. Korhonen. Learning distributed representations of sentences from unlabelled data. In NAACL, 2016.

[30]

S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8), 1997.

Digital Library

[31]

L. Hong, O. Dan, and B. D. Davison. Predicting popular messages in Twitter. In WWW companion volume, 2011.

Digital Library

[32]

J. Hu, T. Yamasaki, and K. Aizawa. Multimodal learning for image popularity prediction on social media. In Consumer Electronics-Taiwan, 2016.

[33]

M. Iyyer, V. Manjunatha, J. Boyd-Graber, and H. Daumé III. Deep unordered composition rivals syntactic methods for text classification. In ACL, 2015.

[34]

A. Jaech, V. Zayats, H. Fang, M. Ostendorf, and H. Hajishirzi. Talking to the crowd: What do people react to in online discussions? In EMNLP, 2015.

[35]

T. Joachims. Optimizing search engines using clickthrough data. In KDD, 2002.

Digital Library

[36]

R. Khan, J. Van de Weijer, F. Shahbaz Khan, D. Muselet, C. Ducottet, and C. Barat. Discriminative color descriptors. In CVPR, 2013.

Digital Library

[37]

A. Khosla, A. Das Sarma, and R. Hamid. What makes an image popular? In WWW, 2014.

Digital Library

[38]

A. Khosla, J. Xiao, A. Torralba, and A. Oliva. Memorability of image regions. In NIPS, 2012.

Digital Library

[39]

H. Lakkaraju, J. J. McAuley, and J. Leskovec. What's in a name? understanding the interplay between titles, content, and communities in social media. In ICWSM, 2013.

[40]

C. Lampe and P. Resnick. Slash (dot) and burn: distributed moderation in a large online conversation space. In CHI, 2004.

Digital Library

[41]

M. Lee, S. H. Jin, and D. Mimno. Beyond exchangeability: The Chinese voting process. In NIPS, 2016.

[42]

K. Lerman and T. Hogg. Using a model of social dynamics to predict popularity of news. In WWW, 2010.

Digital Library

[43]

C. Lynch, K. Aryafar, and J. Attenberg. Images don't lie: Transferring deep visual semantic features to large-scale multimodal learning to rank. In KDD, 2016.

Digital Library

[44]

Z. Ma, A. Sun, and G. Cong. Will this#hashtag be popular tomorrow? In SIGIR, 2012.

Digital Library

[45]

M. Mazloom, R. Rietveld, S. Rudinac, M. Worring, and W. van Dolen. Multimodal popularity prediction of brand-related social media posts. In ACM Multimedia, 2016.

Digital Library

[46]

P. J. McParlane, Y. Moshfeghi, and J. M. Jose. Nobody comes here anymore, it's too crowded; Predicting image popularity on Flickr. In Multimedia Retrieval, 2014.

Digital Library

[47]

A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV, 2001.

Digital Library

[48]

J. Pennington, R. Socher, and C. D. Manning. GloVe: Global vectors for word representation. In EMNLP, 2014.

[49]

S. Petrovic, M. Osborne, and V. Lavrenko. RT to win! Predicting message propagation in Twitter. In ICWSM, 2011.

[50]

H. Pinto, J. M. Almeida, and M. A. Gonçalves. Using early view patterns to predict the popularity of YouTube videos. In WSDM, 2013.

Digital Library

[51]

D. M. Romero, C. Tan, and J. Ugander. On the interplay between social and topical structure. In ICWSM, 2013.

[52]

M. J. Salganik, P. S. Dodds, and D. J. Watts. Experimental study of inequality and unpredictability in an artificial cultural market. Science, 2006.

[53]

R. Schifanella, M. Redi, and L. Aiello. An image is worth more than a thousand favorites: Surfacing the hidden beauty of Flickr pictures. In ICWSM, 2015.

[54]

D. A. Shamma, J. Yew, L. Kennedy, and E. F. Churchill. Viral actions: Predicting video view counts using synchronous sharing behaviors. In ICWSM, 2011.

[55]

A. Sharif Razavian, H. Azizpour, J. Sullivan, and S. Carlsson. CNN features off-the-shelf: An astounding baseline for recognition. In CVPR workshops, 2014.

Digital Library

[56]

K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2014.

[57]

P. Singer, F. Flöck, C. Meinhart, E. Zeitfogel, and M. Strohmaier. Evolution of Reddit: From the front page of the internet to a self-referential community? In WWW, 2014.

Digital Library

[58]

H. Solomon and L. Herman. Status symbols and prosocial behavior: The effect of the victim's car on helping. The Journal of Psychology, 97(2), 1977.

[59]

G. Stoddard. Popularity dynamics and intrinsic quality in reddit and hacker news. In ICWSM, 2015.

[60]

B. Suh, L. Hong, P. Pirolli, and E. H. Chi. Want to be retweeted? Large scale analytics on factors impacting retweet in Twitter network. In SocialCom, 2010.

Digital Library

[61]

T. Sun, M. Zhang, and Q. Mei. Unexpected relevance: An empirical study of serendipity in retweets. In ICWSM, 2013.

[62]

C. Tan and L. Lee. All who wander: On the prevalence and characteristics of multi-community engagement. In WWW, 2015.

Digital Library

[63]

C. Tan, L. Lee, and B. Pang. The effect of wording on message propagation: Topic-and author-controlled natural experiments on Twitter. In ACL, 2014.

[64]

Theano Development Team. Theano: A Python framework for fast computation of mathematical expressions. arXiv e-prints, abs/1605.02688, May 2016.

[65]

O. Tsur and A. Rappoport. What's in a hashtag?: Content based prediction of the spread of ideas in microblogging communities. In WSDM, 2012.

Digital Library

[66]

T. Weninger, T. J. Johnston, and M. Glenski. Random voting effects in social-digital spaces: A case study of Reddit post submissions. In Hypertext, 2015.

Digital Library

[67]

B. Wu, W.-H. Cheng, Y. Zhang, and T. Mei. Time matters: Multi-scale temporalization of social media popularity. In ACM Multimedia, 2016.

Digital Library

[68]

K. Yamaguchi, T. L. Berg, and L. E. Ortiz. Chic or social: Visual popularity analysis in online fashion networks. In ACM Multimedia, 2014.

Digital Library

[69]

X. Yan, J. Guo, Y. Lan, and X. Cheng. A biterm topic model for short texts. In WWW, 2013.

Digital Library

[70]

S. Zakrewsky, K. Aryafar, and A. Shokoufandeh. Item popularity prediction in e-commerce using image quality feature vectors. arXiv preprint arXiv:1605.03663, 2016.

[71]

C. Zhong, D. Karamshuk, and N. Sastry. Predicting Pinterest: Automating a distributed human computation. In WWW, 2015.

Digital Library

Cited By

Kairam SBernstein MBruckman AChancellor SChandrasekharan EDe Choudhury MFiesler CLi HProferes NHorta Ribeiro MSmith CWeld GFarzan RLópez CCardoso Llach DQuercia DMustafa MNiu SWong-Villacrés M(2024)Community-Driven Models for Research on Social PlatformsCompanion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing10.1145/3678884.3687141(684-688)Online publication date: 11-Nov-2024
https://dl.acm.org/doi/10.1145/3678884.3687141
Efstratiou AChua TNgo CKa-Wei Lee RKumar RLauw H(2024)Deliberate Exposure to Opposing Views and Its Association with Behavior and Rewards on Political CommunitiesProceedings of the ACM Web Conference 202410.1145/3589334.3645375(2347-2358)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645375
Jeong DSon HChoi YKim K(2024)Enhancing social media post popularity prediction with visual contentJournal of the Korean Statistical Society10.1007/s42952-024-00270-753:3(844-882)Online publication date: 21-May-2024
https://doi.org/10.1007/s42952-024-00270-7
Show More Cited By

Index Terms

Cats and Captions vs. Creators and the Clock: Comparing Multimodal Content to Context in Predicting Relative Popularity
1. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Image search
  2. World Wide Web
    1. Web applications
      1. Social networks
    2. Web searching and information discovery
      1. Content ranking
      2. Social recommendation

Recommendations

Detecting and Characterizing Mental Health Related Self-Disclosure in Social Media
CHI EA '15: Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems

Self-disclosure is an important element facilitating improved psychological wellbeing in individuals with mental illness. As social media is increasingly adopted in health related discourse, we examine how these new platforms might be allowing honest ...
How attachment influences users willingness to donate to content creators in social media

As a relatively new behavior, donation to content creators in social media has become very popular in the last few years. Different from traditional donation to nonprofit organization or victims, donation to content creators in social media has received ...
Predicting influence in an online community of creators
CHI '10: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

This paper introduces the concept of Online Communities of Creators (OCOCs), which are a subset of social network sites in which the core activity is sharing personal, original creations. Next it defines two distinct types of influence, Project ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '17: Proceedings of the 26th International Conference on World Wide Web

April 2017

1678 pages

ISBN:9781450349130

General Chairs:
Rick Barrett
W3Events
,
Rick Cummings
Murdoch University
,
Program Chairs:
Eugene Agichtein
Emory University
,
Evgeniy Gabrilovich
Google Research

Copyright © 2017 Copyright is held by the International World Wide Web Conference Committee (IW3C2).

Sponsors

IW3C2: International World Wide Web Conference Committee

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 03 April 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Science Foundation
Yahoo Inc.

Conference

WWW '17

Sponsor:

IW3C2

WWW '17: 26th International World Wide Web Conference

April 3 - 7, 2017

Perth, Australia

Acceptance Rates

WWW '17 Paper Acceptance Rate 164 of 966 submissions, 17%;

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
681
Total Downloads

Downloads (Last 12 months)112
Downloads (Last 6 weeks)29

Reflects downloads up to 19 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Kairam SBernstein MBruckman AChancellor SChandrasekharan EDe Choudhury MFiesler CLi HProferes NHorta Ribeiro MSmith CWeld GFarzan RLópez CCardoso Llach DQuercia DMustafa MNiu SWong-Villacrés M(2024)Community-Driven Models for Research on Social PlatformsCompanion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing10.1145/3678884.3687141(684-688)Online publication date: 11-Nov-2024
https://dl.acm.org/doi/10.1145/3678884.3687141
Efstratiou AChua TNgo CKa-Wei Lee RKumar RLauw H(2024)Deliberate Exposure to Opposing Views and Its Association with Behavior and Rewards on Political CommunitiesProceedings of the ACM Web Conference 202410.1145/3589334.3645375(2347-2358)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645375
Jeong DSon HChoi YKim K(2024)Enhancing social media post popularity prediction with visual contentJournal of the Korean Statistical Society10.1007/s42952-024-00270-753:3(844-882)Online publication date: 21-May-2024
https://doi.org/10.1007/s42952-024-00270-7
Henn TPosegga O(2023)Attention-grabbing news coverage: Violent images of the Black Lives Matter movement and how they attract user attention on RedditPLOS ONE10.1371/journal.pone.028896218:8(e0288962)Online publication date: 9-Aug-2023
https://doi.org/10.1371/journal.pone.0288962
Litterer S(2023)Conservative Anger and Police Misconduct: Exploring Conservative Discussion of Police on Social MediaDeviant Behavior10.1080/01639625.2023.2284284(1-15)Online publication date: 20-Nov-2023
https://doi.org/10.1080/01639625.2023.2284284
Mudgal PLiu F(2022)Are High-quality Photos More Popular Than Low-quality Ones? A Quantitative Study2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP)10.1109/MMSP55362.2022.9948731(1-5)Online publication date: 26-Sep-2022
https://doi.org/10.1109/MMSP55362.2022.9948731
Lin HLin JOple JChen JHua K(2022)Social Media Popularity Prediction Based on Multi-Modal Self-Attention MechanismsIEEE Access10.1109/ACCESS.2021.313655210(4448-4455)Online publication date: 2022
https://doi.org/10.1109/ACCESS.2021.3136552
Nezhad NMirtaheri SShahbazian R(2022)Popular image generation based on popularity measures by generative adversarial networksMultimedia Tools and Applications10.1007/s11042-022-14090-682:14(20873-20897)Online publication date: 7-Nov-2022
https://doi.org/10.1007/s11042-022-14090-6
Xia YZhu HLu TZhang PGu N(2020)Exploring Antecedents and Consequences of Toxicity in Online DiscussionsProceedings of the ACM on Human-Computer Interaction10.1145/34151794:CSCW2(1-23)Online publication date: 15-Oct-2020
https://dl.acm.org/doi/10.1145/3415179
Wang LLiu RVosoughi SGurrin CÞór Jónsson BKando NSchoeffmann KChen PO'Connor N(2020)Salienteye: Maximizing Engagement While Maintaining Artistic Style on Instagram Using Deep Neural NetworksProceedings of the 2020 International Conference on Multimedia Retrieval10.1145/3372278.3390736(331-335)Online publication date: 8-Jun-2020
https://dl.acm.org/doi/10.1145/3372278.3390736
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents