Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3038912.3052684acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Public Access

Cats and Captions vs. Creators and the Clock: Comparing Multimodal Content to Context in Predicting Relative Popularity

Published: 03 April 2017 Publication History

Abstract

The content of today's social media is becoming more and more rich, increasingly mixing text, images, videos, and audio. It is an intriguing research question to model the interplay between these different modes in attracting user attention and engagement. But in order to pursue this study of multimodal content, we must also account for context: timing effects, community preferences, and social factors (e.g., which authors are already popular) also affect the amount of feedback and reaction that social-media posts receive. In this work, we separate out the influence of these non-content factors in several ways. First, we focus on ranking pairs of submissions posted to the same community in quick succession, e.g., within 30 seconds; this framing encourages models to focus on time-agnostic and community-specific content features. Within that setting, we determine the relative performance of author vs. content features. We find that victory usually belongs to "cats and captions," as visual and textual features together tend to outperform identity-based features. Moreover, our experiments show that when considered in isolation, simple unigram text features and deep neural network visual features yield the highest accuracy individually, and that the combination of the two modalities generally leads to the best accuracies overall.

References

[1]
K. Almgren, J. Lee, and M. Kim. Predicting the future popularity of images on social networks. In MISNC, 2016.
[2]
T. Althoff, C. Danescu-Niculescu-Mizil, and D. Jurafsky. How to ask for a favor: A case study on the success of altruistic requests. In ICWSM, 2014.
[3]
Y. Artzi, P. Pantel, and M. Gamon. Predicting responses to microblog posts. In NAACL, 2012.
[4]
S. Bakhshi and E. Gilbert. Red, purple and pink: The colors of diffusion on Pinterest. PloS One, 2015.
[5]
S. Bakhshi, D. A. Shamma, and E. Gilbert. Faces engage us: Photos with faces attract more likes and comments on instagram. In CHI, 2014.
[6]
E. Bakshy, J. M. Hofman, W. A. Mason, and D. J. Watts. Everyone's an influencer: Quantifying influence on twitter. In WSDM, 2011.
[7]
R. Bandari, S. Asur, and B. A. Huberman. The pulse of news in social media: Forecasting popularity. In ICWSM, 2012.
[8]
Y. Borghol, S. Ardon, N. Carlsson, D. Eager, and A. Mahanti. The untold story of the clones: Content-agnostic factors that impact YouTube video popularity. In KDD, 2012.
[9]
G. Bradski. OpenCV library. Dr. Dobb's Journal of Software Tools, 2000.
[10]
C. Buciluǎ, R. Caruana, and A. Niculescu-Mizil. Model compression. In KDD, pages 535--541. ACM, 2006.
[11]
C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. In ICML, 2005.
[12]
J. Chen. Multi-modal learning: Study on a large-scale micro-video data collection. In ACM Multimedia, 2016.
[13]
F. Chollet. Keras. https://github.com/fchollet/keras, 2015.
[14]
N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, 2005.
[15]
C. Danescu-Niculescu-Mizil, J. Cheng, J. Kleinberg, and L. Lee. You had me at hello: How phrasing affects memorability. In ACL, 2012.
[16]
C. Danescu-Niculescu-Mizil, R. West, D. Jurafsky, J. Leskovec, and C. Potts. No country for old members: User lifecycle and linguistic change in online communities. In WWW, 2013.
[17]
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
[18]
A. Deza and D. Parikh. Understanding image virality. In CVPR, 2015.
[19]
G. Dror, D. Pelleg, O. Rokhlenko, and I. Szpektor. Churn prediction in new users of yahoo! answers. In WWW, 2012.
[20]
F. Figueiredo. On the prediction of popularity of trends and hits for user generated videos. In WSDM, 2013.
[21]
F. Figueiredo, J. M. Almeida, F. Benevenuto, and K. P. Gummadi. Does content determine information popularity in social media?: A case study of YouTube videos' content and their popularity. In CHI, pages 979--982. ACM, 2014.
[22]
F. Gelli, T. Uricchio, M. Bertini, A. Del Bimbo, and S.-F. Chang. Image popularity prediction in social media using sentiment and context features. In ACM Multimedia, 2015.
[23]
E. Gilbert. Widespread underprovision on Reddit. In CSCW, 2013.
[24]
A. Graves and J. Schmidhuber. Framewise phoneme classification with bidirectional LS™ and other neural network architectures. Neural Networks, 2005.
[25]
M. Guerini, C. Strapparava, and G. Özbal. Exploring text virality in social networks. In ICWSM, 2011.
[26]
K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In CVPR, 2016.
[27]
R. Herbrich, T. Graepel, and K. Obermayer. Large margin rank boundaries for ordinal regression. In NIPS, 1999.
[28]
J. Hessel, C. Tan, and L. Lee. Science, AskScience, and BadScience: On the coexistence of highly related communities. In ICWSM, 2016.
[29]
F. Hill, K. Cho, and A. Korhonen. Learning distributed representations of sentences from unlabelled data. In NAACL, 2016.
[30]
S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8), 1997.
[31]
L. Hong, O. Dan, and B. D. Davison. Predicting popular messages in Twitter. In WWW companion volume, 2011.
[32]
J. Hu, T. Yamasaki, and K. Aizawa. Multimodal learning for image popularity prediction on social media. In Consumer Electronics-Taiwan, 2016.
[33]
M. Iyyer, V. Manjunatha, J. Boyd-Graber, and H. Daumé III. Deep unordered composition rivals syntactic methods for text classification. In ACL, 2015.
[34]
A. Jaech, V. Zayats, H. Fang, M. Ostendorf, and H. Hajishirzi. Talking to the crowd: What do people react to in online discussions? In EMNLP, 2015.
[35]
T. Joachims. Optimizing search engines using clickthrough data. In KDD, 2002.
[36]
R. Khan, J. Van de Weijer, F. Shahbaz Khan, D. Muselet, C. Ducottet, and C. Barat. Discriminative color descriptors. In CVPR, 2013.
[37]
A. Khosla, A. Das Sarma, and R. Hamid. What makes an image popular? In WWW, 2014.
[38]
A. Khosla, J. Xiao, A. Torralba, and A. Oliva. Memorability of image regions. In NIPS, 2012.
[39]
H. Lakkaraju, J. J. McAuley, and J. Leskovec. What's in a name? understanding the interplay between titles, content, and communities in social media. In ICWSM, 2013.
[40]
C. Lampe and P. Resnick. Slash (dot) and burn: distributed moderation in a large online conversation space. In CHI, 2004.
[41]
M. Lee, S. H. Jin, and D. Mimno. Beyond exchangeability: The Chinese voting process. In NIPS, 2016.
[42]
K. Lerman and T. Hogg. Using a model of social dynamics to predict popularity of news. In WWW, 2010.
[43]
C. Lynch, K. Aryafar, and J. Attenberg. Images don't lie: Transferring deep visual semantic features to large-scale multimodal learning to rank. In KDD, 2016.
[44]
Z. Ma, A. Sun, and G. Cong. Will this#hashtag be popular tomorrow? In SIGIR, 2012.
[45]
M. Mazloom, R. Rietveld, S. Rudinac, M. Worring, and W. van Dolen. Multimodal popularity prediction of brand-related social media posts. In ACM Multimedia, 2016.
[46]
P. J. McParlane, Y. Moshfeghi, and J. M. Jose. Nobody comes here anymore, it's too crowded; Predicting image popularity on Flickr. In Multimedia Retrieval, 2014.
[47]
A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV, 2001.
[48]
J. Pennington, R. Socher, and C. D. Manning. GloVe: Global vectors for word representation. In EMNLP, 2014.
[49]
S. Petrovic, M. Osborne, and V. Lavrenko. RT to win! Predicting message propagation in Twitter. In ICWSM, 2011.
[50]
H. Pinto, J. M. Almeida, and M. A. Gonçalves. Using early view patterns to predict the popularity of YouTube videos. In WSDM, 2013.
[51]
D. M. Romero, C. Tan, and J. Ugander. On the interplay between social and topical structure. In ICWSM, 2013.
[52]
M. J. Salganik, P. S. Dodds, and D. J. Watts. Experimental study of inequality and unpredictability in an artificial cultural market. Science, 2006.
[53]
R. Schifanella, M. Redi, and L. Aiello. An image is worth more than a thousand favorites: Surfacing the hidden beauty of Flickr pictures. In ICWSM, 2015.
[54]
D. A. Shamma, J. Yew, L. Kennedy, and E. F. Churchill. Viral actions: Predicting video view counts using synchronous sharing behaviors. In ICWSM, 2011.
[55]
A. Sharif Razavian, H. Azizpour, J. Sullivan, and S. Carlsson. CNN features off-the-shelf: An astounding baseline for recognition. In CVPR workshops, 2014.
[56]
K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2014.
[57]
P. Singer, F. Flöck, C. Meinhart, E. Zeitfogel, and M. Strohmaier. Evolution of Reddit: From the front page of the internet to a self-referential community? In WWW, 2014.
[58]
H. Solomon and L. Herman. Status symbols and prosocial behavior: The effect of the victim's car on helping. The Journal of Psychology, 97(2), 1977.
[59]
G. Stoddard. Popularity dynamics and intrinsic quality in reddit and hacker news. In ICWSM, 2015.
[60]
B. Suh, L. Hong, P. Pirolli, and E. H. Chi. Want to be retweeted? Large scale analytics on factors impacting retweet in Twitter network. In SocialCom, 2010.
[61]
T. Sun, M. Zhang, and Q. Mei. Unexpected relevance: An empirical study of serendipity in retweets. In ICWSM, 2013.
[62]
C. Tan and L. Lee. All who wander: On the prevalence and characteristics of multi-community engagement. In WWW, 2015.
[63]
C. Tan, L. Lee, and B. Pang. The effect of wording on message propagation: Topic-and author-controlled natural experiments on Twitter. In ACL, 2014.
[64]
Theano Development Team. Theano: A Python framework for fast computation of mathematical expressions. arXiv e-prints, abs/1605.02688, May 2016.
[65]
O. Tsur and A. Rappoport. What's in a hashtag?: Content based prediction of the spread of ideas in microblogging communities. In WSDM, 2012.
[66]
T. Weninger, T. J. Johnston, and M. Glenski. Random voting effects in social-digital spaces: A case study of Reddit post submissions. In Hypertext, 2015.
[67]
B. Wu, W.-H. Cheng, Y. Zhang, and T. Mei. Time matters: Multi-scale temporalization of social media popularity. In ACM Multimedia, 2016.
[68]
K. Yamaguchi, T. L. Berg, and L. E. Ortiz. Chic or social: Visual popularity analysis in online fashion networks. In ACM Multimedia, 2014.
[69]
X. Yan, J. Guo, Y. Lan, and X. Cheng. A biterm topic model for short texts. In WWW, 2013.
[70]
S. Zakrewsky, K. Aryafar, and A. Shokoufandeh. Item popularity prediction in e-commerce using image quality feature vectors. arXiv preprint arXiv:1605.03663, 2016.
[71]
C. Zhong, D. Karamshuk, and N. Sastry. Predicting Pinterest: Automating a distributed human computation. In WWW, 2015.

Cited By

View all
  • (2024)Community-Driven Models for Research on Social PlatformsCompanion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing10.1145/3678884.3687141(684-688)Online publication date: 11-Nov-2024
  • (2024)Deliberate Exposure to Opposing Views and Its Association with Behavior and Rewards on Political CommunitiesProceedings of the ACM Web Conference 202410.1145/3589334.3645375(2347-2358)Online publication date: 13-May-2024
  • (2024)Enhancing social media post popularity prediction with visual contentJournal of the Korean Statistical Society10.1007/s42952-024-00270-753:3(844-882)Online publication date: 21-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '17: Proceedings of the 26th International Conference on World Wide Web
April 2017
1678 pages
ISBN:9781450349130

Sponsors

  • IW3C2: International World Wide Web Conference Committee

In-Cooperation

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 03 April 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. image processing
  2. language modeling
  3. multimodal
  4. reddit
  5. social media

Qualifiers

  • Research-article

Funding Sources

Conference

WWW '17
Sponsor:
  • IW3C2

Acceptance Rates

WWW '17 Paper Acceptance Rate 164 of 966 submissions, 17%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)112
  • Downloads (Last 6 weeks)29
Reflects downloads up to 19 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Community-Driven Models for Research on Social PlatformsCompanion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing10.1145/3678884.3687141(684-688)Online publication date: 11-Nov-2024
  • (2024)Deliberate Exposure to Opposing Views and Its Association with Behavior and Rewards on Political CommunitiesProceedings of the ACM Web Conference 202410.1145/3589334.3645375(2347-2358)Online publication date: 13-May-2024
  • (2024)Enhancing social media post popularity prediction with visual contentJournal of the Korean Statistical Society10.1007/s42952-024-00270-753:3(844-882)Online publication date: 21-May-2024
  • (2023)Attention-grabbing news coverage: Violent images of the Black Lives Matter movement and how they attract user attention on RedditPLOS ONE10.1371/journal.pone.028896218:8(e0288962)Online publication date: 9-Aug-2023
  • (2023)Conservative Anger and Police Misconduct: Exploring Conservative Discussion of Police on Social MediaDeviant Behavior10.1080/01639625.2023.2284284(1-15)Online publication date: 20-Nov-2023
  • (2022)Are High-quality Photos More Popular Than Low-quality Ones? A Quantitative Study2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP)10.1109/MMSP55362.2022.9948731(1-5)Online publication date: 26-Sep-2022
  • (2022)Social Media Popularity Prediction Based on Multi-Modal Self-Attention MechanismsIEEE Access10.1109/ACCESS.2021.313655210(4448-4455)Online publication date: 2022
  • (2022)Popular image generation based on popularity measures by generative adversarial networksMultimedia Tools and Applications10.1007/s11042-022-14090-682:14(20873-20897)Online publication date: 7-Nov-2022
  • (2020)Exploring Antecedents and Consequences of Toxicity in Online DiscussionsProceedings of the ACM on Human-Computer Interaction10.1145/34151794:CSCW2(1-23)Online publication date: 15-Oct-2020
  • (2020)Salienteye: Maximizing Engagement While Maintaining Artistic Style on Instagram Using Deep Neural NetworksProceedings of the 2020 International Conference on Multimedia Retrieval10.1145/3372278.3390736(331-335)Online publication date: 8-Jun-2020
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media