Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2528394.2528396acmotherconferencesArticle/Chapter ViewAbstractPublication PageshcirConference Proceedingsconference-collections
research-article

Are Some Tweets More Interesting Than Others? #HardQuestion

Published: 03 October 2013 Publication History

Abstract

Twitter has evolved into a significant communication nexus, coupling personal and highly contextual utterances with local news, memes, celebrity gossip, headlines, and other microblogging subgenres. If we take Twitter as a large and varied dynamic collection, how can we predict which tweets will be interesting to a broad audience in advance of lagging social indicators of interest such as retweets? The telegraphic form of tweets, coupled with the subjective notion of interestingness, makes it difficult for human judges to agree on which tweets are indeed interesting.
In this paper, we address two questions: Can we develop a reliable strategy that results in high-quality labels for a collection of tweets, and can we use this labeled collection to predict a tweet's interestingness? To answer the first question, we performed a series of studies using crowdsourcing to reach a diverse set of workers who served as a proxy for an audience with variable interests and perspectives. This method allowed us to explore different labeling strategies, including varying the judges, the labels they applied, the datasets, and other aspects of the task. To address the second question, we used crowdsourcing to assemble a set of tweets rated as interesting or not; we scored these tweets using textual and contextual features; and we used these scores as inputs to a binary classifier. We were able to achieve moderate agreement (κ = 0.52) between the best classifier and the human assessments, a figure which reflects the challenges of the judgment task.

References

[1]
Alonso, O., Carson, C., Gerster, D., Ji, X., and Nabar, S. U. Detecting Uninteresting Content in Text Streams. In CSE (July 2010), 39--42.
[2]
André, P., Bernstein, M. S., and Luther, K. Who gives a tweet?: evaluating microblog content value. In CSCW (2012), 471--474.
[3]
Aroyo, L., and Welty, C. Harnessing disagreement in crowdsourcing a relation extraction gold standard. Tech. Rep. RC25371 (WAT1304-058), IBM Research, 2013.
[4]
Bowker, G. C., and Star, S. L. Sorting Things Out: Classification and Its Consequences. MIT Press, 1999.
[5]
Boyd, D., Golder, S., and Lotan, G. Tweet, tweet, retweet: Conversational aspects of retweeting on twitter. In HICSS (2010), 1--10.
[6]
Choudhary, A. N., Hendrix, W., Lee, K., Palsetia, D., and Liao, W.-K. Social media evolution of the Egyptian revolution. CACM 55, 5 (2012), 74--80.
[7]
Colton, S., and Bundy, A. On the notion of interestingness in automated mathematical creativity. In AISB (1999), 82--91.
[8]
De Choudhury, M., Diakopoulos, N., and Naaman, M. Unfolding the event landscape on twitter: classification and exploration of user categories. In CSCW (2012), 241--244.
[9]
Duan, Y., Jiang, L., Qin, T., Zhou, M., and Shum, H.-Y. An empirical study on learning to rank of tweets. In COLING (2010), 295--303.
[10]
Erdelez, S. Information encountering: a conceptual framework for accidental information discovery. In ISIC (1997), 412--421.
[11]
Fleiss, J. L. Measuring nominal scale agreement among many raters. Psychol.l Bull. 76, 5 (1971), 378--382.
[12]
Gaffney, D. #iranelection: quantifying online activism. In WebSci10 (2010).
[13]
Gayo-Avello, D. Nepotistic relationships in twitter and their impact on rank prestige algorithms. IPM (2013), 1250--1280.
[14]
Hughes, A. L., and Palen, L. Twitter adoption and use in mass convergence and emergency events. IJEM 6, 3/4 (2009), 248--260.
[15]
Hurlock, J., and Wilson, M. L. Searching twitter: Separating the tweet from the chaff. In ICWSM (2011).
[16]
Krippendorff, K. Content Analysis, 2nd ed. Sage Publications, 2003.
[17]
Lempel, R., and Moran, S. Salsa: the stochastic approach for link-structure analysis. TOIS 19, 2 (2001), 131--160.
[18]
Lewis, P. Reading the riots: Investigating England's summer of disorder. The Guardian, September 5, 2011.
[19]
Lin, T., Oren, Etzioni, and Fogarty, J. Identifying interesting assertions from the web. In CIKM (2009), 1787--1790.
[20]
Marcus, A., Bernstein, M., Badar, O., Karger, D., Madden, S., and Miller, R. Twitinfo: aggregating and visualizing microblogs for event exploration. In CHI (2011), 227--236.
[21]
Metzler, D., and Cai, C. USC/ISI at TREC 2011: Microblog track. In TREC (2011).
[22]
Momeni, E., Tao, K., Haslhofer, B., and Houben, G.-J. Identification of useful user comments in social media: A case study on Flickr commons. In JCDL (2013), 1--10.
[23]
Morris, M. R., Counts, S., Roseway, A., Hoff, A., and Schwarz, J. Tweeting is believing?: understanding microblog credibility perceptions. In CSCW (2012), 441--450.
[24]
Naaman, M., Becker, H., and Gravano, L. Hip and trendy: characterizing emerging trends on twitter. JASIST 62, 5 (2011), 902--918.
[25]
Poblete, B., Garcia, R., Mendoza, M., and Jaimes, A. Do all birds tweet the same? characterizing twitter around the world. In CIKM (2011), 1025--1030.
[26]
Robertson, S. E., Walker, S., Jones, S., Hancock-Beaulieu, M., and Gatford, M. Okapi at TREC-3. In TREC (1994).
[27]
Sakaki, T., Okazaki, M., and Matsuo, Y. Earthquake shakes twitter users: real-time event detection by social sensors. In WWW (2010), 851--860.
[28]
Silvia, P. J. What is interesting? exploring the appraisal structure of interest. Emotion 5 (2005), 89--102.
[29]
Strauss, A., and Corbin, J. Basics of Qualitative Research. Sage Publications, 1998.
[30]
Uysal, I., and Croft, W. B. User oriented tweet ranking: a filtering approach to microblogs. In CIKM (2011), 2261--2264.
[31]
Yardi, S., Romero, D., Schoenebeck, G., and Boyd, D. Detecting spam in a twitter network. First Monday 15, 1 (2010).
[32]
Zhao, D., and Rosson, M. B. How and why people twitter: the role that micro-blogging plays in informal communication at work. In GROUP (2009), 243--252.
[33]
Zubiaga, A., Spina, D., Fresno, V., and Martínez, R. Classifying trending topics: a typology of conversation triggers on twitter. In CIKM (2011), 2461--2464.

Cited By

View all
  • (2024)The State of Pilot Study Reporting in Crowdsourcing: A Reflection on Best Practices and GuidelinesProceedings of the ACM on Human-Computer Interaction10.1145/36410238:CSCW1(1-45)Online publication date: 26-Apr-2024
  • (2021)Accelerating Deductive Coding of Qualitative Data: An Experimental Study on the Applicability of CrowdsourcingProceedings of Mensch und Computer 202110.1145/3473856.3473873(432-443)Online publication date: 5-Sep-2021
  • (2020)Connecting Mayors: The Content and Formation of Twitter Information NetworksUrban Affairs Review10.1177/107808742094718258:1(33-67)Online publication date: 10-Aug-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
HCIR '13: Proceedings of the Symposium on Human-Computer Interaction and Information Retrieval
October 2013
52 pages
ISBN:9781450325707
DOI:10.1145/2528394
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

  • FX Palo Alto Laboratory, Inc.: FX Palo Alto Laboratory, Inc.
  • Microsoft Research: Microsoft Research
  • ACM SIGIR: Special Interest Group on Information Retrieval
  • GRAND: Animation and NewMedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 October 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Twitter
  2. crowdsourcing
  3. interestingness
  4. label quality

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

HCIR '13

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)The State of Pilot Study Reporting in Crowdsourcing: A Reflection on Best Practices and GuidelinesProceedings of the ACM on Human-Computer Interaction10.1145/36410238:CSCW1(1-45)Online publication date: 26-Apr-2024
  • (2021)Accelerating Deductive Coding of Qualitative Data: An Experimental Study on the Applicability of CrowdsourcingProceedings of Mensch und Computer 202110.1145/3473856.3473873(432-443)Online publication date: 5-Sep-2021
  • (2020)Connecting Mayors: The Content and Formation of Twitter Information NetworksUrban Affairs Review10.1177/107808742094718258:1(33-67)Online publication date: 10-Aug-2020
  • (2019)The Practice of CrowdsourcingSynthesis Lectures on Information Concepts, Retrieval, and Services10.2200/S00904ED1V01Y201903ICR06611:1(1-149)Online publication date: 28-May-2019
  • (2018)Identifying Retweetable Tweets with a Personalized Global ClassifierProceedings of the 10th Hellenic Conference on Artificial Intelligence10.1145/3200947.3201019(1-8)Online publication date: 9-Jul-2018
  • (2018)Microblog Analysis as a Program of WorkACM Transactions on Social Computing10.1145/31629561:1(1-40)Online publication date: 18-Jan-2018
  • (2018)ConsensUsACM Transactions on Social Computing10.1145/31596491:1(1-26)Online publication date: 18-Jan-2018
  • (2018)Quantifying Controversy on Social MediaACM Transactions on Social Computing10.1145/31405651:1(1-27)Online publication date: 18-Jan-2018
  • (2018)Online Sequencing of Non-Decomposable Macrotasks in Expert CrowdsourcingACM Transactions on Social Computing10.1145/31404591:1(1-33)Online publication date: 10-Jan-2018
  • (2018)Dissemination of evidence in paediatric emergency medicine: a quantitative descriptive evaluation of a 16-week social media promotionBMJ Open10.1136/bmjopen-2018-0222988:6(e022298)Online publication date: 6-Jun-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media