Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2783258.2783319acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Leveraging Social Context for Modeling Topic Evolution

Published: 10 August 2015 Publication History

Abstract

Topic discovery and evolution (TDE) has been a problem which has gained long standing interest in the research community. The goal in topic discovery is to identify groups of keywords from large corpora so that the information in those corpora are summarized succinctly. The nature of text corpora has changed dramatically in the past few years with the advent of social media. Social media services allow users to constantly share, follow and comment on posts from other users. Hence, such services have given a new dimension to the traditional text corpus. The new dimension being that today's corpora have a social context embedded in them in terms of the community of users interested in a particular post, their profiles etc. We wish to harness this social context that comes along with the textual content for TDE. In particular, our goal is to both qualitatively and quantitatively analyze when social context actually helps with TDE. Methodologically, we approach the problem of TDE by a proposing non-negative matrix factorization (NMF) based model that incorporates both the textual information and social context information. We perform experiments on large scale real world dataset of news articles, and use Twitter as the platform providing information about the social context of these news articles. We compare with and outperform several state-of-the-art baselines. Our conclusion is that using the social context information is most useful when faced with topics that are particularly difficult to detect.

Supplementary Material

MP4 File (p517.mp4)

References

[1]
What's in a hashtag?: content based prediction of the spread of ideas in microblogging communities. In WSDM, pages 643--652. ACM, 2012.
[2]
Loulwah AlSumait, Daniel Barbará, and Carlotta Domeniconi. Online-lda. ICDM '08, 2008.
[3]
Hila Becker, Mor Naaman, and Luis Gravano. Event identification in social media. In WebDB, 2009.
[4]
David M. Blei and Michael I. Jordan. Modeling annotated data. SIGIR '03, 2003.
[5]
David M. Blei and John D. Lafferty. Dynamic topic models. ICML '06, 2006.
[6]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, March 2003.
[7]
Jonathan Chang and David M Blei. Relational topic models for document networks. In AISTATS, 2009.
[8]
Guiguang Ding, Yuchen Guo, and Jile Zhou. Collective matrix factorization hashing for multimodal data. In CVPR 2014, pages 2083--2090, June 2014.
[9]
Khalid El-Arini, Min Xu, Emily B. Fox, and Carlos Guestrin. Representing documents through their readers. KDD '13, 2013.
[10]
Elena Erosheva, Stephen Fienberg, and John Lafferty. Mixed-membership models of scientific publications. Proceedings of the National Academy of Sciences, 101(suppl 1):5220--5227, 2004.
[11]
Noriaki Kawamae. Trend analysis model: Trend consists of temporal words, topics, and timestamps. WSDM '11, 2011.
[12]
Daniel D Lee and H Sebastian Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755):788--791, 1999.
[13]
Daniel D. Lee and H. Sebastian Seung. Algorithms for non-negative matrix factorization. In NIPS, 2000.
[14]
Julien Mairal, Francis Bach, Jean Ponce, and Guillermo Sapiro. Online learning for matrix factorization and sparse coding. J. Mach. Learn. Res., 11:19--60, March 2010.
[15]
Andrew McCallum, Xuerui Wang, and Andrés Corrada-Emmanuel. Topic and role discovery in social networks with experiments on enron and academic email. J. Artif. Int. Res., 30(1):249--272, October 2007.
[16]
Ramesh M. Nallapati, Amr Ahmed, Eric P. Xing, and William W. Cohen. Joint latent topic models for text and citations. KDD '08, 2008.
[17]
David Newman, Jey Han Lau, Karl Grieser, and Timothy Baldwin. Automatic evaluation of topic coherence. In Association for Computational Linguistics, 2010.
[18]
Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and Andrew Y Ng. Multimodal deep learning. In ICML-11, 2011.
[19]
Michal Rosen-Zvi, Thomas Griffiths, Mark Steyvers, and Padhraic Smyth. The author-topic model for authors and documents. UAI '04, 2004.
[20]
Diego Saez-Trumper, Carlos Castillo, and Mounia Lalmas. Social media news communities: Gatekeeping, coverage, and statement bias. CIKM '13, 2013.
[21]
Ankan Saha and Vikas Sindhwani. Learning evolving and emerging topics in social media: A dynamic nmf approach with temporal regularization. WSDM '12, 2012.
[22]
Martin Saveski and Amin Mantrach. Item cold-start recommendations: Learning local collective embeddings. RecSys '14, 2014.
[23]
Hassan Sayyadi, Matthew Hurst, and Alexey Maykov. Event detection and tracking in social streams. In ICWSM, 2009.
[24]
Y. Sekiguchi, H. Kawashima, H. Okuda, and M. Oku. In MDM 2006.
[25]
Ajit P. Singh and Geoffrey J. Gordon. Relational learning via collective matrix factorization. KDD '08, 2008.
[26]
Carmen K. Vaca, Amin Mantrach, Alejandro Jaimes, and Marco Saerens. A time-based collective factorization for topic discovery and monitoring in news. WWW '14, 2014.
[27]
Xuerui Wang and Andrew McCallum. Topics over time: A non-markov continuous-time model of topical trends. KDD '06, 2006.
[28]
Yu Wang, Eugene Agichtein, and Michele Benzi. Tm-lda: Efficient online modeling of latent topic transitions in social media. KDD '12, 2012.

Cited By

View all
  • (2024)Topicality boosts popularity: a comparative analysis of NYT articles and Reddit memesSocial Network Analysis and Mining10.1007/s13278-024-01272-314:1Online publication date: 23-Jun-2024
  • (2023)Real-Time Detection of COVID-19 Events From Twitter: A Spatial-Temporally Bursty-Aware MethodIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.316974210:2(656-672)Online publication date: Apr-2023
  • (2023)An embedding approach for analyzing the evolution of research topics with a case study on computer science subdomainsScientometrics10.1007/s11192-023-04642-4128:3(1567-1582)Online publication date: 31-Jan-2023
  • Show More Cited By

Index Terms

  1. Leveraging Social Context for Modeling Topic Evolution

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
    August 2015
    2378 pages
    ISBN:9781450336642
    DOI:10.1145/2783258
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 August 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. collective factorization
    2. social networks
    3. topic discovery
    4. topic monitoring
    5. topic tracking

    Qualifiers

    • Research-article

    Funding Sources

    • EC SUPER
    • National Science Foundation

    Conference

    KDD '15
    Sponsor:

    Acceptance Rates

    KDD '15 Paper Acceptance Rate 160 of 819 submissions, 20%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)21
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 23 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Topicality boosts popularity: a comparative analysis of NYT articles and Reddit memesSocial Network Analysis and Mining10.1007/s13278-024-01272-314:1Online publication date: 23-Jun-2024
    • (2023)Real-Time Detection of COVID-19 Events From Twitter: A Spatial-Temporally Bursty-Aware MethodIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.316974210:2(656-672)Online publication date: Apr-2023
    • (2023)An embedding approach for analyzing the evolution of research topics with a case study on computer science subdomainsScientometrics10.1007/s11192-023-04642-4128:3(1567-1582)Online publication date: 31-Jan-2023
    • (2022)Social Media Event Prediction using DNN with Feedback MechanismACM Transactions on Management Information Systems10.1145/352275913:3(1-24)Online publication date: 14-May-2022
    • (2021)BiTTM: A Core Biterms-Based Topic Model for Targeted AnalysisApplied Sciences10.3390/app11211016211:21(10162)Online publication date: 29-Oct-2021
    • (2021)Towards Study of Research Topics Evolution in Artificial Intelligence based on Topic Embedding2021 11th International Conference on Computer Engineering and Knowledge (ICCKE)10.1109/ICCKE54056.2021.9721503(406-411)Online publication date: 28-Oct-2021
    • (2021)Online Latent Dirichlet Allocation Model Based on Sentiment Polarity Time SeriesWuhan University Journal of Natural Sciences10.1051/wujns/202126646426:6(464-472)Online publication date: 17-Dec-2021
    • (2021)Global Agendas: Detection of Agenda Shifts in Cross-National Discussions Using Neural-Network Text Summarization for TwitterSocial Computing and Social Media: Experience Design and Social Network Analysis10.1007/978-3-030-77626-8_15(221-239)Online publication date: 3-Jul-2021
    • (2020)TermBall: Tracking and Predicting Evolution Types of Research Topics by Using Knowledge Structures in Scholarly Big DataIEEE Access10.1109/ACCESS.2020.30009488(108514-108529)Online publication date: 2020
    • (2020)Understanding hierarchical structural evolution in a scientific discipline: A case study of artificial intelligenceJournal of Informetrics10.1016/j.joi.2020.10104714:3(101047)Online publication date: Aug-2020
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media