Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3405962.3405998acmotherconferencesArticle/Chapter ViewAbstractPublication PageswimsConference Proceedingsconference-collections
research-article

TRIAGE: Temporal Twitter attribute graph patterns

Published: 24 August 2020 Publication History

Abstract

Given a node-attributed network of Twitter users, can we capture their posting behavior over time and identify patterns that could probably describe, model or predict their activity? Based on the assumption that the posts of these users are topic-specific, can we identify temporal connectivity patterns that emerge from the use of specific attributes? More challengingly, are there any particular attribute usage patterns which indicate an inherent anomaly either for users or attributes? Our study attempts to provide solid answers to all the above questions, extending previous work on other social networks and attribute types. We propose TRIAGE, a pipeline of methods which: (a) identify temporal behavioral patterns in individual attribute distributions, (b) model the temporal evolution of attribute induced graphs and (c) detect irregular attributes and users based on the patterns identified earlier; More specifically, we model the attribute distributions using the log-Odds ratio, we provide explanations with respect to the attribute induced subgraph patterns and we observe the structural differences of attribute induced subgraphs based on these patterns. Experimental results show that: most of the individual attribute distributions remain stable over time following mostly power laws norm; the temporal evolution of attribute induced graphs obey certain laws and deviations are outliers; finally, we discover that we can indeed identify the structure of each subgraph, based on the emerging patterns. Real dataset experiments on 50K Twitter users activities and attributes has successfully proven that TRIAGE has effectively identified Twitter user and attribute behavioral patterns and can identify irregular activities for users and anomalous graph structures for attribute induced subgraphs.

References

[1]
Leman Akoglu, Rishi Chandy, and Christos Faloutsos. 2013. Opinion Fraud Detection in Online Reviews by Network Effects. In ICWSM. The AAAI Press.
[2]
Leman Akoglu, Mary McGlohon, and Christos Faloutsos. 2010. oddball: Spotting Anomalies in Weighted Graphs. In PAKDD (2) (Lecture Notes in Computer Science, Vol. 6119). Springer, 410--421.
[3]
Leman Akoglu, Hanghang Tong, and Danai Koutra. 2015. Graph based anomaly detection and description: a survey. Data mining and knowledge discovery 29, 3 (2015), 626--688.
[4]
Faiyaz Al Zamal, Wendy Liu, and Derek Ruths. 2012. Homophily and latent attribute inference: Inferring latent attributes of twitter users from neighbors. In Sixth International AAAI Conference on Weblogs and Social Media.
[5]
Albert-Laszlo Barabasi. 2005. The origin of bursts and heavy tails in human dynamics. Nature 435, 7039 (2005), 207--211.
[6]
David R Bild, Yue Liu, Robert P Dick, Z Morley Mao, and Dan S Wallach. 2015. Aggregate characterization of user behavior in twitter and analysis of the retweet graph. ACM Transactions on Internet Technology (TOIT) 15, 1 (2015), 1--24.
[7]
PV Bindu, Rahul Mishra, and P Santhi Thilagam. 2018. Discovering spammer communities in Twitter. Journal of Intelligent Information Systems 51, 3 (2018), 503--527.
[8]
Deepayan Chakrabarti and Christos Faloutsos. 2006. Graph mining: Laws, generators, and algorithms. ACM computing surveys (CSUR) 38, 1 (2006), 2-es.
[9]
Nikan Chavoshi, Hossein Hamooni, and Abdullah Mueen. 2016. DeBot: Twitter Bot Detection via Warped Correlation. In ICDM. 817--822.
[10]
Daniel YT Chino, Alceu F Costa, Agma JM Traina, and Christos Faloutsos. 2017. VolTime: Unsupervised Anomaly Detection on Users' Online Activity Volume. In Proceedings of the 2017 SIAM International Conference on Data Mining. SIAM, 108--116.
[11]
Aaron Clauset, Cosma Rohilla Shalizi, and Mark EJ Newman. 2009. Power-law distributions in empirical data. SIAM review 51, 4 (2009), 661--703.
[12]
David Crandall, Dan Cosley, Daniel Huttenlocher, Jon Kleinberg, and Siddharth Suri. 2008. Feedback effects between similarity and social influence in online communities. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 160--168.
[13]
Munmun De Choudhury, Scott Counts, and Eric Horvitz. 2013. Major life changes and behavioral markers in social media: case of childbirth. In Proceedings of the 2013 conference on Computer supported cooperative work. 1431--1442.
[14]
Munmun De Choudhury, Scott Counts, Eric J Horvitz, and Aaron Hoff. 2014. Characterizing and predicting postpartum depression from shared facebook data. In Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing. 626--638.
[15]
Pedro OS Vaz De Melo, Leman Akoglu, Christos Faloutsos, and Antonio AF Loureiro. 2010. Surprising patterns for the call duration distribution of mobile phone users. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 354--369.
[16]
Pravallika Devineni, Danai Koutra, Michalis Faloutsos, and Christos Faloutsos. 2015. If walls could talk: Patterns and anomalies in Facebook wallposts. In ASONAM. ACM, 367--374.
[17]
Dhivya Eswaran, Reihaneh Rabbany, Artur W Dubrawski, and Christos Faloutsos. [n.d.]. Social-Affiliation Networks: Patterns and the SOAR Model. ([n. d.]).
[18]
Michalis Faloutsos, Petros Faloutsos, and Christos Faloutsos. 1999. On power-law relationships of the internet topology. ACM SIGCOMM computer communication review 29, 4 (1999), 251--262.
[19]
Emilio Ferrara, Onur Varol, Clayton Davis, Filippo Menczer, and Alessandro Flammini. 2016. The rise of social bots. Commun. ACM 59, 7 (2016), 96--104.
[20]
Saptarshi Ghosh, Bimal Viswanath, Farshad Kooti, Naveen Kumar Sharma, Gautam Korlam, Fabricio Benevenuto, Niloy Ganguly, and Krishna Phani Gummadi. 2012. Understanding and combating link farming in the twitter social network. In Proceedings of the 21st international conference on World Wide Web. ACM, 61--70.
[21]
Neil Zhenqiang Gong, Ameet Talwalkar, Lester Mackey, Ling Huang, Eui Chul Richard Shin, Emil Stefanov, Elaine Shi, and Dawn Song. 2014. Joint link prediction and attribute inference using a social-attribute network. ACM Transactions on Intelligent Systems and Technology (TIST) 5, 2 (2014), 1--20.
[22]
Lei Guo, Enhua Tan, Songqing Chen, Xiaodong Zhang, and Yihong Zhao. 2009. Analyzing patterns of user content generation in online social networks. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. 369--378.
[23]
Bryan Hooi, Neil Shah, Alex Beutel, Stephan Günnemann, Leman Akoglu, Mohit Kumar, Disha Makhija, and Christos Faloutsos. 2016. Birdnest: Bayesian inference for ratings-fraud detection. In Proceedings of the 2016 SIAM International Conference on Data Mining. SIAM, 495--503.
[24]
Meng Jiang, Peng Cui, Alex Beutel, Christos Faloutsos, and Shiqiang Yang. 2014. CatchSync: catching synchronized behavior in large directed graphs. In KDD. ACM, 941--950.
[25]
Myunghwan Kim and Jure Leskovec. 2011. Modeling social networks with node attributes using the multiplicative attribute graph model. arXiv preprint arXiv:1106.5053 (2011).
[26]
Myunghwan Kim and Jure Leskovec. 2012. Multiplicative attribute graph model of real-world networks. Internet mathematics 8, 1-2 (2012), 113--160.
[27]
Danai Koutra, Vasileios Koutras, B Aditya Prakash, and Christos Faloutsos. 2013. Patterns amongst competing task frequencies: Super-linearities, and the almonddg model. In Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 201--212.
[28]
Timothy La Fond and Jennifer Neville. 2010. Randomization tests for distinguishing social influence and homophily effects. In Proceedings of the 19th international conference on World wide web. ACM, 601--610.
[29]
Jure Leskovec, Deepayan Chakrabarti, Jon Kleinberg, Christos Faloutsos, and Zoubin Ghahramani. 2010. Kronecker graphs: An approach to modeling networks. Journal of Machine Learning Research 11, Feb (2010), 985--1042.
[30]
Jure Leskovec, Jon M. Kleinberg, and Christos Faloutsos. 2005. Graphs over time: densification laws, shrinking diameters and possible explanations. In KDD. ACM, 177--187.
[31]
Tetyana Lokot and Nicholas Diakopoulos. 2016. News Bots: Automating news and information dissemination on Twitter. Digital Journalism 4, 6 (2016), 682--699.
[32]
Bryan Perozzi and Leman Akoglu. 2016. Scalable anomaly ranking of attributed neighborhoods. In Proceedings of the 2016 SIAM International Conference on Data Mining. SIAM, 207--215.
[33]
Joseph J Pfeiffer III, Sebastian Moreno, Timothy La Fond, Jennifer Neville, and Brian Gallagher. 2014. Attributed graph models: Modeling network structure with correlated attributes. In Proceedings of the 23rd international conference on World wide web. 831--842.
[34]
Venkata Krishna Pillutla, Zhanpeng Fang, Pravallika Devineni, Christos Faloutsos, Danai Koutra, and Jie Tang. 2016. On Skewed Multi-dimensional Distributions: the FusionRP Model, Algorithms, and Discoveries. In Proceedings of the 2016 SIAM International Conference on Data Mining. SIAM, 783--791.
[35]
Mukund Seshadri, Sridhar Machiraju, Ashwin Sridharan, Jean Bolot, Christos Faloutsos, and Jure Leskove. 2008. Mobile call graphs: beyond power-law and lognormal distributions. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. 596--604.
[36]
Neil Shah, Alex Beutel, Brian Gallagher, and Christos Faloutsos. 2014. Spotting Suspicious Link Behavior with fBox: An Adversarial Perspective. CoRR abs/1410.3915 (2014).
[37]
Kurt Thomas, Damon McCoy, Chris Grier, Alek Kolcz, and Vern Paxson. 2013. Trafficking Fraudulent Accounts: The Role of the Underground Market in Twitter Spam and Abuse. In USENIX Security Symposium. 195--210.
[38]
Onur Varol, Emilio Ferrara, Clayton A Davis, Filippo Menczer, and Alessandro Flammini. 2017. Online human-bot interactions: Detection, estimation, and characterization. In Eleventh international AAAI conference on web and social media.
[39]
Yuan Wang, Jie Liu, Jishi Qu, Yalou Huang, Jimeng Chen, and Xia Feng. 2014. Hashtag graph based topic model for tweet mining. In 2014 IEEE International Conference on Data Mining. IEEE, 1025--1030.
[40]
Chao Michael Zhang and Vern Paxson. 2011. Detecting and analyzing automated activity on twitter. In International Conference on Passive and Active Network Measurement. Springer, 102--111.
[41]
Elena Zheleva, Hossam Sharara, and Lise Getoor. 2009. Co-evolution of social and affiliation networks. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 1007--1016.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WIMS 2020: Proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics
June 2020
279 pages
ISBN:9781450375429
DOI:10.1145/3405962
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Anomaly Detection
  2. Graph mining
  3. Network Modelling
  4. Social networks
  5. Twitter

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

Conference

WIMS 2020

Acceptance Rates

WIMS 2020 Paper Acceptance Rate 35 of 63 submissions, 56%;
Overall Acceptance Rate 140 of 278 submissions, 50%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 102
    Total Downloads
  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)1
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media