research-article

Spotting fake reviewer groups in consumer reviews

Authors:

Arjun Mukherjee,

Natalie GlanceAuthors Info & Claims

WWW '12: Proceedings of the 21st international conference on World Wide Web

Pages 191 - 200

https://doi.org/10.1145/2187836.2187863

Published: 16 April 2012 Publication History

Abstract

Opinionated social media such as product reviews are now widely used by individuals and organizations for their decision making. However, due to the reason of profit or fame, people try to game the system by opinion spamming (e.g., writing fake reviews) to promote or demote some target products. For reviews to reflect genuine user experiences and opinions, such spam reviews should be detected. Prior works on opinion spam focused on detecting fake reviews and individual fake reviewers. However, a fake reviewer group (a group of reviewers who work collaboratively to write fake reviews) is even more damaging as they can take total control of the sentiment on the target product due to its size. This paper studies spam detection in the collaborative setting, i.e., to discover fake reviewer groups. The proposed method first uses a frequent itemset mining method to find a set of candidate groups. It then uses several behavioral models derived from the collusion phenomenon among fake reviewers and relation models based on the relationships among groups, individual reviewers, and products they reviewed to detect fake reviewer groups. Additionally, we also built a labeled dataset of fake reviewer groups. Although labeling individual fake reviews and reviewers is very hard, to our surprise labeling fake reviewer groups is much easier. We also note that the proposed technique departs from the traditional supervised learning approach for spam detection because of the inherent nature of our problem which makes the classic supervised learning approach less effective. Experimental results show that the proposed method outperforms multiple strong baselines including the state-of-the-art supervised classification, regression, and learning to rank algorithms.

References

[1]

Agrawal, R. and Srikant, R. Fast algorithms for mining association rules. VLDB. 1994.

Digital Library

[2]

Benevenuto, F., Rodrigues, T., Almeida, V., Almeida, J, Gonvalves, M. A. Detecting spammers and content promoters in online video social networks. SIGIR. 2009.

Digital Library

[3]

Burges, C.J.C., Shaked, T., Renshaw, E. Lazier, A. Deeds, M., Hamilton, N. Hullender., G. Learning to rank using gradient descent. ICML. 2005.

Digital Library

[4]

Castillo, C., Davison, B. Adversarial Web Search Foundations and Trends in Information Retrieval, 5, 2010.

Digital Library

[5]

Castillo, C., Donato, D., Becchetti, L., Boldi, P., Leonardi, S., Santini, M., and Vigna, S. 2006. A reference collection for web spam. SIGIR Forum 40, 2, 11--24, S. 2006.

Digital Library

[6]

Chirita, P.A., Diederich, J., and Nejdl, W. MailRank: using ranking for spam detection. CIKM. 2005.

Digital Library

[7]

Douceur, J. R. The sybil attack. IPTPS Workshop. 2002.

Digital Library

[8]

Eagle, N. and Pentland, A. Reality Mining: Sensing Complex Social Systems. Personal and Ubiquitous Computing. 2005.

Digital Library

[9]

Fayyad, U. M. and Irani, K. B. Multi-interval discretization of continuous-valued attributes for classification learning. IJCAI. 1993.

[10]

Fleiss, J. L. Measuring nominal scale agreement among many raters. Psychological Bulletin, 76(5), pp. 378--382, 1971.

[11]

Freund, Y., Iyer, R., Schapire, R. and Singer, Y. An efficient boosting algorithm for combining preference. JMLR. 2003.

Digital Library

[12]

Heath, M. T., Scientific Computing: An Introductory Survey. McGrawHill, New York. Second edition. 2002.

Digital Library

[13]

Hsu, W., Dutta, D., Helmy, A. Mining Behavioral Groups in Large Wireless LANs. MobiCom. 2007.

Digital Library

[14]

Jindal, N. and Liu, B. Opinion spam and analysis. WSDM. 2008.

Digital Library

[15]

Jindal, N., Liu, B. and Lim, E. P. Finding Unusual Review Patterns Using Unexpected Rules. CIKM. 2010.

Digital Library

[16]

Joachims, T. Making large-scale support vector machine learning practical. Advances in Kernel Methods. MIT Press. 1999.

Digital Library

[17]

Joachims, T. Optimizing Search Engines Using Clickthrough Data. KDD. 2002.

Digital Library

[18]

Kim, S.M., Pantel, P., Chklovski, T. and Pennacchiotti, M. Automatically assessing review helpfulness. EMNLP. 2006.

Digital Library

[19]

Kleinberg, J. M. Authoritative sources in a hyperlinked environment. ACM-SIAM SODA, 1998.

Digital Library

[20]

Kolari, P., Java, A., Finin, T., Oates, T., Joshi, A. Detecting Spam Blogs: A Machine Learning Approach. AAAI. 2006.

Digital Library

[21]

Koutrika, G., Effendi, F. A., Gyöngyi, Z., Heymann, P., and H. Garcia-Molina. Combating spam in tagging systems. AIRWeb. 2007.

Digital Library

[22]

Landis, J. R. and Koch, G. G. The measurement of observer agreement for categorical data. Biometrics, 33, 159--174, 1977.

[23]

Li, F., Huang, M., Yang, Y. and Zhu, X. Learning to identify review Spam. IJCAI. 2011.

Digital Library

[24]

Lim, E. Nguyen, V. A., Jindal, N., Liu, B., and Lauw, H. Detecting Product Review Spammers Using Rating Behavior. CIKM. 2010.

Digital Library

[25]

Liu, J., Cao, Y., Lin, C., Huang, Zhou, M. Low-quality product review detection in opinion summarization. EMNLP, 2007.

[26]

Liu, T-Y. Learning to Rank for Information Retrieval. Foundations and Trends in Information Retrieval 3(3): 225--331. 2009.

Digital Library

[27]

Markines, B., Cattuto, C., and Menczer, F. Social spam detection. AIRWeb. 2009.

Digital Library

[28]

Martinez-Romo, J. and Araujo, A. Web Spam Identification Through Language Model Analysis. AIRWeb. 2009.

Digital Library

[29]

Mukherjee, A., Liu, B., Wang, J., Glance, N., Jindal, N. Detecting Group Review Spam. WWW. 2011. (2-page Poster paper)

Digital Library

[30]

Ntoulas, A., Najork, M., Manasse M., Fetterly, D. Detecting Spam Web Pages through Content Analysis. WWW'2006.

Digital Library

[31]

Ott, M., Choi, Y., Cardie, C. Hancock, J. Finding Deceptive Opinion Spam by Any Stretch of the Imagination. ACL. 2011.

Digital Library

[32]

Wang, G., Xie, S., Liu, B., and Yu, P. S. Review Graph based Online Store Review Spammer Detection. ICDM. 2011.

Digital Library

[33]

Wang, Y. Ma, M. Niu, Y., Chen, H. Spam Double-Funnel: Connecting Web Spammers with Advertisers. WWW'2007.

Digital Library

[34]

Wu, G., Greene, D., Smyth, B. and Cunningham, P. 2010. Distortion as a validation criterion in the identification of suspicious reviews. Technical report, UCD-CSI-2010-04, University College Dublin.

[35]

Wu, B., Goel V. & Davison, B. D. Topical TrustRank: using topicality to combat Web spam. WWW. 2006.

Digital Library

[36]

Yan, F., Jiang, J., Lu, Y., Luo, Q., Zhang, M. Community Discovery Based on Social Actors' Interests & Social Relationships. SKG. 2008.

Digital Library

[37]

Zhang, Z. and Varadarajan, B. Utility scoring of product reviews. CIKM. 2006.

Digital Library

Cited By

Han TXu WFang YDing X(2025)Large Scale Anonymous Collusion and its detection in crowdsourcingExpert Systems with Applications10.1016/j.eswa.2024.125284259(125284)Online publication date: Jan-2025
https://doi.org/10.1016/j.eswa.2024.125284
Gupta MJain CJain IBisht S. D(2024)Unlocking Sentiments: Enhancing IOCL Petrol Pump ExperiencesInternational Journal of Innovative Science and Research Technology (IJISRT)10.38124/ijisrt/IJISRT24MAY214(929-936)Online publication date: 27-May-2024
https://doi.org/10.38124/ijisrt/IJISRT24MAY214
Ashraf SJaved ABellary SBala PPanigrahi P(2024)Leveraging Stacking Framework for Fake Review Detection in the Hospitality SectorJournal of Theoretical and Applied Electronic Commerce Research10.3390/jtaer1902007519:2(1517-1558)Online publication date: 15-Jun-2024
https://doi.org/10.3390/jtaer19020075
Show More Cited By

Index Terms

Spotting fake reviewer groups in consumer reviews
1. Applied computing
  1. Law, social and behavioral sciences

Recommendations

Spotting opinion spammers using behavioral footprints
KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

Opinionated social media such as product reviews are now widely used by individuals and organizations for their decision making. However, due to the reason of profit or fame, people try to game the system by opinion spamming (e.g., writing fake reviews) ...
Simultaneously detecting fake reviews and review spammers using factor graph model
WebSci '13: Proceedings of the 5th Annual ACM Web Science Conference

Review spamming is quite common on many online shopping platforms like Amazon. Previous attempts for fake review and spammer detection use features of reviewer behavior, rating, and review content. However, to the best of our knowledge, there is no work ...
Opinion spam and analysis
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data Mining

Evaluative texts on the Web have become a valuable source of opinions on products, services, events, individuals, etc. Recently, many researchers have studied such opinion sources as product reviews, forum posts, and blogs. However, existing research ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '12: Proceedings of the 21st international conference on World Wide Web

April 2012

1078 pages

ISBN:9781450312295

DOI:10.1145/2187836

General Chairs:
Alain Mille
Université de Lyon, France
,
Fabien Gandon
INRIA, France
,
Jacques Misselis
HP, France
,
Program Chairs:
Michael Rabinovich
Case Western Reserve University, USA
,
Steffen Staab
University of Koblenz-Landau, Germany

Copyright © 2012 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Univ. de Lyon: Universite de Lyon

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 April 2012

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

WWW 2012

Sponsor:

Univ. de Lyon

WWW 2012: 21st World Wide Web Conference 2012

April 16 - 20, 2012

Lyon, France

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

481
Total Citations
View Citations
3,685
Total Downloads

Downloads (Last 12 months)138
Downloads (Last 6 weeks)16

Reflects downloads up to 19 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Han TXu WFang YDing X(2025)Large Scale Anonymous Collusion and its detection in crowdsourcingExpert Systems with Applications10.1016/j.eswa.2024.125284259(125284)Online publication date: Jan-2025
https://doi.org/10.1016/j.eswa.2024.125284
Gupta MJain CJain IBisht S. D(2024)Unlocking Sentiments: Enhancing IOCL Petrol Pump ExperiencesInternational Journal of Innovative Science and Research Technology (IJISRT)10.38124/ijisrt/IJISRT24MAY214(929-936)Online publication date: 27-May-2024
https://doi.org/10.38124/ijisrt/IJISRT24MAY214
Ashraf SJaved ABellary SBala PPanigrahi P(2024)Leveraging Stacking Framework for Fake Review Detection in the Hospitality SectorJournal of Theoretical and Applied Electronic Commerce Research10.3390/jtaer1902007519:2(1517-1558)Online publication date: 15-Jun-2024
https://doi.org/10.3390/jtaer19020075
Ye SLiu GLin YLin ZShi YHuang Z(2024)Research on the negative effect of product scarcity appeals on the purchase intention of green products and its mechanismFrontiers in Psychology10.3389/fpsyg.2024.122501115Online publication date: 8-Apr-2024
https://doi.org/10.3389/fpsyg.2024.1225011
Zhang YWang HStavrou A(2024)A multiview clustering framework for detecting deceptive reviewsJournal of Computer Security10.3233/JCS-22000132:1(31-52)Online publication date: 2-Feb-2024
https://dl.acm.org/doi/10.3233/JCS-220001
Zhao YLi TYuan QDeng S(2024)How to detect fake online physician reviews: A deep learning approachDIGITAL HEALTH10.1177/2055207624127717110Online publication date: 30-Aug-2024
https://doi.org/10.1177/20552076241277171
Olsson EEriksson BPicazo-Sanchez PAndersson LSabelfeld AQuek TGao DZhou JCardenas A(2024)FakeX: A Framework for Detecting Fake Reviews of Browser ExtensionsProceedings of the 19th ACM Asia Conference on Computer and Communications Security10.1145/3634737.3656999(769-784)Online publication date: 1-Jul-2024
https://dl.acm.org/doi/10.1145/3634737.3656999
Devan KMala G(2024)An Efficient Deep Learning Mechanism for Predicting Fake News/Reviews in Twitter DataInternational Journal on Artificial Intelligence Tools10.1142/S021821302450006433:06Online publication date: 16-Oct-2024
https://doi.org/10.1142/S0218213024500064
Shehnepoor STogneri RLiu WBennamoun M(2024)Spatio-Temporal Graph Representation Learning for Fraudster Group DetectionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.321200135:5(6628-6642)Online publication date: May-2024
https://doi.org/10.1109/TNNLS.2022.3212001
Rout JSahoo KDalmia ABakshi SBilal MSong H(2024)Understanding Large-Scale Network Effects in Detecting Review SpammersIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.324313911:4(4994-5004)Online publication date: Aug-2024
https://doi.org/10.1109/TCSS.2023.3243139
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents