Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2187836.2187863acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Spotting fake reviewer groups in consumer reviews

Published: 16 April 2012 Publication History

Abstract

Opinionated social media such as product reviews are now widely used by individuals and organizations for their decision making. However, due to the reason of profit or fame, people try to game the system by opinion spamming (e.g., writing fake reviews) to promote or demote some target products. For reviews to reflect genuine user experiences and opinions, such spam reviews should be detected. Prior works on opinion spam focused on detecting fake reviews and individual fake reviewers. However, a fake reviewer group (a group of reviewers who work collaboratively to write fake reviews) is even more damaging as they can take total control of the sentiment on the target product due to its size. This paper studies spam detection in the collaborative setting, i.e., to discover fake reviewer groups. The proposed method first uses a frequent itemset mining method to find a set of candidate groups. It then uses several behavioral models derived from the collusion phenomenon among fake reviewers and relation models based on the relationships among groups, individual reviewers, and products they reviewed to detect fake reviewer groups. Additionally, we also built a labeled dataset of fake reviewer groups. Although labeling individual fake reviews and reviewers is very hard, to our surprise labeling fake reviewer groups is much easier. We also note that the proposed technique departs from the traditional supervised learning approach for spam detection because of the inherent nature of our problem which makes the classic supervised learning approach less effective. Experimental results show that the proposed method outperforms multiple strong baselines including the state-of-the-art supervised classification, regression, and learning to rank algorithms.

References

[1]
Agrawal, R. and Srikant, R. Fast algorithms for mining association rules. VLDB. 1994.
[2]
Benevenuto, F., Rodrigues, T., Almeida, V., Almeida, J, Gonvalves, M. A. Detecting spammers and content promoters in online video social networks. SIGIR. 2009.
[3]
Burges, C.J.C., Shaked, T., Renshaw, E. Lazier, A. Deeds, M., Hamilton, N. Hullender., G. Learning to rank using gradient descent. ICML. 2005.
[4]
Castillo, C., Davison, B. Adversarial Web Search Foundations and Trends in Information Retrieval, 5, 2010.
[5]
Castillo, C., Donato, D., Becchetti, L., Boldi, P., Leonardi, S., Santini, M., and Vigna, S. 2006. A reference collection for web spam. SIGIR Forum 40, 2, 11--24, S. 2006.
[6]
Chirita, P.A., Diederich, J., and Nejdl, W. MailRank: using ranking for spam detection. CIKM. 2005.
[7]
Douceur, J. R. The sybil attack. IPTPS Workshop. 2002.
[8]
Eagle, N. and Pentland, A. Reality Mining: Sensing Complex Social Systems. Personal and Ubiquitous Computing. 2005.
[9]
Fayyad, U. M. and Irani, K. B. Multi-interval discretization of continuous-valued attributes for classification learning. IJCAI. 1993.
[10]
Fleiss, J. L. Measuring nominal scale agreement among many raters. Psychological Bulletin, 76(5), pp. 378--382, 1971.
[11]
Freund, Y., Iyer, R., Schapire, R. and Singer, Y. An efficient boosting algorithm for combining preference. JMLR. 2003.
[12]
Heath, M. T., Scientific Computing: An Introductory Survey. McGrawHill, New York. Second edition. 2002.
[13]
Hsu, W., Dutta, D., Helmy, A. Mining Behavioral Groups in Large Wireless LANs. MobiCom. 2007.
[14]
Jindal, N. and Liu, B. Opinion spam and analysis. WSDM. 2008.
[15]
Jindal, N., Liu, B. and Lim, E. P. Finding Unusual Review Patterns Using Unexpected Rules. CIKM. 2010.
[16]
Joachims, T. Making large-scale support vector machine learning practical. Advances in Kernel Methods. MIT Press. 1999.
[17]
Joachims, T. Optimizing Search Engines Using Clickthrough Data. KDD. 2002.
[18]
Kim, S.M., Pantel, P., Chklovski, T. and Pennacchiotti, M. Automatically assessing review helpfulness. EMNLP. 2006.
[19]
Kleinberg, J. M. Authoritative sources in a hyperlinked environment. ACM-SIAM SODA, 1998.
[20]
Kolari, P., Java, A., Finin, T., Oates, T., Joshi, A. Detecting Spam Blogs: A Machine Learning Approach. AAAI. 2006.
[21]
Koutrika, G., Effendi, F. A., Gyöngyi, Z., Heymann, P., and H. Garcia-Molina. Combating spam in tagging systems. AIRWeb. 2007.
[22]
Landis, J. R. and Koch, G. G. The measurement of observer agreement for categorical data. Biometrics, 33, 159--174, 1977.
[23]
Li, F., Huang, M., Yang, Y. and Zhu, X. Learning to identify review Spam. IJCAI. 2011.
[24]
Lim, E. Nguyen, V. A., Jindal, N., Liu, B., and Lauw, H. Detecting Product Review Spammers Using Rating Behavior. CIKM. 2010.
[25]
Liu, J., Cao, Y., Lin, C., Huang, Zhou, M. Low-quality product review detection in opinion summarization. EMNLP, 2007.
[26]
Liu, T-Y. Learning to Rank for Information Retrieval. Foundations and Trends in Information Retrieval 3(3): 225--331. 2009.
[27]
Markines, B., Cattuto, C., and Menczer, F. Social spam detection. AIRWeb. 2009.
[28]
Martinez-Romo, J. and Araujo, A. Web Spam Identification Through Language Model Analysis. AIRWeb. 2009.
[29]
Mukherjee, A., Liu, B., Wang, J., Glance, N., Jindal, N. Detecting Group Review Spam. WWW. 2011. (2-page Poster paper)
[30]
Ntoulas, A., Najork, M., Manasse M., Fetterly, D. Detecting Spam Web Pages through Content Analysis. WWW'2006.
[31]
Ott, M., Choi, Y., Cardie, C. Hancock, J. Finding Deceptive Opinion Spam by Any Stretch of the Imagination. ACL. 2011.
[32]
Wang, G., Xie, S., Liu, B., and Yu, P. S. Review Graph based Online Store Review Spammer Detection. ICDM. 2011.
[33]
Wang, Y. Ma, M. Niu, Y., Chen, H. Spam Double-Funnel: Connecting Web Spammers with Advertisers. WWW'2007.
[34]
Wu, G., Greene, D., Smyth, B. and Cunningham, P. 2010. Distortion as a validation criterion in the identification of suspicious reviews. Technical report, UCD-CSI-2010-04, University College Dublin.
[35]
Wu, B., Goel V. & Davison, B. D. Topical TrustRank: using topicality to combat Web spam. WWW. 2006.
[36]
Yan, F., Jiang, J., Lu, Y., Luo, Q., Zhang, M. Community Discovery Based on Social Actors' Interests & Social Relationships. SKG. 2008.
[37]
Zhang, Z. and Varadarajan, B. Utility scoring of product reviews. CIKM. 2006.

Cited By

View all
  • (2025)Large Scale Anonymous Collusion and its detection in crowdsourcingExpert Systems with Applications10.1016/j.eswa.2024.125284259(125284)Online publication date: Jan-2025
  • (2024)Unlocking Sentiments: Enhancing IOCL Petrol Pump ExperiencesInternational Journal of Innovative Science and Research Technology (IJISRT)10.38124/ijisrt/IJISRT24MAY214(929-936)Online publication date: 27-May-2024
  • (2024)Leveraging Stacking Framework for Fake Review Detection in the Hospitality SectorJournal of Theoretical and Applied Electronic Commerce Research10.3390/jtaer1902007519:2(1517-1558)Online publication date: 15-Jun-2024
  • Show More Cited By

Index Terms

  1. Spotting fake reviewer groups in consumer reviews

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '12: Proceedings of the 21st international conference on World Wide Web
    April 2012
    1078 pages
    ISBN:9781450312295
    DOI:10.1145/2187836
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • Univ. de Lyon: Universite de Lyon

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 April 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. fake review detection
    2. group opinion spam
    3. opinion spam

    Qualifiers

    • Research-article

    Conference

    WWW 2012
    Sponsor:
    • Univ. de Lyon
    WWW 2012: 21st World Wide Web Conference 2012
    April 16 - 20, 2012
    Lyon, France

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)138
    • Downloads (Last 6 weeks)16
    Reflects downloads up to 19 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Large Scale Anonymous Collusion and its detection in crowdsourcingExpert Systems with Applications10.1016/j.eswa.2024.125284259(125284)Online publication date: Jan-2025
    • (2024)Unlocking Sentiments: Enhancing IOCL Petrol Pump ExperiencesInternational Journal of Innovative Science and Research Technology (IJISRT)10.38124/ijisrt/IJISRT24MAY214(929-936)Online publication date: 27-May-2024
    • (2024)Leveraging Stacking Framework for Fake Review Detection in the Hospitality SectorJournal of Theoretical and Applied Electronic Commerce Research10.3390/jtaer1902007519:2(1517-1558)Online publication date: 15-Jun-2024
    • (2024)Research on the negative effect of product scarcity appeals on the purchase intention of green products and its mechanismFrontiers in Psychology10.3389/fpsyg.2024.122501115Online publication date: 8-Apr-2024
    • (2024)A multiview clustering framework for detecting deceptive reviewsJournal of Computer Security10.3233/JCS-22000132:1(31-52)Online publication date: 2-Feb-2024
    • (2024)How to detect fake online physician reviews: A deep learning approachDIGITAL HEALTH10.1177/2055207624127717110Online publication date: 30-Aug-2024
    • (2024)FakeX: A Framework for Detecting Fake Reviews of Browser ExtensionsProceedings of the 19th ACM Asia Conference on Computer and Communications Security10.1145/3634737.3656999(769-784)Online publication date: 1-Jul-2024
    • (2024)An Efficient Deep Learning Mechanism for Predicting Fake News/Reviews in Twitter DataInternational Journal on Artificial Intelligence Tools10.1142/S021821302450006433:06Online publication date: 16-Oct-2024
    • (2024)Spatio-Temporal Graph Representation Learning for Fraudster Group DetectionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.321200135:5(6628-6642)Online publication date: May-2024
    • (2024)Understanding Large-Scale Network Effects in Detecting Review SpammersIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.324313911:4(4994-5004)Online publication date: Aug-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media