Abstract
Reviews and comments play important role in online shopping. It can help people getting more information about stores and products. The potential customers tend to make decision according to it. However, driven by profit and fame, spammers post spurious reviews to mislead the customers by promoting or demoting target store. Previous studies mainly focused on the text features and ratings to identify fake reviews. However, these studies ignore the importance of relationship between store and reviewer. This paper first proposes sentiment analysis techniques to calculate the sentiment score of reviews. Then a relationship-based method has been proposed to identify the spammers. We also present an algorithm which can detect both single-mode spammers and multi-mode spammers. A subset of highly suspicious reviewers is selected for evaluation by human judges. Experimental results show that the proposed method can find out the review spammer efficiently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bachman, C.W., Daya, M.: The role concept in data models. In: Proceedings of the Third International Conference on Very Large Data Bases, VLDB 1977, vol. 3, pp. 464–476. VLDB Endowment (1977)
Gyongyi, Z., Garcia-Molina, H.: Web spam taxonomy. In: First International Workshop on Adversarial Information Retrieval on the Web, AIRWeb 2005 (2005)
Wu, B., Davison, B.D.: Identifying link farm spam pages. In: Special Interest Tracks and Posters of the 14th International Conference on World Wide Web. ACM (2005)
Fallows, D.: Spam: How it is hurting email and degrading life on the internet. Technical report, Pew Internet and American Life Project (October 2003)
Thomason, A.: Blog Spam: A Review. In: CEAS (2007)
Jindal, N., Chang, B.L.: Review Spam Detection. In: WWW 2007 (2007)
Jindal, N., Liu, B., Lim, E.P.: Finding Unusual Review Patterns using Unexpected Rules. In: CIKM 2010 (2010)
Lim, E.P., Nguyen, V.A., Jindal, N., Liu, B., Lauw, H.W.: Detecting product review spammers using rating behaviors. In: CIKM 2010 (2010)
Wang, G., Xie, S., Liu, B., Philip, S.Y.: Identify Online Store Review Spammers via Social Review Graph. ACM Transactions on Intelligent Systems and Technology 3(4), Article No. 61 (September 2012)
Mukherjee, A., Liu, B., Glance, N.: Spotting Fake Reviewer Groups in Consumer Reviews. In: WWW 2012 (2012)
Zhang, R., Sha, C., Zhou, M., Zhou, A.: Exploiting shopping and reviewing behavior to rescore online evaluations. In: WWW 2012 (2012)
Kant, R., Sengamedu, S.H., Kumar, K.S.: Comment spam detection by sequence mining. In: WWW 2012 (2012)
Li, F., Huang, M., Yang, Y., Zhu, X.: Learning to identify review spam. In: IJCAI 2011 (2011)
Ott, M., Choi, Y., Cardie, C., Hancock, J.T.: Finding deceptive opinion spam by any stretch of the imagination. In: HLT 2011 (2011)
Xie, S., et al.: Review spam detection via temporal pattern discovery. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (2012)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177. ACM Press, New York (2004)
Ding, X., Liu, B., Yu, P.S., et al.: A holistic lexicon-based approach to opinion mining. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 231–240. ACM Press, New York (2008)
Kim, S.-M., Hovy, E.: Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics, p. 1367. Association for Computational Linguistics, Philadelphia (2004)
Indurkhya, N., Damerau, F.J.: Handbook of natural language processing. In: 2nd. Chapman and Hall/CRC (2010)
Ohana, B., Tierney, B.: Sentiment classification of reviews using SentiWordNet. In: 9th IT & T Conference, p. 13 (2009)
Hu, X., Tang, J., Gao, H., et al.: Unsupervised sentiment analysis with emotional signals. In: Proceedings of the 22nd International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, pp. 607–618 (2013)
Pak, A., Paroubek, P.: Twitter as a Corpus for Sentiment Analysis and Opinion Mining. In: LREC (2010)
Li, N., Wu, D.D.: Using text mining and sentiment analysis for online forums hotspot detection and forecast. Decision Support Systems 48(2), 354–368 (1981, 2010)
Bermingham, A., Smeaton, A.F.: Classifying sentiment in microblogs: is brevity an advantage? In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 1833–1836. ACM (2010)
Mudinas, A., Zhang, D., Levene, M.: Combining lexicon and learning based approaches for concept-level sentiment analysis. In: Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining, p. 5. ACM (2012)
Zhang, Q., Wu, Y., Li, T., et al.: Mining product reviews based on shallow dependency parsing. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and development in Information Retrieval, pp. 726–727. ACM (2009)
Wu, Y., Zhang, Q., Huang, X., et al.: Phrase dependency parsing for opinion mining. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 3, pp. 1533–1541. Association for Computational Linguistics (2009)
Joshi, M., Penstein-Ros, C.: Generalizing dependency features for opinion mining. In: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp. 313–316. Association for Computational Linguistics (2009)
Zhao, W.X., et al.: Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics (2010)
Thet, T.T., Na, J.-C., Khoo, C.S.G.: Aspect-based sentiment analysis of movie reviews on discussion boards. Journal of Information Science 36(6), 823–848 (2010)
Mei, Q., et al.: Topic sentiment mixture: modeling facets and opinions in weblogs. In: Proceedings of the 16th International Conference on World Wide Web. ACM (2007)
Esuli, A., Sebastiani, F.: Sentiwordnet: A publicly available lexical resource for opinion mining. In: Proceedings of LREC, vol. 6 (2006)
Wilson, T., et al.: OpinionFinder: A system for subjectivity analysis. In: Proceedings of HLT/EMNLP on Interactive Demonstrations. Association for Computational Linguistics (2005)
Jindal, N., Liu, B.: Mining comparative sentences and relations. In: AAAI, vol. 22 (2006)
Fleiss, J.L., Cohen, J.: The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educational and Psychological Measurement
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Peng, Q. (2014). Store Review Spammer Detection Based on Review Relationship. In: Parsons, J., Chiu, D. (eds) Advances in Conceptual Modeling. ER 2013. Lecture Notes in Computer Science, vol 8697. Springer, Cham. https://doi.org/10.1007/978-3-319-14139-8_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-14139-8_30
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14138-1
Online ISBN: 978-3-319-14139-8
eBook Packages: Computer ScienceComputer Science (R0)