Abstract
Predicting customer purchase behavior is an interesting and challenging task. In the e-commerce context, meeting this challenge requires confronting many problems not observed in the traditional business context. Recommender system technology has been widely adopted by e-commerce websites. However, a traditional recommendation algorithm cannot perform well the predictive task in this context. This study intends to build a predictive framework for customer purchase behavior in the e-commerce context. This framework, known as CustOmer purchase pREdiction modeL (COREL), may be understood as a two-stage process. First, associations among products are investigated and exploited to predicate customer’s motivations, i.e., to build a candidate product collection. Next, customer preferences for product features are learned and subsequently used to identify the candidate products most likely to be purchased. This study investigates three categories of product features and develops methods to detect customer preferences for each of these three categories. When a product purchased by a particular consumer is submitted to COREL, the program can return the top n products most likely to be purchased by that customer in the future. Experiments conducted on a real dataset show that customer preference for particular product features plays a key role in decision-making and that COREL greatly outperforms the baseline methods.
Similar content being viewed by others
Notes
In this paper, we use the term motivation as shorthand for motivations for purchasing products.
In this paper, we use the term “product feature” to refer both to product attributes (e.g., color, size, etc.) and to the related information displayed on e-commerce websites (e.g., ratings, reviews, sales, etc.).
References
Ahn, L. V., & Dabbish, L. (2004). Labeling images with a computer game. In The Proceedings of the SIGCHI'04 Conference on Human Factors in Computing Systems (pp. 319–326).
Archak, N., Ghose, A., & Ipeirotis, P. G. (2011). Deriving the pricing power of product features by mining consumer reviews. Management Science, 57(8), 1485–1509.
Balabanovic, M., & Shoham, Y. (1997). Fab: Content-based, collaborative recommendation. Communications of the ACM, 40(3), 66–72.
Bodapati, A. V. (2008). Recommendation systems with purchases data. Journal of Marketing Research, 45(1), 77–93.
Chang, C.-C., & Lin, C.-J. (2011). LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2(3), 27:1–27:27.
Chen, J., Jin, Q., Zhao, S., Bao, S., Zhang, L., Su, Z., et al. Does product recommendation meet its waterloo in unexplored categories?: No, price comes to help. In Proceedings of the 37th International ACM Conference on Research and Development in Information Retrieval, SIGIR’14.
Chen, Z.-Y., & Fan, Z.-P. (2012). Distributed customer behavior prediction using multiplex data: A collaborative MK-SVM approach. Knowledge-Based Systems, 35, 111–119.
Guo, Y., & Barnes, S. (2011). Purchase behavior in virtual worlds: An empirical investigation in second life. Information and Management, 48(7), 303–312.
Hung, C., & Tsai, C. (2008). Market segmentation based on hierarchical self-organizing map for markets of multimedia on demand. Expert Systems with Applications, 34(1), 780–787.
Kim, E., Kim, W., & Lee, Y. (2003). Combination of multiple classifiers for the customer’s purchase behavior prediction. Decision Support Systems, 34(2), 167–175.
Koenigstein, N., & Koren, Y. (2013). Towards scalable and accurate item-oriented recommendations. In Proceedings of the 7th ACM Conference on Recommender Systems (pp. 419–422).
Li, Y., Hu, J., Zhai, C., & Chen, Y. (2010). Improving one-class collaborative filtering by incorporating rich user information. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management (pp. 959–968).
Magnus, J. R., & Neudecker, H. (1995). Matrix differential calculus with applications in statistics and econometrics. New York: Wiley.
Manning, C. D., Raghavan, P., & Schutze, H. (2009). Introduction to information retrieval. Cambridge: Cambridge University Press.
Mobasher, B., Dai, H., Luo, T., & Nakagawa, M. (2002). Using sequential and non-sequential patterns in predictive web usage mining tasks. In Proceedings of the 2002 IEEE International Conference on Data Mining (pp. 669–672).
Moon, S., & Russell, G. J. (2008). Predicting product purchase from inferred customer similarity: An autologistic model approach. Management Science, 54(1), 71–82.
Paquet, U., & Koenigstein, N. (2013). One-class collaborative filtering with random graphs. In Proceedings of the 22nd International Conference on World Wide Web (pp. 999–1008).
Parikh, N., & Sundaresan, N. (2009). Buzz-based recommender system. In Proceedings of the 18th International Conference on World Wide Web (pp. 1231–1232).
Park, S.-H., Huh, S.-Y., Oh, W., & Han, S. P. (2012). A Social network-based inference model for validating customer profile data. MIS Quarterly, 36(4), 1217–1237.
Pazzani, M., & Billsus, D. (2007). Content-based recommendation systems. The adaptive web - Methods and Strategies of Web Personalization. Lecture Notes in Computer Science, 4321, 325–341.
Ponte, J., & Croft, W. B. (1998). A language modeling approach to information retrieval. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 275–281).
Rendle, S., Freudenthaler, C., & Schmidt-Thieme, L. (2010). Factorizing personalized Markov chains for next-basket recommendation. In Proceedings of the 19th International Conference on World Wide Web (pp. 811–820).
Shi, S. W., & Zhang, J. (2014). Usage experience with decision aids and evolution of online purchase behavior. Marketing Science, 33(6), 871–882.
Snow, R., O’Connor, B., Jurafsky, D., & Ng, A. Y. (2008). Cheap and fast—But is it good?: Evaluating non-expert annotations for natural language tasks. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (pp. 254–263).
Somervuori, O., & Ravaja, N. (2013). Purchase behavior and psychophysiological responses to different price levels. Psychology and Marketing, 30(6), 479–489.
Suh, E. H., Noh, K. C., & Suh, C. K. (1999). Customer list segmentation using the combined response model. Expert Systems with Applications, 17(2), 89–97.
Vapnik, V. N. (1998). Statistical learning theory. New York: Wiley.
Wang, J., & Zhang, Y. (2011). Utilizing marginal net utility for recommendation in E-commerce. In Proceedings of the 34th International ACM Conference on Research and Development in Information Retrieval (pp. 1003–1012).
Wong, C. R., Fu, A. W., & Wang, K. (2005). Data mining for inventory item selection with cross-selling considerations. Data Mining and Knowledge Discovery, 11(1), 81–112.
Yang, S., & Allenby, G. M. (2003). Modeling interdependent consumer preferences. Journal of Marketing Research, 40(3), 282–294.
Yang, Y., Liu, H., & Cai, Y. (2013). Discovery of online shopping patterns across website. Informs Journal on Computing, 25(1), 161–176.
Zhang, Y., & Pennacchiotti, M. (2013). Predicting purchase behaviors from social media. In Proceedings of the 22th International Conference on World Wide Web (pp. 1521–1531).
Zimdars, A., Chickering, D. M., & Meek, C. (2001). Using temporal data for making recommendations. In Proceedings of the 17th Conference on Uncertainty in Artificial Intelligence (pp. 580–588).
Acknowledgments
This work was supported by Major Program of National Natural Science Foundation of China (No. 91218301), National Natural Science Foundation of China (No. 71473201) and Humanities and Social Science Foundation of Ministry of Education of China (No. 14YJAZH063).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Qiu, J., Lin, Z. & Li, Y. Predicting customer purchase behavior in the e-commerce context. Electron Commer Res 15, 427–452 (2015). https://doi.org/10.1007/s10660-015-9191-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10660-015-9191-6