Abstract
Nowadays, many e-commerce websites allow users to login with their existing social networking accounts. When a new user comes to an e-commerce website, it is interesting to study whether the information from external social media platforms can be utilized to alleviate the cold-start problem. In this paper, we focus on a specific task on cross-site information sharing, i.e., leveraging the text posted by a user on the social media platform (termed as social text) to infer his/her purchase preference of product categories on an e-commerce platform. To solve the task, a key problem is how to effectively represent the social text in a way that its information can be utilized on the e-commerce platform. We study two major kinds of text representation methods for predicting cross-site purchase preference, including shallow textual features and deep textual features learned by deep neural network models. We conduct extensive experiments on a large linked dataset, and our experimental results indicate that it is promising to utilize the social text for predicting purchase preference. Specially, the deep neural network approach has shown a more powerful predictive ability when the number of categories becomes large.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Zhang Y, Pennacchiotti M. Recommending branded products from social media. In Proc. the 7th ACM Conference on Recommender Systems, Oct. 2013, pp.77-84.
Ricci F, Rokach L, Shapira B. Introduction to Recommender Systems Handbook. Springer US, 2011, pp.1-35.
Zhang Y, Pennacchiotti M. Predicting purchase behaviors from social media. In Proc. the 22nd International Conference on World Wide Web, May 2013, pp.1521-1532.
Wang J, ZhaoWX, He Y, Li X. Leveraging product adopter information from online reviews for product recommendation. In Proc. the 9th International AAAI Conference on Web and Social Media, May 2015, pp.464-472.
Wang J, Cong G, Zhao W X, Li X. Mining user intents in Twitter: A semi-supervised approach to inferring intent categories for tweets. In Proc. the 29th AAAI Conference on Artificial Intelligence, Jan. 2015, pp.318-324.
Zhao W X, Guo Y, He Y, Jiang H, Wu Y, Li X. We know what you want to buy: A demographic-based system for product recommendation on microblogs. In Proc. the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2014, pp.1935-1944.
Tang D, Wei F, Yang N, Zhou M, Liu T, Qin B. Learning sentiment-specific word embedding for Twitter sentiment classification. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, June 2014, pp.1555-1565.
Cao Z, Li W, Li S, Wei F, Li Y. AttSum: Joint learning of focusing and summarization with neural attention. In Proc. COLING, Dec. 2016, pp.547-556.
Gui L, Xu R, He Y, Lu Q, Wei Z. Intersubjectivity and sentiment: From language to knowledge. In Proc. the 25th International Joint Conference on Artificial Intelligence, July 2016, pp.2789-2795.
Lai S, Xu L, Liu K, Zhao J. Recurrent convolutional neural networks for text classification. In Proc. the 29th International AAAI Conference on Web and Social Media, Jan. 2015, pp.2267-2273.
Yih W, He X, Meek C. Semantic parsing for single-relation question answering. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, June 2014, pp.643-648.
Kalchbrenner N, Grefenstette E, Blunsom P. A convolutional neural network for modelling sentences. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics, June 2014, pp.655-665.
Basili R. Learning to classify text using support vector machines: Methods, theory, and algorithms by Thorsten Joachims. Computational Linguistics, 2003, 29(4): 655-661.
Salton G, Wong A, Yang C S. A vector space model for automatic indexing. Commun. ACM, 1975, 18(11): 613-620.
Robertson S. Understanding inverse document frequency: On theoretical arguments for IDF. Journal of Documentation, 2004, 60(5): 503-520.
Blei D M, Ng A Y, Jordan M I. Latent Dirichlet allocation. In Proc. NIPS, Dec. 2001, pp.601-608.
Seroussi Y, Bohnert F, Zukerman I. Personalised rating prediction for new users using latent factor models. In Proc. the 22nd ACM Conference on Hypertext and Hypermedia, June 2011, pp.47-56.
Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. In Proc. International Conference on Learning Representations, May 2013.
Mikolov T, Sutskever I, Chen K, Corrado G S, Dean J. Distributed representations of words and phrases and their compositionality. In Proc. NIPS, Dec. 2013, pp.3111-3119.
Le Q, Mikolov T. Distributed representations of sentences and documents. In Proc. the 31st International Conference on Machine Learning, June 2014, pp.1188-1196.
Kim Y. Convolutional neural networks for sentence classification. In Proc. the Empirical Methods in Natural Language Processing, Oct. 2014, pp.1746-1751.
Hinton G E, Salakhutdinov R R. Reducing the dimensionality of data with neural networks. Science, 2006, 313(5786): 504-507.
Erhan D, Bengio Y, Courville A, Manzagol P A, Vincent P, Bengio S. Why does unsupervised pretraining help deep learning? Journal of Machine Learning Research, 2010, 11: 625-660.
Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P. Natural language processing (almost) from scratch. Journal of Machine Learning Research, 2011, 12: 2493-2537.
Kalchbrenner N, Blunsom P. Recurrent convolutional neural networks for discourse compositionality. In Proc. the Workshop on Continuous Vector Space Models and Their Compositionality, Aug. 2013, pp.119-126.
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Computation, 1997, 9(8): 1735-1780.
Gers F A, Schraudolph N N, Schmidhuber J. Learning precise timing with LSTM recurrent networks. Journal of Machine Learning Research, 2002, 3: 115-143.
Zhang S, Zheng D, Hu X, Yang M. Bidirectional long shortterm memory networks for relation classification. In Proc. the 29th Pacific Asia Conference on Language, Information and Computation, Oct.30-Nov.1, 2015.
Jarvelin K, Kekalainen J. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems, 2002, 20(4): 422-446.
Burges C, Shaked T, Renshaw E, Lazier A, Deeds M, Hamilton N, Hullender G N. Learning to rank using gradient descent. In Proc. the 22nd Annual International Conference on Machine Learning, Aug. 2005, pp.89-96.
Zhao W X, Li S, He Y, Chang E Y, Wen J, Li X. Connecting social media to e-commerce: Cold-start product recommendation using microblogging information. IEEE Transactions on Knowledge and Data Engineering, 2016, 28(5): 1147-1159.
Guo S, Wang M, Leskovec J. The role of social networks in online shopping: Information passing, price of trust, and consumer choice. In Proc. the 12th ACM Conference on Electronic Commerce, June 2011, pp.157-166.
Bhatt R, Chaoji V, Parekh R. Predicting product adoption in large-scale social networks. In Proc. the 19th ACM International Conference on Information and Knowledge Management, Oct. 2010, pp.1039-1048.
Zhou F, Ji Y, Jiao R J. Predicting product adoption in large social networks for demand estimation. In Proc. ISPECE, Sept. 2014, pp.890-899.
Hill S, Provost F, Volinsky C. Network-based marketing: Identifying likely adopters via consumer networks. Statistical Science, 2006, 21(2): 256-276.
Iyengar R, Han S, Gupta S. Do friends influence purchases in a social network? SSRN ELectronic Journal, 2009.
Pan S J, Yang Q. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(10): 1345-1359.
PanW, Xiang EW, Yang Q. Transfer learning in collaborative filtering with uncertain ratings. In Proc. the 26th AAAI Conference on Artificial Intelligence, July 2012, pp.662-668.
Li B, Yang Q, Xue X. Can movies and books collaborate? Cross-domain collaborative filtering for sparsity reduction. In Proc. the 21st International Joint Conference on Artificial Intelligence, July 2009, pp.2052-2057.
Singh A P, Gordon G J. Relational learning via collective matrix factorization. In Proc. the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2008, pp.650-658.
Zhao L, Pan S J, Xiang E W, Zhong E, Lu Z, Yang Q. Active transfer learning for cross-system recommendation. In Proc. the 27th AAAI Conference on Artificial Intelligence, July 2013.
Hu L, Cao J, Xu G, Cao L, Gu Z, Zhu C. Personalized recommendation via cross-domain triadic factorization. In Proc. the 22nd International Conference on World Wide Web, May 2013, pp.595-606.
Cao Y, Xu R, Chen T. Combining convolutional neural network and support vector machine for sentiment classification. In Proc. the 4th National Conference on Social Media Processing, Nov. 2015, pp.144-155.
Miwa M, Bansal M. End-to-end relation extraction using LSTMs on sequences and tree structures. In Proc. the 54th Annual Meeting of the Association for Computational Linguistics, Aug. 2016.
Xu Y, Jia R, Mou L, Li G, Chen Y, Lu Y, Jin Z. Improved relation classification by deep recurrent neural networks with data augmentation. In Proc. the 26th International Conference on Computational Linguistics, Dec. 2016, pp.1461-1470.
Chen W F, Ku L W. UTCNN: A deep learning model of stance classification on social media text. In Proc. the 26th International Conference on Computational Linguistics, Dec. 2016, pp.1635-1645.
Dong L, Wei F, Zhou M, Xu K. Adaptive multicompositionality for recursive neural models with applications to sentiment analysis. In Proc. the 28th AAAI Conference on Artificial Intelligence, July 2014, pp.1537-1543.
Hermann K M, Kocisky T, Grefenstette E, Espeholt L, Kay W, Suleyman M, Blunsom P. Teaching machines to read and comprehend. In Proc. NIPS, Dec. 2015, pp.1693-1701.
Kobayashi S, Tian R, Okazaki N, Inui K. Dynamic entity representation with max-pooling improves machine reading. In Proc. the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics, June 2016, pp.850-855.
Chen D, Bolton J, Manning C D. A thorough examination of the CNN/daily mail reading comprehension task. In Proc. the 54th Annual Meeting of the Association for Computational Linguistics, Aug. 2016.
Wang L, Cao Z, de Melo G, Liu Z. Relation classification via multi-level attention CNNs. In Proc. the 54th Annual Meeting of the Association for Computational Linguistics, Aug. 2016.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
ESM 1
(PDF 189 kb)
Rights and permissions
About this article
Cite this article
Bai, T., Dou, HJ., Zhao, W.X. et al. An Experimental Study of Text Representation Methods for Cross-Site Purchase Preference Prediction Using the Social Text Data. J. Comput. Sci. Technol. 32, 828–842 (2017). https://doi.org/10.1007/s11390-017-1763-6
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-017-1763-6