Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2488388.2488464acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Know your personalization: learning topic level personalization in online services

Published: 13 May 2013 Publication History

Abstract

Online service platforms (OSPs), such as search engines, news-websites, ad-providers, etc., serve highly personalized content to the user, based on the profile extracted from her history with the OSP. In this paper, we capture OSP's personalization for an user in a new data structure called the personalization vector (?), which is a weighted vector over a set of topics, and present efficient algorithms to learn it.
Our approach treats OSPs as black-boxes, and extracts η by mining only their output, specifically, the personalized (for an user) and vanilla (without any user information) contents served, and the differences in these content. We believe that such treatment of OSPs is a unique aspect of our work, not just enabling access to (so far hidden) profiles in OSPs, but also providing a novel and practical approach for retrieving information from OSPs by mining differences in their outputs.
We formulate a new model called Latent Topic Personalization (LTP) that captures the personalization vector in a learning framework and present efficient inference algorithms for determining it. We perform extensive experiments targeting search engine personalization, using data from both real Google users and synthetic setup. Our results indicate that LTP achieves high accuracy (R-pre = 84%) in discovering personalized topics.For Google data, our qualitative results demonstrate that the topics determined by LTP for a user correspond well to his ad-categories determined by Google.

References

[1]
P. N. Bennett, R. W. White, W. Chu, S. T. Dumais, P. Bailey, F. Borisyuk, and X. Cui. Modeling the impact of short-and long-term behavior on search personalization. In SIGIR, 2012.
[2]
J. Bischof and E. Airoldi. Summarizing topical content with word frequency and exclusivity. ICML, 2012.
[3]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 2003.
[4]
S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, 2004.
[5]
C. Buckley and E. M. Voorhees. Retrieval evaluation with incomplete information. In SIGIR, 2004.
[6]
G. Chen, H. Bai, L. Shou, K. Chen, and Y. Gao. Ups: efficient privacy protection in personalized web search. In SIGIR, 2011.
[7]
N. Craswell, A. P. de Vries, and I. Soboroff. Overview of the trec-2005 enterprise track. In TREC, 2005.
[8]
Z. Dou, R. Song, and J. R. Wen. A large-scale evaluation and analysis of personalized search strategies. In WWW, 2007.
[9]
A. Hannak, P. Sapiezynski, A. M. Kakhki, B. Krishnamurthy, D. Lazer, A. Mislove, and C. Wilson. Measuring personalization of web search. In WWW, 2013.
[10]
A. Korolova. Privacy violations using microtargeted ads: A case study. In ICDMW, 2010.
[11]
J. Lindamood, R. Heatherly, M. Kantarcioglu, and B. Thuraisingham. Inferring private information using social network data. In WWW, 2009.
[12]
R. D. Luce. Individual choice behavior: A theoretical analysis. Wiley, 1959.
[13]
J. Luxenburger, S. Elbassuoni, and G. Weikum. Matching task profiles and user needs in personalized web search. In CIKM, 2008.
[14]
A. Machanavajjhala, A. Korolova, and A. D. Sarma. Personalized social recommendations: accurate or private. VLDB Endowment, 2011.
[15]
A. Majumder and N. Shrivastava. Know your personalization: Learning topic level personalization in online services. Arxiv, abs/1212.3390, 2012.
[16]
C. L. Mallows. Non-null ranking models. Biometrika, 1957.
[17]
H. Mao, X. Shuai, and A. Kapadia. Wpes. In Proc. workshop on Privacy in the electronic society, 2011.
[18]
N. Matthijs and F. Radlinski. Personalizing web search using long term browsing history. In WSDM, 2011.
[19]
A. K. McCallum. Mallet: A machine learning for language toolkit. http://mallet.cs.umass.edu.
[20]
E. Pariser. The Filter Bubble: What the Internet Is Hiding from You. Penguin Press, 2011.
[21]
T. Qin, X. Geng, and T. Y. Liu. A new probabilistic model for rank aggregation. In NIPS, 2010.
[22]
D. Sontag, K. Collins-Thompson, P. N. Bennett, R. W. White, S. Dumais, and B. Billerbeck. Probabilistic models for personalizing web search. In WSDM, 2012.
[23]
B. Tan, X. Shen, and C. X. Zhai. Mining long-term search history to improve search accuracy. In SIGKDD, 2006.
[24]
J. Teevan, E. Adar, R. Jones, and M. Potts. History repeats itself: repeat queries in yahoo's logs. In SIGIR, 2006.
[25]
J. Teevan, S. T. Dumais, and E. Horvitz. Personalizing search via automated analysis of interests and activities. In SIGIR, 2005.
[26]
J. Teevan, S. T. Dumais, and E. Horvitz. Potential for personalization. TOCHI, 2010.
[27]
J. Teevan, S. T. Dumais, and D. J. Liebling. To personalize or not to personalize: modeling queries with variation in user intent. In SIGIR, 2008.
[28]
R. Wetzker, C. Zimmermann, and C. Bauckhage. Detecting trends in social bookmarking systems: A del.icio.us endeavor. IJDWM, 2010.
[29]
Y. Xu, K. Wang, B. Zhang, and Z. Chen. Privacy-enhancing personalized web search. In WWW, 2007.
[30]
Y. Zhu, L. Xiong, and C. Verdery. Anonymizing user profiles for personalized web search. In WWW, 2010.
[31]
Z. A. Zhu, W. Chen, T. Wan, C. Zhu, G. Wang, and Z. Chen. To divide and conquer search ranking by learning query difficulty. In CIKM, 2009.

Cited By

View all
  • (2022)Modeling User Profiles Through Multiple Types of User Interaction BehaviorsWeb Information Systems Engineering – WISE 202110.1007/978-3-030-90888-1_42(557-564)Online publication date: 1-Jan-2022
  • (2021)Towards Self-Regulated Individual Learning Path Generation Using Outcome Taxonomies and Constructive Alignment2021 IEEE International Conference on Engineering, Technology & Education (TALE)10.1109/TALE52509.2021.9678777(465-472)Online publication date: 5-Dec-2021
  • (2020)Individual Learning Effectiveness Based on Cognitive Taxonomies and Constructive Alignment2020 IEEE REGION 10 CONFERENCE (TENCON)10.1109/TENCON50793.2020.9293733(1002-1006)Online publication date: 16-Nov-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '13: Proceedings of the 22nd international conference on World Wide Web
May 2013
1628 pages
ISBN:9781450320351
DOI:10.1145/2488388

Sponsors

  • NICBR: Nucleo de Informatcao e Coordenacao do Ponto BR
  • CGIBR: Comite Gestor da Internet no Brazil

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. learning
  2. online service providers
  3. search personalization
  4. topics

Qualifiers

  • Research-article

Conference

WWW '13
Sponsor:
  • NICBR
  • CGIBR
WWW '13: 22nd International World Wide Web Conference
May 13 - 17, 2013
Rio de Janeiro, Brazil

Acceptance Rates

WWW '13 Paper Acceptance Rate 125 of 831 submissions, 15%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)1
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Modeling User Profiles Through Multiple Types of User Interaction BehaviorsWeb Information Systems Engineering – WISE 202110.1007/978-3-030-90888-1_42(557-564)Online publication date: 1-Jan-2022
  • (2021)Towards Self-Regulated Individual Learning Path Generation Using Outcome Taxonomies and Constructive Alignment2021 IEEE International Conference on Engineering, Technology & Education (TALE)10.1109/TALE52509.2021.9678777(465-472)Online publication date: 5-Dec-2021
  • (2020)Individual Learning Effectiveness Based on Cognitive Taxonomies and Constructive Alignment2020 IEEE REGION 10 CONFERENCE (TENCON)10.1109/TENCON50793.2020.9293733(1002-1006)Online publication date: 16-Nov-2020
  • (2020)Personalization in text information retrievalJournal of the Association for Information Science and Technology10.1002/asi.2423471:3(349-369)Online publication date: 28-Jan-2020
  • (2019)Large-Scale Gender/Age Prediction of Tumblr Users2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)10.1109/ICMLA.2019.00128(712-717)Online publication date: Dec-2019
  • (2018)Radikal Online - Das Internet und die Radikalisierung von Jugendlichenkommunikation@gesellschaft10.15460/kommges.2018.19.3.60619:3Online publication date: 1-Nov-2018
  • (2018)Q&RProceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3219819.3219894(139-148)Online publication date: 19-Jul-2018
  • (2018)Learning Geo-Social User Topical Profiles with Bayesian Hierarchical User FactorizationThe 41st International ACM SIGIR Conference on Research & Development in Information Retrieval10.1145/3209978.3210044(205-214)Online publication date: 27-Jun-2018
  • (2017)What Are You Known For?Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3077136.3080820(743-752)Online publication date: 7-Aug-2017
  • (2017)Leveraging Behavioral Factorization and Prior Knowledge for Community Discovery and ProfilingProceedings of the Tenth ACM International Conference on Web Search and Data Mining10.1145/3018661.3018693(71-79)Online publication date: 2-Feb-2017
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media