Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1963405.1963411acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Sparse hidden-dynamics conditional random fields for user intent understanding

Published: 28 March 2011 Publication History

Abstract

Understanding user intent from her sequential search behaviors, i.e. predicting the intent of each user query in a search session, is crucial for modern Web search engines. However, due to the huge number of user behavior variables and coarse level intent labels defined by human editors, it is very difficult to directly model user behavioral dynamics or user intent dynamics in user search sessions. In this paper, we propose a novel Sparse Hidden-Dynamic Conditional Random Fields (SHDCRF) model for user intent learning from their search sessions. Through incorporating the proposed hidden state variables, SHDCRF aims to learn a substructure, i.e. a set of related hidden variables, for each intent label and they are used to model the intermediate dynamics between user intent labels and user behavioral variables. In addition, SHDCRF learns a sparse relation between the hidden variables and intent labels to make the hidden state variables explainable. Extensive experiment results, on real user search sessions from a popular commercial search engine show that the proposed SHDCRF model significantly outperforms in terms of intent prediction results that those classical solutions such as Support Vector Machine (SVM), Conditional Random Field (CRF) and Latnet-Dynamic Conditional Random Field (LDCRF).

References

[1]
E. Agichtein, Eric Brill and S. Dumais. Improving Web search ranking by incorporating user behavior information. In SIGIR' 06, pp. 19--26.
[2]
E. Agichtein, Eric Brill, S. Dumais and R. Ragno. Learning User Interaction Models for Predicting Web Search Result Preferences, In SIGIR'06, pp. 3--10.
[3]
Z. B. Andrei, M. Fontoura, E. Gabrilovich, A. Joshi, V. Josifovski and T. Zhang. Robust Classification of Rare Queries Using Web Knowledge. In SIGIR'07, pp. 231--238.
[4]
H. Cao, D. Jiang, J. Pei, Q. He and Z. Liao, E. Chen and H. Li. Context-Aware Query Suggestion By Mining Click-Through and session Data. In SIGKDD'08.
[5]
H. Cao, D. Jiang, J. Pei, E. Chen and H. Li. Towards context-aware search by learning a very large variable length hidden markov model from search logs. In WWW'09.
[6]
H. Cao, D. H. Hu, D. Shen, D. Jiang, J. T. Sun, E. Chen and Q. Yang. Context-aware query classification. In SIGIR'09.
[7]
Z. Cheng, B. Gao and T. Y. Liu. Actively predicting diverse search intent from user browsing behaviors. In WWW'10.
[8]
C. Cortes and V. Vapnik. Support-Vector Networks. Machine Learning, 1995, Vol. 20, pp.273--297.
[9]
E. R. Daniel and Danny Levinson. Understanding User Goals in Web Search. In WWW'04, pp. 13--19.
[10]
G. David, D. Nichols, M. Brain and O. D. Terry. Using collaborative filtering to weave an information tapestry. Communications of the ACM, 1992, vol.12, pp. 61--70.
[11]
V. Ganti, A.K. Christian and X. Li. Precomputing search features for fast and accurate query classification. In WSDM'10.
[12]
Gunawardana, M. Milind, A. Alex, and J.C. Platt. Hidden Conditional Random Fields for Phone Classification. International Conference on Speech Communication and Technology(ICSCT), 2005.
[13]
Hassan, R. Jones, and K. L. Klinkner. Beyond DCG, User Behavior as a Predictor of a Successful Search. In WSDM'10.
[14]
Q. He, D. Jiang, Z. Liao, C. H. Steven, K. Chang, E. P. Lim and H. Li. Web Query Recommendation via Sequential Query Prediction. In ICDE'09.
[15]
J. Hu, G. Wang, F. Lochovsky, J. T. Sun and Z. Chen. Understanding user's query intent with wikipedia. In WWW'09.
[16]
D. H. Hu, D. Shen, J. T. Sun, Q. Yang and Z. Chen. Context-Aware Online Commercial Intention Detection. LNCS(5829), 2009, pp. 135--149.
[17]
D. Jeffrey and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In Operating Systems Design and Implementation(OSDI), 2004, pp. 137--150.
[18]
T. Joachims. SVM-light Support Vector Machine. http://svmlight.joachims.org/
[19]
I. H. Kang, G. C. Kim. Query Type Classification for web document Retrieval. In SIGIR'03.
[20]
J. Lafferty, A. McCallum and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In ICML' 01.
[21]
Lewis and David. Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval. In ECML'98, pp.4--15.
[22]
X. Li, Y. Y. W., Alex Acero. Learning Query Intent from Regularized Click Graphs. In SIGIR'08.
[23]
D. C. Liu and J. Nocedal. On the Limited Memory Method for Large Scale Optimization. Mathematical Programming, 1989. pp. 503--528.
[24]
S. Martigny and T. Artieres. Neural conditional random fields. In AISTATS'10, pp.177--184.
[25]
H. Nguyen. Capturing User Intent For Information Retrieval. In AAAI'04.
[26]
N. Okazaki. CRFsuit. A fast implementation of Conditional Random Fields. http://www.chokkan.org/software/crfsuite/.
[27]
J. Pearl. Probabilistic Reasoning in Intellignet Systems: Networks of Plausible Inference. Morgan Kaufmann, 1998.
[28]
J. Peng, L. F. Bo and J. B. Xu. Conditional Neural Fields. In NIPS'09.
[29]
L. M. Philippe, A. Quattoni and T. Darrell. Latent-Dynamic Discriminative Models for Continuous Gesture Recognition. In CVPR'07.
[30]
Quattoni, M. Collins, and T. Darrell. Conditional random fields for object recognition. In NIPS'04.
[31]
B. Y. Ricardo, C. Hurtado and M. Mendoza. Query Clustering for Boosting Web Page Ranking. LNCS(3034), 2004, pp.164--175.
[32]
S. Sarawagi and William W. Cohen. Semi-Markov conditional random fields for information extraction. In NIPS'04.
[33]
K. Seymore, A. McCallum and R. Rosenfeld. Learning Hidden Markov Model Structure for Information Extraction. In AAAI'99 Workshop on Machine Learning for Information Extraction, 2009.
[34]
D. Shen, J. T. Sun, Q. Yang, and Z. Chen. Building Bridges for Web Query Classification. In SIGIR'06.
[35]
D. Shen and R. Pan. Query Enrichment for Web-Query Classification. ACM Transactions on Information System 2006.
[36]
M. B. Steven, C. J. Eric, F. Ophir, D. L. David, C. Abdur, K. Aleksander. Improving Automatic Query Classification via Semi-Supervised Learning. In ICDM'05.
[37]
M. B. Steven and C. J. Eric. Automatic Classification of Web Queries Using Very Large Unlabeled Query Logs. In TOIS'06, vol. 24, pp. 320--352.
[38]
Sutton and A. McCallum. Collective Segmentation and Labeling of Distant Entities in Information Extraction. In ICML workshop on Statistical Relational Learning, 2004.
[39]
Sutton, K. Rohanimanesh and A. McCallum. Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence Data. In ICML'04.
[40]
Viterbi. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory, 2003, Vol.13, pp. 260--269.
[41]
J. Wang, A. de Vries and M. Reinders. A User-Item Relevance Model for Log-Based Collaborative Filtering. LNCS3936 (January 2006), pp. 37--48.
[42]
S. B. Wang, A. Quattini, L. P. Morency and D. Demirdjian. Hidden Conditional Random Fields for Gesture Recognition. In CVPR'06.
[43]
Yu, L. Deng, and S. Wang. Learning in the Deep-Structured Conditional Random Fields. In NIPS'09.
[44]
X. H. Zhang. Building Maximum Entropy Text Classifier Using Semi-Supervised Learning. PhD thesis, NUS, 2004.
[45]
Thomas G. Dietterich. Machine Learning for Sequential Data: A Review. LNCS(2396), 2002, pp. 15--30.
[46]
H. Zou, T. Hastie and R. Tibshirani. Sparse Principal Component Analysis. Journal of Computational and Graphical Statistics, 2006.
[47]
J. Mairal, F. Bach, J. Ponce and G. Sapiro. Online dictionary learning for sparse coding. In ICML'09.

Cited By

View all
  • (2024)Identifying Shopping Intent in Product QA for Proactive RecommendationsAdvances on Graph-Based Approaches in Information Retrieval10.1007/978-3-031-71382-8_3(25-40)Online publication date: 10-Oct-2024
  • (2022)A novel temporal recommendation method based on user query topic evolutionKnowledge-Based Systems10.1016/j.knosys.2022.108239241(108239)Online publication date: Apr-2022
  • (2021)Dataset of Natural Language Queries for E-CommerceProceedings of the 2021 Conference on Human Information Interaction and Retrieval10.1145/3406522.3446043(307-311)Online publication date: 14-Mar-2021
  • Show More Cited By

Index Terms

  1. Sparse hidden-dynamics conditional random fields for user intent understanding

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '11: Proceedings of the 20th international conference on World wide web
    March 2011
    840 pages
    ISBN:9781450306324
    DOI:10.1145/1963405
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 March 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. conditional random field
    2. hidden variable
    3. sparse hidden-dynamic
    4. user intent
    5. user search session

    Qualifiers

    • Research-article

    Conference

    WWW '11
    WWW '11: 20th International World Wide Web Conference
    March 28 - April 1, 2011
    Hyderabad, India

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 20 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Identifying Shopping Intent in Product QA for Proactive RecommendationsAdvances on Graph-Based Approaches in Information Retrieval10.1007/978-3-031-71382-8_3(25-40)Online publication date: 10-Oct-2024
    • (2022)A novel temporal recommendation method based on user query topic evolutionKnowledge-Based Systems10.1016/j.knosys.2022.108239241(108239)Online publication date: Apr-2022
    • (2021)Dataset of Natural Language Queries for E-CommerceProceedings of the 2021 Conference on Human Information Interaction and Retrieval10.1145/3406522.3446043(307-311)Online publication date: 14-Mar-2021
    • (2021)A Framework for Comparative Analysis of Intention Mining ApproachesResearch Challenges in Information Science10.1007/978-3-030-75018-3_2(20-37)Online publication date: 8-May-2021
    • (2018)A Semantic Approach for Estimating Consumer Content Preferences from Online Search QueriesMarketing Science10.1287/mksc.2018.111237:6(930-952)Online publication date: 1-Nov-2018
    • (2017)Multi-task deep learning for user intention understanding in speech interaction systemsProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298239.3298264(161-167)Online publication date: 4-Feb-2017
    • (2017)Survey on challenges of Question Answering in the Semantic WebSemantic Web10.3233/SW-1602478:6(895-920)Online publication date: 1-Jan-2017
    • (2017)A statistical analysis approach to predict user's changing requirements for software service evolutionJournal of Systems and Software10.1016/j.jss.2017.06.071132:C(147-164)Online publication date: 1-Oct-2017
    • (2016)Learning to Filter User Explicit Intents in Online Vietnamese Social Media TextsIntelligent Information and Database Systems10.1007/978-3-662-49390-8_2(13-24)Online publication date: 2016
    • (2015)Examining Personalization in Academic Web SearchProceedings of the 26th ACM Conference on Hypertext & Social Media10.1145/2700171.2791039(103-111)Online publication date: 24-Aug-2015
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media