Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3269206.3271808acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

RIN: Reformulation Inference Network for Context-Aware Query Suggestion

Published: 17 October 2018 Publication History

Abstract

Search engine users always endeavor to reformulate queries during search sessions for articulating their information needs because it is not always easy to articulate the search intents. To further ameliorate the reformulation process, search engines may provide some query suggestions based on previous queries. In this paper, we propose Reformulation Inference Network (RIN) to learn how users reformulate queries, thereby benefiting context-aware query suggestion. Instead of categorizing reformulations into predefined patterns, we represent queries and reformulations in a homomorphic hidden space through heterogeneous network embedding. To capture the structure of the session context, a recurrent neural network (RNN) with the attention mechanism is employed to encode the search session by reading the homomorphic query and reformulation embeddings. It enables the model to explicitly captures the former reformulation for each query in the search session and directly learn user reformulation behaviors, from which query suggestion may benefit as shown in previous studies. To generate query suggestions, a binary classifier and an RNN-based decoder are introduced as the query discriminator and the query generator. Inspired by the intuition that model accurately predicting the next reformulation can also correctly infer the next intended query, a reformulation inferencer is then designed for inferring the next reformulation in the latent space of homomorphic embeddings. Therefore, both question suggestion and reformulation prediction can be simultaneously optimized by multi-task learning. Extensive experiments are conducted on publicly available AOL search engine logs. The experimental results demonstrate that RIN outperforms competitive baselines across various situations for both discriminative and generative tasks of context-aware query suggestion.

References

[1]
Mart'in Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et almbox. 2016. TensorFlow: A System for Large-Scale Machine Learning. In OSDI '16, Vol. 16. 265--283.
[2]
Jun-ichi Akahani, Kaoru Hiramatsu, and Kiyoshi Kogure. 2002. Coordinating heterogeneous information services based on approximate ontology translation. In Proceedings of AAMAS-2002 Workshop on Agentcities: Challenges in Open Agent Systems. Citeseer, 10--14.
[3]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).
[4]
Ziv Bar-Yossef and Naama Kraus. 2011. Context-sensitive query auto-completion. In Proceedings of the 20th international conference on World wide web. ACM, 107--116.
[5]
Paolo Boldi, Francesco Bonchi, Carlos Castillo, Debora Donato, and Sebastiano Vigna. 2009b. Query suggestions using query-flow graphs. In Proceedings of the 2009 workshop on Web Search Click Data. ACM, 56--63.
[6]
Paolo Boldi, Francesco Bonchi, Carlos Castillo, and Sebastiano Vigna. 2009a. From" dango" to" japanese cakes": Query reformulation models and patterns. In Web Intelligence and Intelligent Agent Technologies, 2009. WI-IAT'09. IEEE/WIC/ACM International Joint Conferences on, Vol. 1. IEEE, 183--190.
[7]
Christopher JC Burges. 2010. From ranknet to lambdarank to lambdamart: An overview. Learning, Vol. 11, 23--581 (2010), 81.
[8]
Huanhuan Cao, Daxin Jiang, Jian Pei, Enhong Chen, and Hang Li. 2009. Towards context-aware search by learning a very large variable length hidden markov model from search logs. In Proceedings of the 18th international conference on World wide web. ACM, 191--200.
[9]
Huanhuan Cao, Daxin Jiang, Jian Pei, Qi He, Zhen Liao, Enhong Chen, and Hang Li. 2008. Context-aware query suggestion by mining click-through and session data. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 875--883.
[10]
Surajit Chaudhuri and Raghav Kaushik. 2009. Extending autocompletion to tolerate errors. In Proceedings of the 2009 ACM SIGMOD International Conference on Management of data. ACM, 707--718.
[11]
Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014).
[12]
Stéphane Clinchant and Florent Perronnin. 2013. Aggregating continuous word embeddings for information retrieval. In CVSC '13 . 100--109.
[13]
Van Dang and Bruce W Croft. 2010. Query reformulation using anchor text. In Proceedings of the third ACM international conference on Web search and data mining. ACM, 41--50.
[14]
Mostafa Dehghani, Sascha Rothe, Enrique Alfonseca, and Pascal Fleury. 2017. Learning to Attend, Copy, and Generate for Session-Based Query Suggestion. In CIKM '17. ACM, 1747--1756.
[15]
Bruno M Fonseca, Paulo Golgher, Bruno Pôssas, Berthier Ribeiro-Neto, and Nivio Ziviani. 2005. Concept-based interactive query expansion. In CIKM '05. ACM, 696--703.
[16]
Bruno M Fonseca, Paulo Braz Golgher, Edleno Silva de Moura, and Nivio Ziviani. 2003. Using association rules to discover search engines related queries. In Web Congress '03. IEEE, 66--71.
[17]
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. KDD '16. ACM, 855--864.
[18]
Jiafeng Guo, Xueqi Cheng, Gu Xu, and Xiaofei Zhu. 2011. Intent-aware query similarity. In Proceedings of the 20th ACM international conference on Information and knowledge management. ACM, 259--268.
[19]
Jun Han and Claudio Moraga. 1995. The Influence of the Sigmoid Function Parameters on the Speed of Backpropagation Learning. In Proceedings of the International Workshop on Artificial Neural Networks: From Natural to Artificial Neural Computation. Springer-Verlag, 195--201.
[20]
Qi He, Daxin Jiang, Zhen Liao, Steven CH Hoi, Kuiyu Chang, Ee-Peng Lim, and Hang Li. 2009. Web query recommendation via sequential query prediction. In Data Engineering, 2009. ICDE'09. IEEE 25th International Conference on. IEEE, 1443--1454.
[21]
Geoffrey E Hinton and Ruslan R Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. science, Vol. 313, 5786 (2006), 504--507.
[22]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation, Vol. 9, 8 (1997), 1735--1780.
[23]
Chien-Kang Huang, Lee-Feng Chien, and Yen-Jen Oyang. 2003. Relevant term suggestion in interactive web search based on contextual information in query session logs. Journal of the Association for Information Science and Technology, Vol. 54, 7 (2003), 638--649.
[24]
Jeff Huang and Efthimis N Efthimiadis. 2009. Analyzing and evaluating query reformulation strategies in web search logs. In Proceedings of the 18th ACM conference on Information and knowledge management. ACM, 77--86.
[25]
Di Jiang, Kenneth Wai-Ting Leung, and Wilfred Ng. 2011. Context-aware search personalization with concept preference. In Proceedings of the 20th ACM international conference on Information and knowledge management. ACM, 563--572.
[26]
Jyun-Yu Jiang and Pu-Jen Cheng. 2016. Classifying User Search Intents for Query Auto-Completion. In ICTIR '16. ACM, 49--58.
[27]
Jyun-Yu Jiang, Yen-Yu Ke, Pao-Yu Chien, and Pu-Jen Cheng. 2014. Learning user reformulation behavior for query auto-completion. In SIGIR '14. ACM, 445--454.
[28]
Jyun-Yu Jiang, Jing Liu, Chin-Yew Lin, and Pu-Jen Cheng. 2015. Improving ranking consistency for web search by leveraging a knowledge base and search logs. In CIKM '15. ACM, 1441--1450.
[29]
Rosie Jones, Benjamin Rey, Omid Madani, and Wiley Greiner. 2006. Generating query substitutions. In WWW '06. ACM, 387--396.
[30]
Rafal Jozefowicz, Wojciech Zaremba, and Ilya Sutskever. 2015. An empirical exploration of recurrent network architectures. In ICML '15 . 2342--2350.
[31]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[32]
Quoc Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In ICML '14 . 1188--1196.
[33]
Chia-Jung Lee, Ruey-Cheng Chen, Shao-Hang Kao, and Pu-Jen Cheng. 2009. A term dependency-based approach for query terms ranking. In CIKM '09. ACM, 1267--1276.
[34]
Zhen Liao, Daxin Jiang, Enhong Chen, Jian Pei, Huanhuan Cao, and Hang Li. 2011. Mining concept sequences from large-scale search logs for context-aware query suggestion. TIST, Vol. 3, 1 (2011), 17.
[35]
Qiaozhu Mei, Dengyong Zhou, and Kenneth Church. 2008. Query suggestion using hitting time. In CIKM '08. ACM, 469--478.
[36]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In NIPS '13 . 3111--3119.
[37]
Bhaskar Mitra. 2015. Exploring session context using distributed representations of queries and reformulations. In SIGIR '15. ACM, 3--12.
[38]
Michael Naehrig, Kristin Lauter, and Vinod Vaikuntanathan. 2011. Can homomorphic encryption be practical?. In Proceedings of the 3rd ACM workshop on Cloud computing security workshop. ACM, 113--124.
[39]
Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In ICML '10. 807--814.
[40]
Umut Ozertem, Olivier Chapelle, Pinar Donmez, and Emre Velipasaoglu. 2012. Learning to suggest: a machine learning framework for ranking query suggestions. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval. ACM, 25--34.
[41]
Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) . 1532--1543.
[42]
Gary Ren, Xiaochuan Ni, Manish Malik, and Qifa Ke. 2018. Conversational Query Understanding Using Sequence to Sequence Modeling. In WWW '18. International World Wide Web Conferences Steering Committee, 1715--1724.
[43]
Rodrygo LT Santos, Craig Macdonald, and Iadh Ounis. 2013. Learning to rank query suggestions for adhoc and diversity search. Information Retrieval, Vol. 16, 4 (2013), 429--451.
[44]
Milad Shokouhi. 2013. Learning to personalize query auto-completion. In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval. ACM, 103--112.
[45]
Yang Song, Dengyong Zhou, and Li-wei He. 2012. Query suggestion by constructing term-transition graphs. In Proceedings of the fifth ACM international conference on Web search and data mining. ACM, 353--362.
[46]
Alessandro Sordoni, Yoshua Bengio, Hossein Vahabi, Christina Lioma, Jakob Grue Simonsen, and Jian-Yun Nie. 2015. A hierarchical recurrent encoder-decoder for generative context-aware query suggestion. In CIKM '15. ACM, 553--562.
[47]
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to sequence learning with neural networks. In NIPS '14 . 3104--3112.
[48]
Christoph Tillmann, Stephan Vogel, Hermann Ney, Arkaitz Zubiaga, and Hassan Sawaf. 1997. Accelerated DP based search for statistical translation. In EUROSPEECH '97 .
[49]
Charles F Van Loan. 1996. Matrix computations (Johns Hopkins studies in mathematical sciences). (1996).
[50]
Bin Wu, Chenyan Xiong, Maosong Sun, and Zhiyuan Liu. 2018. Query Suggestion with Feedback Memory Network. In WWW '18. International World Wide Web Conferences Steering Committee, 1563--1571.
[51]
Wei Wu, Hang Li, and Jun Xu. 2013. Learning query and document similarities from click-through bipartite graph with metadata. In Proceedings of the sixth ACM international conference on Web search and data mining. ACM, 687--696.
[52]
Hamed Zamani and W Bruce Croft. 2016. Estimating embedding vectors for queries. In ICTIR '16. ACM, 123--132.
[53]
Hamed Zamani and W Bruce Croft. 2017. Relevance-based Word Embedding. In SIGIR '17. ACM, 505--514.
[54]
Guoqing Zheng and Jamie Callan. 2015. Learning to reweight terms with distributed representations. In SIGIR '15. ACM, 575--584.

Cited By

View all
  • (2024)Balancing Act: Boosting Strategies for Informed Search on Controversial TopicsProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638329(254-265)Online publication date: 10-Mar-2024
  • (2024)Mining Exploratory Queries for Conversational SearchProceedings of the ACM Web Conference 202410.1145/3589334.3645424(1386-1394)Online publication date: 13-May-2024
  • (2024)Knowledge-Augmented Large Language Models for Personalized Contextual Query SuggestionProceedings of the ACM Web Conference 202410.1145/3589334.3645404(3355-3366)Online publication date: 13-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management
October 2018
2362 pages
ISBN:9781450360142
DOI:10.1145/3269206
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. query embedding
  2. query reformulation
  3. query session modeling
  4. query suggestion
  5. recurrent neural network

Qualifiers

  • Research-article

Conference

CIKM '18
Sponsor:

Acceptance Rates

CIKM '18 Paper Acceptance Rate 147 of 826 submissions, 18%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)22
  • Downloads (Last 6 weeks)2
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Balancing Act: Boosting Strategies for Informed Search on Controversial TopicsProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638329(254-265)Online publication date: 10-Mar-2024
  • (2024)Mining Exploratory Queries for Conversational SearchProceedings of the ACM Web Conference 202410.1145/3589334.3645424(1386-1394)Online publication date: 13-May-2024
  • (2024)Knowledge-Augmented Large Language Models for Personalized Contextual Query SuggestionProceedings of the ACM Web Conference 202410.1145/3589334.3645404(3355-3366)Online publication date: 13-May-2024
  • (2024)End-to-end pseudo relevance feedback based vertical web search queries recommendationMultimedia Tools and Applications10.1007/s11042-024-18559-4Online publication date: 21-Feb-2024
  • (2024)Sequential query prediction based on multi-armed bandits with ensemble of transformer experts and immediate feedbackData Mining and Knowledge Discovery10.1007/s10618-024-01057-438:6(3758-3782)Online publication date: 2-Aug-2024
  • (2024)Probabilistic graph model and neural network perspective of click models for web searchKnowledge and Information Systems10.1007/s10115-024-02145-z66:10(5829-5873)Online publication date: 6-Jun-2024
  • (2023)Graph Learning for Exploratory Query Suggestions in an Instant Search SystemProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615481(4780-4786)Online publication date: 21-Oct-2023
  • (2023)Exploiting Intent Evolution in E-commercial Query RecommendationProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599821(5162-5173)Online publication date: 6-Aug-2023
  • (2023)FE-TCM: Filter-Enhanced Transformer Click Model for Web SearchIEEE Access10.1109/ACCESS.2023.325946211(28680-28687)Online publication date: 2023
  • (2023)Deep Learning Methods for Query Auto CompletionAdvances in Information Retrieval10.1007/978-3-031-28241-6_35(341-348)Online publication date: 16-Mar-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media