research-article

Aggregating E-commerce Search Results from Heterogeneous Sources via Hierarchical Reinforcement Learning

Authors:

Ryuichi Takanobu,

Bo ZhengAuthors Info & Claims

WWW '19: The World Wide Web Conference

Pages 1771 - 1781

https://doi.org/10.1145/3308558.3313455

Published: 13 May 2019 Publication History

Abstract

In this paper, we investigate the task of aggregating search results from heterogeneous sources in an E-commerce environment. First, unlike traditional aggregated web search that merely presents multi-sourced results in the first page, this new task may present aggregated results in all pages and has to dynamically decide which source should be presented in the current page. Second, as pointed out by many existing studies, it is not trivial to rank items from heterogeneous sources because the relevance scores from different source systems are not directly comparable. To address these two issues, we decompose the task into two subtasks in a hierarchical structure: a high-level task for source selection where we model the sequential patterns of user behaviors onto aggregated results in different pages so as to understand user intents and select the relevant sources properly; and a low-level task for item presentation where we formulate a slot filling process to sequentially present the items instead of giving each item a relevance score when deciding the presentation order of heterogeneous items. Since both subtasks can be naturally formulated as sequential decision problems and learn from the future user feedback on search results, we build our model with hierarchical reinforcement learning. Extensive experiments demonstrate that our model obtains remarkable improvements in search performance metrics, and achieves a higher user satisfaction.

References

[1]

Jaime Arguello. 2017. Aggregated search. Foundations and Trends in Information Retrieval10, 5 (2017), 365-502.

Digital Library

[2]

Jaime Arguello, Fernando Diaz, and Jamie Callan. 2011. Learning to aggregate vertical results into web search results. In Proc. 20th ACM Int. Conf. Information and Knowledge Management. 201-210.

Digital Library

[3]

Jaime Arguello, Fernando Diaz, Jamie Callan, and Ben Carterette. 2011. A methodology for evaluating aggregated search results. In Proc. 33rd European Conf. Information Retrieval. 141-152.

Digital Library

[4]

Jaime Arguello, Fernando Diaz, Jamie Callan, and Jean-Francois Crespo. 2009. Sources of evidence for vertical selection. In Proc. 32nd Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 315-322.

Digital Library

[5]

Andrew G Barto and Sridhar Mahadevan. 2003. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems13 (2003), 341-379.

Digital Library

[6]

Horatiu Bota, Ke Zhou, Joemon M Jose, and Mounia Lalmas. 2014. Composite retrieval of heterogeneous web search. In Proc. 23rd Int. Conf. World Wide Web. 119-130.

Digital Library

[7]

Marc Bron, Jasmijn Van Gorp, Frank Nack, Lotte Belice Baltussen, and Maarten de Rijke. 2013. Aggregated search interface preferences in multi-session search tasks. In Proc. 36th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 123-132.

Digital Library

[8]

Danqi Chen, Weizhu Chen, Haixun Wang, Zheng Chen, and Qiang Yang. 2012. Beyond ten blue links: Enabling user click modeling in federated web search. In Proc. 5th ACM Int. Conf. Web Search and Data Mining. 463-472.

Digital Library

[9]

Ye Chen, Yiqun Liu, Ke Zhou, Meng Wang, Min Zhang, and Shaoping Ma. 2015. Does vertical bring more satisfaction?: Predicting search satisfaction in a heterogeneous environment. In Proc. 24th ACM Int. Conf. Information and Knowledge Management. 1581-1590.

Digital Library

[10]

Aleksandr Chuklin, Anne Schuth, Katja Hofmann, Pavel Serdyukov, and Maarten De Rijke. 2013. Evaluating aggregated search using interleaving. In Proc. 22nd ACM Int. Conf. Information and Knowledge Management. 669-678.

Digital Library

[11]

Fernando Diaz. 2009. Integration of news content into web results. In Proc. 2nd ACM Int. Conf. Web Search and Data Mining. 182-191.

Digital Library

[12]

Jun Feng, Heng Li, Minlie Huang, Shichen Liu, Wenwu Ou, Zhirong Wang, and Xiaoyan Zhu. 2018. Learning to collaborate: Multi-scenario ranking via multi-agent reinforcement learning. In Proc. 27th Int. Conf. World Wide Web. 1939-1948.

Digital Library

[13]

Hado van Hasselt, Arthur Guez, and David Silver. 2016. Deep reinforcement learning with double Q-Learning. In Proc. 30th AAAI Conf. Artificial Intelligence. 2094-2100.

Digital Library

[14]

Matthew Hausknecht and Peter Stone. 2015. Deep recurrent Q-Learning for partially observable MDPs. In Proc. 29th AAAI Conf. Artificial Intelligence, Fall Symp. Series, Sequential Decision Making for Intelligent Agents. 29-37.

[15]

Dzung Hong, Luo Si, Paul Bracke, Michael Witt, and Tim Juchcinski. 2010. A joint probabilistic classification model for resource selection. In Proc. 33rd Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 98-105.

Digital Library

[16]

Luo Jie, Sudarshan Lamkhede, Rochit Sapra, Evans Hsu, Helen Song, and Yi Chang. 2013. A unified search federation system based on online user feedback. In Proc. 19th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining. 1195-1203.

Digital Library

[17]

Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2014. Aggregated search: A new information retrieval paradigm. Comput. Surveys46, 3 (2014), 41.

Digital Library

[18]

Tejas D Kulkarni, Karthik Narasimhan, Ardavan Saeedi, and Josh Tenenbaum. 2016. Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation. In Proc. 30th Annu. Conf. Neural Information Processing Systems. 3675-3683.

Digital Library

[19]

Or Levi, Ido Guy, Fiana Raiber, and Oren Kurland. 2018. Selective cluster presentation on the search results page. ACM Transactions on Information Systems36, 3 (2018), 28.

Digital Library

[20]

Zeyang Liu, Yiqun Liu, Ke Zhou, Min Zhang, and Shaoping Ma. 2015. Influence of vertical result in web search examination. In Proc. 38th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 193-202.

Digital Library

[21]

Bo Long and Yi Chang. 2014. Relevance ranking for vertical search engines. Morgan Kaufmann Publishers Inc.

Digital Library

[22]

Ilya Markov, Eugene Kharitonov, Vadim Nikulin, Pavel Serdyukov, Maarten De Rijke, and Fabio Crestani. 2014. Vertical-aware click model-based effectiveness metrics. In Proc. 23rd ACM Int. Conf. Information and Knowledge Management. 1867-1870.

Digital Library

[23]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. In Proc. 27th Annu. Conf. Neural Information Processing Systems, Deep Learning Workshop.

[24]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. 2015. Human-level control through deep reinforcement learning. Nature518, 7540 (2015), 529-533.

[25]

Harrie Oosterhuis and Maarten de Rijke. 2018. Ranking for relevance and display preferences in complex presentation layouts. In Proc. 41st Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 845-854.

Digital Library

[26]

Baolin Peng, Xiujun Li, Lihong Li, Jianfeng Gao, Asli Celikyilmaz, Sungjin Lee, and Kam-Fai Wong. 2017. Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning. In Proc. 22nd Conf. Empirical Methods in Natural Language Processing. 2231-2240.

[27]

Ashok Kumar Ponnuswami, Kumaresh Pattabiraman, Desmond Brand, and Tapas Kanungo. 2011. Model characterization curves for federated search using click-logs: predicting user engagement metrics for the span of feasible operating points. In Proc. 20th Int. Conf. World Wide Web. 67-76.

Digital Library

[28]

Ashok Kumar Ponnuswami, Kumaresh Pattabiraman, Qiang Wu, Ran Gilad-Bachrach, and Tapas Kanungo. 2011. On composition of a federated web search result page: Using online users to provide pairwise preference for heterogeneous verticals. In Proc. 4th ACM Int. Conf. Web Search and Data Mining. 715-724.

Digital Library

[29]

Tom Schaul, Daniel Horgan, Karol Gregor, and David Silver. 2015. Universal value function approximators. In Proc. 32nd Int. Conf. Machine Learning. 1312-1320.

Digital Library

[30]

Luo Si and Jamie Callan. 2003. Relevant document distribution estimation method for resource selection. In Proc. 26th Int. ACM SIGIR Conf. Research and Development in Informaion Retrieval. 298-305.

Digital Library

[31]

Shanu Sushmita, Hideo Joho, Mounia Lalmas, and Robert Villa. 2010. Factors affecting click-through behavior in aggregated search interfaces. In Proc. 19th ACM Int. Conf. Information and Knowledge Management. 519-528.

Digital Library

[32]

Richard S Sutton and Andrew G Barto. 1998. Reinforcement learning: An introduction. MIT press.

Digital Library

[33]

Richard S Sutton, Doina Precup, and Satinder Singh. 1999. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence112 (1999), 181-211.

Digital Library

[34]

Ryuichi Takanobu, Tianyang Zhang, Jiexi Liu, and Minlie Huang. 2019. A Hierarchical Framework for Relation Extraction with Reinforcement Learning. In Proc. 33rd AAAI Conf. Artificial Intelligence.

Digital Library

[35]

Chen Tessler, Shahar Givony, Tom Zahavy, Daniel J Mankowitz, and Shie Mannor. 2017. A deep hierarchical approach to lifelong learning in Minecraft. In Proc. 31st AAAI Conf. Artificial Intelligence. 1553-1561.

Digital Library

[36]

Gilad Tsur, Yuval Pinter, Idan Szpektor, and David Carmel. 2016. Identifying web queries with question intent. In Proc. 25th Int. Conf. World Wide Web. 783-793.

Digital Library

[37]

Lauren Turpin, Diane Kelly, and Jaime Arguello. 2016. To blend or not to blend?: Perceptual speed, visual memory and aggregated search. In Proc. 39th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 1021-1024.

Digital Library

[38]

Chao Wang, Yiqun Liu, Min Zhang, Shaoping Ma, Meihong Zheng, Jing Qian, and Kuo Zhang. 2013. Incorporating vertical results into search click models. In Proc. 36th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 503-512.

Digital Library

[39]

Yue Wang, Dawei Yin, Luo Jie, Pengyuan Wang, Makoto Yamada, Yi Chang, and Qiaozhu Mei. 2016. Beyond ranking: Optimizing whole-page presentation. In Proc. 9th ACM Int. Conf. Web Search and Data Mining. 103-112.

Digital Library

[40]

Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, and Nando de Freitas. 2016. Dueling Network Architectures for Deep Reinforcement Learning. In Proc. 33rd Int. Conf. Machine Learning. 1995-2003.

Digital Library

[41]

Long Xia, Jun Xu, Yanyan Lan, Jiafeng Guo, Wei Zeng, and Xueqi Cheng. 2017. Adapting Markov decision process for search result diversification. In Proc. 40th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 535-544.

Digital Library

[42]

Junqi Zhang, Yiqun Liu, Shaoping Ma, and Qi Tian. 2018. Relevance Estimation with Multiple Information Sources on Search Engine Result Pages. In Proc. 27th ACM Int. Conf. Information and Knowledge Management. 627-636.

Digital Library

[43]

Ke Zhou, Ronan Cummins, Mounia Lalmas, and Joemon M Jose. 2012. Evaluating aggregated search pages. In Proc. 35th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 115-124.

Digital Library

[44]

Ke Zhou, Ronan Cummins, Mounia Lalmas, and Joemon M Jose. 2013. Which vertical search engines are relevant?. In Proc. 22nd Int. Conf. World Wide Web. 1557-1568.

Digital Library

[45]

Tao Zhuang, Wenwu Ou, and Zhirong Wang. 2018. Globally Optimized Mutual Influence Aware Ranking in E-Commerce Search. In Proc. 27th Int. Joint Conf. Artificial Intelligence. 3725-3731.

Digital Library

Cited By

Garba AKhalid SAleryni AUllah ITairan NShah HMumin D(2024)Utilizing Ant Colony Optimization for Result Merging in Federated SearchEngineering, Technology & Applied Science Research10.48084/etasr.730214:4(14832-14839)Online publication date: 2-Aug-2024
https://doi.org/10.48084/etasr.7302
Zhang YShao WChen XDu YXu XZheng DPei CZhang SJiang PGai K(2023)A Multi-Agent Framework for Recommendation with Heterogeneous Sources2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191154(1-8)Online publication date: 18-Jun-2023
https://doi.org/10.1109/IJCNN54540.2023.10191154
Zaman MKhan ARashid U(2023)Towards Summarization of Aggregated Multimedia Verticals Web Search Results2023 18th International Conference on Emerging Technologies (ICET)10.1109/ICET59753.2023.10374811(263-268)Online publication date: 6-Nov-2023
https://doi.org/10.1109/ICET59753.2023.10374811
Show More Cited By

Recommendations

Composite retrieval of heterogeneous web search
WWW '14: Proceedings of the 23rd international conference on World wide web

Traditional search systems generally present a ranked list of documents as answers to user queries. In aggregated search systems, results from different and increasingly diverse verticals (image, video, news, etc.) are returned to users. For instance, ...
Hierarchical Reinforcement Learning: A Comprehensive Survey

Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of challenging long-horizon decision-making tasks into simpler subtasks. During the past years, the landscape of HRL research has grown profoundly, resulting in copious ...
The Effects of Aggregated Search Coherence on Search Behavior

Aggregated search is the task of combining results from multiple independent search systems in a single Search Engine Results Page (SERP). Aggregated search coherence refers to the extent to which different sources on the SERP focus on similar senses of ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '19: The World Wide Web Conference

May 2019

3620 pages

ISBN:9781450366748

DOI:10.1145/3308558

Editors:
Ling Liu
Georgia Tech, USA
,
Ryen White
Microsoft Research, USA

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

IW3C2: International World Wide Web Conference Committee

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

WWW '19

WWW '19: The Web Conference

May 13 - 17, 2019

CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

9
Total Citations
View Citations
444
Total Downloads

Downloads (Last 12 months)35
Downloads (Last 6 weeks)3

Reflects downloads up to 30 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Garba AKhalid SAleryni AUllah ITairan NShah HMumin D(2024)Utilizing Ant Colony Optimization for Result Merging in Federated SearchEngineering, Technology & Applied Science Research10.48084/etasr.730214:4(14832-14839)Online publication date: 2-Aug-2024
https://doi.org/10.48084/etasr.7302
Zhang YShao WChen XDu YXu XZheng DPei CZhang SJiang PGai K(2023)A Multi-Agent Framework for Recommendation with Heterogeneous Sources2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191154(1-8)Online publication date: 18-Jun-2023
https://doi.org/10.1109/IJCNN54540.2023.10191154
Zaman MKhan ARashid U(2023)Towards Summarization of Aggregated Multimedia Verticals Web Search Results2023 18th International Conference on Emerging Technologies (ICET)10.1109/ICET59753.2023.10374811(263-268)Online publication date: 6-Nov-2023
https://doi.org/10.1109/ICET59753.2023.10374811
Chen XYao LMcAuley JZhou GWang X(2023)Deep reinforcement learning in recommender systemsKnowledge-Based Systems10.1016/j.knosys.2023.110335264:COnline publication date: 9-Mar-2023
https://dl.acm.org/doi/10.1016/j.knosys.2023.110335
Zhang HZhao PXian XSheng VHao YCui Z(2023)Click is not equal to purchase: multi-task reinforcement learning for multi-behavior recommendationWorld Wide Web10.1007/s11280-023-01215-626:6(4153-4172)Online publication date: 20-Dec-2023
https://doi.org/10.1007/s11280-023-01215-6
Cai QCui CXiong YWang WXie ZZhang M(2022)A Survey on Deep Reinforcement Learning for Data Processing and AnalyticsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3155196(1-1)Online publication date: 2022
https://doi.org/10.1109/TKDE.2022.3155196
Achsas SNfaoui E(2022)Academic Aggregated Search Approach Based on BERT Language Model2022 2nd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET)10.1109/IRASET52964.2022.9737888(1-9)Online publication date: 3-Mar-2022
https://doi.org/10.1109/IRASET52964.2022.9737888
HE XAn BLi YChen HWang RWang XYu RLi XWang Z(2020)Learning to Collaborate in Multi-Module Recommendation via Multi-Agent Reinforcement Learning without CommunicationProceedings of the 14th ACM Conference on Recommender Systems10.1145/3383313.3412233(210-219)Online publication date: 22-Sep-2020
https://dl.acm.org/doi/10.1145/3383313.3412233
Li LSun LWeng CHuo CRen Wd'Aquin MDietze SHauff CCurry ECudre Mauroux P(2020)Spending Money WiselyProceedings of the 29th ACM International Conference on Information & Knowledge Management10.1145/3340531.3412745(2597-2604)Online publication date: 19-Oct-2020
https://dl.acm.org/doi/10.1145/3340531.3412745
Song JLi ZHu ZWu YLi ZLi JGao J(2020)PoisonRec: An Adaptive Data Poisoning Framework for Attacking Black-box Recommender Systems2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00021(157-168)Online publication date: Apr-2020
https://doi.org/10.1109/ICDE48307.2020.00021

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents