Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3308558.3313455acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Aggregating E-commerce Search Results from Heterogeneous Sources via Hierarchical Reinforcement Learning

Published: 13 May 2019 Publication History

Abstract

In this paper, we investigate the task of aggregating search results from heterogeneous sources in an E-commerce environment. First, unlike traditional aggregated web search that merely presents multi-sourced results in the first page, this new task may present aggregated results in all pages and has to dynamically decide which source should be presented in the current page. Second, as pointed out by many existing studies, it is not trivial to rank items from heterogeneous sources because the relevance scores from different source systems are not directly comparable. To address these two issues, we decompose the task into two subtasks in a hierarchical structure: a high-level task for source selection where we model the sequential patterns of user behaviors onto aggregated results in different pages so as to understand user intents and select the relevant sources properly; and a low-level task for item presentation where we formulate a slot filling process to sequentially present the items instead of giving each item a relevance score when deciding the presentation order of heterogeneous items. Since both subtasks can be naturally formulated as sequential decision problems and learn from the future user feedback on search results, we build our model with hierarchical reinforcement learning. Extensive experiments demonstrate that our model obtains remarkable improvements in search performance metrics, and achieves a higher user satisfaction.

References

[1]
Jaime Arguello. 2017. Aggregated search. Foundations and Trends in Information Retrieval10, 5 (2017), 365-502.
[2]
Jaime Arguello, Fernando Diaz, and Jamie Callan. 2011. Learning to aggregate vertical results into web search results. In Proc. 20th ACM Int. Conf. Information and Knowledge Management. 201-210.
[3]
Jaime Arguello, Fernando Diaz, Jamie Callan, and Ben Carterette. 2011. A methodology for evaluating aggregated search results. In Proc. 33rd European Conf. Information Retrieval. 141-152.
[4]
Jaime Arguello, Fernando Diaz, Jamie Callan, and Jean-Francois Crespo. 2009. Sources of evidence for vertical selection. In Proc. 32nd Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 315-322.
[5]
Andrew G Barto and Sridhar Mahadevan. 2003. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems13 (2003), 341-379.
[6]
Horatiu Bota, Ke Zhou, Joemon M Jose, and Mounia Lalmas. 2014. Composite retrieval of heterogeneous web search. In Proc. 23rd Int. Conf. World Wide Web. 119-130.
[7]
Marc Bron, Jasmijn Van Gorp, Frank Nack, Lotte Belice Baltussen, and Maarten de Rijke. 2013. Aggregated search interface preferences in multi-session search tasks. In Proc. 36th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 123-132.
[8]
Danqi Chen, Weizhu Chen, Haixun Wang, Zheng Chen, and Qiang Yang. 2012. Beyond ten blue links: Enabling user click modeling in federated web search. In Proc. 5th ACM Int. Conf. Web Search and Data Mining. 463-472.
[9]
Ye Chen, Yiqun Liu, Ke Zhou, Meng Wang, Min Zhang, and Shaoping Ma. 2015. Does vertical bring more satisfaction?: Predicting search satisfaction in a heterogeneous environment. In Proc. 24th ACM Int. Conf. Information and Knowledge Management. 1581-1590.
[10]
Aleksandr Chuklin, Anne Schuth, Katja Hofmann, Pavel Serdyukov, and Maarten De Rijke. 2013. Evaluating aggregated search using interleaving. In Proc. 22nd ACM Int. Conf. Information and Knowledge Management. 669-678.
[11]
Fernando Diaz. 2009. Integration of news content into web results. In Proc. 2nd ACM Int. Conf. Web Search and Data Mining. 182-191.
[12]
Jun Feng, Heng Li, Minlie Huang, Shichen Liu, Wenwu Ou, Zhirong Wang, and Xiaoyan Zhu. 2018. Learning to collaborate: Multi-scenario ranking via multi-agent reinforcement learning. In Proc. 27th Int. Conf. World Wide Web. 1939-1948.
[13]
Hado van Hasselt, Arthur Guez, and David Silver. 2016. Deep reinforcement learning with double Q-Learning. In Proc. 30th AAAI Conf. Artificial Intelligence. 2094-2100.
[14]
Matthew Hausknecht and Peter Stone. 2015. Deep recurrent Q-Learning for partially observable MDPs. In Proc. 29th AAAI Conf. Artificial Intelligence, Fall Symp. Series, Sequential Decision Making for Intelligent Agents. 29-37.
[15]
Dzung Hong, Luo Si, Paul Bracke, Michael Witt, and Tim Juchcinski. 2010. A joint probabilistic classification model for resource selection. In Proc. 33rd Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 98-105.
[16]
Luo Jie, Sudarshan Lamkhede, Rochit Sapra, Evans Hsu, Helen Song, and Yi Chang. 2013. A unified search federation system based on online user feedback. In Proc. 19th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining. 1195-1203.
[17]
Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2014. Aggregated search: A new information retrieval paradigm. Comput. Surveys46, 3 (2014), 41.
[18]
Tejas D Kulkarni, Karthik Narasimhan, Ardavan Saeedi, and Josh Tenenbaum. 2016. Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation. In Proc. 30th Annu. Conf. Neural Information Processing Systems. 3675-3683.
[19]
Or Levi, Ido Guy, Fiana Raiber, and Oren Kurland. 2018. Selective cluster presentation on the search results page. ACM Transactions on Information Systems36, 3 (2018), 28.
[20]
Zeyang Liu, Yiqun Liu, Ke Zhou, Min Zhang, and Shaoping Ma. 2015. Influence of vertical result in web search examination. In Proc. 38th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 193-202.
[21]
Bo Long and Yi Chang. 2014. Relevance ranking for vertical search engines. Morgan Kaufmann Publishers Inc.
[22]
Ilya Markov, Eugene Kharitonov, Vadim Nikulin, Pavel Serdyukov, Maarten De Rijke, and Fabio Crestani. 2014. Vertical-aware click model-based effectiveness metrics. In Proc. 23rd ACM Int. Conf. Information and Knowledge Management. 1867-1870.
[23]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. In Proc. 27th Annu. Conf. Neural Information Processing Systems, Deep Learning Workshop.
[24]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. 2015. Human-level control through deep reinforcement learning. Nature518, 7540 (2015), 529-533.
[25]
Harrie Oosterhuis and Maarten de Rijke. 2018. Ranking for relevance and display preferences in complex presentation layouts. In Proc. 41st Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 845-854.
[26]
Baolin Peng, Xiujun Li, Lihong Li, Jianfeng Gao, Asli Celikyilmaz, Sungjin Lee, and Kam-Fai Wong. 2017. Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning. In Proc. 22nd Conf. Empirical Methods in Natural Language Processing. 2231-2240.
[27]
Ashok Kumar Ponnuswami, Kumaresh Pattabiraman, Desmond Brand, and Tapas Kanungo. 2011. Model characterization curves for federated search using click-logs: predicting user engagement metrics for the span of feasible operating points. In Proc. 20th Int. Conf. World Wide Web. 67-76.
[28]
Ashok Kumar Ponnuswami, Kumaresh Pattabiraman, Qiang Wu, Ran Gilad-Bachrach, and Tapas Kanungo. 2011. On composition of a federated web search result page: Using online users to provide pairwise preference for heterogeneous verticals. In Proc. 4th ACM Int. Conf. Web Search and Data Mining. 715-724.
[29]
Tom Schaul, Daniel Horgan, Karol Gregor, and David Silver. 2015. Universal value function approximators. In Proc. 32nd Int. Conf. Machine Learning. 1312-1320.
[30]
Luo Si and Jamie Callan. 2003. Relevant document distribution estimation method for resource selection. In Proc. 26th Int. ACM SIGIR Conf. Research and Development in Informaion Retrieval. 298-305.
[31]
Shanu Sushmita, Hideo Joho, Mounia Lalmas, and Robert Villa. 2010. Factors affecting click-through behavior in aggregated search interfaces. In Proc. 19th ACM Int. Conf. Information and Knowledge Management. 519-528.
[32]
Richard S Sutton and Andrew G Barto. 1998. Reinforcement learning: An introduction. MIT press.
[33]
Richard S Sutton, Doina Precup, and Satinder Singh. 1999. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence112 (1999), 181-211.
[34]
Ryuichi Takanobu, Tianyang Zhang, Jiexi Liu, and Minlie Huang. 2019. A Hierarchical Framework for Relation Extraction with Reinforcement Learning. In Proc. 33rd AAAI Conf. Artificial Intelligence.
[35]
Chen Tessler, Shahar Givony, Tom Zahavy, Daniel J Mankowitz, and Shie Mannor. 2017. A deep hierarchical approach to lifelong learning in Minecraft. In Proc. 31st AAAI Conf. Artificial Intelligence. 1553-1561.
[36]
Gilad Tsur, Yuval Pinter, Idan Szpektor, and David Carmel. 2016. Identifying web queries with question intent. In Proc. 25th Int. Conf. World Wide Web. 783-793.
[37]
Lauren Turpin, Diane Kelly, and Jaime Arguello. 2016. To blend or not to blend?: Perceptual speed, visual memory and aggregated search. In Proc. 39th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 1021-1024.
[38]
Chao Wang, Yiqun Liu, Min Zhang, Shaoping Ma, Meihong Zheng, Jing Qian, and Kuo Zhang. 2013. Incorporating vertical results into search click models. In Proc. 36th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 503-512.
[39]
Yue Wang, Dawei Yin, Luo Jie, Pengyuan Wang, Makoto Yamada, Yi Chang, and Qiaozhu Mei. 2016. Beyond ranking: Optimizing whole-page presentation. In Proc. 9th ACM Int. Conf. Web Search and Data Mining. 103-112.
[40]
Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, and Nando de Freitas. 2016. Dueling Network Architectures for Deep Reinforcement Learning. In Proc. 33rd Int. Conf. Machine Learning. 1995-2003.
[41]
Long Xia, Jun Xu, Yanyan Lan, Jiafeng Guo, Wei Zeng, and Xueqi Cheng. 2017. Adapting Markov decision process for search result diversification. In Proc. 40th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 535-544.
[42]
Junqi Zhang, Yiqun Liu, Shaoping Ma, and Qi Tian. 2018. Relevance Estimation with Multiple Information Sources on Search Engine Result Pages. In Proc. 27th ACM Int. Conf. Information and Knowledge Management. 627-636.
[43]
Ke Zhou, Ronan Cummins, Mounia Lalmas, and Joemon M Jose. 2012. Evaluating aggregated search pages. In Proc. 35th Int. ACM SIGIR Conf. Research and Development in Information Retrieval. 115-124.
[44]
Ke Zhou, Ronan Cummins, Mounia Lalmas, and Joemon M Jose. 2013. Which vertical search engines are relevant?. In Proc. 22nd Int. Conf. World Wide Web. 1557-1568.
[45]
Tao Zhuang, Wenwu Ou, and Zhirong Wang. 2018. Globally Optimized Mutual Influence Aware Ranking in E-Commerce Search. In Proc. 27th Int. Joint Conf. Artificial Intelligence. 3725-3731.

Cited By

View all
  • (2024)Utilizing Ant Colony Optimization for Result Merging in Federated SearchEngineering, Technology & Applied Science Research10.48084/etasr.730214:4(14832-14839)Online publication date: 2-Aug-2024
  • (2023)A Multi-Agent Framework for Recommendation with Heterogeneous Sources2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191154(1-8)Online publication date: 18-Jun-2023
  • (2023)Towards Summarization of Aggregated Multimedia Verticals Web Search Results2023 18th International Conference on Emerging Technologies (ICET)10.1109/ICET59753.2023.10374811(263-268)Online publication date: 6-Nov-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '19: The World Wide Web Conference
May 2019
3620 pages
ISBN:9781450366748
DOI:10.1145/3308558
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • IW3C2: International World Wide Web Conference Committee

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. aggregated search
  2. hierarchical reinforcement learning
  3. user feedback
  4. vertical

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

WWW '19
WWW '19: The Web Conference
May 13 - 17, 2019
CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)35
  • Downloads (Last 6 weeks)3
Reflects downloads up to 30 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Utilizing Ant Colony Optimization for Result Merging in Federated SearchEngineering, Technology & Applied Science Research10.48084/etasr.730214:4(14832-14839)Online publication date: 2-Aug-2024
  • (2023)A Multi-Agent Framework for Recommendation with Heterogeneous Sources2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191154(1-8)Online publication date: 18-Jun-2023
  • (2023)Towards Summarization of Aggregated Multimedia Verticals Web Search Results2023 18th International Conference on Emerging Technologies (ICET)10.1109/ICET59753.2023.10374811(263-268)Online publication date: 6-Nov-2023
  • (2023)Deep reinforcement learning in recommender systemsKnowledge-Based Systems10.1016/j.knosys.2023.110335264:COnline publication date: 9-Mar-2023
  • (2023)Click is not equal to purchase: multi-task reinforcement learning for multi-behavior recommendationWorld Wide Web10.1007/s11280-023-01215-626:6(4153-4172)Online publication date: 20-Dec-2023
  • (2022)A Survey on Deep Reinforcement Learning for Data Processing and AnalyticsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3155196(1-1)Online publication date: 2022
  • (2022)Academic Aggregated Search Approach Based on BERT Language Model2022 2nd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET)10.1109/IRASET52964.2022.9737888(1-9)Online publication date: 3-Mar-2022
  • (2020)Learning to Collaborate in Multi-Module Recommendation via Multi-Agent Reinforcement Learning without CommunicationProceedings of the 14th ACM Conference on Recommender Systems10.1145/3383313.3412233(210-219)Online publication date: 22-Sep-2020
  • (2020)Spending Money WiselyProceedings of the 29th ACM International Conference on Information & Knowledge Management10.1145/3340531.3412745(2597-2604)Online publication date: 19-Oct-2020
  • (2020)PoisonRec: An Adaptive Data Poisoning Framework for Attacking Black-box Recommender Systems2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00021(157-168)Online publication date: Apr-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media