Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3360901.3364417acmconferencesArticle/Chapter ViewAbstractPublication Pagesk-capConference Proceedingsconference-collections
research-article

Cooperative Web Agents by Combining Semantic Technologies with Reinforcement Learning

Published: 23 September 2019 Publication History

Abstract

An increasing number of Web pages are enriched with semantic annotations using, for instance, RDFa and Schema.org in order to ease data interpretation for Web browsers and search engines. Lifted data representations might enable a new class of semantic agents for complex decision-making on the Web, reaching from filling out Web forms and booking flights to mastering any Web-based task. However, exclusively using semantic models has several drawbacks, such as extensive manual efforts for enabling semantic agents to solve Web tasks or limited capabilities to personalize task solutions for end users. While using semantics is a crucial component for guiding prospective semantic agents, we have to find a balance between modelling rich background knowledge and learning optimal agent behaviour through advanced Machine Learning (ML) methods. In this work, we propose a semantic agent framework for (i) modelling agent-related semantic annotations for Web tasks, (ii) using the latter to train statistical agents using Reinforcement Learning (RL) in an offline simulation as well as in real online use and (iii) feeding back the learned agent behaviour and its provenance information in terms of a semantic model that can be directly used by purely semantics-based agents. We evaluate our approach based on the MiniWob++ benchmark for automatically solving Web tasks. We show that our proposed semantic agent framework enables to (i) warm start agents with semantic background knowledge to faster learn task-dependent optimal behaviour in terms of the expected cumulative reward and (ii) directly reuse the learned behaviour by semantic-based agents that act by automatically derived Notation3 (N3) implication rules.

References

[1]
J. Hendler. 2001. Agents and the Semantic Web. IEEE Intelligent Systems 16, 2 (March 2001), 30--37. https://doi.org/10.1109/5254.920597
[2]
Todd Hester, Matej Vecerík, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Ian Osband, Gabriel Dulac-Arnold, John Agapiou, Joel Z. Leibo, and Audrunas Gruslys. 2018. Deep Q-learning From Demonstrations. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), the 30th innovative Applications of Artificial Intelligence (IAAI-18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI-18), New Orleans, Louisiana, USA, February 2--7, 2018. 3223--3230. https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16976
[3]
Pascal Hitzler, Markus Krötzsch, and Sebastian Rudolph. 2010. Foundations of Semantic Web Technologies. Chapman and Hall/CRC Press.
[4]
Jonathan Ho and Stefano Ermon. 2016. Generative Adversarial Imitation Learning. In Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5--10, 2016, Barcelona, Spain. 4565--4573. http://papers.nips.cc/paper/6391-generative-adversarial-imitationlearning
[5]
Evan Zheran Liu, Kelvin Guu, Panupong Pasupat, Tianlin Shi, and Percy Liang. 2018. Reinforcement Learning on Web Interfaces using Workflow-Guided Exploration. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. https://openreview.net/forum?id=ryTp3f-0-
[6]
Nicole Merkle and Stefan Zander. 2018. Using a Semantic Simulation Framework for Teaching Machine Learning Agents. Procedia Computer Science 137 (2018), 78 -- 89. https://doi.org/10.1016/j.procs.2018.09.008 Proceedings of the 14th International Conference on Semantic Systems 10th -- 13th of September 2018 Vienna, Austria.
[7]
Nicole Merkle, Stefan Zander, and Viliam Simko. 2018. A Semantic Use Case Simulation Framework for Training Machine Learning Algorithms. In Knowledge Engineering and Knowledge Management, Catherine Faron Zucker, Chiara Ghidini, Amedeo Napoli, and Yannick Toussaint (Eds.). Springer International Publishing, Cham, 243--257.
[8]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin A. Riedmiller. 2013. Playing Atari with Deep Reinforcement Learning. CoRR arXiv:cs/0204012 (2013). arXiv:1312.5602
[9]
Patrick Philipp, Maria Maleshkova, Achim Rettinger, and Darko Katic. 2015. A Semantic Framework for Sequential Decision Making. In Engineering the Web in the Big Data Era - 15th International Conference, ICWE 2015, Rotterdam, The Netherlands, June 23--26, 2015, Proceedings. 392--409. https://doi.org/10.1007/978- 3--319--19890--3_25
[10]
Martin L. Puterman. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley. https://doi.org/10.1002/9780470316887
[11]
Stéphane Ross, Geoffrey J. Gordon, and Drew Bagnell. 2011. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2011, Fort Lauderdale, USA, April 11--13, 2011. 627--635. http://proceedings.mlr.press/v15/ross11a/ross11a.pdf
[12]
Tianlin Shi, Andrej Karpathy, Linxi Fan, Jonathan Hernandez, and Percy Liang. 2017. World of Bits: An Open-Domain Platform for Web-Based Agents. In Proceedings of the 34th International Conference on Machine Learning (Proceedings of Machine Learning Research), Doina Precup and Yee Whye Teh (Eds.), Vol. 70. PMLR, International Convention Centre, Sydney, Australia, 3135--3144. http: //proceedings.mlr.press/v70/shi17a.html
[13]
David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Vedavyas Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy P. Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, and Demis Hassabis. 2016. Mastering the game of Go with deep neural networks and tree search. Nature 529, 7587 (2016), 484--489. https://doi.org/10.1038/nature16961
[14]
David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, et al. 2017. Mastering the game of go without human knowledge. Nature 550, 7676 (2017), 354.
[15]
Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction (second ed.). The MIT Press. http://incompleteideas.net/book/the-book- 2nd.html
[16]
Ruben Verborgh, Dörthe Arndt, Sofie Van Hoecke, Jos De Roo, Giovanni Mels, Thomas Steiner, and Joaquim Gabarró. 2017. The pragmatic proof: Hypermedia API composition and execution. TPLP 17, 1 (2017), 1--48. https://doi.org/10.1017/ S1471068416000016
[17]
Ruben Verborgh, Thomas Steiner, Davy Van Deursen, Rik Van de Walle, and Joaquim Gabarró Vallés. 2011. Efficient runtime service discovery and consumption with hyperlinked RESTdesc. In 2011 7th International Conference on Next Generation Web Services Practices. IEEE, 373--379.

Cited By

View all
  • (2023)Learning at the Edge: Mobile Edge Computing and Reinforcement Learning for Enhanced Web Application Performance2023 9th International Conference on Web Research (ICWR)10.1109/ICWR57742.2023.10138952(300-304)Online publication date: 3-May-2023
  • (2021)Intelligent software web agents: A gap analysisJournal of Web Semantics10.1016/j.websem.2021.100659(100659)Online publication date: Sep-2021
  • (undefined)Intelligent Software Web Agents: A Gap AnalysisSSRN Electronic Journal10.2139/ssrn.3945443

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
K-CAP '19: Proceedings of the 10th International Conference on Knowledge Capture
September 2019
281 pages
ISBN:9781450370080
DOI:10.1145/3360901
  • General Chairs:
  • Mayank Kejriwal,
  • Pedro Szekely,
  • Program Chair:
  • Raphaël Troncy
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 September 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. reinforcement learning
  2. semantic agents
  3. semantic web
  4. simulation
  5. web automation

Qualifiers

  • Research-article

Conference

K-CAP '19
Sponsor:
K-CAP '19: Knowledge Capture Conference
November 19 - 21, 2019
CA, Marina Del Rey, USA

Acceptance Rates

Overall Acceptance Rate 55 of 198 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Learning at the Edge: Mobile Edge Computing and Reinforcement Learning for Enhanced Web Application Performance2023 9th International Conference on Web Research (ICWR)10.1109/ICWR57742.2023.10138952(300-304)Online publication date: 3-May-2023
  • (2021)Intelligent software web agents: A gap analysisJournal of Web Semantics10.1016/j.websem.2021.100659(100659)Online publication date: Sep-2021
  • (undefined)Intelligent Software Web Agents: A Gap AnalysisSSRN Electronic Journal10.2139/ssrn.3945443

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media