Using Options for Long-Horizon Off-Policy Evaluation.

AllImages Videos News Maps Shopping Books

Using Options and Covariance Testing for Long Horizon Off-Policy ...

Mar 9, 2017 · We propose using policies over temporally extended actions, called options, and show that combining these policies with importance sampling can significantly ...

[PDF] Using Options and Covariance Testing for Long Horizon Off-Policy ...

people.cs.umass.edu › Guo2017

Evaluating a policy by deploying it in the real world can be risky and costly. Off-policy policy evaluation (OPE) algorithms use historical data collected ...

[PDF] Using Options for Long-Horizon Off-Policy Evaluation

www.semanticscholar.org › paper › Usin...

It is shown theoretically and experimentally that combining importance sampling with options-based policies can significantly improve performance for ...

Using Options for Long-Horizon Off-Policy Evaluation - ResearchGate

www.researchgate.net › publication › 31...

We propose using policies over temporally extended actions, called options, to address this long-horizon problem. We show theoretically and experimentally that ...

Reviews: Using Options and Covariance Testing ... - NIPS papers

proceedings.neurips.cc › paper › file

The authors investigate how options influence the variance of importance sampling estimators to increase the length of trajectories that off-policy evaluation ...

Using options and covariance testing for long horizon off-policy policy ...

dl.acm.org › doi

We propose using policies over temporally extended actions, called options, and show that combining these policies with importance sampling can significantly ...

People also search for

Using options for long horizon off policy evaluation pdf

Using options for long horizon off policy evaluation github

[PDF] Using Options and Covariance Testing for Long Horizon Off-Policy ...

www.semanticscholar.org › paper › Usin...

This work proposes using policies over temporally extended actions, called options, and shows that combining these policies with importance sampling can ...

[PDF] State Relevance for Off-Policy Evaluation

proceedings.mlr.press › ...

Using options and covariance testing for long horizon off-policy policy evaluation. In Proceedings of the 31st International Con- ference on Neural ...

[PDF] BLACK-BOX OFF-POLICY ESTIMATION FOR INFINITE ...

openreview.net › pdf

Using options and covariance testing for long horizon off-policy policy evaluation. In Advances in Neural Information Processing Systems 30. (NIPS), pp. 2489 ...

[PDF] Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation

papers.neurips.cc › paper › 7781-b...

Using options and covariance testing for long horizon off-policy policy evaluation. In Advances in Neural Information Processing. Systems 30 (NIPS), pages ...