Investigating Practical Linear Temporal Difference Learning.

AllBooks Videos Images Maps News Shopping

[1602.08771] Investigating practical linear temporal difference learning

Feb 28, 2016 · This paper contains two main contributions. First, we derive two new hybrid TD policy-evaluation algorithms, which fill a gap in this collection ...

Scholarly articles for Investigating Practical Linear Temporal Difference Learning.

scholar.google.com › citations

Investigating practical linear temporal difference …
White · Cited by 47

[PDF] Investigating Practical Linear Temporal Difference Learning

aamas.csc.liv.ac.uk › pdfs

ABSTRACT. Off-policy reinforcement learning has many applications in- cluding: learning from demonstration, learning multiple goal.

[PDF] Investigating Practical Linear Temporal Difference Learning ...

www.semanticscholar.org › paper

This paper derives two new hybrid TD policy-evaluation algorithms, which fill a gap in this collection of algorithms and performs an empirical comparison to ...

(PDF) Investigating practical linear temporal difference learning

www.researchgate.net › download

This paper contains two main contributions. First, we derive two new hybrid TD policy-evaluation algorithms, which fill a gap in this collection of algorithms.

Investigating Practical Linear Temporal Difference Learning

dl.acm.org › doi

First, we derive two new hybrid TD policy-evaluation algorithms, which fill a gap in this collection of algorithms. Second, we perform an empirical comparison ...

Chapter 9 Temporal-Difference Learning

web.stanford.edu › group › handbookch10

TD learning is an unsupervised technique in which the learning agent learns to predict the expected value of a variable occurring at the end of a sequence of ...

Practical issues in temporal difference learning

link.springer.com › content › pdf

The evaluation function learned in Samuel's study was a linear function of the input variables, whereas multilayer networks learn more complex nonlinear.

Investigating practical, linear temporal difference learning. - dblp

dblp.uni-trier.de › rec › corr › WhiteW16

Bibliographic details on Investigating practical, linear temporal difference learning.

[PDF] Practical Issues in Temporal Difference Learning

proceedings.neurips.cc › paper › file

This paper examines whether temporal difference methods for training connectionist networks, such as Suttons's TO('\) algorithm, can be suc-.

Missing: Investigating | Show results with:Investigating

[PDF] Linear Least-Squares algorithms for temporal difference learning

scholarworks.umass.edu › download

They allow a system to learn to predict the total amount of reward expected over time, and they can be used for other prediction problems as well (Anderson, ...