Nothing Special   »   [go: up one dir, main page]

Skip to content
#

actor-critic

Here are 346 public repositories matching this topic...

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  • Updated May 29, 2022
  • Python

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

  • Updated Jan 22, 2019
  • Jupyter Notebook

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

  • Updated Jan 16, 2021
  • Python

Improve this page

Add a description, image, and links to the actor-critic topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the actor-critic topic, visit your repo's landing page and select "manage topics."

Learn more