on-policy

Here are 8 public repositories matching this topic...

MarcoMeter / episodic-transformer-memory-ppo

Clean baseline implementation of PPO using an episodic TransformerXL memory

deep-reinforcement-learning pytorch transformer policy-gradient pomdp actor-critic proximal-policy-optimization ppo on-policy episodic-memory transformer-xl gtrxl trxl gated-transformer-xl memory-gym

Updated Jun 18, 2024
Python

wisnunugroho21 / reinforcement_learning_truly_ppo

Star

Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch

reinforcement-learning deep-learning deep-reinforcement-learning pytorch ppo on-policy

Updated Dec 31, 2020
Python

wisnunugroho21 / reinforcement_learning_v_mpo

Star

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)

reinforcement-learning deep-reinforcement-learning pytorch on-policy v-mpo

Updated Oct 23, 2021
Python

kristogj / on-policy-mcts

Star

Monte Carlo Search Tree for training shared Actor-Critic-Network on the game Hex🏋️

hex reinforcement-learning pytorch mcts on-policy

Updated May 5, 2020
Python

BY571 / pytorch-vmpo

Star

PyTorch implementation of V-MPO

reinforcement-learning on-policy pytorch-implementation v-mpo vmpo

Updated Sep 29, 2022
Python

nima-siboni / simplest-world-Actor-Critic

Star

Reinforcement learning, Policy Gradient, Actor-Critic, AC, Agent-based Simulation, Simple-world

reinforcement-learning monte-carlo-simulation actor-critic-algorithm on-policy reinforcement-learning-environments

Updated Jul 31, 2020
Python

mabirck / CS294-DeepRL

Star

My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.

deep-neural-networks reinforcement-learning deep-learning deep-reinforcement-learning pytorch neural-networks policy-gradient reinforcement pytorch-tutorials cs294 on-policy off-policy

Updated Jan 15, 2018
Python

srefsland / deep-rl-mcts

Star

On-policy MCTS combined with deep learning to train an actor-critic neural network that plays Hex (Con-tac-tix).

python hex reinforcement-learning tensorflow deep-reinforcement-learning mcts monte-carlo-tree-search actor-critic on-policy

Updated Nov 25, 2024
Python

Improve this page

Add a description, image, and links to the on-policy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the on-policy topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

on-policy

Here are 8 public repositories matching this topic...

MarcoMeter / episodic-transformer-memory-ppo

wisnunugroho21 / reinforcement_learning_truly_ppo

wisnunugroho21 / reinforcement_learning_v_mpo

kristogj / on-policy-mcts

BY571 / pytorch-vmpo

nima-siboni / simplest-world-Actor-Critic

mabirck / CS294-DeepRL

srefsland / deep-rl-mcts

Improve this page

Add this topic to your repo