Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
-
Updated
Mar 31, 2024 - Python
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
A PyTorch library for building deep reinforcement learning agents.
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
PyTorch C++ Reinforcement Learning
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]
Trading Environment(OpenAI Gym) + PPO(TensorForce)
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Deep Reinforcement Learning in C#
Clean baseline implementation of PPO using an episodic TransformerXL memory
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Proximal Policy Optimization (PPO) algorithm for Contra
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Add a description, image, and links to the proximal-policy-optimization topic page so that developers can more easily learn about it.
To associate your repository with the proximal-policy-optimization topic, visit your repo's landing page and select "manage topics."