Skip to content
View ReinholdM's full-sized avatar
:shipit:
crazy at ASR & RL
:shipit:
crazy at ASR & RL

Highlights

  • Pro

Block or report ReinholdM

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Papers-of-Offline-RL Papers-of-Offline-RL Public

    Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)

    17 3

  2. Offline-Pre-trained-Multi-Agent-Decision-Transformer Offline-Pre-trained-Multi-Agent-Decision-Transformer Public

    Python 106 16

  3. AlphaZero_Gomoku AlphaZero_Gomoku Public

    Forked from junxiaosong/AlphaZero_Gomoku

    An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

    Python 1

  4. jymh/d4marl jymh/d4marl Public

    A Dataset and Benchmark for Offline Multi-Agent Reinforcement Learning

    Python 6

  5. An-implementation-of-Language-Model-with-LSTM-Att An-implementation-of-Language-Model-with-LSTM-Att Public

    Windows | Pytorch

    Python 3

  6. CS294-158-Deep-Unsupervised-Learning CS294-158-Deep-Unsupervised-Learning Public

    Course from Spring 2019

    Jupyter Notebook 1