Releases · IntelLabs/coach

This repository has been archived by the owner on Dec 11, 2022. It is now read-only.

24 Jul 13:14

galnov

v1.0.0

2697142

Release 1.0.0 Latest

Latest

TD3
New APIs for Coach usage as a library
Updated Getting Started tutorial
Batch RL tutorial

Assets 2

30 May 08:36

galnov

v0.12.1

6e7e7f6

Release 0.12.1

Fixes for breaking API changes (OpenAI Gym, Scipy)
OPE: Weighted Importance Sampling
Creating a dataset using an agent
Printing input size as part of network summary

Assets 2

01 May 15:58

shadiendrawis

v0.12.0

74db141

Release 0.12.0

ACER
Soft Actor-Critic
BCQ
Batch RL
Off-policy evaluation (estimators: DM, DR, Sequential DR, IPS)

Assets 2

01 May 15:54

shadiendrawis

v0.11.2

a543f10

Release 0.11.2

Intel Tensorflow fix.

Assets 2

24 Jan 19:00

galnov

v0.11.1

135f02f

Release 0.11.1

Roll out worker memory leak fix
wxPython dependency removal

Assets 2

27 Nov 23:46

galnov

v0.11.0

533bb43

Release 0.11.0

Horizontal scaling
MxNet support
ONNX export
New documentation

Assets 2

26 Aug 12:25

gal-leibovich

v0.10.0

3fd0bf4

Release 0.10.0

A complete redesign - non-backward compatible. Enabling multi-agent support.

New features -

PIP package
Benchmarks
Hierarchical Reinforcement Learning (demonstrated by Hierarchical Actor-Critic)
Tutorials
Shared memory (e.g. Replay Buffer) between workers
Tests (unit-tests, reward-based tests, trace-based tests)
Using Coach as a library (see example here)

New Environments -

Toy Environments (Exploration Chain, BitFlip)
DeepMind PySC2 support (Starcraft 2)
DeepMind Control Suite

New Algorithms -

Hindsight Experience Replay
Prioritized Experience Replay
Hierarchical Actor-Critic
UCB with Q-Ensembles

Assets 2

19 Dec 17:29

itaicaspi-intel

v0.9.0

125c7ee

Release 0.9.0

New features -

CARLA 0.7 simulator integration
Human control of the game play
Recording of human game play and storing / loading the replay buffer
Behavioral cloning agent and presets
Golden tests for several presets
Selecting between deep / shallow image embedders
Rendering through pygame (with some boost in performance)

API changes -

Improved environment wrapper API
Added an evaluate flag to allow convenient evaluation of existing checkpoints
Improve frameskip definition in Gym

Bug fixes -

Fixed loading of checkpoints for agents with more than one network
Fixed the N Step Q learning agent python3 compatibility

Assets 2

19 Oct 10:40

galnov

v0.8.0

f7979b0

v0.8.0

Initial public release

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: IntelLabs/coach

Release 1.0.0

Release 0.12.1

Release 0.12.0

Release 0.11.2

Release 0.11.1

Release 0.11.0

Release 0.10.0

Release 0.9.0

v0.8.0