This repository has been archived by the owner on Dec 11, 2022. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 463
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* SAC algorithm * SAC - updates to agent (learn_from_batch), sac_head and sac_q_head to fix problem in gradient calculation. Now SAC agents is able to train. gym_environment - fixing an error in access to gym.spaces * Soft Actor Critic - code cleanup * code cleanup * V-head initialization fix * SAC benchmarks * SAC Documentation * typo fix * documentation fixes * documentation and version update * README typo
- Loading branch information
1 parent
33dc29e
commit 74db141
Showing
92 changed files
with
2,813 additions
and
403 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,48 @@ | ||
# Soft Actor Critic | ||
|
||
Each experiment uses 3 seeds and is trained for 3M environment steps. | ||
The parameters used for SAC are the same parameters as described in the [original paper](https://arxiv.org/abs/1801.01290). | ||
|
||
### Inverted Pendulum SAC - single worker | ||
|
||
```bash | ||
coach -p Mujoco_SAC -lvl inverted_pendulum | ||
``` | ||
|
||
<img src="inverted_pendulum_sac.png" alt="Inverted Pendulum SAC" width="800"/> | ||
|
||
|
||
### Hopper Clipped SAC - single worker | ||
|
||
```bash | ||
coach -p Mujoco_SAC -lvl hopper | ||
``` | ||
|
||
<img src="hopper_sac.png" alt="Hopper SAC" width="800"/> | ||
|
||
|
||
### Half Cheetah Clipped SAC - single worker | ||
|
||
```bash | ||
coach -p Mujoco_SAC -lvl half_cheetah | ||
``` | ||
|
||
<img src="half_cheetah_sac.png" alt="Half Cheetah SAC" width="800"/> | ||
|
||
|
||
### Walker 2D Clipped SAC - single worker | ||
|
||
```bash | ||
coach -p Mujoco_SAC -lvl walker2d | ||
``` | ||
|
||
<img src="walker2d_sac.png" alt="Walker 2D SAC" width="800"/> | ||
|
||
|
||
### Humanoid Clipped SAC - single worker | ||
|
||
```bash | ||
coach -p Mujoco_SAC -lvl humanoid | ||
``` | ||
|
||
<img src="humanoid_sac.png" alt="Humanoid SAC" width="800"/> |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.