PPO_tf

Implementação da proximal policy optimization (PPO) usando tensorflow com comentários em português

Ambiente

CartPole-v0 do open ai gym
espaço de estado: contínuo espaço de ação: discreto

Dependencias

python3.6
tensorflow v1.4
open ai gym

Treinamento

python main.py

Testar politica treinada

python test_policy.py

Tensorboard

tensorboard --logdir=log

LICENÇA

MIT LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.vscode		.vscode
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.ipynb		main.ipynb
main.py		main.py
policy_net.py		policy_net.py
ppo.py		ppo.py
test_policy.ipynb		test_policy.ipynb
test_policy.py		test_policy.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PPO_tf

Ambiente

Dependencias

Treinamento

Testar politica treinada

Tensorboard

LICENÇA

About

Releases

Packages

Contributors 3

Languages

License

samuel-caldas/ppotf

Folders and files

Latest commit

History

Repository files navigation

PPO_tf

Ambiente

Dependencias

Treinamento

Testar politica treinada

Tensorboard

LICENÇA

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages