Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Accuracy improvement possible? #1

Open
ruze00 opened this issue May 23, 2022 · 1 comment
Open

Accuracy improvement possible? #1

ruze00 opened this issue May 23, 2022 · 1 comment

Comments

@ruze00
Copy link

ruze00 commented May 23, 2022

I'm running the code verbatim but not finding the results which might be expected. For example, running ping_pong_a2c results in barely any improvement after more than 8,000 runs, while I would expect a good level of accuracy (at least > 0 score) by 5,000 iterations or so based on other people reporting results based on using RL with Atari/Pong.

image

Is there something I'm missing? Do the hyperparameters need to be tuned rather than run as is?

Thank you for creating the code base.

@allohvk
Copy link

allohvk commented Nov 8, 2022

No, it does not converge. I spent days on this code to debug why but couldn't drill down to the exact issue. Use the openAi gym wrappers to manipulate the frames

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants