A3C-FF seems not work well? #28

pengsun · 2017-02-15T12:36:03Z

Hi @miyosuda, thanks for providing the code! When I experimented it with other games than pong (only the ROM name and ACTION_SIZE are modified), I found A3C-FF seems not work very well. For example, after iteration 50M, the training score for breakout is ~30, while that for space_invaders is ~600, which are lower than what is reported in the A3C paper.

Also, I found videos for breakout and space_invaders in @Itsukara 's fork, could you @Itsukara show your training details on these two games with this code?

miyosuda · 2017-02-15T13:52:33Z

@pengsun
Recently I've changed LOCAL_T_MAX value from 5 to 20.

d67e7dc

I've checked with Pong environment with LSTM, and confirmed that the score becomes so much better, but I didn't check FF version when I changed this parameter.

Could you try old LOCAL_T_MAX=5 setting?

pengsun · 2017-02-15T15:12:14Z

Hi @miyosuda, sorry I forget to say that, I did try A3C-FF with LOCAL_T_MAX = 5, because it's the A3C paper's settings. I also tried 20, the same low scores.

Previously I had a private implementation of A3C in Torch 7, and I found that A3C-FF works as good as in the paper on breakout and space_invaders, no matter LOCAL_T_MAX =5 or 20

pengsun · 2017-02-15T15:14:47Z

But I can confirm that your A3C-LSTM can achieve reasonable high scores on breakout and space_invaders, that's why I feel strange...

babaktr · 2017-02-15T15:19:18Z

If you're running on Gym, try running Deterministic-v0. The regular Atari games have a random frame skip/action repeat between 1-5 in Gym that could impact the result. Deterministic fixes the FS to 4 (3 for SpaceInvaders)

pengsun · 2017-02-15T15:28:47Z

Hi @babaktr, no I didn't run it on Gym, I just followed @miyosuda and installed ale python wrapper. The skip frame = 4, repeat action probability = 0, as defaulted in game_state.py.

duyunshu · 2017-04-20T21:01:59Z

@pengsun would you mind open-source your torch implementation? I'm also using torch but can't find a good replication.

pengsun · 2017-04-21T05:43:32Z

Hi @duyunshu , thanks for your interest. Yes, it can be found here: https://github.com/pengsun/torch-rl-async-v2

But I don'd know if I have the time to add a README.md for how to run the code...

duyunshu · 2017-04-21T15:30:33Z

@pengsun thank you very much! I'll try if can make it work in my machine.

zhoudoudou · 2017-12-19T02:37:46Z

hi,@miyosuda, thank you thanks for providing the code! but i run the A3C FF,i use all your setting ，but the score is very bad ,it always fluctuates between -19 and -20（30.00M）. but i run the A3CLSTM,is normal。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A3C-FF seems not work well? #28

A3C-FF seems not work well? #28

pengsun commented Feb 15, 2017

miyosuda commented Feb 15, 2017 •

edited

Loading

pengsun commented Feb 15, 2017

pengsun commented Feb 15, 2017

babaktr commented Feb 15, 2017

pengsun commented Feb 15, 2017

duyunshu commented Apr 20, 2017

pengsun commented Apr 21, 2017

duyunshu commented Apr 21, 2017

zhoudoudou commented Dec 19, 2017

A3C-FF seems not work well? #28

A3C-FF seems not work well? #28

Comments

pengsun commented Feb 15, 2017

miyosuda commented Feb 15, 2017 • edited Loading

pengsun commented Feb 15, 2017

pengsun commented Feb 15, 2017

babaktr commented Feb 15, 2017

pengsun commented Feb 15, 2017

duyunshu commented Apr 20, 2017

pengsun commented Apr 21, 2017

duyunshu commented Apr 21, 2017

zhoudoudou commented Dec 19, 2017

miyosuda commented Feb 15, 2017 •

edited

Loading