You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
So, I delete the --gamma parameter and directly modify the file config.py. Set the default into 0.96:
parser.add_argument("--gamma", type=float, default=0.96,
help='discount factor for rewards (default: 0.99)')
However, I get the result:
Task ShadowHandCatchOver2Underarm Algo mat_dec Exp single updates 8250/8333 episodes, total num timesteps 49506000/50000000, FPS 1021.
average_step_rewards is 0.330600768327713.
some episodes done, average rewards: 19.574572331772863
Task ShadowHandCatchOver2Underarm Algo mat_dec Exp single updates 8275/8333 episodes, total num timesteps 49656000/50000000, FPS 1022.
average_step_rewards is 0.3444286584854126.
some episodes done, average rewards: 20.018084016291084
Task ShadowHandCatchOver2Underarm Algo mat_dec Exp single updates 8300/8333 episodes, total num timesteps 49806000/50000000, FPS 1023. average_step_rewards is 0.3596132695674896.
some episodes done, average rewards: 20.760233263901018
Task ShadowHandCatchOver2Underarm Algo mat_dec Exp single updates 8325/8333 episodes, total num timesteps 49956000/50000000, FPS 1024.
average_step_rewards is 0.3465554118156433.
some episodes done, average rewards: 20.917307748507582
It is far away from your results (about 25) in the paper. I guess there might be some config set wrongly. Can I get a latest script or any instructions about what I might do wrong?
The text was updated successfully, but these errors were encountered:
hiya,thank you so much for your attention, I noticed that your error message contains a lot of hyper parameters that are not in this repo, e.g. num_envs, cfg_train, steps_num... It seems that your config conflicts with other things in your local Python environment/workspace.
Thus, I recommend first to find out the cause of this strange error before modifying the config file directly~~ hoping it might help you~~
Those hyper parameters are not introduced into this code by me as I just use the original code. It seems that the Bi-Dexhands benchmark introduces these config. I haven't modified this source code. I just clone the current version into local and run the script: ./mat/scripts/train_hands.sh.
I have checked this just now. The original code can reproduce the bug. And it still gets the same error information.
Problem: Reproduced result is lower than the one in paper a lot
Details:
I want to reproduce the results in Bi-DexHands domain. I use the scripts which you provide directly.
However, it shows there are some bugs :
So, I delete the --gamma parameter and directly modify the file
config.py
. Set the default into 0.96:However, I get the result:
It is far away from your results (about 25) in the paper. I guess there might be some config set wrongly. Can I get a latest script or any instructions about what I might do wrong?
The text was updated successfully, but these errors were encountered: