You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There environment seems to be getting created twice. First one just to convert the environment observation and action space into gym format and then pass into the RLLIBAgent class where the environment is built again through rllib library. If I understand correctly this takes more memory for two environments and rewriting to just make one will help with memory.
The environment for the l2rpn_neurips_2020_track1_small take a very long time to do 100 iterations with train_batch_size of 20,000 added to env_config_ppo. These two parameters may even need to be higher to get good results. If something can be done to speed up the training that would be helpful for scaling to bigger networks.
System information
1.9.5
0.8.0
osx
PPO_RLLIB
Bug description
The PPO_RLLIB code has been updated but there are couple of issues
Missing the following line
self.env_glop.chronics_handler.reset()
after https://github.com/rte-france/l2rpn-baselines/blob/ba346d347c85ac70cef6e4c73e3e60edc839490c/l2rpn_baselines/PPO_RLLIB/env_rllib.py#L103 and need to add it to make the train and eval script work.There environment seems to be getting created twice. First one just to convert the environment observation and action space into gym format and then pass into the RLLIBAgent class where the environment is built again through rllib library. If I understand correctly this takes more memory for two environments and rewriting to just make one will help with memory.
The environment for the
l2rpn_neurips_2020_track1_small
take a very long time to do 100 iterations withtrain_batch_size
of20,000
added to env_config_ppo. These two parameters may even need to be higher to get good results. If something can be done to speed up the training that would be helpful for scaling to bigger networks.How to reproduce
Execute the train and eval script here
Expected output
Train script should run without any issues and memory requirement is lower and faster training
The text was updated successfully, but these errors were encountered: