You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed while writing my reward function that the last state and current state passed into calc_reward always gave me the same value for the current_hp, even though there was a difference. Changing the shallow copy in openai_api.py from
battle = copy.copy(self.current_battle)
to a deep copy
battle = copy.deepcopy(self.current_battle)
fixed the issue.
I'm not sure if it's only me experiencing this, but I've attached a screenshot of the edit just in case anyone else has experienced this:
The text was updated successfully, but these errors were encountered:
I noticed while writing my reward function that the last state and current state passed into calc_reward always gave me the same value for the current_hp, even though there was a difference. Changing the shallow copy in openai_api.py from
battle = copy.copy(self.current_battle)
to a deep copy
battle = copy.deepcopy(self.current_battle)
fixed the issue.
I'm not sure if it's only me experiencing this, but I've attached a screenshot of the edit just in case anyone else has experienced this:
The text was updated successfully, but these errors were encountered: