Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ParabolaEnv. #29

Merged
merged 5 commits into from
Jun 30, 2020
Merged

Add ParabolaEnv. #29

merged 5 commits into from
Jun 30, 2020

Conversation

pedrofreire
Copy link
Contributor

No description provided.

@pedrofreire pedrofreire requested review from qxcv and AdamGleave June 27, 2020 19:04
Copy link
Member

@AdamGleave AdamGleave left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bounds need to be modified

self._bounds = bounds

super().__init__(
state_space=spaces.Box(low=-bounds, high=bounds, shape=(5,)),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bounds seems wrong:

  • The last three components (a, b, and c) are always in [-1,1] whatever bounds is set to.
  • The first two components (x, y) we say are $\mathbb{R}^2$ in the DERAIL paper. We should either change the paper or the code.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The motivation for bounding is more of a technicality than a environment design point: I just wanted to make sure that returns were bounded, (without the bounds, arbitrarily bad policies can get arbitrarily negative returns); I'd rather add that to the paper than change the code.

@codecov
Copy link

codecov bot commented Jun 30, 2020

Codecov Report

Merging #29 into master will increase coverage by 11.12%.
The diff coverage is 100.00%.

Impacted file tree graph

@@             Coverage Diff              @@
##           master       #29       +/-   ##
============================================
+ Coverage   88.87%   100.00%   +11.12%     
============================================
  Files          20        21        +1     
  Lines         593       619       +26     
============================================
+ Hits          527       619       +92     
+ Misses         66         0       -66     
Impacted Files Coverage Δ
src/seals/diagnostics/__init__.py 100.00% <100.00%> (ø)
src/seals/diagnostics/parabola.py 100.00% <100.00%> (ø)
src/seals/base_envs.py 100.00% <0.00%> (+6.86%) ⬆️
src/seals/util.py 100.00% <0.00%> (+16.66%) ⬆️
tests/test_util.py 100.00% <0.00%> (+17.64%) ⬆️
tests/test_envs.py 100.00% <0.00%> (+32.07%) ⬆️
tests/test_wrappers.py 100.00% <0.00%> (+60.00%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 149710e...8f340f9. Read the comment docs.

@pedrofreire pedrofreire requested a review from AdamGleave June 30, 2020 14:47
Copy link
Member

@AdamGleave AdamGleave left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM (but please update the paper to document this behaviour)

@pedrofreire pedrofreire merged commit eb5ae9b into master Jun 30, 2020
@pedrofreire pedrofreire deleted the parabola-1 branch June 30, 2020 22:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants