Migrate imitation envs to seals #58

Rocamonde · 2022-08-27T11:52:08Z

This PR migrates imitation environments to seals, in order to solve HumanCompatibleAI/imitation#501.

…ature

…ng base POMDP to tabular env

codecov · 2022-08-27T12:18:52Z

Codecov Report

Merging #58 (94fbf49) into master (7def17c) will not change coverage.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##            master       #58    +/-   ##
==========================================
  Coverage   100.00%   100.00%            
==========================================
  Files           24        26     +2     
  Lines          752       982   +230     
==========================================
+ Hits           752       982   +230

Impacted Files	Coverage Δ
src/seals/base_envs.py	`100.00% <100.00%> (ø)`
src/seals/diagnostics/__init__.py	`100.00% <100.00%> (ø)`
src/seals/diagnostics/cliff_world.py	`100.00% <100.00%> (ø)`
src/seals/diagnostics/noisy_obs.py	`100.00% <100.00%> (ø)`
src/seals/diagnostics/random_trans.py	`100.00% <100.00%> (ø)`
tests/test_base_env.py	`100.00% <100.00%> (ø)`
tests/test_diagnostics.py	`100.00% <100.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

AdamGleave

Reviewed everything except imitation_examples.py which I only skimmed. At a high-level design seems good and I definitely agree these are more at home in seals than imitation.

imitation_examples.py probably shouldn't be called that -- a user doesn't care that it used to be in imitation, do they? You might want to put the random matrix one and CliffWorld in different files, in fact, diagnostics/ has stuck to one file per environment and the other environments in seals are just lightweight wrappers around existing environments.

It definitely needs more tests. We were lax in imitation because it was just example code. But in seals environments are key deliverable. We've been maintaining 100% test coverage so far -- although comprehensiveness of tests matters more than raw line coverage.

One bug (?) which is hurting test coverage a lot is that nothing in imitation_examples.py is actually being registered. This should be pretty obvious from the file having 0% code coverage. If the environment isn't registered, our tests won't pick up on it. If it is registered and under seals/, it gets run automatically. I expect that'll get you to 80-90% coverage on that file for free, and if you write a few manual tests as well you'll be in good shape.

From a quick glance at CodeCov (worth taking a more detailed look yourself) in base_envs.py it seems like TabularModelPOMDP is totally untested (nothing using obs_from_space) so that's one area to improve, though again you might get some coverage there from fixing the above, but it's not a bad idea to have some tests targeted at base_envs directly.

When you've addressed these issues please request another review, and I can go over imitation_examples.py at that point, but it's probably best for me to hold off until you relocate it/split it up/test it as that might introduce a lot of changes anyway.

Makefile

setup.cfg

setup.py

src/seals/base_envs.py

AdamGleave · 2022-09-10T04:05:07Z

src/seals/base_envs.py

+            reward_matrix=reward_matrix,
+            horizon=horizon,
+            initial_state_dist=initial_state_dist,
+            observation_matrix=np.eye(transition_matrix.shape[0]),


Something seems off here. We're basically giving a dummy observation matrix to the code, that never gets used because of the obs_from_state override, and doesn't even produce the same values (one-hot coded vs integer index).

If we make change I suggested above to remove observation_matrix from BaseTabularModelPOMDP, you could just switch this to inherit directly from BaseTabularModelPOMDP and get rid of the observation_matrix here entirely. You'd probably need to make BaseTabularModelPOMDP take observation_space as an argument (specifying obs_dim and obs_dtype won't cut it if you want it to be discrete...), but that seems like a reasonable choice, then just move the current construct_obs_space logic into TabularModelPOMDP.

I'm not 100% satisfied with that, as it does seem like we'd probably want TabularModelMDP to be-a TabularModelPOMDP, but it doesn't seem like a major problem if they're both concrete classes and so specialized in different ways.

Alternatively if you wanted to keep the current hierarchy, you could just make observation_matrix not bogus. Either keep it as-is and delete obs_from_state (you get one hot vectors, which is OK) or change np.eye to np.arange (the observation space would be a bit weird there though).

src/seals/diagnostics/noisy_obs.py

…I/seals into imitation-envs-to-seals

AdamGleave · 2022-09-12T05:55:13Z

On test coverage: you're just missing a single line in base_envs.py, line 313 where you unpack the return value from np.iinfo. I guess np.iinfo is always failing. Probably we do actually want to test the case where it's an integer type? Can't remember quite why we added this, there was some environment that had integer types where using -inf/+inf was problematic, right?

Should probably test RandomTransitionEnv with random_obs set to False (currently always defaults to true)

I think we can make rand_state a mandatory argument in make_random_trans_mat, make_random_state_dist and make_obs_mat -- we always pass it in anyway. This would let us delete the first two lines of code from those functions, simplifying things and getting us some test coverage.

I think with those fixed we'll basically be at 100% coverage again :)

Do let me know once everything addressed and I'll re-review.

AdamGleave

LGTM apart from one comment to add a helper function in diagnostics/__init__.py to avoid polluting module namespace.

I assume that you moved imitation_examples.py to cliff_world.py and random_trans.py without any modifications -- I didn't re-review those, let me know if there was any changes I should take a closer look at.

src/seals/diagnostics/__init__.py

src/seals/diagnostics/cliff_world.py

Rocamonde added 15 commits August 26, 2022 22:01

Initial version of imitation+seals merge of POMDP/MDP environments.

291f514

Bug fixes to make tests pass

4996487

Linting and typing

3ad11c2

Ran black

e6cbb7f

Trailing comma to make linter happy

579ae4d

Fix absurd fight between black and flake8

cfe59f5

Fix array access in bash script (code_checks.sh)

38fa563

Added Makefile to simplify local CI checking

ab9a46e

Removed pytype restriction to only python 3.7

7046feb

Added "type: ignore" on call to numpy method with incorrect type sign…

6a8e21a

…ature

Fixed error on incompatible type signature due to inheritance by addi…

e157bab

…ng base POMDP to tabular env

Added imitation examples (to be moved to a better file)

b2f0ae2

Increased max line length in linting

40286da

Linting and docstrings

42fbdc5

Small fixes

3a5d50b

Bug fixes

bba45eb

Rocamonde mentioned this pull request Aug 27, 2022

Replace imitation environments with seals HumanCompatibleAI/imitation#541

Merged

Rocamonde and others added 4 commits August 28, 2022 11:26

Attempt to fix box boundary overflow

49dd9f0

Attempt to fix inf to int overflow

b7dc25f

Fix bug in ResettablePOMDP

cf97099

Merge branch 'master' into imitation-envs-to-seals

0ac051f

AdamGleave reviewed Sep 10, 2022

View reviewed changes

src/seals/diagnostics/noisy_obs.py Outdated Show resolved Hide resolved

Rocamonde added 6 commits September 11, 2022 13:19

Remove makefile for now

0beb871

Roll back line length for now

11160c0

Fix matplotlib issue

bc46368

Remove type ignore

6b85775

Miscellaneous improvements from review feedback

fccb19b

Merge branch 'imitation-envs-to-seals' of github.com:HumanCompatibleA…

f73ca2e

…I/seals into imitation-envs-to-seals

Rocamonde added 6 commits September 11, 2022 16:01

Restructure imitation examples into adequate

61b7056

Improve coverage

8fbea84

Fix typo

580f77a

Add docstring to test

c758d4e

Additional test coverage improvements

4addc5b

Add docstrings to test

5464af7

Rocamonde added 9 commits September 30, 2022 16:35

Rearrange observation matrix in POMDP

a822a46

Fix space constructors

0391192

Switch to python 3.8 minimum

9f84893

Add docstring

6d9879f

Merge remote-tracking branch 'origin' into imitation-envs-to-seals

91b55d2

Improve test coverage

c714e59

Reorder imports

54cb4b5

Added docstring to tests and exceptions

427f67d

Final coverage fixes

b03a680

Rocamonde requested a review from AdamGleave October 4, 2022 01:16

AdamGleave approved these changes Oct 4, 2022

View reviewed changes

src/seals/diagnostics/__init__.py Outdated Show resolved Hide resolved

src/seals/diagnostics/cliff_world.py Outdated Show resolved Hide resolved

Rocamonde added 3 commits October 4, 2022 12:08

Move cliff world registration to function

f533812

Remove comment

0c66584

Add docstring

94fbf49

Rocamonde merged commit 3d2cd41 into master Oct 4, 2022

Rocamonde deleted the imitation-envs-to-seals branch October 4, 2022 13:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate imitation envs to seals #58

Migrate imitation envs to seals #58

Rocamonde commented Aug 27, 2022

codecov bot commented Aug 27, 2022 •

edited

Loading

AdamGleave left a comment •

edited

Loading

AdamGleave Sep 10, 2022

AdamGleave commented Sep 12, 2022 •

edited

Loading

AdamGleave left a comment

Migrate imitation envs to seals #58

Migrate imitation envs to seals #58

Conversation

Rocamonde commented Aug 27, 2022

codecov bot commented Aug 27, 2022 • edited Loading

Codecov Report

AdamGleave left a comment • edited Loading

Choose a reason for hiding this comment

AdamGleave Sep 10, 2022

Choose a reason for hiding this comment

AdamGleave commented Sep 12, 2022 • edited Loading

AdamGleave left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 27, 2022 •

edited

Loading

AdamGleave left a comment •

edited

Loading

AdamGleave commented Sep 12, 2022 •

edited

Loading