Feature skip data iteration when caching scoring #557

BenjaminBossan · 2019-11-11T10:44:55Z

This is the proposed fix for #552.

As discussed there, my proposal is to cache net.forward_iter, so that we can skip the iteration over the data.

It would be very helpful if someone could think of any unintended side-effects this change could have. E.g., in theory, some code could rely on iterating over the data even in case of caching, but I can hardly imagine this happening in reality.

Also, I cannot test on GPU for the moment. I don't see how this would affect the outcome, but it could still be nice of someone verifies it works.

I deprecated cache_net_infer because it was not "private". However, I made the new _cache_net_forward_iter private.

* indentation level * disable some pylint messages * unused fixtures

Before, net.infer was cached when using a scoring callback wiht use_caching=True. This way, the time to make an inference step was saved. However, there was still an iteration step over the data for each scoring callback. If iteration is slow, this could incur a significant overhead. Now net.forward_iter is cached instead. This way, the iteration over the data is skipped and the iteration overhead should be gone.

marrrcin · 2019-11-14T12:35:16Z

Can I tests this from some nightly build or do I have to build the package on my own?

BenjaminBossan · 2019-11-14T15:57:15Z

@marrrcin You would need to install from source, but it's not difficult: https://github.com/skorch-dev/skorch#from-source

ottonemo

In general I think this approach works. We could debate whether this is a problem we need to solve or where PyTorch lacks infrastructure (i.e., caching datasets) but ultimately I think it doesn't hurt to fix this.

ottonemo · 2019-11-15T16:00:56Z

skorch/net.py

+    def _forward_output(self, yp, device):
+        if isinstance(yp, tuple):
+            return tuple(n.to(device) for n in yp)
+        return yp.to(device)


This is structurally very similar to skorch.utils.to_tensor. Maybe we should introduce a skorch.utils.to_device instead? This might also become handy if we support multiple GPUs in the future.

Good point, I moved this to skorch.utils.to_device

…coring

Similar to the comment in cache_net_infer

... instead of having it as a method on NeuralNet. Add tests

BenjaminBossan · 2019-11-17T10:44:04Z

@ottonemo I addressed your comment, pls review again.

Before merging this, I should we consider making a new release? I wouldn't mind this new feature being only on master for some time in case it creates some trouble down the line.

…coring

BenjaminBossan added 2 commits November 11, 2019 11:33

Some cleanups in test_scoring

29dd12d

* indentation level * disable some pylint messages * unused fixtures

BenjaminBossan requested review from thomasjpfan and ottonemo November 11, 2019 10:44

BenjaminBossan self-assigned this Nov 11, 2019

BenjaminBossan mentioned this pull request Nov 11, 2019

EpochScoring caching does not work in every case #552

Closed

ottonemo approved these changes Nov 15, 2019

View reviewed changes

BenjaminBossan added 4 commits November 17, 2019 11:15

Merge branch 'master' into feature-skip-data-iteration-when-caching-s…

56e66ad

…coring

Add comment to explain attribute priority

0c627a1

Similar to the comment in cache_net_infer

Move common functionality to skorch.utils.to_device

3d5576e

... instead of having it as a method on NeuralNet. Add tests

Remove unnecessary import in test_net

19cf6e4

githubnemo mentioned this pull request Nov 26, 2019

Train on GPU and predict on CPU #553

Open

BenjaminBossan and others added 2 commits December 5, 2019 11:17

Merge branch 'master' into feature-skip-data-iteration-when-caching-s…

8869b41

…coring

Merge branch 'master' into feature-skip-data-iteration-when-caching-s…

b64653d

…coring

ottonemo merged commit 09be626 into master Dec 16, 2019

BenjaminBossan deleted the feature-skip-data-iteration-when-caching-scoring branch February 2, 2020 12:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature skip data iteration when caching scoring #557

Feature skip data iteration when caching scoring #557

BenjaminBossan commented Nov 11, 2019

marrrcin commented Nov 14, 2019

BenjaminBossan commented Nov 14, 2019

ottonemo left a comment

ottonemo Nov 15, 2019

BenjaminBossan Nov 17, 2019

BenjaminBossan commented Nov 17, 2019

Feature skip data iteration when caching scoring #557

Feature skip data iteration when caching scoring #557

Conversation

BenjaminBossan commented Nov 11, 2019

marrrcin commented Nov 14, 2019

BenjaminBossan commented Nov 14, 2019

ottonemo left a comment

Choose a reason for hiding this comment

ottonemo Nov 15, 2019

Choose a reason for hiding this comment

BenjaminBossan Nov 17, 2019

Choose a reason for hiding this comment

BenjaminBossan commented Nov 17, 2019