Add TorchScriptWrapper_v1 #802

danieldk · 2022-11-16T09:50:35Z

PyTorch Modules can be converted to TorchScript. TorchScript has the advantage that the model is serialized with the model parameters in a portable manner. Deserialization (in contrast to pickling) does not require certain Python types to be available. In fact, a TorchScript module can be loaded in C++ without a Python interpreter.

For Thinc and spaCy, supporting TorchScript has two main benefits:

Since the model is serialized, we don't run into the issue that we have with the PyTorch wrapper where we have to construct the PyTorch model before deserializing its parameters but we can only know the model shapes through deserialization.
When a model is rewritten for e.g. quantization, it is unwieldy to construct the rewritten model by hand. So, with the TorchScript wrapper we would first have to construct the original model and then reapply the graph transformation. This is quite bad for transformations that reduce the model size, since we end up temporarily allocating all parameters. This is not an issue with TorchScript, since we serialize the rewritten model.

PyTorch `Module`s can be converted to TorchScript. TorchScript has the advantage that the model is serialized with the model parameters in a portable manner. Deserialization (in contrast to pickling) does not require certain Python types to be available. In fact, a TorchScript module can be loaded in C++ without a Python interpreter. For Thinc and spaCy, supporting TorchScript has two main benefits: 1. Since the model is serialized, we don't run into the issue that we have with the PyTorch wrapper where we have to construct the PyTorch model before deserializing its parameters but we can only know the model shapes through deserialization. 2. When a model is rewritten for e.g. quantization, it is unwieldy to construct the rewritten model by hand. So, with the TorchScript wrapper we would first have to construct the original model and then reapply the graph transformation. This is quite bad for transformations that reduce the model size, since we end up temporarily allocating all parameters. This is not an issue with TorchScript, since we serialize the rewritten model.

svlandeg · 2022-11-17T15:28:46Z

@danieldk : can you have a look at the conflicts?

…ptwrapper

danieldk · 2022-11-17T15:49:16Z

@danieldk : can you have a look at the conflicts?

Fixed.

svlandeg

This will be great to have! I'm particularly enthousiastic about the prospect of resolving the initialization/deserialization shape issues 🎉

thinc/layers/torchscriptwrapper.py

thinc/shims/__init__.py

thinc/shims/torchscript.py

website/docs/api-layers.md

website/docs/usage-frameworks.md

danieldk · 2022-11-23T11:26:42Z

I put the PR in draft for the moment, since I am still wondering if it's best to pass None as defaults to the conversion functions.

svlandeg · 2022-11-23T12:41:20Z

I put the PR in draft for the moment, since I am still wondering if it's best to pass None as defaults to the conversion functions.

As discussed, I'd keep it as is.

shadeMe approved these changes Nov 16, 2022

View reviewed changes

Merge remote-tracking branch 'upstream/master' into feature/torchscri…

19561e3

…ptwrapper

svlandeg added enhancement Feature requests and improvements interop / pytorch PyTorch interoperability serialization Saving and loading models labels Nov 22, 2022

svlandeg reviewed Nov 22, 2022

View reviewed changes

Add fixes suggested by @svlandeg

467566e

danieldk marked this pull request as draft November 23, 2022 11:23

danieldk marked this pull request as ready for review November 23, 2022 12:50

svlandeg merged commit 9daaae5 into master Nov 24, 2022

svlandeg deleted the feature/torchscriptwrapper branch November 24, 2022 07:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TorchScriptWrapper_v1 #802

Add TorchScriptWrapper_v1 #802

danieldk commented Nov 16, 2022

svlandeg commented Nov 17, 2022

danieldk commented Nov 17, 2022

svlandeg left a comment

danieldk commented Nov 23, 2022

svlandeg commented Nov 23, 2022

Add TorchScriptWrapper_v1 #802

Add TorchScriptWrapper_v1 #802

Conversation

danieldk commented Nov 16, 2022

svlandeg commented Nov 17, 2022

danieldk commented Nov 17, 2022

svlandeg left a comment

Choose a reason for hiding this comment

danieldk commented Nov 23, 2022

svlandeg commented Nov 23, 2022