Eager mode #635

makslevental · 2022-03-02T23:03:17Z

This PR implements an eager mode backend for PyTorch through the torch-mlir framework. This is accomplished by overriding the __torch_dispatch__ class method on wrapper subclass TorchMLIRTensor(torch.Tensor).

Effectively, this mode works by compiling op by op as the NN is eagerly executed by PyTorch. Entailed in that compilation is building a representation of the op that can be torch.jit.scripted, importing using ModuleBuilder, and then executing (e.g., with RefBackendLinalgOnTensorsBackend). This mode includes a fallback to conventional PyTorch if anything in the torch-mlir compilation process fails (e.g., unsupported op).

Currently, all e2e tests pass execpt for two that involve an upstream PyTorch bug (pytorch/pytorch#74400).

High priority next steps:

A compile cache in order to speed up reruns of the same NN.
Integration with IREE (though not in this repo).
Integration with torch.distributed.

silvasean · 2022-03-02T23:06:09Z

Can you add a test config for the e2e test framework?

https://github.com/llvm/torch-mlir/tree/main/python/torch_mlir_e2e_test/torchscript/configs

It should be possible to pass all the tests if you implement the fallback correctly.

python/torch_mlir/eager/torch_dispatch.py

silvasean

drive-by comment. will need a few passes here on the review.

python/torch_mlir/eager/torch_dispatch.py

CMakeLists.txt

e2e_testing/torchscript/xfail_sets.py

python/torch_mlir/eager_mode/torch_mlir_dispatch.py

examples/lazytensor_tanh.py

python/torch_mlir/eager_mode/torch_mlir_dispatch.py

python/torch_mlir/eager_mode/annotator.py

python/torch_mlir/eager_mode/CMakeLists.txt

python/torch_mlir/eager_mode/torch_mlir_types.py

python/torch_mlir_e2e_test/torchscript/configs/eager_mode.py

python/torch_mlir/eager_mode/torch_mlir_dispatch.py

silvasean

mostly nits.

python/torch_mlir/eager_mode/torch_mlir_dispatch.py

python/torch_mlir/eager_mode/ir_building.py

python/torch_mlir/eager_mode/torch_mlir_dispatch.py

e2e_testing/torchscript/xfail_sets.py

silvasean · 2022-03-22T20:44:41Z

python/torch_mlir/eager_mode/torch_mlir_tensor.py

+from torch_mlir_e2e_test.linalg_on_tensors_backends import refbackend
+
+
+class TorchMLIRTensor(torch.Tensor):


Is there any upstream documentation describing the extension point you are using here? (_make_wrapper_subclass/`_torch_dispatch``, etc.) It would be good to link it in if it exists.

I spoke with brian hirsch and he said the best documentation for how to use wrapper_subclass is https://github.com/albanD/subclass_zoo. I can add as a comment.

…h-mlir framework. This is accomplished by overriding the `__torch_dispatch__` class method on wrapper subclass `TorchMLIRTensor(torch.Tensor)`. Effectively, this mode works by compiling op by op as the NN is eagerly executed by PyTorch. Entailed in that compilation is building a representation of the op that can be `torch.jit.script`ed, importing using `ModuleBuilder`, and then executing (e.g., with `RefBackendLinalgOnTensorsBackend`). This mode includes a fallback to conventional PyTorch if anything in the torch-mlir compilation process fails (e.g., unsupported op). Currently, all e2e tests pass execpt for two that involve an upstream PyTorch bug (pytorch/pytorch#74400). High priority next steps: 1. A compile cache in order to speed up reruns of the same NN. 2. Integration with IREE (though not in this repo). 3. Integration with `torch.distributed`.

makslevental marked this pull request as ready for review March 2, 2022 23:03

makslevental mentioned this pull request Mar 2, 2022

Eager mode #629

Closed

makslevental commented Mar 2, 2022

View reviewed changes