[Roadmap] PyTorch SparseTensor Support 🚀 #5867

EdisonLeeeee · 2022-11-01T08:36:58Z

🚀 The feature, motivation and pitch

PyG currently accepts torch.LongTensor: edge_index and torch_sparse.SparseTensor: adj in MessagePassing, which limits its flexibility for users who use native PyTorch SparseTensor. As such, an additional step is required for them to convert one to another.

The goal of this roadmap is to track the integration of native PyTorch SparseTensor support into PyG. After integration, the edge_index in MessagePassing can also accept PyTorch SparseTensor while allowing backpropagation like torch_sparse.SparseTensor.

General Roadmap

Add PyTorch SparseTensor support for torch_geometric.nn.MessagePassing and corresponding layers inherited from it
Add PyTorch SparseTensor support for torch_geometric.transforms
Add PyTorch SparseTensor support for torch_geometric.loader
Replace torch_sparse.SparseTensor based implementations with PyTorch functionality
...

Implementations

Utility functions

is_torch_sparse_tensor function. Since both strided (dense) or sparse_coo tensors are instances of torch.Tensor, there should be a function to distinguish between two different inputs, i.e., LongTensor edge_index and SparseTensor adj (Add spmm and is_torch_sparse_tensor #5906).
spmm function: sparse-dense matrix multiplication supporting both torch_sparse and PyTorch SparseTensor. For PyTorch SparseTensor, currently only sum and mean aggregations are allowed (Add spmm and is_torch_sparse_tensor #5906, Added mean reduction to utils.spmm for PyTorch Sparse Tensor #6868, PyTorch Sparse Tensor support: HANConv, GATv2Conv, HGTConv, GMMConv, GPSConv, and RGATConv #6932).
is_sparse_tensor: check for either torch.sparse.Tensor or torch_sparse.SparseTensor (Add is_sparse and to_torch_coo_tensor #6003).
to_torch_coo_tensor: convert edge_index to torch.sparse.Tensor (coo format) (Add is_sparse and to_torch_coo_tensor #6003).
Support torch.sparse.Tensor input of add_self_loop, remove_self_loop, and maybe_num_nodes (Added PyTorch Sparse Tensor support for remove_self_loops, add_self_loops, and maybe_num_nodes #6847)

`torch_geometric.nn.*`

`torch_geometric.transforms.*`

ToSparseTensor (Added PyTorch Sparse Tensor support for ToSparseTensor #6930)

The text was updated successfully, but these errors were encountered:

EdisonLeeeee · 2022-11-02T16:56:15Z

So far I can think of a way to support torch SparseTensor with the least effort in MessagePassing is:

def hook(model, input):
    edge_index, size, kwargs = input
    if is_torch_coo_tensor(edge_index):
        adj = edge_index
        edge_index = adj._indices()
        kwargs['edge_weight'] = adj._values()
    return edge_index, size, kwargs

register_propagate_forward_pre_hook(hook)

we can register such a hook in __init__ of MessagePassing, WDYT? @rusty1s

(Update) This is not a perfect solution as we cannot enjoy the benefits of PyTorch SparseTensor in this way.

rusty1s · 2022-11-03T09:45:59Z

I think this would only be a temporary solution. Ultimately, we want direct support for it similar to SparseTensor. Wondering what are the main challenges to support this?

EdisonLeeeee · 2022-11-03T15:09:01Z

IMO, there are some challenges to support this:

It only supports sum aggregation when using PyTorch SparseTensor. For other cases including advanced aggregations, we still need to convert it to the form of (edge_index, edge_weight).
PyTorch SparseTensor does not support spspmm or mul with broadcasting, which makes it challenging to implement some operations such as gcn_norm (e.g., $A\cdot D^{-\frac{1}{2}}$ ) efficiently.
As mentioned above, since PyTorch SparseTensor is also an instance of torch.Tensor, we should change the condition isinstance(edge_index, Tensor) if we want to support it. There should be some auxiliary functions like is_edge_index, is_torch_coo_tensor, or something else.

Please correct me if something is missing.

rusty1s · 2022-11-04T06:40:57Z

These are good thoughts. Appreciate it. Let me think.

The only way I can think of supporting this is to provide a spmm function in PyG that supports both torch_sparse and PyTorch. For PyTorch, we can error out in case aggr != "sum" or aggr != "mean".
Yes, this is a real problem. We would need to provide our own implementation for this by working on the indices and values directly. Happy to move this to follow-up PRs and work on general MessagePassing integration first
Yes, this is expected, but IMO not a blocker, right?

EdisonLeeeee · 2022-11-04T10:19:20Z

You are right. Thanks for making it clear. Will update the roadmap correspondingly and make some PRs soon :)

EdisonLeeeee · 2023-01-25T12:31:29Z

Just go back from vacation :) I've updated the roadmap and corresponding TODO list. I plan to support them in the following weeks.

rusty1s · 2023-01-26T08:50:12Z

Nice to have you back :)

#6514) For the roadmap #5867

sandeep-189 · 2023-03-06T17:23:39Z

Hi, I was trying to use pytorch sparse tensor for a RGCN like model that I was working on. I noticed that the mean aggregation for the pytorch sparse tensor implementation in the spmm function is raising a NotImplementedError. Should the implementation for this be torch.sparse.mm(src, other)/src.shape[0] ? Am I misunderstanding mean aggregation?

rusty1s · 2023-03-07T10:25:25Z

Mean aggregation would refer to a row-wise mean that only normalized across non-zero values. I don't think this is super hard to integrate though @EdisonLeeeee.

EdisonLeeeee · 2023-03-07T10:50:31Z

Yeah. Will take a look and make a PR for it.

sandeep-189 · 2023-03-07T11:05:27Z

Thank you very much for the clarification. I think with that PR, RGCNConv and FastRGCNConv would also be working.

EdisonLeeeee · 2023-03-07T13:13:04Z

I've made a PR for it: #6868

romankouz · 2023-09-26T19:26:26Z

Apologies if I should have opened a new issue for this, but I opened #6889 right around the time the influx of SparseTensor support started and have some follow up questions.

Is the support for SparseTensor simply so that these objects can accept a SparseTensor , or so that they use something like spmm for matrix multiplication? As far as I can see GMMConv still does not support sparse multiplication even if you pass in a sparse tensor. The commit you have for GMMConv SparseTensor support adds tests that allows you to pass SparseTensor but I don't think it does anything differently between sparse and dense tensors. Is this a correct assessment?

I ask not because I'm judging, but because I don't know if there is something I'm missing and just wanted to check :).

For further clarity, I have included a screenshot of the message method of GMMConv and proof that it still gives me out of memory issues and is not using sparse multiplication. Happy to provide anything else that could be useful :)

rusty1s · 2023-09-28T07:04:54Z

Hi @romankouz,

Note that this issue is more about bringing support of torch.sparse.tensor to PyG, not the SparseTensor class introduced by torch-sparse a while ago.

Overall, we aim to support as many layers as possible to accept sparse tensors as input, while only a fraction of them can actually make use of the spmm code path. As a general rule of thumb, we can only leverage spmm if messages are not dependent on destination node features and no edge features are involved (which is not the case for GMMConv). The other main benefit you get from using sparse tensors is deterministic behavior since the aggregation kernels no longer rely on atomic operations.

romankouz · 2023-09-28T13:57:13Z

Hi @rusty1s ,

Thank you for the follow up! I'm not entirely sure how aggregation kernels rely on atomic operations for dense matrices but not for sparse tensors. However, I do realize my misunderstanding regarding SparseTensor integration. Thank you for the response!

rusty1s · 2023-09-29T03:08:29Z

The main difference between the two representations is that for SparseTensor it is guaranteed that neighborhoods are grouped contiguous in memory, which lets us utilize segment_reduce rather than scatter_reduce from torch-sparse. As such, this lets us leverage an alternative aggregation in which we no longer need to rely on atomic operations.

EdisonLeeeee added feature help wanted 0 - Priority P0 roadmap labels Nov 1, 2022

EdisonLeeeee self-assigned this Nov 1, 2022

rusty1s mentioned this issue Nov 7, 2022

Enable SparseTensor in training benchmarks #5881

Merged

EdisonLeeeee mentioned this issue Jan 25, 2023

Add PyTorch SparseTensor support for SGConv, SSGConv and TAGConv #6514

Merged

rusty1s pushed a commit that referenced this issue Jan 26, 2023

Add PyTorch SparseTensor support for SGConv, SSGConv and TAGConv (

77693c9

#6514) For the roadmap #5867

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Roadmap] PyTorch SparseTensor Support 🚀 #5867

[Roadmap] PyTorch SparseTensor Support 🚀 #5867

EdisonLeeeee commented Nov 1, 2022 •

edited by rusty1s

Loading

EdisonLeeeee commented Nov 2, 2022 •

edited

Loading

rusty1s commented Nov 3, 2022

EdisonLeeeee commented Nov 3, 2022

rusty1s commented Nov 4, 2022

EdisonLeeeee commented Nov 4, 2022

EdisonLeeeee commented Jan 25, 2023

rusty1s commented Jan 26, 2023

sandeep-189 commented Mar 6, 2023 •

edited

Loading

rusty1s commented Mar 7, 2023

EdisonLeeeee commented Mar 7, 2023

sandeep-189 commented Mar 7, 2023

EdisonLeeeee commented Mar 7, 2023

romankouz commented Sep 26, 2023 •

edited

Loading

rusty1s commented Sep 28, 2023 •

edited

Loading

romankouz commented Sep 28, 2023

rusty1s commented Sep 29, 2023

[Roadmap] PyTorch SparseTensor Support 🚀 #5867

[Roadmap] PyTorch SparseTensor Support 🚀 #5867

Comments

EdisonLeeeee commented Nov 1, 2022 • edited by rusty1s Loading

🚀 The feature, motivation and pitch

General Roadmap

Implementations

Utility functions

torch_geometric.nn.*

torch_geometric.transforms.*

EdisonLeeeee commented Nov 2, 2022 • edited Loading

rusty1s commented Nov 3, 2022

EdisonLeeeee commented Nov 3, 2022

rusty1s commented Nov 4, 2022

EdisonLeeeee commented Nov 4, 2022

EdisonLeeeee commented Jan 25, 2023

rusty1s commented Jan 26, 2023

sandeep-189 commented Mar 6, 2023 • edited Loading

rusty1s commented Mar 7, 2023

EdisonLeeeee commented Mar 7, 2023

sandeep-189 commented Mar 7, 2023

EdisonLeeeee commented Mar 7, 2023

romankouz commented Sep 26, 2023 • edited Loading

rusty1s commented Sep 28, 2023 • edited Loading

romankouz commented Sep 28, 2023

rusty1s commented Sep 29, 2023

EdisonLeeeee commented Nov 1, 2022 •

edited by rusty1s

Loading

`torch_geometric.nn.*`

`torch_geometric.transforms.*`

EdisonLeeeee commented Nov 2, 2022 •

edited

Loading

sandeep-189 commented Mar 6, 2023 •

edited

Loading

romankouz commented Sep 26, 2023 •

edited

Loading

rusty1s commented Sep 28, 2023 •

edited

Loading