Add topk features #260

aurorarossi · 2023-03-09T21:41:18Z

With this PR I add the topk_nodes and topk_edges features (see issue #41).

src/utils.jl

CarloLucibello · 2023-03-20T17:45:41Z

I think it is more useful and general to have a function with signature

topk_nodes(g::GNNGraph, x::AbstractArray, k::Int; rev::Bool = true, sortby::Union{Nothing, Int} = nothing)

instead of passing the tensor name.

Also, the docstring needs much more explanation of the purpose of this function, the various arguments, and the returned objects.

Following DGL, I think it is useful to return both the sorted array and permutation.
So if we call (y, partialperm) the output:

y should be an array of size (num_feat, k, num_graphs)
partialperm should be an array of integers of size (k, num_graphs) if sortby === nothing and of size (num_feat, k, num_graphs) if sortby is an integer.

…uralNetworks.jl into add_topk_GPU

CarloLucibello · 2024-03-11T11:03:05Z

Does it work on gpu as well?

aurorarossi · 2024-03-11T12:51:11Z

Yes, it works on the GPU.

CarloLucibello · 2024-03-11T13:19:38Z

I think it would be nice to contribute topk_matrix as a topk function in MLUtils.jl and then base this PR on top of that.

Then I would have here two functions, topk_nodes and topk_edges as in Deep Graph Library (https://docs.dgl.ai/generated/dgl.topk_nodes.html#dgl.topk_nodes)

CarloLucibello · 2024-03-11T13:09:21Z

src/utils.jl

+- `feat`: a feature array of size `(number_features, g.num_nodes)` or `(number_features, g.num_edges)` of the graph `g`.
+- `k`: the number of top features to return.
+- `rev`: if `true`, sort in descending order otherwise returns the `k` smallest elements.
+- `sortby`: the index of the feature to sort by. If `nothing`, every row independently.


this sentence is not clear

CarloLucibello · 2024-03-11T13:12:13Z

src/utils.jl

+function _topk_matrix(matrix::AbstractArray, k::Int; rev::Bool = true, sortby::Union{Nothing, Int} = nothing)
+    if sortby === nothing
+        sorted_matrix = sort(matrix, dims = 2; rev)[:, 1:k]
+        vector_indices = map(x -> sortperm(x; rev), eachrow(matrix))


instead of sorting the whole matrix, it would be more efficient to use partialsortperm. I'm not sure is supported by CUDA.jl though

CarloLucibello · 2024-03-11T13:21:23Z

src/utils.jl

+    if g.num_graphs == 1
+        return _topk_matrix(feat, k; rev, sortby)
+    else
+        matrices = [feat[:, g.graph_indicator .== i] for i in 1:(g.num_graphs)]


the masking would be different for edge feature

aurorarossi added 9 commits March 9, 2023 13:10

Add functions

f72cacb

Add test

92e7314

Fix functions

cc0a015

Export functions

6c1abb0

Fix

4d788f2

Simplify test

9de994f

Add docstrings

e69402e

Remove comments

6d2579f

Add topk_edges tests

e10c4a9

CarloLucibello reviewed Mar 10, 2023

View reviewed changes

src/utils.jl Outdated Show resolved Hide resolved

aurorarossi added 2 commits March 10, 2023 13:08

Fix batch case and reorder

eec3a46

Modify test arbitrary node number case

b02dcaa

aurorarossi closed this Mar 28, 2023

aurorarossi force-pushed the add_topk_GPU branch from b02dcaa to 05fca7c Compare March 28, 2023 11:32

Merge branch 'add_topk_GPU' of https://github.com/aurorarossi/GraphNe…

ae88974

…uralNetworks.jl into add_topk_GPU

aurorarossi reopened this Mar 28, 2023

aurorarossi marked this pull request as draft March 28, 2023 12:09

aurorarossi added 5 commits March 28, 2023 17:00

Add tests like to DGL

75b0b8f

Fix to return permutations

226b07d

Change name

24faa9a

Improve docs

87f0430

Add example

da04648

aurorarossi marked this pull request as ready for review March 28, 2023 18:55

Merge branch 'master' into add_topk_GPU

9c29257

aurorarossi marked this pull request as draft March 9, 2024 20:34

Fix function signature

caa3b6e

aurorarossi marked this pull request as ready for review March 10, 2024 10:41

aurorarossi requested a review from CarloLucibello March 11, 2024 09:40

CarloLucibello reviewed Mar 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add topk features #260

Add topk features #260

aurorarossi commented Mar 9, 2023

CarloLucibello commented Mar 20, 2023

CarloLucibello commented Mar 11, 2024

aurorarossi commented Mar 11, 2024

CarloLucibello commented Mar 11, 2024

CarloLucibello Mar 11, 2024

CarloLucibello Mar 11, 2024

CarloLucibello Mar 11, 2024

Add topk features #260

Are you sure you want to change the base?

Add topk features #260

Conversation

aurorarossi commented Mar 9, 2023

CarloLucibello commented Mar 20, 2023

CarloLucibello commented Mar 11, 2024

aurorarossi commented Mar 11, 2024

CarloLucibello commented Mar 11, 2024

CarloLucibello Mar 11, 2024

Choose a reason for hiding this comment

CarloLucibello Mar 11, 2024

Choose a reason for hiding this comment

CarloLucibello Mar 11, 2024

Choose a reason for hiding this comment