Speedup and fix of multiplication by OneHotMatrix #1756

racinmat · 2021-10-25T22:00:53Z

PR Checklist

Tests are added
Entry in NEWS.md
Documentation, if applicable
API changes require approval from a committer (different from the author, if applicable)

Fixes #1355 .
Also fixes bug mentioned in #1355 (comment).
Adds tests for both gpu and cpu.
Adds multiplication by sparse matrix to benchmarks.

Also fixed gradient calculation on GPU.

perf/sparse_input.jl

perf/runbenchmarks.jl

test/cuda/cuda.jl

src/onehot.jl

test/onehot.jl

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

test/cuda/cuda.jl

test/onehot.jl

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

src/onehot.jl

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

test/cuda/cuda.jl

CarloLucibello · 2021-10-26T08:54:45Z

@racinmat do you want to include #1355 (comment) as well?

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

racinmat · 2021-10-26T09:13:45Z

yes. I want, pushing right now.

CarloLucibello · 2021-10-26T12:10:03Z

@racinmat gpu tests (buildkite) are still failing, not sure why

racinmat · 2021-10-26T16:10:35Z

I see, I messed up dimension validation before multiplicating by the adjoint, fixing it and adding it also to cpu tests.

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

src/onehot.jl

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

…o break it.

racinmat · 2021-10-26T16:53:13Z

I'm a bit unsure about adding code which would work for ReshapedArrays without actually adding testcase which would exercise that code and called the method with reshaped array. Is it ok, or should I add tests for it?

ToucheSir · 2021-10-26T17:10:05Z

That's a good question, are we clear on what the semantics would be with reshaped arrays?

darsnack · 2021-10-26T17:17:57Z

Reshaped arrays in this context means N-d OneHotArrays that are reshaped to look like OneHotMatrix with the first dimension untouched.

So using

reshape(OneHotArray(rand(1:10, 5, 5), 10), 10, :)

should work for adding tests to exercise it. I do think we should add tests.

racinmat · 2021-10-26T21:07:18Z

During testing reshaped arrays I realized there is no _onehot implemented for adjoint of reshaped onehot matrix and I realized I got a bit lost there.
So I guess I'll need a bit of help.
There should be sth. like _isonehot(B) || return invoke(*, Tuple{AbstractMatrix, AbstractMatrix}, A, B), right? But how should _isonehot look like to distinguish e.g. these?

julia>   b4 = reshape(Flux.OneHotMatrix([1 2 3; 2 2 1], 3), 3, :)
3×6 reshape(OneHotArray(::Matrix{Int64}), 3, 6) with eltype Bool:
 1  ⋅  ⋅  ⋅  ⋅  1
 ⋅  1  1  1  ⋅  ⋅
 ⋅  ⋅  ⋅  ⋅  1  ⋅

julia>   b5 = reshape(b4, 6, :)
6×3 reshape(OneHotArray(::Matrix{Int64}), 6, 3) with eltype Bool:
 1  0  0
 0  1  0
 0  0  1
 0  0  1
 1  1  0
 0  0  0

julia>   b5'
3×6 adjoint(reshape(OneHotArray(::Matrix{Int64}), 6, 3)) with eltype Bool:
 1  0  0  0  1  0
 0  1  0  0  1  0
 0  0  1  1  0  0

I thought I could check indices, but they are same for both of them, and one is onehot and the other is not.

I'm currently not sure which way to go:

Should I add these cases to tests and try to make it work in this PR? I don't know how to make efficient check for _isonehot for adjoint reshaped arrays, so I'm a bit lost there.
Should I make the new method for adjoint only for OneHotMatrix and support for reshaped arrays would be left for separate PR and add these test cases so we know we need to cover them?

racinmat · 2021-10-27T11:53:54Z

In the end I decided I will keep it as function Base.:(*)(A::AbstractMatrix, B::Adjoint{Bool, <:OneHotMatrix}), but I added tests for multiplication by reshaped onehot matrix so in the future people would have test data to see if some additional otimized version would behave correctly.

CarloLucibello · 2021-10-28T09:07:44Z

bors r+

bors · 2021-10-28T09:31:35Z

Build succeeded:

buildkite/flux-dot-jl

racinmat and others added 3 commits October 25, 2021 18:18

adding commented out sparse optimizations

da9fbba

Speedup of multiplication by OneHotMatrix.

9427363

Also fixed gradient calculation on GPU.

return benchs

a1f35a2

DhairyaLGandhi reviewed Oct 25, 2021

View reviewed changes

perf/sparse_input.jl Outdated Show resolved Hide resolved

removing benchmark which should be in different repo

6d0468e

racinmat requested a review from DhairyaLGandhi October 25, 2021 22:12

DhairyaLGandhi reviewed Oct 25, 2021

View reviewed changes

perf/runbenchmarks.jl Outdated Show resolved Hide resolved

DhairyaLGandhi reviewed Oct 25, 2021

View reviewed changes

test/cuda/cuda.jl Outdated Show resolved Hide resolved

CarloLucibello reviewed Oct 25, 2021

View reviewed changes

src/onehot.jl Outdated Show resolved Hide resolved

racinmat added 2 commits October 26, 2021 00:22

removed unrelated code, fixed typo, fixed NEWS entry.

5c31069

defining special method in order to keep reshaped arrays untouched

60d2516

racinmat requested review from CarloLucibello and DhairyaLGandhi October 25, 2021 22:27

ToucheSir reviewed Oct 25, 2021

View reviewed changes

test/onehot.jl Outdated Show resolved Hide resolved

Update test/onehot.jl

6e79d64

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

CarloLucibello reviewed Oct 25, 2021

View reviewed changes

test/cuda/cuda.jl Outdated Show resolved Hide resolved

ToucheSir reviewed Oct 25, 2021

View reviewed changes

test/onehot.jl Outdated Show resolved Hide resolved

Update test/cuda/cuda.jl

7268c89

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

CarloLucibello reviewed Oct 26, 2021

View reviewed changes

src/onehot.jl Show resolved Hide resolved

Update src/onehot.jl

5f7ce6b

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

racinmat requested review from ToucheSir and CarloLucibello October 26, 2021 06:44

CarloLucibello reviewed Oct 26, 2021

View reviewed changes

test/cuda/cuda.jl Outdated Show resolved Hide resolved

Update test/cuda/cuda.jl

931d409

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

racinmat added 2 commits October 26, 2021 11:14

adding optimization of multiplication by adjoint

2507126

Merge branch 'master' of https://github.com/racinmat/Flux.jl

644bd1a

racinmat requested a review from CarloLucibello October 26, 2021 09:15

racinmat and others added 3 commits October 26, 2021 18:17

fixed dimension check, added tests to check different dimensionality

5ac7a3d

Update src/onehot.jl

75f0e9c

Co-authored-by: Carlo Lucibello <carlo.lucibello@gmail.com>

using gather for OneHotLike

7ce132f

ToucheSir reviewed Oct 26, 2021

View reviewed changes

src/onehot.jl Outdated Show resolved Hide resolved

ToucheSir reviewed Oct 26, 2021

View reviewed changes

src/onehot.jl Outdated Show resolved Hide resolved

Update src/onehot.jl

f221ee9

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

racinmat requested review from ToucheSir and darsnack October 26, 2021 16:29

racinmat added 3 commits October 26, 2021 18:32

using different method for onehot vector and onehot matrix

0a67472

Merge branch 'master' of https://github.com/racinmat/Flux.jl

fcda965

returning to default implementation for onehot because I don't want t…

6e2da25

…o break it.

dispatching on OneHotLike of dimension 2

775efee

racinmat and others added 2 commits October 26, 2021 23:16

added many tests for reshaped matrices

51b7c00

fixed tests, using only onehot for adjoint

35ab120

CarloLucibello approved these changes Oct 28, 2021

View reviewed changes

bors bot merged commit 69afb67 into FluxML:master Oct 28, 2021

racinmat mentioned this pull request Nov 1, 2021

Remove onehot matrix multiplication if the speedup is not large enough CTUAvastLab/Mill.jl#88

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speedup and fix of multiplication by OneHotMatrix #1756

Speedup and fix of multiplication by OneHotMatrix #1756

racinmat commented Oct 25, 2021

CarloLucibello commented Oct 26, 2021

racinmat commented Oct 26, 2021

CarloLucibello commented Oct 26, 2021

racinmat commented Oct 26, 2021

racinmat commented Oct 26, 2021

ToucheSir commented Oct 26, 2021

darsnack commented Oct 26, 2021

racinmat commented Oct 26, 2021 •

edited

Loading

racinmat commented Oct 27, 2021

CarloLucibello commented Oct 28, 2021

bors bot commented Oct 28, 2021

Speedup and fix of multiplication by OneHotMatrix #1756

Speedup and fix of multiplication by OneHotMatrix #1756

Conversation

racinmat commented Oct 25, 2021

PR Checklist

CarloLucibello commented Oct 26, 2021

racinmat commented Oct 26, 2021

CarloLucibello commented Oct 26, 2021

racinmat commented Oct 26, 2021

racinmat commented Oct 26, 2021

ToucheSir commented Oct 26, 2021

darsnack commented Oct 26, 2021

racinmat commented Oct 26, 2021 • edited Loading

racinmat commented Oct 27, 2021

CarloLucibello commented Oct 28, 2021

bors bot commented Oct 28, 2021

racinmat commented Oct 26, 2021 •

edited

Loading