using OneHotArrays #2025

mcabbott · 2022-07-23T22:27:48Z

This removes onehot.jl since the package is now registered, JuliaRegistries/General#64647

Tests ~~could be~~ removed too.

Maybe docs need though?

mcabbott · 2022-07-24T17:12:31Z

GPU test failure seems to be this:

julia> using CUDA, OneHotArrays, NNlibCUDA

julia> CUDA.allowscalar(false)

julia> x = [1, 3, 2];

julia> y = onehotbatch(x, 0:3)
4×3 OneHotMatrix(::Vector{UInt32}) with eltype Bool:
 ⋅  ⋅  ⋅
 1  ⋅  ⋅
 ⋅  ⋅  1
 ⋅  1  ⋅

julia> y2 = onehotbatch(x |> cu, 0:3)
ERROR: Scalar indexing is disallowed.
Invocation of getindex resulted in scalar indexing of a GPU array.
This is typically caused by calling an iterating implementation of a method.
Such implementations *do not* execute on the GPU, but very slowly on the CPU,
and therefore are only permitted from the REPL for prototyping purposes.
If you did intend to index this array, annotate the caller with @allowscalar.
Stacktrace:
 [1] error(s::String)
   @ Base ./error.jl:33
 [2] assertscalar(op::String)
   @ GPUArraysCore ~/.julia/packages/GPUArraysCore/rSIl2/src/GPUArraysCore.jl:78
 [3] getindex
   @ ~/.julia/packages/GPUArrays/gok9K/src/host/indexing.jl:9 [inlined]
 [4] iterate
   @ ./abstractarray.jl:1144 [inlined]
 [5] iterate
   @ ./abstractarray.jl:1142 [inlined]
 [6] _onehotbatch(data::CuArray{Int64, 1, CUDA.Mem.DeviceBuffer}, labels::NTuple{4, Int64})
   @ OneHotArrays ~/.julia/packages/OneHotArrays/Moo4n/src/onehot.jl:87
 [7] onehotbatch(::CuArray{Int64, 1, CUDA.Mem.DeviceBuffer}, ::UnitRange{Int64})
   @ OneHotArrays ~/.julia/packages/OneHotArrays/Moo4n/src/onehot.jl:84
 [8] top-level scope

Because #1959 doesn't exist in OneHotArrays

mcabbott · 2022-07-24T17:57:38Z

Downstream failure for Transformers seems to be related:

Precompiling project...
[469](https://github.com/FluxML/Flux.jl/runs/7489381487?check_suite_focus=true#step:6:470)
  ✗ Transformers
[470](https://github.com/FluxML/Flux.jl/runs/7489381487?check_suite_focus=true#step:6:471)
  0 dependencies successfully precompiled in 13 seconds (102 already precompiled)
[471](https://github.com/FluxML/Flux.jl/runs/7489381487?check_suite_focus=true#step:6:472)
1 dependency errored. To see a full report either run `import Pkg; Pkg.precompile()` or load the package
[472](https://github.com/FluxML/Flux.jl/runs/7489381487?check_suite_focus=true#step:6:473)
     Testing Running tests...
[473](https://github.com/FluxML/Flux.jl/runs/7489381487?check_suite_focus=true#step:6:474)
WARNING: both PrimitiveOneHot and Flux export "OneHotArray"; uses of it in module Basic must be qualified
[474](https://github.com/FluxML/Flux.jl/runs/7489381487?check_suite_focus=true#step:6:475)
ERROR: LoadError: UndefVarError: OneHotArray not defined
[475](https://github.com/FluxML/Flux.jl/runs/7489381487?check_suite_focus=true#step:6:476)

Maybe it shouldn't export OneHotArray? Cc @chengchingwen

chengchingwen · 2022-07-25T06:00:38Z

Personally, I would prefer not having OneHotArray exported from Flux. And usually you won't need that either because the most-used api for onehot encoding is onehot and onehotbatch.

btw. exporting onehot would also conflict with Enzyme.onehot.

chengchingwen · 2022-07-25T06:18:29Z

Just out of Curiosity, do we really need a dependency on OneHotArrays.jl directly? It seems that none of code/functions in Flux explicitly need OneHotArray. It could be a complete separate package and people need that just using OneHotArrays directly, though that would be breaking change.

mcabbott · 2022-07-25T14:12:30Z

Ok. I think nothing was exported before, so for now this PR shouldn't call @reexport, not sure why I put that initially.

And longer term, indeed, there's no strong reason for Flux to depend on this. Maybe Flux@0.14 can simply drop it?

chengchingwen · 2022-07-25T14:19:03Z

And longer term, indeed, there's no strong reason for Flux to depend on this. Maybe Flux@0.14 can simply drop it?

Then do we need a deprecate warning for accessing those function from Flux in the next patch release?

mcabbott · 2022-07-25T14:47:19Z

CI tells me that the Embedding layer has special methods for OneHotArrays. So it can't trivially be removed.

Possibly those methods be changed to dispatch on AbstractMatrix{<:Bool} & call generic * etc, but not this PR.

chengchingwen · 2022-07-25T14:51:43Z

Also these:

Flux.jl/src/layers/recurrent.jl

Line 203 in b8bdc2d

    
           function (m::RNNCell{F,A,V,<:AbstractMatrix{T}})(h, x::Union{AbstractVecOrMat{T},OneHotArray}) where {F,A,V,T}

I think OneHotArray should overload * and NNlib.gather so we can just call those functions.

Saransh-cpp · 2022-07-25T15:05:33Z

OneHotArrays should also be added as a doc dependency or the API reference would not show up in Flux's docs (missing docstring is a warning, hence the test won't error out). We could also remove the page completely, given that we are not re-exporting the package, but there should be some reference to the package in Flux's docs. Maybe adding it to ecosystem.md if we decide to remove the page?

(I can take this up in another PR or the changes can be made to this PR itself!)

mcabbott · 2022-07-25T19:17:41Z

Sorting out docs in another PR sounds great.

Not so clear whether it wants to be included like NNlib / MLUtils or pushed out to ecosystem.md; maybe that depends on whether Flux@0.14 is going to load it at all.

Then the goal of this PR is only to remove the code, so that we don't have two versions current -- e.g. #1959 happened after the package was created, which is confusing.

darsnack

I think with a rebase this should be good to go. The longer we wait, the more things will depend on the Flux-internal version (e.g. #2031).

The downstream errors appear unrelated (FastAI and Metalhead for sure). AtomicGraphNets.jl has not run CI for 2 months, but the error appears to be related to the SciML stack.

darsnack · 2022-08-10T15:31:19Z

I just ran the AtomicGraphNets.jl test locally against the current release. They throw the same errors, so we can safely ignore those. @rkurchin you may want to look into those.

* using OneHotArrays * rm tests * skip a test * don't export, add depwarns * back to using

using OneHotArrays

ef481f2

mcabbott force-pushed the rm_onehot branch from 0341902 to ef481f2 Compare July 23, 2022 22:36

rm tests

20776fe

mcabbott added the run downstream test label Jul 24, 2022

mcabbott mentioned this pull request Jul 24, 2022

Better errors for un-implemented functions FluxML/NNlib.jl#427

Open

mcabbott mentioned this pull request Jul 24, 2022

onehotbatch(::CuArray, ...) moves data to host FluxML/OneHotArrays.jl#16

Open

skip a test

972afb2

don't export, add depwarns

e17043b

back to using

6bfcd4d

darsnack approved these changes Aug 10, 2022

View reviewed changes

Merge branch 'master' into rm_onehot

1489728

mcabbott merged commit 1914f38 into FluxML:master Aug 10, 2022

mcabbott deleted the rm_onehot branch August 10, 2022 16:20

Saransh-cpp pushed a commit to Saransh-cpp/Flux.jl that referenced this pull request Aug 11, 2022

using OneHotArrays (FluxML#2025)

08d655d

* using OneHotArrays * rm tests * skip a test * don't export, add depwarns * back to using

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

using OneHotArrays #2025

using OneHotArrays #2025

mcabbott commented Jul 23, 2022 •

edited

Loading

mcabbott commented Jul 24, 2022 •

edited

Loading

mcabbott commented Jul 24, 2022 •

edited

Loading

chengchingwen commented Jul 25, 2022 •

edited

Loading

chengchingwen commented Jul 25, 2022

mcabbott commented Jul 25, 2022

chengchingwen commented Jul 25, 2022

mcabbott commented Jul 25, 2022

chengchingwen commented Jul 25, 2022

Saransh-cpp commented Jul 25, 2022 •

edited

Loading

mcabbott commented Jul 25, 2022

darsnack left a comment •

edited

Loading

darsnack commented Aug 10, 2022

using OneHotArrays #2025

using OneHotArrays #2025

Conversation

mcabbott commented Jul 23, 2022 • edited Loading

mcabbott commented Jul 24, 2022 • edited Loading

mcabbott commented Jul 24, 2022 • edited Loading

chengchingwen commented Jul 25, 2022 • edited Loading

chengchingwen commented Jul 25, 2022

mcabbott commented Jul 25, 2022

chengchingwen commented Jul 25, 2022

mcabbott commented Jul 25, 2022

chengchingwen commented Jul 25, 2022

Saransh-cpp commented Jul 25, 2022 • edited Loading

mcabbott commented Jul 25, 2022

darsnack left a comment • edited Loading

Choose a reason for hiding this comment

darsnack commented Aug 10, 2022

mcabbott commented Jul 23, 2022 •

edited

Loading

mcabbott commented Jul 24, 2022 •

edited

Loading

mcabbott commented Jul 24, 2022 •

edited

Loading

chengchingwen commented Jul 25, 2022 •

edited

Loading

Saransh-cpp commented Jul 25, 2022 •

edited

Loading

darsnack left a comment •

edited

Loading