Improve inferability of shape::Dims for cat #39294

timholy · 2021-01-17T16:18:01Z

cat is often called with Varargs or heterogenous inputs,
and in such cases inference almost always fails. Even when all the arrays
are of the same type, if the number of varargs isn't known
inference typically fails. The culprit is probably #36454.

This reduces the number of failures considerably, by avoiding
creation of vararg length tuples in the shape-inference pipeline.

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably #36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline.

base/abstractarray.jl

jebej · 2021-01-18T15:53:35Z

Would this help with #21673?

timholy · 2021-01-18T17:43:55Z

Seems likely that at least one of this and #39292 should help, but I haven't tested.

timholy · 2021-01-19T07:46:47Z

For me, on the benchmark in #21673 master yields

julia> @btime test1(20)
  2.448 μs (34 allocations: 1.56 KiB)

whereas this branch yields

julia> @btime test1(20)
  1.052 μs (15 allocations: 864 bytes)

and the branch in #39292 yields

julia> @btime test1(20)
  1.746 μs (26 allocations: 1.14 KiB)

So they both help.

base/abstractarray.jl

The `cat` pipeline has long had poor inferrability. Together with #39292 and #39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of #21673 benchmark very similarly.

timholy · 2021-01-19T08:59:23Z

The final fix is in #39314.

The `cat` pipeline has long had poor inferrability. Together with #39292 and #39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of #21673 benchmark very similarly.

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably #36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline. (cherry picked from commit 815076b)

The `cat` pipeline has long had poor inferrability. Together with #39292 and #39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of #21673 benchmark very similarly. (cherry picked from commit 78d55e2)

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably #36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline. (cherry picked from commit 815076b)

The `cat` pipeline has long had poor inferrability. Together with #39292 and #39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of #21673 benchmark very similarly. (cherry picked from commit 78d55e2)

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably JuliaLang#36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline.

The `cat` pipeline has long had poor inferrability. Together with JuliaLang#39292 and JuliaLang#39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of JuliaLang#21673 benchmark very similarly.

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably JuliaLang#36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline.

The `cat` pipeline has long had poor inferrability. Together with JuliaLang#39292 and JuliaLang#39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of JuliaLang#21673 benchmark very similarly.

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably #36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline. (cherry picked from commit 815076b)

The `cat` pipeline has long had poor inferrability. Together with #39292 and #39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of #21673 benchmark very similarly. (cherry picked from commit 78d55e2)

Sacha0 reviewed Jan 17, 2021

View reviewed changes

base/abstractarray.jl Outdated Show resolved Hide resolved

timholy commented Jan 19, 2021

View reviewed changes

base/abstractarray.jl Outdated Show resolved Hide resolved

Update base/abstractarray.jl

3769f0b

timholy mentioned this pull request Jan 19, 2021

Use lispy tuples in cat (fixes #21673) #39314

Merged

timholy mentioned this pull request Jan 19, 2021

[question] Multiple calls to @snoopi_deep timholy/SnoopCompile.jl#222

Closed

timholy merged commit 815076b into master Jan 19, 2021

timholy deleted the teh/cat_shape branch January 19, 2021 19:19

timholy added the backport 1.6 Change should be backported to release-1.6 label Jan 19, 2021

KristofferC mentioned this pull request Jan 20, 2021

Backports 1.6-rc1 #39160

Merged

60 tasks

KristofferC removed the backport 1.6 Change should be backported to release-1.6 label Feb 1, 2021

timholy mentioned this pull request Feb 19, 2021

More efficient hvcat of scalars and arrays of numbers #39729

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve inferability of shape::Dims for cat #39294

Improve inferability of shape::Dims for cat #39294

timholy commented Jan 17, 2021

jebej commented Jan 18, 2021

timholy commented Jan 18, 2021

timholy commented Jan 19, 2021

timholy commented Jan 19, 2021

Improve inferability of shape::Dims for cat #39294

Improve inferability of shape::Dims for cat #39294

Conversation

timholy commented Jan 17, 2021

jebej commented Jan 18, 2021

timholy commented Jan 18, 2021

timholy commented Jan 19, 2021

timholy commented Jan 19, 2021