Use lispy tuples in cat (fixes #21673) #39314

timholy · 2021-01-19T08:48:49Z

The cat pipeline has long had poor inferrability.
Together with #39292 and #39294, this should basically put an
end to that problem.

This (which is on top of #39292) gives me the following for the benchmarks in #21673:

julia> @btime test1(20);
  808.795 ns (18 allocations: 720 bytes)

julia> @btime test2(20);
  28.607 ns (1 allocation: 256 bytes)

which is already a considerable improvement for test1 (I got 2.448 μs before any of these changes, and 1.746 μs with just #39292).
But with this and #39294, I get

julia> @btime test1(20)
  33.313 ns (1 allocation: 256 bytes)

That's close to a 100x improvement over where we started, and within 20% of test2. I think that's good enough to declare #21673 fixed.

The `cat` pipeline has long had poor inferrability. Together with #39292 and #39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of #21673 benchmark very similarly.

jebej · 2021-01-19T16:07:05Z

Thanks for working on this! It appears that 1.6 has regressed substantially compared to 1.5, so hopefully these PRs can be backported.

Sacha0 · 2021-01-19T19:22:45Z

base/abstractarray.jl

+__cat(A, shape, catdims, X...) = __cat_offset!(A, shape, catdims, ntuple(zero, length(shape)), X...)
+
+function __cat_offset!(A, shape, catdims, offsets, x, X...)
+    # splitting the "work" on x from X... may reduce latency (fewer costly specializations)


The `cat` pipeline has long had poor inferrability. Together with #39292 and #39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of #21673 benchmark very similarly. (cherry picked from commit 78d55e2)

The `cat` pipeline has long had poor inferrability. Together with JuliaLang#39292 and JuliaLang#39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of JuliaLang#21673 benchmark very similarly.

The `cat` pipeline has long had poor inferrability. Together with #39292 and #39294, this should basically put an end to that problem. Together, at least in simple cases these make the performance of `cat` essentially equivalent to the manual version. In other words, the `test1` and `test2` of #21673 benchmark very similarly. (cherry picked from commit 78d55e2)

timholy added 2 commits January 19, 2021 03:13

reduce specialization cost (?)

4be707f

timholy force-pushed the teh/more_cat_inferrability branch from 6eb51e3 to 4be707f Compare January 19, 2021 08:57

timholy mentioned this pull request Jan 19, 2021

Improve inferability of shape::Dims for cat #39294

Merged

timholy mentioned this pull request Jan 19, 2021

[question] Multiple calls to @snoopi_deep timholy/SnoopCompile.jl#222

Closed

Sacha0 reviewed Jan 19, 2021

View reviewed changes

JeffBezanson approved these changes Jan 19, 2021

View reviewed changes

JeffBezanson added performance Must go faster backport 1.6 Change should be backported to release-1.6 labels Jan 19, 2021

timholy merged commit 78d55e2 into master Jan 20, 2021

timholy deleted the teh/more_cat_inferrability branch January 20, 2021 06:46

KristofferC removed the backport 1.6 Change should be backported to release-1.6 label Feb 1, 2021

timholy mentioned this pull request Feb 19, 2021

More efficient hvcat of scalars and arrays of numbers #39729

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use lispy tuples in cat (fixes #21673) #39314

Use lispy tuples in cat (fixes #21673) #39314

timholy commented Jan 19, 2021 •

edited

Loading

jebej commented Jan 19, 2021 •

edited

Loading

Sacha0 Jan 19, 2021

Use lispy tuples in cat (fixes #21673) #39314

Use lispy tuples in cat (fixes #21673) #39314

Conversation

timholy commented Jan 19, 2021 • edited Loading

jebej commented Jan 19, 2021 • edited Loading

Sacha0 Jan 19, 2021

Choose a reason for hiding this comment

timholy commented Jan 19, 2021 •

edited

Loading

jebej commented Jan 19, 2021 •

edited

Loading