Fix the performance of `reinterpretarray` with simultaneous reshaping #37559

This addresses longstanding performance problems with `reinterpret` when `sizeof(eltype(a))` is an integer multiple of `sizeof(T)`. By reshaping the array to have an extra "channel dimension," LLVM can unroll the inner loop thanks to static size information. Conversely, this consumes the initial "channel dimension" if `sizeof(T)` is an integer multiple of `sizeof(eltype(a))`.

In order to make this a subtype of `AbstractCartesianIndex`, formerly this added a useless type parameter `N` to indicate the dimensionality of the array for which this index was constructed. But since none of the code depends on `N`, and it would have forced useless specialization, it seems better to not have it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the performance of `reinterpretarray` with simultaneous reshaping #37559

Fix the performance of `reinterpretarray` with simultaneous reshaping #37559

Commits on Sep 29, 2020

Fix the performance of reinterpretarray with simultaneous reshaping #37559

Fix the performance of reinterpretarray with simultaneous reshaping #37559

Commits on Sep 29, 2020

Fix the performance of `reinterpretarray` with simultaneous reshaping #37559

Fix the performance of `reinterpretarray` with simultaneous reshaping #37559