Add function get_device #269

oschulz · 2021-10-07T19:45:55Z

Closes #268

vchuravy · 2021-10-07T19:53:37Z

bors try

bors · 2021-10-07T19:54:56Z

try

Build failed:

CI-julia-nightly (nightly, ubuntu-latest, x64)

src/KernelAbstractions.jl

vchuravy · 2021-10-07T19:56:38Z

@jpsamaroo can you unbork AMDGPU on nightly?

oschulz · 2021-10-07T19:59:30Z

In the examples, we could now simplify code like

    if isa(a, Array)
        kernel! = naive_transpose_kernel!(CPU(),4)
    else
        kernel! = naive_transpose_kernel!(CUDADevice(),256)
    end

to

    device = KernelAbstractions.get_device(A)
    n = isa(a, AbstractGPUArray) ? 256 : 4
    kernel! = naive_transpose_kernel!(device,n)

but it would require a dependency (at least for the example) on GPUArrays.jl.

I could also add a function isgpuarray (or similar), then we could do

    device = KernelAbstractions.get_device(A)
    n = KernelAbstractions.isgpuarray(A) ? 256 : 4
    kernel! = naive_transpose_kernel!(device,n)

without additional dependencies. What do you think?

vchuravy · 2021-10-07T20:04:11Z

Hm we could also add a default static launch size, but it is just an example. What I would do there is not check for GPU-ness, but check for Array

oschulz · 2021-10-07T20:11:16Z

but check for Array

But what's if the user passes a SubArray or so?

Hm we could also add a default static launch size

My thinking was that the author of naive_transpose!(a, b) can judge best how much parallelism would make sense, given the input size. Now they just need to know what's possible on the hardware side - do we have a cross-vendor API function to query device capabilities?

oschulz · 2021-10-07T20:15:40Z

But as for the example - silly me, we can just do

    device = KernelAbstractions.get_device(A)
    n = device isa GPU ? 256 : 4
    kernel! = naive_transpose_kernel!(device, n)

Very literate, I think. :-)

oschulz · 2021-10-07T20:27:05Z

I changed the examples accordingly.

oschulz · 2021-10-08T12:09:21Z

Looks like bors is still running. :-)

vchuravy · 2021-10-08T12:57:07Z

bors try

bors · 2021-10-08T12:58:09Z

try

Build failed:

buildkite/kernelabstractions-dot-jl/rocm-julia-1-dot-6

oschulz · 2021-10-14T15:31:29Z

@vchuravy , does this design look better to you now?

lib/CUDAKernels/src/CUDAKernels.jl

vchuravy

I guess so, I am not a fan that this requires us pulling LinearAlgebra and SparseArrays in even though these are currently stdlibs. I would rather have the question of "what is your storage array" be solved somewhere like Adapt.jl

src/KernelAbstractions.jl

vchuravy · 2021-10-17T20:57:00Z

bors try

bors · 2021-10-17T20:58:05Z

try

Build failed:

CI-julia-nightly (nightly, ubuntu-latest, x64)

oschulz · 2021-10-17T22:35:47Z

I would rather have the question of "what is your storage array" be solved somewhere like Adapt.jl

Yes, that would be nice - I've also been thinking that it would be nice if Adapt.jl would get a "read/inspect" functionality in addition to the "write/transform" functionality in the future. In the end, the designer of a data structure knows best what "underlying" data should mean (esp. if there are several arrays), so the clean way of doing a recursive "where is this stored?" would be to implement an "inspect" companion to adapt().

oschulz · 2021-10-18T07:27:18Z

Is bors hanging?

vchuravy · 2021-10-18T14:07:09Z

bors try

bors · 2021-10-18T14:08:27Z

try

Build failed:

CI-julia-nightly (nightly, ubuntu-latest, x64)

oschulz · 2021-10-19T15:53:37Z

Is bors stalled again?

vchuravy · 2021-10-19T17:10:13Z

You can click on the red x next to they "Try #" https://github.com/JuliaGPU/KernelAbstractions.jl/runs/3927630339

jpsamaroo · 2021-10-19T18:51:10Z

ROCm CI got hung after I updated the local buildkite repo; I've restarted the runners, which are now processing jobs.

oschulz · 2021-10-19T18:58:19Z

Thanks!

oschulz · 2022-02-14T22:09:24Z

I think the remaining test failures are unrelated to this PR (correct?).

oschulz · 2022-02-15T09:43:52Z

I saw JuliaGPU/AMDGPU.jl#187 got merged with a new AMDGPU.jl release. Should we try bors again?

oschulz · 2022-02-16T12:12:48Z

@jpsamaroo , could you kick bors again, with the new AMD fixes?

jpsamaroo · 2022-02-16T13:31:22Z

@oschulz you would need to rebase on #288, although that PR isn't yet CI-green.

oschulz · 2022-02-16T14:43:15Z

rebase on #288, although that PR isn't yet CI-green.

Ah Ok - maybe easiest to just wait until #288 is in, then?

Project.toml

lib/ROCKernels/Project.toml

oschulz · 2022-03-01T20:52:45Z

I saw you made some changes @vchuravy , do I need to do anything in addition?

(Am I correct that the current test failures are unrelated to this PR?)

vchuravy · 2022-03-02T13:46:03Z

The ROCM failure looks related, only Julia nightly is currently expected to fail.

vchuravy · 2022-03-02T22:17:43Z

Thank you Oliver!

oschulz · 2022-03-03T16:40:37Z

Thank you Valentin!

oschulz mentioned this pull request Oct 7, 2021

Adding a function to get device from array (type)? #268

Closed

bors bot added a commit that referenced this pull request Oct 7, 2021

Try #269:

264a7ba

vchuravy reviewed Oct 7, 2021

View reviewed changes

src/KernelAbstractions.jl Outdated Show resolved Hide resolved

bors bot added a commit that referenced this pull request Oct 8, 2021

Try #269:

12fd5ad

vchuravy reviewed Oct 17, 2021

View reviewed changes

lib/CUDAKernels/src/CUDAKernels.jl Outdated Show resolved Hide resolved

vchuravy approved these changes Oct 17, 2021

View reviewed changes

src/KernelAbstractions.jl Outdated Show resolved Hide resolved

bors bot added a commit that referenced this pull request Oct 17, 2021

Try #269:

6f1026b

bors bot added a commit that referenced this pull request Oct 18, 2021

Try #269:

62bc583

oschulz and others added 9 commits February 28, 2022 16:42

Add function get_device

ba31af6

Use get_device in examples

f2c2145

Define get_device based on array instance instead of type

cde7ab0

Apply suggestions from code review

6bfe2b9

Fix test deps

2aaa035

Run get_device GPU tests after GPU availability checks

657fd94

Add .vscode to .gitignore

e40f725

Add missing dep on LinearAlgebra to ROC tests

636702a

Make get_device tests generic

7d4c57e

vchuravy force-pushed the get_device branch from 392ba91 to 7cd29ec Compare February 28, 2022 21:43

oschulz added 3 commits February 28, 2022 16:44

Require AMDGPU v0.2.14

202e8a8

Disable sparse array get_device test for ROCm

82ce7e4

Fix get_device tests

7cd29ec

vchuravy reviewed Feb 28, 2022

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

vchuravy reviewed Feb 28, 2022

View reviewed changes

lib/ROCKernels/Project.toml Outdated Show resolved Hide resolved

Apply suggestions from code review

f8bebab

fixup: ROCDevice lookup

28e804e

vchuravy merged commit 5c1444f into JuliaGPU:master Mar 2, 2022

oschulz mentioned this pull request May 22, 2022

RFC: Establish concept of a computing device JuliaGPU/Adapt.jl#52

Closed

oschulz deleted the get_device branch March 23, 2023 11:38

vchuravy mentioned this pull request Aug 19, 2024

How (if possible) can we get rid of SparseArrays and StaticArrays? #506

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add function get_device #269

Add function get_device #269

oschulz commented Oct 7, 2021

vchuravy commented Oct 7, 2021

bors bot commented Oct 7, 2021

vchuravy commented Oct 7, 2021

oschulz commented Oct 7, 2021 •

edited

Loading

vchuravy commented Oct 7, 2021

oschulz commented Oct 7, 2021 •

edited

Loading

oschulz commented Oct 7, 2021

oschulz commented Oct 7, 2021

oschulz commented Oct 8, 2021

vchuravy commented Oct 8, 2021

bors bot commented Oct 8, 2021

oschulz commented Oct 14, 2021

vchuravy left a comment

vchuravy commented Oct 17, 2021

bors bot commented Oct 17, 2021

oschulz commented Oct 17, 2021 •

edited

Loading

oschulz commented Oct 18, 2021

vchuravy commented Oct 18, 2021

bors bot commented Oct 18, 2021

oschulz commented Oct 19, 2021

vchuravy commented Oct 19, 2021

jpsamaroo commented Oct 19, 2021

oschulz commented Oct 19, 2021

oschulz commented Feb 14, 2022

oschulz commented Feb 15, 2022

oschulz commented Feb 16, 2022

jpsamaroo commented Feb 16, 2022

oschulz commented Feb 16, 2022

oschulz commented Mar 1, 2022

vchuravy commented Mar 2, 2022

vchuravy commented Mar 2, 2022

oschulz commented Mar 3, 2022

Add function get_device #269

Add function get_device #269

Conversation

oschulz commented Oct 7, 2021

vchuravy commented Oct 7, 2021

bors bot commented Oct 7, 2021

try

vchuravy commented Oct 7, 2021

oschulz commented Oct 7, 2021 • edited Loading

vchuravy commented Oct 7, 2021

oschulz commented Oct 7, 2021 • edited Loading

oschulz commented Oct 7, 2021

oschulz commented Oct 7, 2021

oschulz commented Oct 8, 2021

vchuravy commented Oct 8, 2021

bors bot commented Oct 8, 2021

try

oschulz commented Oct 14, 2021

vchuravy left a comment

Choose a reason for hiding this comment

vchuravy commented Oct 17, 2021

bors bot commented Oct 17, 2021

try

oschulz commented Oct 17, 2021 • edited Loading

oschulz commented Oct 18, 2021

vchuravy commented Oct 18, 2021

bors bot commented Oct 18, 2021

try

oschulz commented Oct 19, 2021

vchuravy commented Oct 19, 2021

jpsamaroo commented Oct 19, 2021

oschulz commented Oct 19, 2021

oschulz commented Feb 14, 2022

oschulz commented Feb 15, 2022

oschulz commented Feb 16, 2022

jpsamaroo commented Feb 16, 2022

oschulz commented Feb 16, 2022

oschulz commented Mar 1, 2022

vchuravy commented Mar 2, 2022

vchuravy commented Mar 2, 2022

oschulz commented Mar 3, 2022

oschulz commented Oct 7, 2021 •

edited

Loading

oschulz commented Oct 7, 2021 •

edited

Loading

oschulz commented Oct 17, 2021 •

edited

Loading