Adding advanced interface #68

dfm · 2024-03-13T00:19:21Z

This is still very much a work in progress, but it should help with #66 and #67. I'll request feedback ASAP!

dfm · 2024-03-14T18:02:17Z

@lgarrison — Here's my first pass at all this. Here's how we would provide the configuration relevant for #67:

from jax_finufft import nufft2, options

opts = options.NestedOpts(
  type1=options.Opts(gpu_method=1),
  type2=options.Opts(gpu_method=2),
)

nufft2(..., opts=opts)

or, equivalently in this case:

from jax_finufft import nufft2, options

opts = options.NestedOpts(
  forward=options.Opts(gpu_method=2),
  backward=options.Opts(gpu_method=1),
)

nufft2(..., opts=opts)

It's not all that ergonomic, but I think it's a decent start!

dfm · 2024-03-14T18:08:24Z

I've also done something with the imports to break the CUDA compilation. I think it has something to do with jax_finufft_gpu.h being included twice. We probably need to move the descriptor definition to a separate header.

…file

for more information, see https://pre-commit.ci

lgarrison · 2024-03-15T00:37:13Z

The immediate CUDA compilation error is just that there's no declaration of the default_opts<T> function visible to jax_finufft_gpu.cc. That declaration lives in lib/jax_finufft_gpu.h, but that file can't be included as a header in multiple compilation units because it contains function definitions as well as declarations. To fix this, I did the usual thing of splitting the declarations out into a header and putting the definitions in a source file. I called them cufinufft_wrapper.h and cufinufft_wrapper.cc, since most of that file is about giving the cufinufft functions C++ wrappers. But if we don't want to fix it this way for any reason, let me know!

Some CUDA tests fail locally with a CUDA illegal memory access. Not yet sure if it's a problem with the opts, or this header refactoring.

lgarrison · 2024-03-15T15:38:15Z

The problem was indeed with the header refactoring. The y_index and z_index functions have generic templated definitions in the header, as well as template specializations in the source file. But if the specializations aren't declared in the header, then the compiler won't know it needs to look for the specializations and will just use the generic version.

I don't like my solution, it feels fragile to me! It's too easy to write a specialization in the source file that gets silently ignored. Not sure if there's a better pattern we should be using here.

dfm · 2024-03-15T15:44:38Z

Thanks for taking this down @lgarrison!! I'll take a look this afternoon.

lgarrison · 2024-03-15T15:53:37Z

I confirm the opts are working for me and fix the performance issue from #67.

dfm · 2024-03-15T19:19:10Z

Thanks @lgarrison! I think that the approach you came up with here is totally fine. I agree that it's not very elegant, but I think we should just roll with it and revisit only if we need to later. It's possible that the whole library could benefit from some refactoring, but let's not let that get in the way of merging this. With that in mind, I'm going to merge this now!

This fixes #67, but let's leave #66 open until we add info to the README.

dfm added 4 commits March 13, 2024 19:12

Adding CPU options struct

71a8e9f

don't export enum values

89d28ae

getting CPU opts working

1c23899

naming issue

77c102c

dfm force-pushed the advanced branch from aec2542 to 77c102c Compare March 13, 2024 23:13

dfm added 7 commits March 13, 2024 19:47

moving options building to python via pydantic

032f150

getting opts implemented

b8b9513

gpu bugs

3f814e4

more bugs

6c50730

bugz bugz

15550d2

adding options tests

c4c942e

fixing variable name

ea4bef9

dfm marked this pull request as ready for review March 14, 2024 17:58

dfm requested a review from lgarrison March 14, 2024 17:58

dfm added 2 commits March 14, 2024 14:09

moving include of descriptor

246d993

includes

fae0d33

lgarrison force-pushed the advanced branch from 387f2fb to fae0d33 Compare March 14, 2024 21:00

lgarrison and others added 2 commits March 14, 2024 20:21

lib: put cufinufft wrapper function declarations in their own header …

5d93584

…file

[pre-commit.ci] auto fixes from pre-commit.com hooks

73a8a37

for more information, see https://pre-commit.ci

lib: need to declare template specialization in header

c18a433

dfm linked an issue Mar 15, 2024 that may be closed by this pull request

Computing gradient of a nufft2 costs 13x more than the nufft2 alone on GPU #67

Closed

dfm merged commit ef69daa into main Mar 15, 2024
5 checks passed

dfm deleted the advanced branch March 15, 2024 19:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding advanced interface #68

Adding advanced interface #68

dfm commented Mar 13, 2024

dfm commented Mar 14, 2024

dfm commented Mar 14, 2024

lgarrison commented Mar 15, 2024 •

edited

Loading

lgarrison commented Mar 15, 2024

dfm commented Mar 15, 2024

lgarrison commented Mar 15, 2024

dfm commented Mar 15, 2024

Adding advanced interface #68

Adding advanced interface #68

Conversation

dfm commented Mar 13, 2024

dfm commented Mar 14, 2024

dfm commented Mar 14, 2024

lgarrison commented Mar 15, 2024 • edited Loading

lgarrison commented Mar 15, 2024

dfm commented Mar 15, 2024

lgarrison commented Mar 15, 2024

dfm commented Mar 15, 2024

lgarrison commented Mar 15, 2024 •

edited

Loading