Use memoryviews and allow nogil in gradient search #455

mraspaud · 2022-09-21T10:14:50Z

This PR uses moryviews and allows nogil in gradient search

Closes Release GIL in gradient search resampling #445

codecov · 2022-09-21T12:21:42Z

Codecov Report

Merging #455 (8ba117b) into main (8da4081) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##             main     #455   +/-   ##
=======================================
  Coverage   94.28%   94.28%           
=======================================
  Files          69       69           
  Lines       12388    12388           
=======================================
  Hits        11680    11680           
  Misses        708      708

Flag	Coverage Δ
unittests	`94.28% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

coveralls · 2022-09-21T12:32:01Z

Coverage remained the same at 94.13% when pulling 8ba117b on mraspaud:feature-nogil-gradient into 8da4081 on pytroll:main.

djhoese

One small comment, but otherwise if it compiles and works then 👍

Might be nice in the future to not have to do np.full everywhere or if they are intermediate arrays use C-level buffers/arrays, but that's in the future.

djhoese · 2022-09-21T16:11:46Z

pyresample/gradient/_gradient_search.pyx

+    image = np.full([z_size, y_size, x_size], np.nan, dtype=DTYPE)
+    cdef DTYPE_t [:, :, :] image_view = image


You may benefit from defining these as [:, :, ::1] when you know they are C contiguous. This does (from my experience) require all arguments where this is passed to also define their arguments as [:, :, ::1] which seems dumb to me but anyway...

My understanding is that cython can make better indexing and for looping choices when it knows that everything is contiguous (makes sense).

djhoese · 2022-09-21T17:04:56Z

Do you have a plot of the differences? Dask diagnostics plot if you can

mraspaud · 2022-09-26T13:34:50Z

TBH, I can't see any difference. My guess is that somehow the gil was already released...
before the change (so not releasing the gil?)

after the change (releasing the gil?)

djhoese · 2022-09-26T14:14:18Z

It might be easier to see if you do multiple datasets at the same time? Like running a satpy Scene with multiple bands loaded and all being resampled at the same time. Actually yeah, if the nogil sections you have now are all run serially for the most part then dask may be making it look like high CPU usage. You could also try passing 0.2 to your ResourceProfiler to get more data points on the CPU/memory graph since it looks like it is only every half second and your total processing time isn't that long.

...or you forgot to recompile between tests.

mraspaud · 2022-09-26T15:05:43Z

I did recompile! Checking now with other settings

mraspaud · 2022-09-26T15:36:21Z

Tried with 3 composites, a bigger destination area, the setting was already to dt=0.25. The results are marginally better without gil.
before this PR

after this PR

Next, I'm going to try with bigger data (was testing with seviri 3km0

djhoese · 2022-09-26T15:40:34Z

Yeah that CPU line definitely looks better. The amount that it drops down still makes me think the chunk size could be larger (if the files are a good size for it).

mraspaud · 2022-09-26T15:44:52Z

chunk size is 1024 in the previous examples

pnuu · 2022-09-26T15:56:17Z

I guess the flat 100 % in the beginning is the Satpy file handler creation we've already discussed in pytroll/satpy#2186 ?

mraspaud · 2022-09-27T06:45:12Z

I guess the flat 100 % in the beginning is the Satpy file handler creation we've already discussed in pytroll/satpy#2186 ?

yes, that looks like it. Far to long imo, we should look a that.

djhoese · 2022-10-28T18:35:05Z

@mraspaud can you give more details on what your code and target AreaDefinition looked like for your last comment's plots? I'm wondering if we can come up with a case that shows a little more improvement.

mraspaud · 2022-10-31T09:03:20Z

@djhoese that was seviri data full disk resampled onto a full earth mollweide:

moll:
  description: moll
  projection:
    ellps: WGS84
    lon_0: 0.0
    proj: moll
    lat_0: 0.0
  shape:
    height: 4500
    width: 9000
  area_extent:
    lower_left_xy: [-18040095.696147293, -9020047.848073646]
    upper_right_xy: [18040095.696147293, 9020047.848073646]
    units: m

then I tried with fci but got my computer to crash..

djhoese · 2022-10-31T14:57:09Z

Trying it with ABI to a latlong projection (not the full data region). Before this PR:

After:

So I see better CPU usage and a faster run time by almost 30 seconds. This was generating a fully corrected true_color.

Note: This is using my "pass through" profiling where I just compute the chunks and throw them away (no saving to disk).

djhoese · 2022-10-31T15:08:51Z

And for the record, here is the same thing with nearest:

But I also see "PerformanceWarning"s in the kd_tree.py module. But it took 3x as much memory and took longer than the before this PR case of gradient_search.

djhoese · 2022-10-31T15:21:54Z

Pretty:

mraspaud · 2022-11-07T14:05:24Z

Thanks for trying it out @djhoese ! Should we merge?

Use memoryviews and allow nogil in gradient search

aafde42

mraspaud added the enhancement label Sep 21, 2022

mraspaud requested review from djhoese and pnuu September 21, 2022 10:14

mraspaud self-assigned this Sep 21, 2022

Fix straight gradient resampling

8ba117b

djhoese approved these changes Sep 21, 2022

View reviewed changes

djhoese merged commit 5f53ecd into pytroll:main Nov 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use memoryviews and allow nogil in gradient search #455

Use memoryviews and allow nogil in gradient search #455

mraspaud commented Sep 21, 2022 •

edited

Loading

codecov bot commented Sep 21, 2022 •

edited

Loading

coveralls commented Sep 21, 2022

djhoese left a comment

djhoese Sep 21, 2022

djhoese commented Sep 21, 2022

mraspaud commented Sep 26, 2022

djhoese commented Sep 26, 2022

mraspaud commented Sep 26, 2022

mraspaud commented Sep 26, 2022

djhoese commented Sep 26, 2022

mraspaud commented Sep 26, 2022

pnuu commented Sep 26, 2022

mraspaud commented Sep 27, 2022

djhoese commented Oct 28, 2022

mraspaud commented Oct 31, 2022

djhoese commented Oct 31, 2022 •

edited

Loading

djhoese commented Oct 31, 2022

djhoese commented Oct 31, 2022

mraspaud commented Nov 7, 2022

		image = np.full([z_size, y_size, x_size], np.nan, dtype=DTYPE)
		cdef DTYPE_t [:, :, :] image_view = image

Use memoryviews and allow nogil in gradient search #455

Use memoryviews and allow nogil in gradient search #455

Conversation

mraspaud commented Sep 21, 2022 • edited Loading

codecov bot commented Sep 21, 2022 • edited Loading

Codecov Report

coveralls commented Sep 21, 2022

djhoese left a comment

Choose a reason for hiding this comment

djhoese Sep 21, 2022

Choose a reason for hiding this comment

djhoese commented Sep 21, 2022

mraspaud commented Sep 26, 2022

djhoese commented Sep 26, 2022

mraspaud commented Sep 26, 2022

mraspaud commented Sep 26, 2022

djhoese commented Sep 26, 2022

mraspaud commented Sep 26, 2022

pnuu commented Sep 26, 2022

mraspaud commented Sep 27, 2022

djhoese commented Oct 28, 2022

mraspaud commented Oct 31, 2022

djhoese commented Oct 31, 2022 • edited Loading

djhoese commented Oct 31, 2022

djhoese commented Oct 31, 2022

mraspaud commented Nov 7, 2022

mraspaud commented Sep 21, 2022 •

edited

Loading

codecov bot commented Sep 21, 2022 •

edited

Loading

djhoese commented Oct 31, 2022 •

edited

Loading