Setting for the amount of (V)RAM used for upscaling #2088

stonerl · 2023-08-13T16:53:00Z

CPU upscaling & upscaling on Apple Silicon (CPU & GPU)

A value between 20% and 80% of the freely available memory can be chosen for upscaling. If desired, instead of the freely available memory, these values can be applied to the entire available RAM. If a user chooses to do so, a warning will be presented and when upscaling the settings and amount of used RAM will be logged.

For GPU upscaling, the amount of freely available can be set. This setting is only available on Windows and Linux.

MaxTileSize for NCNN on Apple Silicon has been added.

Should fix #876

stonerl · 2023-08-14T10:07:43Z

I also implemented the logic for PyTorch.

To give you a general overview. This feature currently is Apple Silicon only. That is due to the unique memory architecture on newer Macs. Since there is no separation between RAM and VRAM, it is only necessary to set a general limit for usable RAM for upscaling.

The idea is that the minimum amount of RAM left for the system is 8 GB (which means 8 GB Macs are more or less out of the question). The max amount of RAM that can be reserved for the system is 80% of the total system memory.

I did some testing on my 32 GB Mac, and it looks promising.

RAM usage with 8 GB of reserved system memory:

RAM usage with 25.6 GB (80 %) of reserved system memory:

I'd also like to implement this for NCNN and would reuse some code snippets from #2070. Haven't looked into it yet, but is there something similar for ONNX?

In general, I think this approach could be reused for Windows/Linux. But I assume there needs to be a second input Field for VRAM. Furthermore, the description needs to be reworded there, since the RAM usage, there is coupled to the CPU and the reserved system memory basically is only used for CPU upscaling.

@RunDevelopment @joeyballentine, could I get some feedback from you?

backend/src/packages/chaiNNer_pytorch/pytorch/processing/upscale_image.py

RunDevelopment · 2023-08-14T11:26:14Z

I'd also like to implement this for NCNN and would reuse some code snippets from #2070. Haven't looked into it yet, but is there something similar for ONNX?

We don't support estimated tile sizes for ONNX. We just never implemented it.

As for NCNN: it's complicated. We do estimate for NCNN, but that estimation frequently crashed the backend on macs for some reason, so I turned it off (#2006). It will now always estimate a tile size of 256 on mac. This is a good default, because it only needs 3~4GB of (V)RAM for most models.

src/renderer/components/SettingsModal.tsx

joeyballentine · 2023-08-14T12:49:05Z

That is due to the unique memory architecture on newer Macs. Since there is no separation between RAM and VRAM, it is only necessary to set a general limit for usable RAM for upscaling.

That's not unique to macs, anyone upscaling using integrated graphics via NCNN will have the same problem. And of course, anyone upscaling on CPU will only use RAM.

stonerl · 2023-08-14T12:55:26Z

That's not unique to macs, anyone upscaling using integrated graphics via NCNN will have the same problem.

Are you sure? From my understanding, on x86, the integrated GPUs get a share of the main memory. This memory will then be subtracted from the main memory available for the CPU.

e.g., 32 GB Total Memory, 8 GB for the GPU, means the CPU will receive 24 GB of RAM.

On the Mac, that is different because the GPU can have 100% of RAM.

stonerl · 2023-08-14T12:58:30Z

@joeyballentine

Here is a comprehensive comparison:

https://www.sir-apfelot.de/en/compare-shared-memory-unified-memory-41023/

backend/src/packages/chaiNNer_pytorch/pytorch/processing/upscale_image.py

RunDevelopment · 2023-08-14T13:13:34Z

I think Intel integrated graphics also uses the unified memory model. https://www.intel.com/content/www/us/en/support/articles/000020962/graphics.html

stonerl · 2023-08-14T15:34:30Z

Interesting. We would need someone with such a system to test this.

stonerl · 2023-08-14T21:01:18Z

Added calculation for NCNN MaxTileSize on Apple Silicon Macs. Left it 256 for Intel Macs, though.

It seems vkdev.get_heap_budget was reporting values that were far too high.

On my system with vkdev.get_heap_budget:

[2023-08-14 22:49:01.734] [info]  Backend: [92171] [INFO] Estimating memory required: 170.55 GB, 17.07 GB available. Estimated tile size: 1024

And with the psutil.virtual_memory().available and max memory set to 80%.

[2023-08-14 22:44:33.013] [info]  Backend: [88582] [INFO] Estimating memory required: 170.55 GB, 10.78 GB available. Estimated tile size: 512

stonerl · 2023-08-18T22:49:01Z

I squashed the commits and rebased it on main. This is how it looks now.

Windows and Linux

macOS

Log file

[2023-08-19 00:41:40.080] [info]  Backend: [70296] [INFO] Memory limit set to 80.0% of total system memory. (32.0 GB)
[2023-08-19 00:41:40.080] [info]  Backend: [70296] [INFO] Estimating memory required: 14.67 GB, 25.60 GB available. Estimated tile size: 1024
[2023-08-19 00:41:40.084] [info]  Backend: [70296] [INFO] Auto split image (1024x1024px @ 3) with initial tile size (1024, 1024).

JeremyRand · 2023-08-27T18:29:26Z

What's still needed before we can get this merged?

joeyballentine · 2023-08-28T16:01:40Z

Ah crap, my backend settings change heavily conflicted with this... I'll try to come up with a plan for what to do to get this into the new system. Sorry about that

RunDevelopment

I am so sorry. I forgot to submit my review. My comments just sat there for a week...

src/renderer/components/SettingsModal.tsx

stonerl · 2023-09-05T07:53:57Z

What's still needed before we can get this merged?

I'll need to wait for @joeyballentine to implement a general settings mechanism, since he did some major changes to the settings in an earlier PR.

As soon as this is done, I refactor this PR.

joeyballentine · 2023-09-05T13:04:00Z

Yeah, I'll try to get that done as soon as I can

CPU upscaling & upscaling on Apple Silicon (CPU & GPU) A value between 20% and 80% of the freely available memory can be chosen for upscaling. If desired, instead of the freely available memory, these values can be applied to the entire available RAM. If a user chooses to do so, a warning will be presented and when upscaling the settings and amount of used RAM will be logged. For GPU upscaling the amount of freely available can be set. This setting is only available on Windows and Linux. MaxTileSize for NCNN on Apple Silicon has been added. add description for memory limit to upscaling node

stonerl · 2023-10-07T07:13:58Z

@JeremyRand I'm currently very busy IRL. If you want to you can take this over and incorporate your changes. I'm not sure when I can continue working on this.

JeremyRand · 2023-10-12T21:55:36Z

@stonerl If I take this PR over, what remaining work needs to be done? I admit I haven't kept up with the code review on this PR, so it's hard for me to evaluate whether the remaining work is something I can do efficiently.

JeremyRand · 2023-10-12T22:49:05Z

(Or maybe @joeyballentine would be able to answer that?)

joeyballentine · 2023-10-14T04:24:59Z

@JeremyRand I'm not sure what still needs to be done for this. I can take a look tomorrow and try to figure that out though

JeremyRand · 2023-11-26T14:24:23Z

@joeyballentine Any updates?

joeyballentine · 2023-11-26T15:50:25Z

@JeremyRand sorry, we've been focused on the iterator rewrite and now on extracting out our model detection code into a separate package. After we're done with that we gotta finish the iterator rewrite (it's taken way longer than we thought).

After that, we should be able to work on stuff like this

JeremyRand · 2023-11-26T17:15:25Z

Sure, no worries. The iterator rewrite looks really cool, I'm fine with waiting on this PR if it gets the iterator rewrite done faster. :)

This comment was marked as outdated.

Sign in to view

stonerl force-pushed the ram-settings branch 5 times, most recently from 9cd9abf to 39012ae Compare August 14, 2023 09:51

stonerl force-pushed the ram-settings branch 4 times, most recently from f5f1ffa to 1c1996c Compare August 14, 2023 10:57

RunDevelopment reviewed Aug 14, 2023

View reviewed changes

backend/src/packages/chaiNNer_pytorch/pytorch/processing/upscale_image.py Outdated Show resolved Hide resolved

backend/src/packages/chaiNNer_pytorch/pytorch/processing/upscale_image.py Outdated Show resolved Hide resolved

stonerl force-pushed the ram-settings branch from 1c1996c to 343bac0 Compare August 14, 2023 11:32

stonerl commented Aug 14, 2023

View reviewed changes

src/renderer/components/SettingsModal.tsx Outdated Show resolved Hide resolved

stonerl force-pushed the ram-settings branch from 343bac0 to ca99c80 Compare August 14, 2023 12:10

RunDevelopment reviewed Aug 14, 2023

View reviewed changes

backend/src/packages/chaiNNer_pytorch/pytorch/processing/upscale_image.py Outdated Show resolved Hide resolved

stonerl force-pushed the ram-settings branch from ca99c80 to 73d682c Compare August 14, 2023 13:05

stonerl force-pushed the ram-settings branch from 863f22d to 0dbd460 Compare August 14, 2023 15:33

stonerl force-pushed the ram-settings branch from 0dbd460 to 5126eb8 Compare August 14, 2023 15:36

This comment was marked as outdated.

Sign in to view

stonerl marked this pull request as ready for review August 14, 2023 19:52

stonerl changed the title ~~add a UI setting for the amount of ram reserved for the system~~ Setting for the amount of RAM used for upscaling on Apple Silicon Aug 14, 2023

stonerl force-pushed the ram-settings branch 2 times, most recently from 3ab5f42 to 2cef556 Compare August 18, 2023 22:36

stonerl force-pushed the ram-settings branch 2 times, most recently from 8a3b0b7 to 16a24fe Compare August 21, 2023 10:31

stonerl requested a review from RunDevelopment August 28, 2023 07:02

RunDevelopment reviewed Aug 30, 2023

View reviewed changes

src/renderer/components/SettingsModal.tsx Outdated Show resolved Hide resolved

src/renderer/components/SettingsModal.tsx Outdated Show resolved Hide resolved

src/renderer/components/SettingsModal.tsx Outdated Show resolved Hide resolved

stonerl force-pushed the ram-settings branch from 16a24fe to 4d6a53c Compare September 1, 2023 15:37

joeyballentine mentioned this pull request Sep 6, 2023

Add a way to set & get general settings #2189

Merged

stonerl force-pushed the ram-settings branch 4 times, most recently from 70ca2a6 to d4a0027 Compare September 7, 2023 16:51

stonerl marked this pull request as draft September 7, 2023 21:15

stonerl force-pushed the ram-settings branch from d4a0027 to b4566e5 Compare September 23, 2023 11:32

stonerl force-pushed the ram-settings branch from b4566e5 to f652d2b Compare September 23, 2023 11:32

JeremyRand mentioned this pull request Nov 29, 2023

Make ncnn memory budget configurable (v2) #2351

Merged

joeyballentine mentioned this pull request Jan 16, 2024

Add PyTorch memory budget limit setting #2472

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setting for the amount of (V)RAM used for upscaling #2088

Setting for the amount of (V)RAM used for upscaling #2088

stonerl commented Aug 13, 2023 •

edited

Loading

This comment was marked as outdated.

stonerl commented Aug 14, 2023

RunDevelopment commented Aug 14, 2023

joeyballentine commented Aug 14, 2023

stonerl commented Aug 14, 2023

stonerl commented Aug 14, 2023

RunDevelopment commented Aug 14, 2023

stonerl commented Aug 14, 2023

This comment was marked as outdated.

This comment was marked as outdated.

stonerl commented Aug 14, 2023

stonerl commented Aug 18, 2023 •

edited

Loading

JeremyRand commented Aug 27, 2023

joeyballentine commented Aug 28, 2023

RunDevelopment left a comment

stonerl commented Sep 5, 2023

joeyballentine commented Sep 5, 2023

stonerl commented Oct 7, 2023

JeremyRand commented Oct 12, 2023

JeremyRand commented Oct 12, 2023

joeyballentine commented Oct 14, 2023

JeremyRand commented Nov 26, 2023

joeyballentine commented Nov 26, 2023

JeremyRand commented Nov 26, 2023

Setting for the amount of (V)RAM used for upscaling #2088

Are you sure you want to change the base?

Setting for the amount of (V)RAM used for upscaling #2088

Conversation

stonerl commented Aug 13, 2023 • edited Loading

This comment was marked as outdated.

stonerl commented Aug 14, 2023

RunDevelopment commented Aug 14, 2023

joeyballentine commented Aug 14, 2023

stonerl commented Aug 14, 2023

stonerl commented Aug 14, 2023

RunDevelopment commented Aug 14, 2023

stonerl commented Aug 14, 2023

This comment was marked as outdated.

This comment was marked as outdated.

stonerl commented Aug 14, 2023

stonerl commented Aug 18, 2023 • edited Loading

Windows and Linux

macOS

Log file

JeremyRand commented Aug 27, 2023

joeyballentine commented Aug 28, 2023

RunDevelopment left a comment

Choose a reason for hiding this comment

stonerl commented Sep 5, 2023

joeyballentine commented Sep 5, 2023

stonerl commented Oct 7, 2023

JeremyRand commented Oct 12, 2023

JeremyRand commented Oct 12, 2023

joeyballentine commented Oct 14, 2023

JeremyRand commented Nov 26, 2023

joeyballentine commented Nov 26, 2023

JeremyRand commented Nov 26, 2023

stonerl commented Aug 13, 2023 •

edited

Loading

stonerl commented Aug 18, 2023 •

edited

Loading