GPU Memory-Usage, full finetuning vs LoRA vs LoRA + CPU Offloading #1804

EthanChen1234 · 2024-05-27T14:55:52Z

EthanChen1234
May 27, 2024

It's a great work!

hi, how to calculate the GPU Memory in different methods?

In addition, using LoRA method, the base parameter will go through forward pass, will the base parameter go through backward pass?

younesbelkada · 2024-05-29T15:26:48Z

younesbelkada
May 29, 2024

Hi !
Thanks !
One way to calculate effectively memory stats is to call: torch.cuda.reset_peak_memory_stats(device) at the beginning of your script then call: max_memory = torch.cuda.max_memory_allocated(device)

In addition, using LoRA method, the base parameter will go through forward pass, will the base parameter go through backward pass?

Since backward pass is based on the chain rule, I would say so, however we only calculate gradients on the LoRA parameters

1 reply

EthanChen1234 May 29, 2024
Author

ok, thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU Memory-Usage, full finetuning vs LoRA vs LoRA + CPU Offloading #1804

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

GPU Memory-Usage, full finetuning vs LoRA vs LoRA + CPU Offloading #1804

EthanChen1234 May 27, 2024

Replies: 1 comment · 1 reply

younesbelkada May 29, 2024

EthanChen1234 May 29, 2024 Author

EthanChen1234
May 27, 2024

Replies: 1 comment 1 reply

younesbelkada
May 29, 2024

EthanChen1234 May 29, 2024
Author