fix for itemsize => element_size() for torch backwards compat #30133

winglian · 2024-04-09T01:21:46Z

This is a similar fix to huggingface/peft#1630.

.itemsize on a tensor is only supported on torch>=2.1

This PR makes sure the checks are compatible with earlier versions of pytorch that have been overlooked by myself in #30162

winglian · 2024-04-09T01:22:08Z

@pacman100 @BenjaminBossan

muellerzr · 2024-04-09T02:07:41Z

src/transformers/modeling_utils.py

@@ -1160,7 +1160,7 @@ def num_parameters(self, only_trainable: bool = False, exclude_embeddings: bool
                # used for the 4bit quantization (uint8 tensors are stored)
                if is_loaded_in_4bit and isinstance(param, bnb.nn.Params4bit):
                    total_numel.append(
-                        param.numel() * 2 * self.hf_quantizer.quantization_config.bnb_4bit_quant_storage.itemsize
+                        param.numel() * 2 * self.hf_quantizer.quantization_config.bnb_4bit_quant_storage.element_size()


I meant to respond to the other one earlier, but may be good to have an if/else to know which version to call? E.g. param.numel() * 2 * elem_size where elemsize is defined earlier based on the torch version

Correction, as @BenjaminBossan pointed out this is a must for us to have

Besides itemsize not being available in torch 2.0.1, I couldn't get element_size() to work either (not a dtype attribute error).

I put this 'helper' function at the top of the file to do it manually:

def get_dtype_size(dtype):
if dtype == torch.float32:
return 4
elif dtype == torch.float64:
return 8
elif dtype == torch.float16:
return 2
elif dtype == torch.uint8:
return 1
elif dtype == torch.int8:
return 1
elif dtype == torch.int16:
return 2
elif dtype == torch.int32:
return 4
elif dtype == torch.int64:
return 8
else:
raise ValueError("Unsupported dtype")

and then used this in place of the itemsize code:

if is_loaded_in_4bit and isinstance(param, bnb.nn.Params4bit):
quant_storage = self.hf_quantizer.quantization_config.bnb_4bit_quant_storage
nb_params = get_dtype_size(quant_storage)
total_numel.append(param.numel() * 2 * nb_params)
else:
total_numel.append(param.numel())

I've only tested it by using the qlora config on tiny-llama; and at least for this case it works:

accelerate launch -m axolotl.cli.train examples/tiny-llama/qlora.yml

I am using image 'winglian/axolotl-base:main-base-py3.11-cu121-2.2.1' (which actually uses torch 2.0.1 not 2.2.1 as the name implies.). Ubuntu 22.04 / laptop RTX4090 (16G)

Thank you soooo much!!!

HuggingFaceDocBuilderDev · 2024-04-09T05:54:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan · 2024-04-09T10:18:48Z

Note that this PR fails on PEFT CI.

winglian · 2024-04-09T20:43:03Z

my last change isn't correct either. I'll take a look at this tomorrow.

younesbelkada · 2024-04-18T09:43:06Z

Hi @winglian , as pointed out by @hiyouga my fix did not cover all the cases, would you be happy to rebase your PR with main and we merge it ?

src/transformers/modeling_utils.py

younesbelkada · 2024-04-22T17:38:34Z

src/transformers/modeling_utils.py

-                    # For compatibility with older PT version - see: https://github.com/huggingface/peft/pull/1635
-                    nb_params = (
-                        quant_storage.itemsize if hasattr(quant_storage, "itemsize") else quant_storage.element_size()
+                    if hasattr(param, "element_size"):


Why not simply use:

quant_storage = self.hf_quantizer.quantization_config.bnb_4bit_quant_storage num_bytes = quant_storage.element_size()

element_size seems present from 1.9.1: https://pytorch.org/docs/1.9.1/search.html?q=element_size&check_keywords=yes&area=default to latest: https://pytorch.org/docs/2.2/search.html?q=element_size&check_keywords=yes&area=default

cc @winglian @BenjaminBossan @hiyouga

self.hf_quantizer.quantization_config.bnb_4bit_quant_storage is a torch.dtype instance while only the torch.tensor has element_size()

huggingface/peft#1635

younesbelkada

Thanks for taking care of the backward compatibility for previous torch versions !

amyeroberts

Thanks for handling this fix!

Just a small nit

src/transformers/modeling_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

fix for itemsize => element_size() for torch backwards compat

a52c11e

muellerzr reviewed Apr 9, 2024

View reviewed changes

improve handling of element counting

b83473a

hiyouga mentioned this pull request Apr 18, 2024

AttributeError: 'torch.dtype' object has no attribute 'element_size' #30304

Closed

4 tasks

Merge branch 'main' into torch-dtype-itemsize

14b57cc

younesbelkada reviewed Apr 22, 2024

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

Update src/transformers/modeling_utils.py

1d11e7c

younesbelkada reviewed Apr 22, 2024

View reviewed changes

fixup

44a0a8d

younesbelkada approved these changes Apr 23, 2024

View reviewed changes

younesbelkada requested a review from amyeroberts April 23, 2024 10:14

amyeroberts approved these changes Apr 23, 2024

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

Update src/transformers/modeling_utils.py

a6b71b4

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

younesbelkada merged commit 57fc00f into huggingface:main Apr 23, 2024
21 checks passed

renovate-bot mentioned this pull request Jun 6, 2024

chore(deps): update python GoogleCloudPlatform/kubernetes-engine-samples#1298

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix for itemsize => element_size() for torch backwards compat #30133

fix for itemsize => element_size() for torch backwards compat #30133

winglian commented Apr 9, 2024 •

edited by younesbelkada

Loading

winglian commented Apr 9, 2024

muellerzr Apr 9, 2024

muellerzr Apr 9, 2024

peterdonnelly1 Apr 17, 2024

yand1ngs Apr 30, 2024

HuggingFaceDocBuilderDev commented Apr 9, 2024

BenjaminBossan commented Apr 9, 2024

winglian commented Apr 9, 2024

younesbelkada commented Apr 18, 2024

younesbelkada Apr 22, 2024

younesbelkada Apr 22, 2024

hiyouga Apr 22, 2024 •

edited

Loading

younesbelkada left a comment

amyeroberts left a comment

fix for itemsize => element_size() for torch backwards compat #30133

fix for itemsize => element_size() for torch backwards compat #30133

Conversation

winglian commented Apr 9, 2024 • edited by younesbelkada Loading

winglian commented Apr 9, 2024

muellerzr Apr 9, 2024

Choose a reason for hiding this comment

muellerzr Apr 9, 2024

Choose a reason for hiding this comment

peterdonnelly1 Apr 17, 2024

Choose a reason for hiding this comment

yand1ngs Apr 30, 2024

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Apr 9, 2024

BenjaminBossan commented Apr 9, 2024

winglian commented Apr 9, 2024

younesbelkada commented Apr 18, 2024

younesbelkada Apr 22, 2024

Choose a reason for hiding this comment

younesbelkada Apr 22, 2024

Choose a reason for hiding this comment

hiyouga Apr 22, 2024 • edited Loading

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

winglian commented Apr 9, 2024 •

edited by younesbelkada

Loading

hiyouga Apr 22, 2024 •

edited

Loading