Bitpacking #291

vayuda · 2024-05-29T17:04:18Z

Based on this issue: #284

Adding this first iteration of packing/unpacking algorithms to support lower bit dtypes into protoype/

…n/ao into uint4-improvements

…o uint4-improvements

pytorch-bot · 2024-05-29T17:04:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/291

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b01550f with merge base 5485929 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torchao/prototype/common/bitpacking.py

test/prototype/test_bitpacking.py

…king

gau-nernst · 2024-05-29T22:55:20Z

torchao/prototype/common/bitpacking.py

+
+    Inputs:
+    data: torch.Tensor - a tensor of unpacked elements of a small dtype.
+    container_size: int - the size of the large dtype in bits.


Just curious. container_size can be determined from data.dtype right? e.g. uint8 -> 8, uint16 -> 16. (there is also this - https://pytorch.org/docs/stable/type_info.html#torch.torch.iinfo).
Also, is it assumed that data.dtype has container_size number of bits? What if data use larger or smaller bit-width than container_size? e.g. store int4 in int32, then request to pack to int8. Depending on what are your assumptions to the inputs, perhaps some kind of type checking and/or type casting is good.

…king

msaroufim

Nice first step let's keep iterating on cuda mode to figure out how to promote this to a stable feature

vayuda added 13 commits May 23, 2024 19:58

init additions

37e4d2d

extended pack/unpack

b37b529

Merge branch 'pytorch:main' into uint4-improvements

f834885

pack/unpack from n to m dtypes

33845fe

works with torch.compile, but not optimized

3e7ca9b

works on gpu

88fe113

Merge branch 'pytorch:main' into uint4-improvements

3e1813a

added row-wise bitpack

80b9a41

Merge branch 'pytorch:main' into uint4-improvements

036334c

Merge branch 'uint4-improvements' of https://github.com/JayakumarPawa…

6a02cc1

…n/ao into uint4-improvements

Merge branch 'pytorch:main' into uint4-improvements

61d3666

restructured into prototype/

47d9c92

Merge branch 'uint4-improvements' of https://github.com/vayuda/ao int…

5bdef89

…o uint4-improvements

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 29, 2024

vayuda added 3 commits May 29, 2024 13:10

revert nuclear fix

8d1ea34

removed temp log

6e1a7d6

removed trinary stuff from this branch

46e39fd