Bitnet 1.58 prework, POC, and staging #281

CoffeeVampir3 · 2024-05-26T19:48:24Z

Bitnet 1.58 Groundwork

After some talks with Saroufim and the cuda mode team working on bitnet, we've outlined a strategy for implementing bitnet 1.58 method into torch. This issue lays the groundwork for 2-bit trinary tensor quantization and bitnet linear work for Bitnet 1.58

I've set up a staging repo Staging with a number of items:

To the point minimal lib
Training notebook for creating a full model, up to the point where we quantize and pack
Cleaned up minimal training example for running as a script
Example of the compiled kernel

This covers the initial groundwork for getting working trinary networks into torch.

Example Quantization Method
POC layer quantization
Runnable example model with quantized layers (In progress Dtype and Runnable Model)
AO dtype
AO layer type (?) for bitnet linear
Runnable example model with full dtype + bitnet linear layer, shippable

msaroufim · 2024-05-26T23:42:46Z

Very cool! Thank you for writing up such a clear plan. We can start merging the bit-packing logic and the layer quantization so feel free to send a PR whenever you're ready. This is very much following a similar to the playbook for fp6 @gau-nernst followed

Related work

Subclass work: Added first bits of Uint2Tensor and BitnetTensor #282
Generic bit packing Generic packing algorithms from size N to M #284

CoffeeVampir3 · 2024-05-28T05:41:29Z

👍 #285 I've kept this seperate from Andreas' commit for now, this encapsulates only the working bits.

CoffeeVampir3 mentioned this issue May 28, 2024

Trinary2 dtype and quantization for Bitnet 1.58 #285

Closed

5 tasks

msaroufim mentioned this issue May 28, 2024

Lowering after pointwise cat can lead to uncontiguous memory accesses pytorch/pytorch#124002

Open

msaroufim added the tracker label Jun 4, 2024

yanbing-j pushed a commit to yanbing-j/ao that referenced this issue Dec 9, 2024

fix runner-et cmakelists for android (pytorch#281)

9e66a30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bitnet 1.58 prework, POC, and staging #281

Bitnet 1.58 prework, POC, and staging #281

CoffeeVampir3 commented May 26, 2024 •

edited

Loading

msaroufim commented May 26, 2024 •

edited

Loading

CoffeeVampir3 commented May 28, 2024

Bitnet 1.58 prework, POC, and staging #281

Bitnet 1.58 prework, POC, and staging #281

Comments

CoffeeVampir3 commented May 26, 2024 • edited Loading

Bitnet 1.58 Groundwork

msaroufim commented May 26, 2024 • edited Loading

CoffeeVampir3 commented May 28, 2024

CoffeeVampir3 commented May 26, 2024 •

edited

Loading

msaroufim commented May 26, 2024 •

edited

Loading