Change 8bit optimizer blocksize 2048->256; additional bf16 support #1365

matthewdouglas · 2024-09-19T00:00:51Z

This PR stacks on top of #1360 with an update to the blocksize for the 8bit blockwise optimizers. Additionally, bf16 support is added for RMSprop, Adagrad, and Momentum.

@TimDettmers @Titus-von-Koeller

github-actions · 2024-09-19T00:04:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

TimDettmers

This looks all good. The only thing that we might want to check is if the error boundaries in the tests are now improved. We want to keep the absolute and relative error tight, so that if we have a slight degradation, the tests fail again.

matthewdouglas · 2024-09-20T19:48:58Z

@TimDettmers I was able to tighten some of the tolerances.

…itsandbytes-foundation#1365) * Change 8bit optimizer blocksize 2048->256; additional bf16 support * Update tolerances for 8bit optimizer tests

matthewdouglas added 5 commits September 16, 2024 09:55

Add AdEMAMix optimizer

d8c4b39

Add PagedAdEMAMix32bit, AdEMAMix32bit

0911854

Add PagedAdEMAMix32bit, AdEMAMix32bit

a922dab

AdEMAMix: add support for alpha/beta3 scheduling

d4b92d1

Change 8bit optimizer blocksize 2048->256; additional bf16 support

c0cb4a3

matthewdouglas added the enhancement New feature or request label Sep 19, 2024

matthewdouglas requested a review from TimDettmers September 19, 2024 00:00

TimDettmers reviewed Sep 20, 2024

View reviewed changes

matthewdouglas added 3 commits September 20, 2024 14:54

Update paged AdEMAMix

1a3d3c6

Merge branch 'ademamix' into optim-blocksize-256

c2e7749

Update tolerances for 8bit optimizer tests

f8d206f

matthewdouglas changed the base branch from ademamix to main September 20, 2024 19:52

Fix merge conflicts

ccab70c

matthewdouglas merged commit aa57bd8 into main Sep 20, 2024
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change 8bit optimizer blocksize 2048->256; additional bf16 support #1365

Change 8bit optimizer blocksize 2048->256; additional bf16 support #1365

matthewdouglas commented Sep 19, 2024

github-actions bot commented Sep 19, 2024

TimDettmers left a comment

matthewdouglas commented Sep 20, 2024

Change 8bit optimizer blocksize 2048->256; additional bf16 support #1365

Change 8bit optimizer blocksize 2048->256; additional bf16 support #1365

Conversation

matthewdouglas commented Sep 19, 2024

github-actions bot commented Sep 19, 2024

TimDettmers left a comment

Choose a reason for hiding this comment

matthewdouglas commented Sep 20, 2024