DRY sampler improvements #6053

belladoreai · 2024-05-25T15:29:08Z

I was asked by @p-e-w to split some of the changes from #6047 into a separate PR here.

This PR contains the data type performance improvement for DRY, and a minor fix to prevent crash on large repetitive inputs.

Edit: now also contains change to cap the max match length to 50

See the main PR for more info.

Checklist:

I have read the Contributing guidelines.

modules/sampler_hijack.py

p-e-w

LGTM modulo style nits.

modules/sampler_hijack.py

belladoreai · 2024-05-30T18:05:52Z

@oobabooga Ready for merge

p-e-w

Looks good now! Probably makes sense to merge this before DRY is merged into master.

Hunterius8 · 2024-05-31T15:53:15Z

Quickly compared this version to the previous two, looks like it has still gotten a little faster.

The tokens per second only decrease by ~5% at a range of about 21500 tokens now, compared to the previous version's ~11%.

jojje · 2024-06-03T21:44:48Z

LGTM.

Seems @p-e-w is also good with it. What's your view @oobabooga ?

It is an improvement over the original. If further optimization turns out to be necessary, such can be done in separate PRs.

Vhallo · 2024-06-10T22:26:35Z

Good to see the performance issues being solved. Seems like it might be worthwhile to integrate this into Exllamav2 / TabbyAPI now as well?

oobabooga · 2024-06-13T02:27:07Z

Thanks for the reviews, merging now before merging DRY to the main branch.

belladoreai added 2 commits May 23, 2024 00:54

Improve DRY sampler performance 1 of 2 (simple data type changes).

7c03e4a

Fix DRY crash on very repetitive inputs.

3c51b95

belladoreai mentioned this pull request May 25, 2024

DRY sampler performance optimization #6047

Closed

1 task

p-e-w reviewed May 26, 2024

View reviewed changes

modules/sampler_hijack.py Outdated Show resolved Hide resolved

modules/sampler_hijack.py Show resolved Hide resolved

modules/sampler_hijack.py Outdated Show resolved Hide resolved

belladoreai added 2 commits May 26, 2024 23:14

Rename variable.

62e76ed

Change DRY max match length.

918ff94

belladoreai requested a review from p-e-w May 27, 2024 13:38

p-e-w approved these changes May 28, 2024

View reviewed changes

modules/sampler_hijack.py Outdated Show resolved Hide resolved

modules/sampler_hijack.py Outdated Show resolved Hide resolved

Comment.

686e7a8

belladoreai requested a review from p-e-w May 28, 2024 09:39

belladoreai mentioned this pull request May 30, 2024

DRY: A modern repetition penalty that reliably prevents looping #5677

Merged

3 tasks

p-e-w approved these changes May 31, 2024

View reviewed changes

belladoreai mentioned this pull request Jun 3, 2024

Speed-up the DRY logits processor #6087

Closed

Vhallo mentioned this pull request Jun 10, 2024

Addition of DRY: A modern repetition penalty that reliably prevents looping turboderp/exllamav2#447

Closed

oobabooga added 2 commits June 12, 2024 19:36

Merge branch 'dev' into belladoreai-dev-dry-optimization2

ed43c5e

lint

b556ee2

oobabooga merged commit 3abafee into oobabooga:dev Jun 13, 2024

PoetOnTheRun pushed a commit to PoetOnTheRun/text-generation-webui that referenced this pull request Oct 22, 2024

DRY sampler improvements (oobabooga#6053)

834e2e9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRY sampler improvements #6053

DRY sampler improvements #6053

belladoreai commented May 25, 2024 •

edited

Loading

p-e-w left a comment

belladoreai commented May 30, 2024

p-e-w left a comment

Hunterius8 commented May 31, 2024

jojje commented Jun 3, 2024

Vhallo commented Jun 10, 2024

oobabooga commented Jun 13, 2024

DRY sampler improvements #6053

DRY sampler improvements #6053

Conversation

belladoreai commented May 25, 2024 • edited Loading

Checklist:

p-e-w left a comment

Choose a reason for hiding this comment

belladoreai commented May 30, 2024

p-e-w left a comment

Choose a reason for hiding this comment

Hunterius8 commented May 31, 2024

jojje commented Jun 3, 2024

Vhallo commented Jun 10, 2024

oobabooga commented Jun 13, 2024

belladoreai commented May 25, 2024 •

edited

Loading