-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DRY sampler improvements #6053
DRY sampler improvements #6053
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM modulo style nits.
@oobabooga Ready for merge |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good now! Probably makes sense to merge this before DRY is merged into master.
LGTM. Seems @p-e-w is also good with it. What's your view @oobabooga ? It is an improvement over the original. If further optimization turns out to be necessary, such can be done in separate PRs. |
Good to see the performance issues being solved. Seems like it might be worthwhile to integrate this into Exllamav2 / TabbyAPI now as well? |
Thanks for the reviews, merging now before merging DRY to the main branch. |
I was asked by @p-e-w to split some of the changes from #6047 into a separate PR here.
This PR contains the data type performance improvement for DRY, and a minor fix to prevent crash on large repetitive inputs.
Edit: now also contains change to cap the max match length to 50
See the main PR for more info.
Checklist: