Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add epsilon and eta sampling #475

Merged
merged 1 commit into from
Oct 14, 2023
Merged

Add epsilon and eta sampling #475

merged 1 commit into from
Oct 14, 2023

Conversation

StefanGliga
Copy link

I have implemented epsilon and eta sampling from https://arxiv.org/abs/2210.15191 .
In the UI I opted to expose it in units of 1e-4, to be similar to ooba.
Tested both pytorch/gpu and TPU(jax_static specifically) and everything seems to work fine.

@AlpinDale
Copy link

@henk717 this looks good to me, have you tested yet for a merge?

g4rg added a commit to g4rg/aphrodite-engine that referenced this pull request Oct 3, 2023
@henk717
Copy link
Owner

henk717 commented Oct 3, 2023

Have not been able to review it yet (See discord for details), its the first one on my list once I am able to review.

AlpinDale pushed a commit to aphrodite-engine/aphrodite-engine that referenced this pull request Oct 4, 2023
* KoboldAI endpoint

* KAI: fix loading, spec defaults, error handling

* ignore use_default_badwordsid

* prefix model name

* KAI: fix incorrect JSONResponse usage

* KAI: reduce schema

* KAI: implement kobolcpp style streaming

* KAI: use list comprehension

* KAI: lint

* KAI: add epilon+eta cutoff parameters according to henk717/KoboldAI#475
@henk717
Copy link
Owner

henk717 commented Oct 14, 2023

Can't reproduce the issue I had with it anymore, so ill merge it and hopefully we don't break userspace.

@henk717 henk717 merged commit 009c48a into henk717:united Oct 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants