Feat: add alternative choices selection methods #835

AidanCooper · 2024-07-30T18:23:51Z

This will likely need refinement and optimisation, so consider this a proposal that I'd like to seek feedback on.

Motivation

SGLang's current choices normalisation method (token length normalised) often performs poorly due to bias towards longer-token options. This arises in cases where the later tokens of an option with many tokens are highly predictable based on its earlier tokens. This is the most succinct example I can come up with that illustrates the flaw, which will trip up even highly capable models:

@sgl.function
def example(s):
    s += sgl.user("What is the capital of France?")
    s += sgl.assistant(sgl.gen("answer", choices=["Paris", "Antidisestablishmentarianism"]))

This PR provides solutions to the above example, and should resolve #523, #608, and possibly other open issues.

Modification

This PR enables the choices normalisation methodology to be configurable, and alongside the existing token length normalised strategy, introduces two new alternatives:

Greedy token selection
Unconditional likelihood normalised, as per this link that @merrymercy shared with me

Both of these implementations probably need to be further refined. One potential issue I've noticed with greedy selection is that it if there are differences in the tokens prepended to the options for token healing purposes, then the selection will be based on this rather than the actual option, which doesn't seem right. It's outside the scope of this PR, but the token healing process in general seems to have an outsized impact on the choices selection.

Checklist

Ensure pre-commit pre-commit run --all-files or other linting tools are used to fix potential lint issues.
Confirm that modifications are covered by complete unit tests. If not, please add more unit tests for correctness.
Modify documentation as needed, such as docstrings or example tutorials.

max99x · 2024-07-30T23:25:32Z

Really cool to see progress on this issue, as it's a major blocker we're facing. The most effective solution I've found is to use the probability of some kind of end token to distinguish the priority of choices which are prefixes of other choices. It does require me to specify which suffix token(s) to take into account. E.g. if I'm generating a JSON string, I look for a double quote; if I ask the model to answer with only the choice and nothing else, I look for EOT, etc. Would be nice to be able to support that through this same API.

merrymercy · 2024-08-01T23:13:41Z

Could you resolve the conflicts? I will review it later this week.

AidanCooper · 2024-08-02T10:04:24Z

Could you resolve the conflicts? I will review it later this week.

Done — thanks!

zhyncs · 2024-08-02T11:03:39Z

Hi @AidanCooper I've fixed the CI issue with the fork. Could you merge the latest main branch? Thanks.

Ying1123

This looks great! Although you call it a proposal, I like the overall design. I left some minor comments. We can merge this after you resolve them.

python/sglang/lang/ir.py

python/sglang/lang/backend/runtime_endpoint.py

python/sglang/test/test_choices.py

AidanCooper · 2024-08-05T10:32:30Z

Thanks @Ying1123! The downside to resolving the default behaviour at the API layer is that we can't specify backend-dependent values, but it's workable in this instance.

Thanks for merging this. I think it's possible that the new selection algorithms could be further optimised for real-world use with some further tweaking, but that will be easier done in follow-on PRs.

zhyncs requested review from Ying1123, merrymercy, zhyncs and hnyls2002 July 30, 2024 18:26

AidanCooper force-pushed the main branch from 2310247 to bc414d0 Compare July 30, 2024 18:55

Ying1123 assigned merrymercy Aug 1, 2024

AidanCooper force-pushed the main branch from 6d6cee6 to 506391c Compare August 2, 2024 10:03

AidanCooper force-pushed the main branch 4 times, most recently from 8ab9126 to 3a50179 Compare August 2, 2024 14:15

Ying1123 reviewed Aug 4, 2024

View reviewed changes

python/sglang/lang/ir.py Outdated Show resolved Hide resolved

python/sglang/lang/backend/runtime_endpoint.py Outdated Show resolved Hide resolved

python/sglang/test/test_choices.py Outdated Show resolved Hide resolved

Ying1123 added the high priority label Aug 4, 2024

AidanCooper added 2 commits August 5, 2024 09:37

Add alternative choices selection methods

8612e56

Address requested changes

a518b34

AidanCooper force-pushed the main branch from 3a50179 to a518b34 Compare August 5, 2024 10:18

Ying1123 enabled auto-merge (squash) August 5, 2024 10:27

Ying1123 approved these changes Aug 5, 2024

View reviewed changes

Ying1123 merged commit 94e0115 into sgl-project:main Aug 5, 2024
3 checks passed

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: add alternative choices selection methods #835

Feat: add alternative choices selection methods #835

AidanCooper commented Jul 30, 2024

max99x commented Jul 30, 2024

merrymercy commented Aug 1, 2024

AidanCooper commented Aug 2, 2024

zhyncs commented Aug 2, 2024

Ying1123 left a comment

AidanCooper commented Aug 5, 2024

Feat: add alternative choices selection methods #835

Feat: add alternative choices selection methods #835

Conversation

AidanCooper commented Jul 30, 2024

Motivation

Modification

Checklist

max99x commented Jul 30, 2024

merrymercy commented Aug 1, 2024

AidanCooper commented Aug 2, 2024

zhyncs commented Aug 2, 2024

Ying1123 left a comment

Choose a reason for hiding this comment

AidanCooper commented Aug 5, 2024