[FIX] Update EOS from config #2475

zhengy001 · 2024-12-13T05:59:58Z

Motivation

Correct EOS as default tokenizer eos_token_id may different from model config eos_token_id

Modifications

Checklist

Format your code according to the Contributor Guide.
Add unit tests as outlined in the Contributor Guide.
Update documentation as needed, including docstrings or example tutorials.

zhengy001 · 2024-12-13T06:07:21Z

Per @remixer-dec, model Mistral-Nemo-Instruct-2407-Q5_K_L.gguf has this issue.

merrymercy · 2024-12-17T11:40:52Z

python/sglang/srt/managers/tokenizer_manager.py

@@ -247,6 +247,10 @@ async def _tokenize_one_request(
        # Parse sampling parameters
        sampling_params = SamplingParams(**obj.sampling_params)
        sampling_params.normalize(self.tokenizer)
+        sampling_params.update_from_config(


Revert the changes in tokenizer_manager.py and sampling_params.py. Move all related changes to schedule_batch.py.
Reason:
The sampling_params is used to handle per-request configs, but this eos_token_ids form hf_config is the same for all requests, so we do not need to attach it to sampling_params.
To reduce the overhead, we should pre-process the hf-config and add some more conditions in check_finished

Add an input argument for check_finished as Req cannot acquire model info.

zhengy001 requested review from merrymercy, hnyls2002 and Ying1123 as code owners December 13, 2024 05:59

zhengy001 mentioned this pull request Dec 13, 2024

[Feature] GGUF support #1616

Closed

2 tasks

merrymercy requested changes Dec 17, 2024

View reviewed changes

merrymercy added the await-response label Dec 17, 2024

zhengy001 requested review from zhyncs, ispobock and ByronHsu as code owners December 19, 2024 09:24

zhengy001 requested a review from merrymercy December 19, 2024 09:26

zhengy001 added 3 commits December 20, 2024 02:06

Update EOS from config

a0adb2b

Move to schedule_batch

9d9d54b

Update

339fb98

zhengy001 force-pushed the zyang_eos branch from e015183 to 339fb98 Compare December 20, 2024 02:08

Minor update

35c4d6a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] Update EOS from config #2475

[FIX] Update EOS from config #2475

zhengy001 commented Dec 13, 2024 •

edited

Loading

zhengy001 commented Dec 13, 2024

merrymercy Dec 17, 2024

zhengy001 Dec 19, 2024

[FIX] Update EOS from config #2475

Are you sure you want to change the base?

[FIX] Update EOS from config #2475

Conversation

zhengy001 commented Dec 13, 2024 • edited Loading

Motivation

Modifications

Checklist

zhengy001 commented Dec 13, 2024

merrymercy Dec 17, 2024

Choose a reason for hiding this comment

zhengy001 Dec 19, 2024

Choose a reason for hiding this comment

zhengy001 commented Dec 13, 2024 •

edited

Loading