[Bugfix] Guided decoding falls back to outlines when fails to import xgrammar #12976

terrytangyuan · 2025-02-09T04:03:42Z

This fixes the issue when xgrammar module cannot be imported successfully for some reason, e.g. triton is not available. This fallback allows users to still use guided decoding when xgrammar cannot be used.

  File "/usr/local/lib/python3.10/dist-packages/vllm/engine/multiprocessing/client.py", line 606, in _process_request
    params = await \
  File "/usr/local/lib/python3.10/dist-packages/vllm/engine/async_llm_engine.py", line 553, in build_guided_decoding_logits_processor_async
    processor = await get_guided_decoding_logits_processor(
  File "/usr/local/lib/python3.10/dist-packages/vllm/model_executor/guided_decoding/__init__.py", line 109, in get_guided_decoding_logits_processor
    return get_local_xgrammar_guided_decoding_logits_processor(
  File "/usr/local/lib/python3.10/dist-packages/vllm/model_executor/guided_decoding/xgrammar_decoding.py", line 38, in get_local_xgrammar_guided_decoding_logits_processor
    config = GrammarConfig.from_guided_params(guided_params=guided_params,
  File "/usr/local/lib/python3.10/dist-packages/vllm/model_executor/guided_decoding/xgrammar_decoding.py", line 174, in from_guided_params
    tokenizer_data = TokenizerDataCache.get_tokenizer_data(tokenizer)
  File "/usr/local/lib/python3.10/dist-packages/vllm/model_executor/guided_decoding/xgrammar_decoding.py", line 87, in get_tokenizer_data
    vocab_type = xgr.VocabType.RAW
NameError: name 'xgr' is not defined

…xgrammar Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

github-actions · 2025-02-09T04:03:53Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

terrytangyuan · 2025-02-09T04:04:53Z

This is easier to review when hiding the whitespaces: https://github.com/vllm-project/vllm/pull/12976/files?diff=unified&w=1

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

russellb

just some minor style feedback. thanks for the fix!

vllm/model_executor/guided_decoding/__init__.py

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

russellb

lgtm, thanks!

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

terrytangyuan · 2025-02-11T01:43:01Z

Ci failures seem unrelated: requests.exceptions.HTTPError: 504 Server Error: Gateway Time-out for url: https://huggingface.co/api/models/openai-community/gpt2/tree/main?recursive=True&expand=False

@mgoin Would you like to merge this manually?

…xgrammar (vllm-project#12976) Signed-off-by: Yuan Tang <terrytangyuan@gmail.com> Signed-off-by: SzymonOzog <szymon.ozog@aleph-alpha.com>

…xgrammar (vllm-project#12976) Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

[Bugfix] Guided decoding falls back to outlines when fails to import …

3a6a65e

…xgrammar Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

terrytangyuan requested a review from mgoin as a code owner February 9, 2025 04:03

mergify bot added the structured-output label Feb 9, 2025

terrytangyuan added 2 commits February 8, 2025 23:05

fix line length

a69201d

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

Fix check

984b42a

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

russellb suggested changes Feb 10, 2025

View reviewed changes

vllm/model_executor/guided_decoding/__init__.py Outdated Show resolved Hide resolved

terrytangyuan added 2 commits February 10, 2025 11:49

Address commments

9d5b3e8

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

Remove noqa

e94708d

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

russellb approved these changes Feb 10, 2025

View reviewed changes

terrytangyuan added 2 commits February 10, 2025 12:29

Fix line length

cac0110

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

Fix line length again

563cf4a

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

mgoin approved these changes Feb 10, 2025

View reviewed changes

mgoin enabled auto-merge (squash) February 10, 2025 18:18

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 10, 2025

mgoin merged commit 14ecab5 into vllm-project:main Feb 11, 2025
43 of 47 checks passed

terrytangyuan deleted the xgrammar-import branch February 11, 2025 18:23

kwang1012 pushed a commit to kwang1012/vllm that referenced this pull request Feb 12, 2025

[Bugfix] Guided decoding falls back to outlines when fails to import …

8102cd8

…xgrammar (vllm-project#12976) Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

panf2333 pushed a commit to yottalabsai/vllm that referenced this pull request Feb 18, 2025

[Bugfix] Guided decoding falls back to outlines when fails to import …

11ba74f

…xgrammar (vllm-project#12976) Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

kerthcet pushed a commit to kerthcet/vllm that referenced this pull request Feb 21, 2025

[Bugfix] Guided decoding falls back to outlines when fails to import …

53caffb

…xgrammar (vllm-project#12976) Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Guided decoding falls back to outlines when fails to import xgrammar #12976

[Bugfix] Guided decoding falls back to outlines when fails to import xgrammar #12976

terrytangyuan commented Feb 9, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Feb 9, 2025

terrytangyuan commented Feb 9, 2025

russellb left a comment

russellb left a comment

terrytangyuan commented Feb 11, 2025

[Bugfix] Guided decoding falls back to outlines when fails to import xgrammar #12976

[Bugfix] Guided decoding falls back to outlines when fails to import xgrammar #12976

Conversation

terrytangyuan commented Feb 9, 2025 • edited by github-actions bot Loading

github-actions bot commented Feb 9, 2025

terrytangyuan commented Feb 9, 2025

russellb left a comment

Choose a reason for hiding this comment

russellb left a comment

Choose a reason for hiding this comment

terrytangyuan commented Feb 11, 2025

terrytangyuan commented Feb 9, 2025 •

edited by github-actions bot

Loading