-
Notifications
You must be signed in to change notification settings - Fork 588
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
benchmark json schema #2030
benchmark json schema #2030
Conversation
Please fix the lint error and paste some benchmark results here. Then we can merge this! |
Sure! Here's some experiment results on my machine: Configuration:
|
This is basically done in all current implementations, including TensorRT LLM, and the |
BTW how about this https://github.com/guidance-ai/llgtrt |
Thanks for the suggestion! Currently, we plan to support xgrammar as a faster grammar backend alternative. Maybe we can give it llgtrt a try in future PRs. |
@zhyncs disk cache is turned on by default. xgrammar will solve the preprocess slowness |
Motivation
Modifications
Add a simple benchmark about json schema constrained decoding, based on dataset NousResearch/json-mode-eval
Checklist