benchmark json schema #2030

DarkSharpness · 2024-11-14T07:26:20Z

Motivation

Modifications

Add a simple benchmark about json schema constrained decoding, based on dataset NousResearch/json-mode-eval

Checklist

Format your code according to the Contributor Guide.
Add unit tests as outlined in the Contributor Guide.
Update documentation as needed, including docstrings or example tutorials.

benchmark/json_schema/bench_sglang.py

merrymercy · 2024-11-15T00:42:47Z

Please fix the lint error and paste some benchmark results here. Then we can merge this!

DarkSharpness · 2024-11-15T09:47:38Z

Please fix the lint error and paste some benchmark results here. Then we can merge this!

Sure! Here's some experiment results on my machine:

Configuration: AMD EPYC 7302 16-Core Processor + NVIDIA A100 40G

Settings	Latency (s)	Overall output tokens
outlines + disk_cache + jump_forward	59.788	5580
outlines + disk_cache - jump_forward	57.645	5081
outlines - disk_cache + jump_forward	456.94	5580
outlines - disk_cache - jump_forward	451.79	5081

the overall output tokens should be identical in all cases, but due to this bug, the actual output differs slightly
disk_cache means that the compiled regex/json_schema is already in disk cache before running, so we don't need to recompile that when running (which results in a significant speed up).

zhyncs · 2024-11-15T10:04:07Z

disk_cache means that the compiled regex/json_schema is already in disk cache before running, so we don't need to recompile that when running (which results in a significant speed up).

This is basically done in all current implementations, including TensorRT LLM, and the disk_cache should be enabled by default.

zhyncs · 2024-11-15T10:06:15Z

BTW how about this https://github.com/guidance-ai/llgtrt
It can reduce the first time compilation overhead.

DarkSharpness · 2024-11-15T11:30:52Z

BTW how about this https://github.com/guidance-ai/llgtrt It can reduce the first time compilation overhead.

Thanks for the suggestion!

Currently, we plan to support xgrammar as a faster grammar backend alternative.

Maybe we can give it llgtrt a try in future PRs.

benchmark/json_schema/README.md

benchmark/json_schema/bench_sglang.py

merrymercy · 2024-11-15T11:32:30Z

@zhyncs disk cache is turned on by default. xgrammar will solve the preprocess slowness

…s yet

DarkSharpness and others added 5 commits November 13, 2024 13:04

feat(benchmark): add a benchmark to test json_schema

a8d51ed

Merge branch 'main' into constrained-benchmark

b607555

Merge branch 'sgl-project:main' into bench_json_schema

4c7469a

minor: fix the format by running pre-commit

9bdcec1

Merge branch 'main' into bench_json_schema

096b127

merrymercy requested changes Nov 14, 2024

View reviewed changes

benchmark/json_schema/bench_sglang.py Show resolved Hide resolved

DarkSharpness added 2 commits November 14, 2024 10:14

feat(benchmark): check whether the output is in correct json format

c21fb8a

minor: fix the format by running pre-commit

8f00e68

merrymercy requested changes Nov 15, 2024

View reviewed changes

benchmark/json_schema/bench_sglang.py Outdated Show resolved Hide resolved

DarkSharpness and others added 3 commits November 15, 2024 07:40

minor: use sgl.system, sgl.user, sgl.assistant to rewrite

a803b1c

fix: skip a corrupted example

d72f785

revert: skip the wrong number...

1f8185d

merrymercy requested changes Nov 15, 2024

View reviewed changes

benchmark/json_schema/README.md Outdated Show resolved Hide resolved

benchmark/json_schema/bench_sglang.py Outdated Show resolved Hide resolved

DarkSharpness added 2 commits November 15, 2024 19:46

fix: skip all schemas with 'email', which is not supported by outline…

69f9caa

…s yet

minor: remove useless arguments

ccfabaf

merrymercy merged commit 954f4e6 into sgl-project:main Nov 15, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmark json schema #2030

benchmark json schema #2030

DarkSharpness commented Nov 14, 2024

merrymercy commented Nov 15, 2024

DarkSharpness commented Nov 15, 2024

zhyncs commented Nov 15, 2024

zhyncs commented Nov 15, 2024

DarkSharpness commented Nov 15, 2024

merrymercy commented Nov 15, 2024

benchmark json schema #2030

benchmark json schema #2030

Conversation

DarkSharpness commented Nov 14, 2024

Motivation

Modifications

Checklist

merrymercy commented Nov 15, 2024

DarkSharpness commented Nov 15, 2024

zhyncs commented Nov 15, 2024

zhyncs commented Nov 15, 2024

DarkSharpness commented Nov 15, 2024

merrymercy commented Nov 15, 2024