Include queue time in benchmark report #2854
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Please read our CONTRIBUTING.md prior to creating your first pull request.
Creating PR to include queue time in the benchmark report.
Tests run:
Ran benchmark locally with
resnet50.yaml
with batch sizes 1, 2, 4, 8, 16 and concurrency 4. 10000 requests are used for inference.The AB report and metrics can be found be in this tar file:
resnet_reports.tar.gz
As evident from the reports, the P50, P90 and P99 of the queue times can be found in the
ab_report.csv
andstats_metrics.json
files.