Add Llama2 inference benchmark under a new "benchmarks" section #435

dacorvo · 2024-01-23T15:50:11Z

This is basically the content of the benchmark paragraph in the corresponding blog post.

I have also added the scripts to run the benchmark and generate the images.

HuggingFaceDocBuilderDev · 2024-01-23T16:01:28Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

philschmid

Great idea to have a separate section for benchmarks!

philschmid · 2024-01-24T09:01:14Z

benchmark/text-generation/gen_barcharts.py

Can we include the instance type/number of neuron cores in the title?

There is no guarantee that all models are using the same number of cores. It is saved in the JSON results file for each model though, so eventually we could use it.

docs/source/_toctree.yml

docs/source/benchmarks/inferentia-llama2.mdx

Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>

dacorvo force-pushed the benchmark branch from 09eec0a to 2dcdb5c Compare January 23, 2024 15:57

dacorvo force-pushed the benchmark branch from 2dcdb5c to 05d06de Compare January 23, 2024 16:08

dacorvo marked this pull request as ready for review January 23, 2024 16:10

dacorvo requested review from philschmid, michaelbenayoun and JingyaHuang January 23, 2024 16:10

dacorvo added 2 commits January 24, 2024 09:00

chore: added text-generation benchmark scripts

f88d555

doc: added llama2 benchmark

24ae113

dacorvo force-pushed the benchmark branch from 05d06de to 24ae113 Compare January 24, 2024 09:01

philschmid approved these changes Jan 24, 2024

View reviewed changes

Apply suggestions from code review

77c9aa9

Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>

dacorvo merged commit 2709183 into main Jan 24, 2024
1 check passed

dacorvo deleted the benchmark branch January 24, 2024 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Llama2 inference benchmark under a new "benchmarks" section #435

Add Llama2 inference benchmark under a new "benchmarks" section #435

dacorvo commented Jan 23, 2024

HuggingFaceDocBuilderDev commented Jan 23, 2024

philschmid left a comment

philschmid Jan 24, 2024

dacorvo Jan 24, 2024

Add Llama2 inference benchmark under a new "benchmarks" section #435

Add Llama2 inference benchmark under a new "benchmarks" section #435

Conversation

dacorvo commented Jan 23, 2024

HuggingFaceDocBuilderDev commented Jan 23, 2024

philschmid left a comment

Choose a reason for hiding this comment

philschmid Jan 24, 2024

Choose a reason for hiding this comment

dacorvo Jan 24, 2024

Choose a reason for hiding this comment