Bump Inference APIs to Neuron 2.13 #206

JingyaHuang · 2023-08-29T15:56:11Z

Unblock VAE encoder
Remove optimized cross-attention score workaround
Unblock dynamic batching test
Update documentation

HuggingFaceDocBuilderDev · 2023-08-29T15:59:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

philschmid

LGTM!

docs/source/guides/models.mdx

philschmid · 2023-08-30T06:46:10Z

docs/source/guides/export_model.mdx

@@ -226,6 +231,7 @@ optimum-cli export neuron --model stabilityai/stable-diffusion-2-1-base \
  --batch_size 1 \
  --height 512 `# height in pixels of generated image, eg. 512, 768` \
  --width 512 `# width in pixels of generated image, eg. 512, 768` \
+  --num_image_per_prompt 4 `# number of images to generate per prompt, defaults to 1` \


is this needed with dynamic batching support now?

If we turn on dynamic batching, we can input any batch size / number images per prompt, but by default the dynamic batch size is turned off, so we can keep the snippet this way I think.

michaelbenayoun

LGTM

JingyaHuang added 2 commits August 29, 2023 15:47

unblock

f3d4e03

update doc

3b92807

unskip dynamic test

2146cdd

JingyaHuang marked this pull request as ready for review August 29, 2023 16:56

JingyaHuang requested review from philschmid, michaelbenayoun and dacorvo August 29, 2023 16:56

philschmid approved these changes Aug 30, 2023

View reviewed changes

dacorvo approved these changes Aug 30, 2023

View reviewed changes

michaelbenayoun approved these changes Aug 30, 2023

View reviewed changes

JingyaHuang added 3 commits August 30, 2023 09:11

add num img per prompt to test

dbb7eef

update doc

bdd0feb

update doc with extra

c176208

JingyaHuang merged commit b93c706 into main Aug 30, 2023
11 of 13 checks passed

JingyaHuang deleted the bump-to-2.13 branch August 30, 2023 12:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump Inference APIs to Neuron 2.13 #206

Bump Inference APIs to Neuron 2.13 #206

JingyaHuang commented Aug 29, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 29, 2023

philschmid left a comment

philschmid Aug 30, 2023

JingyaHuang Aug 30, 2023

michaelbenayoun left a comment

Bump Inference APIs to Neuron 2.13 #206

Bump Inference APIs to Neuron 2.13 #206

Conversation

JingyaHuang commented Aug 29, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Aug 29, 2023

philschmid left a comment

Choose a reason for hiding this comment

philschmid Aug 30, 2023

Choose a reason for hiding this comment

JingyaHuang Aug 30, 2023

Choose a reason for hiding this comment

michaelbenayoun left a comment

Choose a reason for hiding this comment

JingyaHuang commented Aug 29, 2023 •

edited

Loading