Large model inference #2215

HamidShojanazeri · 2023-04-02T20:02:00Z

Description

Adding PiPPy large model inference with HF example

Note waiting for PiPPY binaries later this week(04/10) to update the dev requirements.

Also this PR moves all our large model related examples, deepspeed, accelerate under example/ large_model folder

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
[X ] New feature (non-breaking change which adds functionality)
[X ] This change requires a documentation update

Feature/Issue validation/testing

Please describe the Unit or Integration tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced.
Please also list any relevant details for your test configuration.

Test A
Logs for regression tests
https://drive.google.com/file/d/1k7iBydhIv2MpBXmwDuaPkFuerXige5Tm/view?usp=share_link
Test B
Logs for Test B

Checklist:

Did you have fun?
Have you added tests that prove your fix is effective or that this feature works?
Has code been commented, particularly in hard-to-understand areas?
Have you made corresponding changes to the documentation?

codecov · 2023-04-02T20:32:38Z

Codecov Report

Merging #2215 (c9245b1) into master (044bbc1) will decrease coverage by 1.17%.
The diff coverage is 0.00%.

❗ Current head c9245b1 differs from pull request most recent head 6efa20c. Consider uploading reports for the commit 6efa20c to get more accurate results

@@            Coverage Diff             @@
##           master    #2215      +/-   ##
==========================================
- Coverage   71.47%   70.31%   -1.17%     
==========================================
  Files          73       75       +2     
  Lines        3341     3392      +51     
  Branches       57       57              
==========================================
- Hits         2388     2385       -3     
- Misses        950     1004      +54     
  Partials        3        3

Impacted Files	Coverage Δ
ts/handler_utils/distributed/pt_pippy.py	`0.00% <0.00%> (ø)`
ts/torch_handler/distributed/base_pippy_handler.py	`0.00% <0.00%> (ø)`

... and 1 file with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

lxning

I added comments for Huggingface_pippy. They can apply to the Hugingfac_accelerate and Deepspeed_mii

ts/torch_handler/distributed/base_pippy_handler.py

ts/handler_utils/distributed/pt_pippy.py

examples/large_models/Huggingface_pippy/Readme.md

examples/large_models/Huggingface_pippy/setup_config.json

examples/large_models/Huggingface_pippy/requirements.txt

examples/large_models/Huggingface_pippy/config.properties

into large_model_inference

docs/large_model_inference.md

lxning · 2023-04-17T22:25:45Z

docs/large_model_inference.md

+
+This document explain how Torchserve supports large model serving, here large model refers to the models that are not able to fit into one gpu so they need be splitted in multiple partitions over multiple gpus.
+
+## PiPPY (PyTorch Native solution for large model inference)


could move this readme under dir Huggingface_pippy since this doc is about pippy?

Can we add common instruction/steps in this doc?

lxning · 2023-04-17T22:31:03Z

examples/large_models/Huggingface_pippy/Readme.md

+
+### Step 0: Install torchserve from src
+```bash
+python ts_scripts/install_from_src.py


examples/large_models/Huggingface_pippy/Readme.md

test/pytest/test_distributed_inference_handler.py

…for others pipe_driver

msaroufim · 2023-04-20T18:21:28Z

LGTM, please just make sure lint jobs are green and we can merge

HamidShojanazeri · 2023-04-20T19:45:40Z

Thanks @msaroufim , sure will address the failing spell checks and lints

into large_model_inference

HamidShojanazeri added 6 commits April 2, 2023 19:39

mv HF large model to new dir

d2befbd

adding HF models to the new dir

8b5f32b

adding HF pippy

a118567

adding large model doc

769b47c

adding large model utils

55f6e62

adding base handler for pippy

673109b

HamidShojanazeri changed the title ~~[WIP} Large model inference~~ [WIP] Large model inference Apr 3, 2023

lxning reviewed Apr 3, 2023

View reviewed changes

HamidShojanazeri added 21 commits April 3, 2023 19:54

clean up

4bdd917

clean up

1000655

clean up

54a0cfa

clean up

8ac0208

formatting

69ce39d

clean up

9472a31

clean ups

b4cbff2

adding max lenght

54a49d8

clean up

02e9ccb

clean up

0c0a31f

Merge branch 'master' into large_model_inference

0ad1954

adding logger

73dc137

Merge branch 'large_model_inference' of https://github.com/pytorch/serve

bf992ee

into large_model_inference

update to latests

3ffd034

update to latests

3d5a584

update model-config

bf13141

clean up

da9f071

adding recent settings

7b94104

update steps

34f4830

update to use ctx

7205862

update to use ctx

a4dd43a

HamidShojanazeri added 2 commits April 16, 2023 23:16

remove uneccesary move logs func

542b489

remove uneccesary move logs func

8c8306a

lxning reviewed Apr 17, 2023

View reviewed changes

HamidShojanazeri added 7 commits April 18, 2023 15:59

removing install from src

3b1b1f3

fix the model name

2d2c967

removing expected json type

a4f3910

adding assertion for exitcode

dcf48d6

moving torchrun to frontend spec

f151f01

moving torchrun to frontend spec

9a0c503

clean up

3c0ddb4

lxning approved these changes Apr 18, 2023

View reviewed changes

HamidShojanazeri added 2 commits April 18, 2023 18:17

make sure for HF it returns the patched model that supports generate …

d58b272

…for others pipe_driver

adding torchpippy

278b53b

msaroufim approved these changes Apr 20, 2023

View reviewed changes

HamidShojanazeri added 3 commits April 20, 2023 19:43

fix typos

199a2cc

fix typos

ff00364

extending the vocab

e114a7e

HamidShojanazeri and others added 10 commits April 20, 2023 12:45

Merge branch 'master' into large_model_inference

1f0807f

extending the vocab

4c3b816

fix typos

7775977

Merge branch 'large_model_inference' of https://github.com/pytorch/serve

563345f

into large_model_inference

fix typos

475aad5

Merge branch 'master' into large_model_inference

a24ca0a

update examples readme and fix deadinks

2bdee81

Merge branch 'large_model_inference' of https://github.com/pytorch/serve

c2f047a

into large_model_inference

fixing typos

3b24cc5

Merge branch 'master' into large_model_inference

6efa20c

lxning merged commit d23d2ab into master Apr 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Large model inference #2215

Large model inference #2215

HamidShojanazeri commented Apr 2, 2023 •

edited

Loading

codecov bot commented Apr 2, 2023 •

edited

Loading

lxning left a comment

lxning Apr 17, 2023

lxning Apr 17, 2023

msaroufim commented Apr 20, 2023

HamidShojanazeri commented Apr 20, 2023


		This document explain how Torchserve supports large model serving, here large model refers to the models that are not able to fit into one gpu so they need be splitted in multiple partitions over multiple gpus.

		## PiPPY (PyTorch Native solution for large model inference)

Large model inference #2215

Large model inference #2215

Conversation

HamidShojanazeri commented Apr 2, 2023 • edited Loading

Description

Type of change

Feature/Issue validation/testing

Checklist:

codecov bot commented Apr 2, 2023 • edited Loading

Codecov Report

lxning left a comment

Choose a reason for hiding this comment

lxning Apr 17, 2023

Choose a reason for hiding this comment

lxning Apr 17, 2023

Choose a reason for hiding this comment

msaroufim commented Apr 20, 2023

HamidShojanazeri commented Apr 20, 2023

HamidShojanazeri commented Apr 2, 2023 •

edited

Loading

codecov bot commented Apr 2, 2023 •

edited

Loading