fix(pipelines): support passing decoder model + tokenizer #319

dacorvo · 2023-11-13T12:46:39Z

When passing explicitly a neuron model to a pipeline, we check the model class. This modifies the check to accept not only NeuronBaseModel but also NeuronModelForCausalLM.

Update: cherry-picked @glegendre01 modifications to github workflows to reactivate INF1 CI.

HuggingFaceDocBuilderDev · 2023-11-13T12:50:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

We also remove driver reinstallation step

michaelbenayoun · 2023-11-14T10:23:52Z

optimum/neuron/pipelines/transformers/base.py

@@ -123,7 +123,7 @@ def load_pipeline(
            model, export=export, **compiler_args, **input_shapes, **hub_kwargs, **kwargs
        )
    # uses neuron model
-    elif isinstance(model, NeuronBaseModel):
+    elif isinstance(model, (NeuronBaseModel, NeuronModelForCausalLM)):


QQ: why is NeuronModelForCausalLM not a sublass of NeuronBseModel?

Because NeuronBaseModel is based on JIT models and implements the corresponding conversion logic. NeuronModelForCausalLM is a subclass of NeuronDecoderModel that uses transformers-neuronx models instead.
We could refactor to add a common class to both though, since the latest subclasses of NeuronBaseModel are overriding pretty much all its methods now.

Wouldn't it make more sense to use NeuronDecoderModel here?

I guess it would be good to refactor it if possible. But also it's not pressing and can be postponed to when we have more bandwidth!

I will wait until @JingyaHuang 's pull-request on T5 is merged as it is a new subclass, and any change I make before that to the base class will result in nightmarish conflicts.

JingyaHuang

LGTM, thanks for the fix!

dacorvo marked this pull request as ready for review November 13, 2023 12:51

dacorvo requested review from JingyaHuang, michaelbenayoun and philschmid November 13, 2023 12:51

dacorvo mentioned this pull request Nov 13, 2023

Cannot pass NeuronModelForCausalLM to pipeline #318

Closed

dacorvo and others added 2 commits November 13, 2023 16:46

fix(pipelines): support passing decoder model + tokenizer

c29def2

Remove EC2 jobs

3d8e2e6

We also remove driver reinstallation step

dacorvo force-pushed the fix_pipeline_load_model branch from 786e975 to 3d8e2e6 Compare November 13, 2023 16:53

dacorvo requested a review from glegendre01 November 13, 2023 17:06

michaelbenayoun reviewed Nov 14, 2023

View reviewed changes

JingyaHuang approved these changes Nov 14, 2023

View reviewed changes

dacorvo merged commit d47741f into main Nov 14, 2023
7 of 9 checks passed

dacorvo deleted the fix_pipeline_load_model branch November 14, 2023 12:28

dacorvo mentioned this pull request Nov 14, 2023

Remove EC2 jobs #314

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(pipelines): support passing decoder model + tokenizer #319

fix(pipelines): support passing decoder model + tokenizer #319

dacorvo commented Nov 13, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 13, 2023

michaelbenayoun Nov 14, 2023

dacorvo Nov 14, 2023

JingyaHuang Nov 14, 2023

michaelbenayoun Nov 15, 2023

dacorvo Nov 15, 2023

JingyaHuang left a comment

fix(pipelines): support passing decoder model + tokenizer #319

fix(pipelines): support passing decoder model + tokenizer #319

Conversation

dacorvo commented Nov 13, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Nov 13, 2023

michaelbenayoun Nov 14, 2023

Choose a reason for hiding this comment

dacorvo Nov 14, 2023

Choose a reason for hiding this comment

JingyaHuang Nov 14, 2023

Choose a reason for hiding this comment

michaelbenayoun Nov 15, 2023

Choose a reason for hiding this comment

dacorvo Nov 15, 2023

Choose a reason for hiding this comment

JingyaHuang left a comment

Choose a reason for hiding this comment

dacorvo commented Nov 13, 2023 •

edited

Loading