Use pre-downloaded models with the Transformers backend #3776

joshbtn · 2024-10-09T23:58:53Z

Is your feature request related to a problem? Please describe.
I have a use case where I pre-download the models to be used with the transformers backend and drop them into a my models directory. My server does not have access to the internet. When trying to specify the model for transformers to use it tries to download the model rather than finding the model that already exists in the file system.

Describe the solution you'd like
I'd like to detect if the model has already been downloaded and tell transformers to use that path.

Describe alternatives you've considered
I considered passing the full path as the model name but this fails validation, both in the model yaml config and when trying to pass it in a request to the completions endpoint.

Additional context
I'd like this to work with the transformers backend.

name: some-model
backend: transformers
parameters:
    model: "some-model-from-huggingface"
type: AutoModelForCausalLM
...

This was a work around I tried, however if fails on some validation of the model name in the API. This would have worked had the model name made it's way all the way to the transformers backend. Seeing as there's already a ModelFile provided as part of the request object it seems like we should check that first rather than changing the model name validation.

name: some-model
backend: transformers
parameters:
    model: "/full/path/to/some-model-from-huggingface"
type: AutoModelForCausalLM
...

joshbtn · 2024-10-09T23:59:14Z

This may be related to, or even fix this issue #3594

joshbtn added the enhancement New feature or request label Oct 9, 2024

joshbtn mentioned this issue Oct 10, 2024

feat(transformers): Use downloaded model for Transformers backend if it already exists. #3777

Merged

1 task

mudler closed this as completed in #3777 Oct 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use pre-downloaded models with the Transformers backend #3776

Use pre-downloaded models with the Transformers backend #3776

joshbtn commented Oct 9, 2024

joshbtn commented Oct 9, 2024

Use pre-downloaded models with the Transformers backend #3776

Use pre-downloaded models with the Transformers backend #3776

Comments

joshbtn commented Oct 9, 2024

joshbtn commented Oct 9, 2024