Name		Name	Last commit message	Last commit date
parent directory ..
static		static
Dockerfile		Dockerfile
README.md		README.md
deploy.yaml		deploy.yaml
main.py		main.py
requirements.txt		requirements.txt

README.md

FLAN-T5 XXL

FLAN-T5 is a combination of two: a network and a model. Here, FLAN is Finetuned Language Net and T5 is a language model developed and published by Google in 2020. This model provides an improvement on the T5 model by improving the effectiveness of the zero-shot learning. FLAN-T5 model comes with many variants based on the numbers of parameters: FLAN-T5 small (80M); FLAN-T5 base (250M); FLAN-T5 large (780M); FLAN-T5 XL (3B); FLAN-T5 XXL (11B). This deployment uses the XXL variant with 11B parameters.

Hugging Face repository https://huggingface.co/google/flan-t5-xxl

Usage

Min GPU NVIDIA 3090. At the first start will be downloaded files of models with a total weight of about 45 GB. After that checkpoint shards will be loaded, thal also take a some time. For running just deploy this SDL and wait for the models to finish loading, watching the process in the logs.

Logs

The logs on the screenshot show that the loading of the models has completed and the application web server has started and the application is ready to work:

Demo Video

Peek.2023-07-05.02-16.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flan-t5-xxl

flan-t5-xxl

README.md

FLAN-T5 XXL

Usage

Logs

Demo Video

Files

flan-t5-xxl

Directory actions

More options

Directory actions

More options

Latest commit

History

flan-t5-xxl

Folders and files

parent directory

README.md

FLAN-T5 XXL

Usage

Logs

Demo Video