chore: Update readme (#41)

Signed-off-by: Ce Gao <cegao@tensorchord.ai>
tensorchord · May 25, 2023 · ef5d369 · ef5d369
1 parent 59e2524
commit ef5d369
Showing 1 changed file with 7 additions and 6 deletions.
diff --git a/README.md b/README.md
@@ -16,7 +16,7 @@ Modelz LLM is an inference server that facilitates the utilization of open sourc
 - **OpenAI compatible API**: Modelz LLM provides an OpenAI compatible API for LLMs, which means you can use the OpenAI python SDK to interact with the model.
 - **Self-hosted**: Modelz LLM can be easily deployed on either local or cloud-based environments.
 - **Open source LLMs**: Modelz LLM supports open source LLMs, such as FastChat, LLaMA, and ChatGLM.
-- **Modelz integration**: Modelz LLM can be easily integrated with [Modelz](https://docs.modelz.ai), which is a serverless inference platform for LLMs and other foundation models.
+- **Cloud native**: We provide docker images for different LLMs, which can be easily deployed on Kubernetes, or other cloud-based environments (e.g. [Modelz](https://docs.modelz.ai))
 
 ## Quick Start
 
@@ -33,23 +33,24 @@ pip install git+https://github.com/tensorchord/modelz-llm.git[gpu]
 Please first start the self-hosted API server by following the instructions:
 
 ```bash
-export MODELZ_MODEL="THUDM/chatglm-6b-int4"
-modelz-llm -m MODELZ_MODEL
+modelz-llm -m "THUDM/chatglm-6b-int4"
 ```
 
 Currently, we support the following models:
 
-| Model Name | Model (`MODELZ_MODEL`) | Docker Image |
+| Model Name | Huggingface Model | Docker Image |
 | ---------- | ----------- | ---------------- |
 | Vicuna 7B Delta V1.1  | `lmsys/vicuna-7b-delta-v1.1` | [modelzai/llm-vicuna-7b](https://hub.docker.com/repository/docker/modelzai/llm-vicuna-7b/general) |
 | LLaMA 7B    | `decapoda-research/llama-7b-hf` | [modelzai/llm-llama-7b](https://hub.docker.com/repository/docker/modelzai/llm-llama-7b/general) |
 | ChatGLM 6B INT4    | `THUDM/chatglm-6b-int4` | [modelzai/llm-chatglm-6b-int4](https://hub.docker.com/repository/docker/modelzai/llm-chatglm-6b-int4/general) |
 | ChatGLM 6B  | `THUDM/chatglm-6b` | [modelzai/llm-chatglm-6b](https://hub.docker.com/repository/docker/modelzai/llm-chatglm-6b/general) |
+| Bloomz 560M | `bigscience/bloomz-560m` | |
+| Bloomz 1.7B | `bigscience/bloomz-1b7` | |
+| Bloomz 3B | `bigscience/bloomz-3b` | |
+| Bloomz 7.1B | `bigscience/bloomz-7b1` | |
 
 <!-- | FastChat T5 3B V1.0  | `lmsys/fastchat-t5-3b-v1.0` | `lmsys/fastchat-t5-3b-v1.0` | -->
 
-You could set the `MODELZ_MODEL` environment variables to specify the model and tokenizer.
-
 ### Use OpenAI python SDK
 
 Then you can use the OpenAI python SDK to interact with the model: