Add Dockerfile + build workflow #73

Niek · 2023-04-12T09:53:51Z

Fixes #70

This PR adds a Dockerfile and updates the release workflow to build the latest Docker image too. Both amd64 and arm64 arches are built.

abetlen · 2023-04-12T14:39:25Z

@Niek do you mind moving this to the build release workflow?

Niek · 2023-04-12T16:08:54Z

@abetlen are you referring to build-and-release.yml? If we move the Docker step to this action, it can't use pip install though, it will have to download the artifacts and use that - not sure if this is what you intend.

jmtatsch · 2023-04-15T14:10:27Z

Maybe we should directly add openblas support?
would need those two lines:

RUN apt update && apt install -y libopenblas-dev
RUN LLAMA_OPENBLAS=1 pip install llama-cpp-python[server]

Niek · 2023-04-15T18:25:51Z

Good idea @jmtatsch - added now

Dockerfile

jmtatsch · 2023-04-21T21:58:32Z

Here is a docker file for a cublas capable container that should bring huge speed ups for cuda gpu owners after the next sync with upstream:

FROM nvidia/cuda:12.1.0-devel-ubuntu22.04

EXPOSE 8000
ENV MODEL=/models/ggml-vicuna-13b-1.1-q4_0.bin
# allow non local connections to api
ENV HOST=0.0.0.0

RUN apt update && apt install -y python3 python3-pip && LLAMA_CUBLAS=1 pip install llama-cpp-python[server]

ENTRYPOINT [ "python3", "-m", "llama_cpp.server" ]

gjmulder · 2023-04-22T08:24:55Z

Here is a docker file for a cublas capable container that should bring huge speed ups for cuda gpu owners after the next sync with upstream:

@jmtatsch where is requirements.txt coming from?

jmtatsch · 2023-04-22T21:18:08Z

@jmtatsch where is requirements.txt coming from?

good catch, it isn't necessary at all. I cleaned it up above.
In 0.1.36 CUBLA is broken anyhow for me, waiting for ggerganov/llama.cpp#1128

Niek · 2023-04-24T07:55:51Z

@abetlen do you need any other changes?

abetlen · 2023-04-24T17:57:14Z

@Niek if possible can we include @jmtatsch nvidia-docker container example as well in this PR? Ability to docker pull and run a GPU-accelerated container would be very helpful.

jmtatsch · 2023-04-24T18:04:38Z

@abetlen We should make this two different containers then because the nvidia container with cublas is quite fat and not everyone has a Nvidia card.
I will make a pull request once this one is merged.
Sorry for hijacking your pull request @Niek

abetlen · 2023-05-02T05:33:14Z

@Niek finally got a chance to merge this, great work! We now have a docker image.

@jmtatsch if you're still interested it would be awesome to get that cuBLAS-based image, happy to help there also.

Niek added 2 commits April 12, 2023 11:53

Add Dockerfile + build workflow

c14201d

More generic model name

9ce8146

Niek mentioned this pull request Apr 12, 2023

feature: add llama api using local models Niek/chatgpt-web#108

Closed

Niek mentioned this pull request Apr 14, 2023

feature(devcontainer): add llama cli/api Niek/chatgpt-web#109

Open

Niek added 2 commits April 15, 2023 20:24

Support openblas

59b37bb

Merge branch 'main' of github.com:abetlen/llama-cpp-python

6df27b2

jmtatsch reviewed Apr 21, 2023

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

Change to bullseye

8476b32

abetlen mentioned this pull request May 2, 2023

Add Dockerfile #140

Merged

abetlen merged commit 8476b32 into abetlen:main May 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Dockerfile + build workflow #73

Add Dockerfile + build workflow #73

Niek commented Apr 12, 2023 •

edited

Loading

abetlen commented Apr 12, 2023

Niek commented Apr 12, 2023

jmtatsch commented Apr 15, 2023

Niek commented Apr 15, 2023

jmtatsch commented Apr 21, 2023 •

edited

Loading

gjmulder commented Apr 22, 2023

jmtatsch commented Apr 22, 2023 •

edited

Loading

Niek commented Apr 24, 2023

abetlen commented Apr 24, 2023

jmtatsch commented Apr 24, 2023

abetlen commented May 2, 2023 •

edited

Loading

Add Dockerfile + build workflow #73

Add Dockerfile + build workflow #73

Conversation

Niek commented Apr 12, 2023 • edited Loading

abetlen commented Apr 12, 2023

Niek commented Apr 12, 2023

jmtatsch commented Apr 15, 2023

Niek commented Apr 15, 2023

jmtatsch commented Apr 21, 2023 • edited Loading

gjmulder commented Apr 22, 2023

jmtatsch commented Apr 22, 2023 • edited Loading

Niek commented Apr 24, 2023

abetlen commented Apr 24, 2023

jmtatsch commented Apr 24, 2023

abetlen commented May 2, 2023 • edited Loading

Niek commented Apr 12, 2023 •

edited

Loading

jmtatsch commented Apr 21, 2023 •

edited

Loading

jmtatsch commented Apr 22, 2023 •

edited

Loading

abetlen commented May 2, 2023 •

edited

Loading