From 3cfb429a2fbbfd7b7aa2d19c59d9ffbd5b34ad26 Mon Sep 17 00:00:00 2001 From: katopz Date: Wed, 24 Jul 2024 11:24:20 +0700 Subject: [PATCH] feat: add ollama --- src/SUMMARY.md | 1 + src/ml/infer/llama.cpp.md | 1 - src/ml/infer/mod.md | 2 +- src/ml/infer/ollama.md | 20 ++++++++++++++++++++ 4 files changed, 22 insertions(+), 2 deletions(-) delete mode 100644 src/ml/infer/llama.cpp.md create mode 100644 src/ml/infer/ollama.md diff --git a/src/SUMMARY.md b/src/SUMMARY.md index 0e5520e..71e0bf1 100644 --- a/src/SUMMARY.md +++ b/src/SUMMARY.md @@ -67,6 +67,7 @@ - [ML](ml/mod.md) - [Large Language Model](ml/large-language-model.md) - [Inferences](ml/infer/mod.md) + - [ollama](ml/infer/ollama.md) - [TabbyML](ml/infer/tabbyml.md) - [WasmEdge](ml/infer/wasmedge.md) - [Hugging Face - Text Generation Inference](ml/infer/huggingface-text-generation-inference.md) diff --git a/src/ml/infer/llama.cpp.md b/src/ml/infer/llama.cpp.md deleted file mode 100644 index 0e0e5cd..0000000 --- a/src/ml/infer/llama.cpp.md +++ /dev/null @@ -1 +0,0 @@ -# llama.cpp diff --git a/src/ml/infer/mod.md b/src/ml/infer/mod.md index 689b6c1..c723c10 100644 --- a/src/ml/infer/mod.md +++ b/src/ml/infer/mod.md @@ -5,4 +5,4 @@ - [x] [WasmEdge](./wasmedge.md) // MacOS/Windows/WSL2 - [x] [TabbyML](./tabbyml.md) // MacOS/Windows/WSL2 - [x] [llama.cpp](https://github.com/ggerganov/llama.cpp) // MacOS/Windows // 13GB // 34.01 tokens/sec -- [x] [ollama](https://ollama.ai/) // MacOS +- [x] [ollama](./ollama.md) // MacOS/Windows/WSL2 diff --git a/src/ml/infer/ollama.md b/src/ml/infer/ollama.md new file mode 100644 index 0000000..2479c2f --- /dev/null +++ b/src/ml/infer/ollama.md @@ -0,0 +1,20 @@ +# ollama + +## Setup + +### Windows + +Follow the step at [ollama](https://ollama.ai/) + +``` +ollama run llama3.1 +``` + +## Serve (optional via 13434 PORT) + +### Windows + +``` +SET OLLAMA_HOST=0.0.0.0:13434 +ollama serve +```