fetch_llama_cpp

This Python module automates downloading and setting up the latest and best binary distribution of llama.cpp for your system and graphics card (if present).

It fetches the latest release from GitHub, detects your system's specifications, and selects the most suitable binary for your setup.

Features

Automatic Detection: Detects your operating system, architecture, and GPU (NVIDIA or AMD).
CUDA Compatibility: Checks for CUDA and driver versions to ensure compatibility.
AVX Support: Checks for AVX, AVX2, and AVX512 support on your CPU.
Download and Extraction: Downloads the appropriate binary and extracts it.
Verification: Runs the binary with --version to verify the setup.

Requirements

Python 3.x
requests library
cpuinfo library
zipfile and tarfile modules
subprocess and platform modules

Usage

There are several ways to run fetch_llama_cpp:

1. As a module

% python3 -m fetch_llama_cpp

2. As a script

% python3 fetch_llama_cpp.py

3. As an import

import fetch_llama_cpp

fetch_llama_cpp.fetch()

4. As a container

% podman run -v $PWD:/app fetch_llama_cpp
% docker run -v $PWD:/app fetch_llama_cpp

Environment

fetch_llama_cpp is designed to be run in a Python 3 environment:

POSIX (Linux, macOS, etc.):

% python -m venv .venv
% source ./.venv/bin/activate
(.venv) % pip install -r requirements.txt
(.venv) % ./fetch_llama_cpp.py

Windows:

> python -m venv .venv
> .\.venv\Scripts\avtivate.ps1
> pip install -r requirements.txt
> python fetch_llama_cpp.py

How It Works

Fetch Latest Release: The script fetches the latest release information from the llama.cpp GitHub repository.
System Information: It detects your operating system and architecture.
GPU Detection: Checks for NVIDIA or AMD GPUs and their respective CUDA and driver versions.
AVX Support: Checks if your CPU supports AVX, AVX2, or AVX512.
Select Best Asset: Based on the detected information, it selects the most suitable binary asset.
Download and Extract: Downloads the selected binary and extracts it to the specified directory.
Run Verification: Runs the binary with --version to ensure it was set up correctly.

Notes

Ensure you have the necessary permissions to run nvidia-smi and lspci commands.
The script assumes a standard directory structure for the downloaded and extracted files.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
__pycache__		__pycache__
build/lib/fetch_llama_cpp		build/lib/fetch_llama_cpp
fetch_llama_cpp		fetch_llama_cpp
samples		samples
tests		tests
.gitignore		.gitignore
Containerfile		Containerfile
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fetch_llama_cpp

Features

Requirements

Usage

Environment

How It Works

Notes

License

About

Releases

Packages

Languages

License

pAI-OS/fetch_llama_cpp

Folders and files

Latest commit

History

Repository files navigation

fetch_llama_cpp

Features

Requirements

Usage

Environment

How It Works

Notes

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages