LLaMA Node

llama-node: Node.js Library for Large Language Model

Official Documentations

_{Picture generated by stable diffusion.}

LLaMA Node

Introduction

This project is in an early stage and is not production ready, we do not follow the semantic versioning. The API for nodejs may change in the future, use it with caution.

This is a nodejs library for inferencing llama, rwkv or llama derived models. It was built on top of llm (originally llama-rs), llama.cpp and rwkv.cpp. It uses napi-rs for channel messages between node.js and llama thread.

Supported models

llama.cpp backend supported models (in GGML format):

llm(llama-rs) backend supported models (in GGML format):

GPT-2
GPT-J
LLaMA: LLaMA, Alpaca, Vicuna, Koala, GPT4All v1, GPT4-X, Wizard
GPT-NeoX: GPT-NeoX, StableLM, RedPajama, Dolly v2
BLOOM: BLOOMZ

rwkv.cpp backend supported models (in GGML format):

RWKV

Supported platforms

darwin-x64
darwin-arm64
linux-x64-gnu (glibc >= 2.31)
linux-x64-musl
win32-x64-msvc

Node.js version: >= 16

Installation

Install llama-node npm package

npm install llama-node

Install anyone of the inference backends (at least one)

llama.cpp

npm install @llama-node/llama-cpp

or llm

npm install @llama-node/core

or rwkv.cpp

npm install @llama-node/rwkv-cpp

Manual compilation

Please see how to start with manual compilation on our contribution guide

CUDA support

Please read the document on our site to get started with manual compilation related to CUDA support

Acknowledgments

This library was published under MIT/Apache-2.0 license. However, we strongly recommend you to cite our work/our dependencies work if you wish to reuse the code from this library.

Models/Inferencing tools dependencies

LLaMA models: facebookresearch/llama
RWKV models: BlinkDL/RWKV-LM
llama.cpp: ggreganov/llama.cpp
llm: rustformers/llm
rwkv.cpp: saharNooby/rwkv.cpp

Some source code comes from

llama-cpp bindings: sobelio/llm-chain
rwkv logits sampling: KerfuffleV2/smolrsrwkv

Community

Join our Discord community now! Click to join llama-node Discord

Name		Name	Last commit message	Last commit date
Latest commit History 218 Commits
.github		.github
doc/assets		doc/assets
example		example
miscellaneous/models		miscellaneous/models
packages		packages
scripts		scripts
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
.npmignore		.npmignore
20B_tokenizer.json		20B_tokenizer.json
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE-APACHE.MD		LICENSE-APACHE.MD
LICENSE-MIT.MD		LICENSE-MIT.MD
README-zh-CN.md		README-zh-CN.md
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
rustfmt.toml		rustfmt.toml
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLaMA Node

Official Documentations

Introduction

Supported models

Supported platforms

Installation

Manual compilation

CUDA support

Acknowledgments

Models/Inferencing tools dependencies

Some source code comes from

Community

About

Releases

Packages

Languages

License

tommoffat/llama-node

Folders and files

Latest commit

History

Repository files navigation

LLaMA Node

Official Documentations

Introduction

Supported models

Supported platforms

Installation

Manual compilation

CUDA support

Acknowledgments

Models/Inferencing tools dependencies

Some source code comes from

Community

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages