hertz-dev

Hertz-dev is an open-source, first-of-its-kind base model for full-duplex conversational audio.

See our blog post for more details: https://si.inc/hertz-dev/

Setup

Inference is known to work on Python 3.10 and CUDA 12.1. Other versions have not been tested as thoroughly. If you want to use CUDA 12.1, you'll need to install torch with pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121 before running pip install -r requirements.txt.

On Ubuntu you may need to install libportaudio: sudo apt-get install libportaudio2

All three scripts will automatically download the models to the ./ckpt directory, and checkpoints are also accessible at https://ckpt.si.inc/hertz-dev/index.txt

Usage

We recommend starting by using inference.ipynb to generate one- or two-channel completions from a prompt.

Then, you can use inference_client.py and inference_server.py to talk to the model live through your microphone. These are currently experimental, and have primarily been tested with Ubuntu on the server and MacOS on the client.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
prompts		prompts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
inference.ipynb		inference.ipynb
inference_client.py		inference_client.py
inference_server.py		inference_server.py
ioblocks.py		ioblocks.py
model.py		model.py
requirements.txt		requirements.txt
tokenizer.py		tokenizer.py
transformer.py		transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hertz-dev

Setup

Usage

About

Releases

Packages

Contributors 5

Languages

License

Standard-Intelligence/hertz-dev

Folders and files

Latest commit

History

Repository files navigation

hertz-dev

Setup

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages