Live-Audio

Welcome to the Live-Audio repository! This project hosts two exciting applications leveraging advanced audio understand and speech generation models to bring your audio experiences to life:

Voice Chat : This application is designed to provide an interactive and natural chatting experience, making it easier to adopt sophisticated AI-driven dialogues in various settings.

For SenseVoice, visit SenseVoice repo and SenseVoice space.

Install

Clone and install

Clone the repo and submodules

#0  source code

apt update
apt-get install vim  ffmpeg  git-lfs -y

mkdir /asset
chmod 777 /asset/
git clone https://github.com/zwong91/Live-Audio.git
cd /workspace/Live-Audio
git pull

#1 pre_install.sh
# 安装 miniconda, PyTorch/CUDA 的 conda 环境
mkdir -p ~/miniconda3
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda3/miniconda.sh
bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3
rm -rf ~/miniconda3/miniconda.sh
~/miniconda3/bin/conda init bash && source ~/miniconda3/bin/activate
conda config --set auto_activate_base false
conda create -n rt python=3.10  -y
conda activate rt

#2  Live-Audio
cd /workspace/Live-Audio
pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple

#3 xtts
cd /workspace/Live-Audio/src/xtts
pip install -e .[all,dev,notebooks]  -i https://pypi.tuna.tsinghua.edu.cn/simple

#4. download xtts-v2 
HF_ENDPOINT=https://hf-mirror.com huggingface-cli download coqui/XTTS-v2  --local-dir  XTTS-v2

(rt) root@enty03:~/rt-audio# nvidia-smi
(rt) root@enty03:~/rt-audio# nvcc --version
(rt) root@enty03:~/rt-audio# pip show torch

Q

"`GLIBCXX_3.4.32' not found" error at runtime. GCC 13.2.0

https://stackoverflow.com/questions/76974555/glibcxx-3-4-32-not-found-error-at-runtime-gcc-13-2-0

Running with Docker

This will not guide you in detail on how to use CUDA in docker, see for example here.

Still, these are the commands for Linux:

distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
&& curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
&& curl -s -L https://nvidia.github.io/libnvidia-container/$distribution/libnvidia-container.list | \
sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list

sudo nvidia-ctk runtime configure --runtime=docker

sudo systemctl restart docker

You can build the container image with:

sudo docker build -t Live-Audio .

After getting your VAD token (see next sections) run:

sudo docker volume create huggingface

sudo docker run --gpus all -p 8765:8765 -v huggingface:/root/.cache/huggingface  -e PYANNOTE_AUTH_TOKEN='VAD_TOKEN_HERE' Live-Audio

The "volume" stuff will allow you not to re-download the huggingface models each time you re-run the container. If you don't need this, just use:

sudo docker run --gpus all -p 19999:19999 -e PYANNOTE_AUTH_TOKEN='VAD_TOKEN_HERE' Live-Audio

Usage

prepare

openai api token.

pem file

HF_ENDPOINT=https://hf-mirror.com python3 -m src.main --certfile cf.pem --keyfile cf.key --tts-type xtts-v2 --vad-type pyannote --vad-args '{"auth_token": "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"}'

test

export PYANNOTE_AUTH_TOKEN=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
ASR_TYPE=sensevoice python -m unittest test.server.test_server

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
frontend		frontend
src		src
test		test
vc		vc
.DS_Store		.DS_Store
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
cf.key		cf.key
cf.pem		cf.pem
docker-compose.yml		docker-compose.yml
dockerignore		dockerignore
ecosystem.config.js		ecosystem.config.js
generate_ssl.sh		generate_ssl.sh
generate_ssl_certificates.py		generate_ssl_certificates.py
nginx.conf		nginx.conf
requirements.txt		requirements.txt
start_app_edge.sh		start_app_edge.sh
start_app_vc_gpu.sh		start_app_vc_gpu.sh
start_app_vc_gpu2.sh		start_app_vc_gpu2.sh
vault.txt		vault.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Live-Audio

Install

Q

Running with Docker

Usage

About

Releases

Packages

Languages

License

utnet-org/LiveAudio-vc

Folders and files

Latest commit

History

Repository files navigation

Live-Audio

Install

Q

Running with Docker

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages