Skip to content

A web app that lets you play around with TalkNet models

License

Notifications You must be signed in to change notification settings

Randy-H0/ControllableTalkNet

 
 

Repository files navigation

Controllable TalkNet

Controllable TalkNet is a web application that lets you synthesize speech, which mimics the pitch and pacing of an existing audio clip. It's based on NVIDIA's implementation of TalkNet 2, with some changes to support singing synthesis and higher audio quality.

Requirements

  • A Google account to run Colab, or...
  • An NVIDIA GPU with 4+ GB of VRAM
  • 10 GB of free space

How to run

Google Colab

TalkNet Offline (Windows)

Docker (Linux)

  • Install Docker and NVIDIA Container Toolkit.
  • Download the Dockerfile. Open a terminal, and navigate to the directory where you saved it.
  • Run docker build -t talknet-offline . to build the image. Add sudo if you're not using rootless Docker.
  • Run docker run -it --gpus all -p 8050:8050 talknet-offline to start TalkNet on http://127.0.0.1:8050/.

About

A web app that lets you play around with TalkNet models

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 60.9%
  • Jupyter Notebook 26.4%
  • CSS 11.7%
  • Dockerfile 1.0%