text-to-sound

Here are 5 public repositories matching this topic...

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

audio music semantic text-to-speech tokenizer speech sound codec audio-codec gpt language-model self-supervised-learning text-to-music vall-e text-to-sound speech-language-model

Create a Movie animation plus Audio plus Subtitle from a text file

ffmpeg text-to-video chatgpt text-to-sound

Exploring Bark, the Open-Source Text-to-Audio Generative Model

Create .wav audio samples with text-to-sound generative AI

python music windows macos cli ai music-composition samples wav synthesis cli-app music-generation generative electronic-music sound-synthesis sample-generation prompt-engineering generative-ai text-to-sound

Add a description, image, and links to the text-to-sound topic page so that developers can more easily learn about it.

To associate your repository with the text-to-sound topic, visit your repo's landing page and select "manage topics."