This project is a toolkit for audio processing tasks.
The project provides functionalities for:
- Speech to Text: Convert audio files to text.
- Text to Speech: Generate speech audio from text.
- Audio Input: Capture audio input from various sources.
- Speech to text conversion with high accuracy.
- Text to speech generation with natural-sounding voices.
- Support for various audio input sources.
This project utilizes a virtual environment for managing dependencies. To install, follow these steps:
- Create a virtual environment:
python3 -m venv .venv
- Activate the virtual environment:
- Linux/macOS:
source .venv/bin/activate
- Windows:
.venv\Scripts\activate
- Linux/macOS:
- Install dependencies:
pip install -r requirements.txt
from utils import speech2text
audio_file = "data/output.wav"
text = speech2text(audio_file)
print(text)
from utils import text2speech
text = "This is a test of the text to speech functionality."
audio_file = "data/output.wav"
text2speech(text, audio_file)
from utils import audio_in
# Capture audio from the default microphone
audio_data = audio_in()
# Process the audio data
# ...
This project is licensed under the [License Name] license.