Skip to content

A voice Assistance tool employing LLMs and Seq2Seq Encoder Decoder Models.

License

Notifications You must be signed in to change notification settings

ppap1771/voice-assistance

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio Processing Toolkit

This project is a toolkit for audio processing tasks.

Overview

The project provides functionalities for:

  • Speech to Text: Convert audio files to text.
  • Text to Speech: Generate speech audio from text.
  • Audio Input: Capture audio input from various sources.

Features

  • Speech to text conversion with high accuracy.
  • Text to speech generation with natural-sounding voices.
  • Support for various audio input sources.

Installation

This project utilizes a virtual environment for managing dependencies. To install, follow these steps:

  1. Create a virtual environment:
    python3 -m venv .venv
  2. Activate the virtual environment:
    • Linux/macOS:
      source .venv/bin/activate
    • Windows:
      .venv\Scripts\activate
  3. Install dependencies:
    pip install -r requirements.txt

Usage

Speech to Text

from utils import speech2text

audio_file = "data/output.wav"
text = speech2text(audio_file)

print(text)

Text to Speech

from utils import text2speech

text = "This is a test of the text to speech functionality."
audio_file = "data/output.wav"
text2speech(text, audio_file)

Audio Input

from utils import audio_in

# Capture audio from the default microphone
audio_data = audio_in()

# Process the audio data
# ...

License

This project is licensed under the [License Name] license.

About

A voice Assistance tool employing LLMs and Seq2Seq Encoder Decoder Models.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages