Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
-
Updated
Dec 20, 2024 - Python
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Pybind11 bindings for Whisper.cpp
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
A static site demonstrating real-time audio transcription via Amazon Transcribe over a WebSocket.
Free speech to text
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time audio transcription using OPENAI's Whisper Ai.
Transcription and annotation interface for recorded audio or video files
Generate subtitles for long movies / podcasts with OpenAI Whisper API.
Speakscribe is a web application that allows users to transcribe audios using OpenAI and also interact with a chat bot. The web application is created in Python using NiceGUI.
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
The Gemini API wrapper for Delphi utilizes advanced models developed by Google to provide robust capabilities, including interactive chat, text embeddings, code generation, image and video prompting, audio analysis and transcription, fine-tuning, caching, and integration with Google Search.
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
Scribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the transcriptions as PDF documents, enhanced by the gpt-3.5-turbo model.
The GroqCloud API wrapper for Delphi provides access to models from Meta, OpenAI, MistralAI and Google on Groq’s LPUs, offering chat, text generation, image analysis, audio transcription, JSON output, tool integration, and content moderation capabilities.
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
Add a description, image, and links to the audio-transcription topic page so that developers can more easily learn about it.
To associate your repository with the audio-transcription topic, visit your repo's landing page and select "manage topics."