This repo contains some scripts for audio processing. Main features include:
- Video/audio to wav
- Audio vocal separation
- Automatic audio slicing
- Audio loudness matching
- Audio data statistics (supports determining audio length)
- Audio resampling
- Audio transcribe (.lab)
- Audio transcribe via FunASR (use
--model-type funasr
to enable, detailed usage can be found at code) - Audio transcribe via WhisperX
([ ] indicates not completed, [x] indicates completed)
This code has been tested on Ubuntu 22.04 / 20.04 + Python 3.10. If you encounter problems on other versions, feedback is welcome.
pip install -e .
fap --help