Techmo Sp. z o.o. module for audio features extraction
pip install techmo-wavelet
from techmo.feature_extraction import calculate_wavelet_fft
features = calculate_wavelet_fft("path/to/audio_file")
- Speech segment is processed by the Hann window,
- The analyzed segment is normalized,
- Speech segment is processed by the wavlet transform,
- Each sub band is subjected to the Fast Fourier Transform,
- Triangular filtration,
- Logarithm of filter outputs.
A detailed presentation of the algorithm is presented in the paper M.Ziołko, M.Kucharski, S.Pałka, B.Ziołko, K.Kaminski, I.Kowalska, A.Szpakowicz, J.Jamiołkowski, M.Chlabicz, M.Witkowski: Fourier-Wavelet Voice Analysis Applied to Medical Screening Tests. Proceedings of the INTERSPEECH 2021 (under review).