-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inverse operations, wiener filter, softmask #5
Comments
Wiener filter would be incredible. I would love to have them. |
Do all the audio transforms have to work on For def stft(signal, window_size, hop_size, window='hann', axis=-1):
fft_config = dict(n_fft=window_size, hop_length=hop_size,
win_length=window_size, window=window)
return np.apply_along_axis(librosa.stft, axis, signal, **fft_config)
def istft(spectrogram, window_size, hop_size, window='hann', axis=-2):
ifft_config = dict(hop_length=hop_size, win_length=window_size,
window=window)
# ... some reshaping code
return np.apply_along_axis(_istft, axis, D, f, t, **ifft_config)
def _istft(spectrogram, frequency_bins, time_bins, **config):
spectrogram = np.reshape(spectrogram, [frequency_bins, time_bins])
return librosa.istft(spectrogram, **config) |
yes, the aim is that all audio transforms (not to be confused with augmentation transforms should be able to run on GPU as aprt of the model. In any case, we would need to wait for the istft |
Ah, ok. I use all my audio transforms as augmentation transforms. |
yes, currently I do the same. the dataset transforms are bound to CPU, but maybe that will change. I think we would still benefit from pure pytorch implementation |
Concerning additional operators, the most valuable at this point would be inverse operators. However, for this, we would have to wait until
ISTFT
is implemented.But maybe it would be nice to implement some operations already like wiener filtering/soft masking/binary masking. Preferably all for multichannel spectrograms.
I could add this, if you like the idea
The text was updated successfully, but these errors were encountered: