- Speech Commands from
torchaudio
, with35
class.
- Using AlexNet with Mel Spectrogram
1
channel. - Output is a softmax with
35
nodes (35
class).
- Edit your configuration in
conf/configs.yaml
- Train model using
python main.py -cp conf -cn configs
- You guys should install
PySoundFile
on windows orsox
on linux, for torchaudio I/O backend.