- Speech Commands from
torchaudio, with35class.
- Using AlexNet with Mel Spectrogram
1channel. - Output is a softmax with
35nodes (35class).
- Edit your configuration in
conf/configs.yaml - Train model using
python main.py -cp conf -cn configs
- You guys should install
PySoundFileon windows orsoxon linux, for torchaudio I/O backend.
