I’m trying to classify many keywords to build a speech command interface on STM32. I’ve tried the 1DConv model, training time is about 4s/epoch and the performance is very poor (about 60% accuracy). Therefore I’m trying to increase the model complexity, for example, using the 2DConv.
Is there any way for me to speed up the NN training speed?
The training is a little bit slow (about 18s/epoch) and my computing time is exceeded. The model specifications are as follow:
DPS: default MFCC
- number of class: 18
- model: default 2dConv