Question/Issue: I have an already existing project which uses CNNs in an ESP S3 for keyword spotting. Up to this point, I’ve been using the tensorflow libraries to compute the mel spectrogram. Since all the tensorflow libraries are already integrated within the project, I just swapped the neural networks hoping it would work.
While the mobilnets work, their accuracy is much lower than the training value. Some of the keywords get mixed up, and the results are much worse than I expected. From testing, I found that the mel spectrogram I get from tensorflow within the esp32 is not exactly like the one I get from Edge Impulse libraries:
Seeing this I tried to train the moblinet locally with the spectrograms coming from tensorflow lite, but accuracy went from 99% to 74%. Is this because the mobilnet is already pre-trained with features coming from tensorflow? Why is there such a difference from tensorflow to edge impulse? Are there standalone dsp libraries that I could integrate with my project?
Project ID: 265677
Context/Use case: Keyword spotting algorithm on the edge.