Using .tflite file in other project

Sina · October 17, 2022, 4:07pm

Hi,

Recently, I designed a KWS model and trained that model, the project is:
https://studio.edgeimpulse.com/studio/146655

Now that the trained model (.lite file) is created, I tried to take this model as a replacement to an existing ARM KWS model. I modified some specific parts of their code to match with my EdgeImpulse tflite model that I could successfully build the project with this replacement. However, I see different behavior. I was wondering how much the preprocessing step is different that the model is trained based on that and does not work well in other projects. I mean whether the MFCC calculation is different than others? or even other parts of preprocessing, such as shifting, normalization, and etc.
I would appreciate any feedback.

Thank you,
Sina

shawn_edgeimpulse · October 18, 2022, 7:59pm

Hi Sina,

Yes, the preprocessing step plays a big part in determining which features are sent to the neural network (i.e. prior to calling inference with the .lite file). To make sure that your custom preprocessing code is similar to what’s in Edge Impulse, I recommend using a known-good sample from your Edge Impulse project. If you go to the “MFCC” page, you can click on individual samples to see both the raw input and the raw MFCCs. Copy the raw features (time-series data that makes up the audio data) into your custom DSP code. Run it, and compare the output to the “processed features” that I’ve highlighted in red in the image. Try this for a few samples from different classes to make sure that the MFCCs are close.

Sina · October 24, 2022, 6:56pm

Hi Shawn,

Thank you for your comment and I tried to capture MFCC outputs in the ARM project and compare those with the one I see in Edge Impulse with the same input vector (taken from raw features). They are not the same as can be seen in the following shot:

I take that raw features and feed them into an ARM internal KWS model to see if it could spot the keyword correctly and seemingly it did correctly.
So, my question is why the MFCC is different from the one in other platforms? could someone help me realize the way Edge Impulse calculates MFCC?

Thank you in advance,
Sina

shawn_edgeimpulse · October 30, 2022, 2:22pm

Hi @Sina,

You can see the C++ SDK code for how Edge Impulse computes the MFCCs here: inferencing-sdk-cpp/feature.hpp at master · edgeimpulse/inferencing-sdk-cpp · GitHub. I recommend using that as a reference when comparing against your own MFCC code.

That being said, sometimes features don’t need to be exact (considering there are a different techniques used to compute the various parts of the MFCCs, like different FFT libraries). You just need to be “close enough” for KWS to work. Keep in mind that “close enough” is relative–the more you deviate from how EI computed MFCCs, the lower your accuracy will likely be.