The MFCC building block is using Mel-frequency cepstrum which is based on how humans hear.
Using this building block makes sense for human speech but I think for other things like door bells, faucets, showers a linear scale (or other scale) might make more sense. So is there a building block that is implementing the spectogram on a linear scale ?
I am especially asking this as the linear spectogram of my doorbell in audacity looks more “distinguishable” than the mel spectogram as you can see in below screenshot for the same audio fragment:
I am still thinking that MFCC is specifically designed for speech recognition. Which also means that it might not be the best tool to recognize other sound patterns. E.g. my doorbell shows a clear band around 9500 Herz which is beyond the range of typical speech analysis.
I think that for my case a feature extraction based on the linear spectogram scale (see audacity diagram above) would be a much better approach as I can easily recognize when the doorbell is ringing in these spectograms.
In terms of implementation I am wondering if the existing MFCC block can not be reused as starting point. In that case you basically need to remove steps and assure that the scale is linear instead of logarithmic.