This is sort of a two-part question.
We have a keyword recognition 'net that now seems to perform quite well. However, its Achille’s heel is background music. In the presence of any sort of background music the success rate falls to zero. It would seem to make sense to add background music during training but I don’t know if this is considered best-practice or a realistic approach.
If it is, would it be possible at some point to add background music or possibly just “custom” noise sources to the data augmentation option?