So far I am impressed by the Studio. User friendly, allows access to the python code for optimization, command line tools for the uploading of training and test data.
These suggestions reflect a small subset of features I usually have in my ML pipeline…
During Training Data Augmentation
- This could be adding random noise, random time shifts etc… We are dealing with impulse signals which can be as shorts as 10->30ms.
- Audio mixing, mixing different background sounds with the training data (That would require a background noise data set).
- Filters, low, high, band pass filters
- Sampling frequency of up to 48kHz. Frequencies of 8->16kHz provide quite important feature information for applications we are currently looking at.
The below suggestion is probably too much.
I guess you are familiar with Audacity and wavesurfer based tools. They are very useful for splitting up audio data for labelling. You have a great tool for data capture, display and playback. Combine that with a limited editing capability, such as audio file splitting and cutting. I think some companies might abandon developing their own in house tools, if they saw that capability.