1 min readFeb 22, 2020
That’s an interesting idea!
If I would go down that path, I would make sure all mel-spectrogram-images were generated using the sane parameters, including total length. Coming up an annotated data set might be a challenge as well, but I have a hunch that an appropriate data set exists somewhere out there…
If you ever get to try that please share your results! 🤓