bark icon indicating copy to clipboard operation
bark copied to clipboard

What will SunoAI Foundation Model be trained on?

Open hu-po opened this issue 1 year ago • 1 comments

It seems like the biggest hurdle for SunoAI is the fact that the Encoder/Decoder is a non-commerical license from Facebook. My guess is the team wants to train their own foundational AI model from scratch. What dataset will you use? For context, this is the dataset according to the facebook paper:

image

hu-po avatar Apr 24 '23 14:04 hu-po

Perhaps https://www.robots.ox.ac.uk/~vgg/data/voxceleb/

It is CC BY 4.0

hu-po avatar Apr 24 '23 14:04 hu-po

a continuously growing mix of proprietary and open datasets :)

gkucsko avatar Apr 26 '23 01:04 gkucsko

@gkucsko Could you provide the list of open datasets you use?

vivasvan1 avatar Oct 07 '23 00:10 vivasvan1