audiolm-pytorch icon indicating copy to clipboard operation
audiolm-pytorch copied to clipboard

[feature request] add trained model with saved weights

Open amirgamil opened this issue 2 years ago • 5 comments

Would be great if the repo also included a trained version of this model with weights people can just download and use :)

amirgamil avatar Feb 04 '23 07:02 amirgamil

After the training is stable for Soundstream, I can share my weights for it :)

ckwdani avatar Feb 14 '23 12:02 ckwdani

@BlackFox1197 what corpus are you training on?

turian avatar Feb 14 '23 22:02 turian

@BlackFox1197 what corpus are you training on?

Im Training on Librispeach.

ckwdani avatar Feb 15 '23 06:02 ckwdani

@BlackFox1197 Would be wonderful if you would train on audioset since that is the most diverse corpus. Or FSD50K which is smaller but diverse. One concern I have is that soundstream reported poor results on music, as opposed to encodec

image

That's soundstream, you can see the music numbers (green) are all worse.

Versus encodec:

image

turian avatar Feb 15 '23 06:02 turian

@BlackFox1197 Are the model weights ready to go? :)

lancer1256 avatar May 17 '23 02:05 lancer1256