open-musiclm The question about training CLAP RVQ

The question about training CLAP RVQ

Open yangdongchao opened this issue 1 year ago • 4 comments

It seems that in the ClapRVQTrainer code, you donot use any gradient backward? How to understood this?

Jul 17 '23 12:07 yangdongchao

Hey, thanks for your interest in this repo! The RVQ parameters are updated via ema. See this relevant thread

Jul 17 '23 18:07 zhvng

Hey, thanks for your interest in this repo! The RVQ parameters are updated via ema. See this relevant thread

thanks for your reply. I want to ask how to judge the training is successful? When we can stop the training?

Jul 18 '23 01:07 yangdongchao

I stopped when the eval loss stopped going down. seems to work good enough.

Jul 18 '23 05:07 zhvng

I stopped when the eval loss stopped going down. seems to work good enough.

Thanks for your relpy. Lastly ,I want to ask, whether you only use audio to extract embedding and use RVQ to quantize it in the training proces. The text is only used in the inference stage?

Jul 19 '23 14:07 yangdongchao

open-musiclm open-musiclm copied to clipboard

The question about training CLAP RVQ

open-musiclm
open-musiclm copied to clipboard