open-musiclm icon indicating copy to clipboard operation
open-musiclm copied to clipboard

The question about training CLAP RVQ

Open yangdongchao opened this issue 1 year ago • 4 comments

It seems that in the ClapRVQTrainer code, you donot use any gradient backward? How to understood this?

yangdongchao avatar Jul 17 '23 12:07 yangdongchao

Hey, thanks for your interest in this repo! The RVQ parameters are updated via ema. See this relevant thread

zhvng avatar Jul 17 '23 18:07 zhvng

Hey, thanks for your interest in this repo! The RVQ parameters are updated via ema. See this relevant thread

thanks for your reply. I want to ask how to judge the training is successful? When we can stop the training?

yangdongchao avatar Jul 18 '23 01:07 yangdongchao

I stopped when the eval loss stopped going down. seems to work good enough.

zhvng avatar Jul 18 '23 05:07 zhvng

I stopped when the eval loss stopped going down. seems to work good enough.

Thanks for your relpy. Lastly ,I want to ask, whether you only use audio to extract embedding and use RVQ to quantize it in the training proces. The text is only used in the inference stage?

yangdongchao avatar Jul 19 '23 14:07 yangdongchao