ChatRWKV
ChatRWKV copied to clipboard
add text condition for gen music
The music cannot be controlled through text. It's better with text constraint like text2img, I think that.
it's possible by using such data to train / finetune the model