align_sd icon indicating copy to clipboard operation
align_sd copied to clipboard

When will the training code be released?

Open howardgriffin opened this issue 1 year ago • 12 comments

Nice work, but when will the training code be released? I'm hoping for it.

howardgriffin avatar Apr 12 '23 09:04 howardgriffin

Thank you for your interest in our work! Which part of code are you wishing for?

tgxs002 avatar Apr 12 '23 09:04 tgxs002

This work is really interesting. I'm interested in both 'training hpc classifier code' and 'training lora code'. I can't wait for this code being released.

howardgriffin avatar Apr 12 '23 09:04 howardgriffin

HPC training code can be ready soon. I can also upload the LoRA training code, but the training data construction is relatively complex, and may take considerable time for me to add comments / refactor the code / write the doc.

tgxs002 avatar Apr 12 '23 10:04 tgxs002

HPC training code can be ready soon. I can also upload the LoRA training code, but the training data construction is relatively complex, and may take considerable time for me to add comments / refactor the code / write the doc.

You can release the training code first if possible. I think this will be very helpful for some people. As for the training data construction, you can release it later when you are ready. Thank you for your nice work!

howardgriffin avatar Apr 12 '23 10:04 howardgriffin

Hi Howard, I have just uploaded the LoRA training code. Please note that it hasn't been sanitized, and lacks necessary training environment specifications. I will provide the full training guide when time is available, stay tuned!

tgxs002 avatar Apr 12 '23 14:04 tgxs002

Hi Blakey @tgxs002, Any updates on this please? Thank you very much

VigneshBaskar avatar May 08 '23 12:05 VigneshBaskar

@VigneshBaskar I'm preparing the training script for the classifier. It will be ready in a few days.

tgxs002 avatar May 10 '23 09:05 tgxs002

I have updated the instructions for adapting SD, please check it out! I will also release the training code of the preference classifier in a few days, maybe also with a much stronger preference classifier checkpoint.

tgxs002 avatar May 10 '23 11:05 tgxs002

Thanks, have been waiting for this

sachinnitw1317 avatar May 10 '23 17:05 sachinnitw1317

@VigneshBaskar I'm preparing the training script for the classifier. It will be ready in a few days.

Thank you very much @tgxs002. Please keep us posted. I am super excited for the code

VigneshBaskar avatar May 10 '23 17:05 VigneshBaskar

Is the code of reinforce learning part available now?

R2Bb1T avatar Aug 25 '23 03:08 R2Bb1T

Here it is: https://github.com/tgxs002/align_sd#training . @R2Bb1T

tgxs002 avatar Aug 25 '23 03:08 tgxs002