OpenChatKit
OpenChatKit copied to clipboard
[feature]Do you support RLHF training ?
After viewing your code , I found that you haven't support RLHF training yet. Your code is mainly about distributed training using pipeline & data parallel. Do you have the plan to support RLHF training?Do you think it is necessary?
Gostei
@zhangce, what do you think?
Impressive to meet an opensource version of the chatGPT. I cannot imagine how tremendous the service to be done for releasing this. Nevertheless, IMHO, current performance based on my experience... I dunno the exact cause but much to be done left I guess. I suspect one of them is RLHF if the quality/distribution of the instruction dataset and the pretrained language model (Neo-20B) is trusted.