janoserdody
Results
1
issues of
janoserdody
### 🥰 Feature Description We are running an on-premise LLM and would like to further fine-tune the model using Reinforcement Learning from Human Feedback (RLHF). To facilitate this, we propose...
enhancement