human-feedback topic
LaMDA-rlhf-pytorch
Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
instructGOOSE
Implementation of Reinforcement Learning from Human Feedback (RLHF)
beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.
trubrics-sdk
Product analytics for AI Assistants
d3po
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
data-is-better-together
Let's build better datasets, together!
prism-alignment
The Prism Alignment Project