human-feedback topic

List human-feedback repositories

LaMDA-rlhf-pytorch

462
Stars
76
Forks
Watchers

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

PaLM-rlhf-pytorch

7.7k
Stars
668
Forks
94
Watchers

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

awesome-RLHF

3.3k
Stars
201
Forks
32
Watchers

A curated list of reinforcement learning with human feedback resources (continually updated)

instructGOOSE

168
Stars
20
Forks
Watchers

Implementation of Reinforcement Learning from Human Feedback (RLHF)

beavertails

105
Stars
3
Forks
Watchers

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

ParroT

166
Stars
24
Forks
Watchers

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

trubrics-sdk

129
Stars
25
Forks
Watchers

Product analytics for AI Assistants

d3po

162
Stars
14
Forks
Watchers

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

data-is-better-together

196
Stars
29
Forks
Watchers

Let's build better datasets, together!