human-feedback topic

List human-feedback repositories

LaMDA-rlhf-pytorch

460
Stars
76
Forks
Watchers

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

PaLM-rlhf-pytorch

7.6k
Stars
664
Forks
94
Watchers

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

awesome-RLHF

2.9k
Stars
184
Forks
32
Watchers

A curated list of reinforcement learning with human feedback resources (continually updated)

instructGOOSE

164
Stars
20
Forks
Watchers

Implementation of Reinforcement Learning from Human Feedback (RLHF)

beavertails

82
Stars
3
Forks
Watchers

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

ParroT

166
Stars
23
Forks
Watchers

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

trubrics-sdk

125
Stars
24
Forks
Watchers

Product analytics for AI Assistants

d3po

122
Stars
9
Forks
Watchers

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

data-is-better-together

156
Stars
26
Forks
Watchers

Let's build better datasets, together!