human-feedback topics

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

wxjiao

bloomz

chatgpt

contrastive

error-guided

trubrics-sdk

129

Stars

25

Forks

Watchers

Product analytics for AI Assistants

trubrics

human-feedback

llm

llmops

machine-learning

d3po

162

Stars

14

Forks

Watchers

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

yk7333

diffusion-models

human-feedback

reinforcement-learning

data-is-better-together

196

Stars

29

Forks

Watchers

Let's build better datasets, together!

huggingface

community

datasets

human-feedback

machine-learning

prism-alignment

32

Stars

1

Forks

Watchers

The Prism Alignment Project

HannahKirk

alignment

dataset

human-feedback

human-feedback-data