preference-learning topic

List preference-learning repositories

tournesol

314
Stars
46
Forks
Watchers

Free and open source code of the https://tournesol.app platform. Meet the community on Discord https://discord.gg/WvcSG55Bf3

magical

73
Stars
11
Forks
Watchers

The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)

SAN-NaviSTAR

47
Stars
5
Forks
Watchers

This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refe...

reward-bench

367
Stars
45
Forks
Watchers

RewardBench: the first evaluation tool for reward models.

metis

34
Stars
8
Forks
Watchers

Python-based GUI to collect Feedback of Chemist in Molecules

ICSFSurvey

137
Stars
3
Forks
Watchers

A comprehensive survey on Internal Consistency and Self-Feedback in Large Language Models.

prelude

19
Stars
0
Forks
Watchers

Aligning LLM Agents by Learning Latent Preference from User Edits

dice

34
Stars
1
Forks
Watchers

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards