preference-learning topic

List preference-learning repositories

tournesol

314
Stars
46
Forks
Watchers

Free and open source code of the https://tournesol.app platform. Meet the community on Discord https://discord.gg/WvcSG55Bf3

magical

73
Stars
11
Forks
Watchers

The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)

SAN-NaviSTAR

47
Stars
5
Forks
Watchers

This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refe...

reward-bench

367
Stars
45
Forks
Watchers

RewardBench: the first evaluation tool for reward models.

metis

53
Stars
13
Forks
53
Watchers

Python-based GUI to collect Feedback of Chemist in Molecules

ICSFSurvey

171
Stars
4
Forks
171
Watchers

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

prelude

19
Stars
0
Forks
Watchers

Aligning LLM Agents by Learning Latent Preference from User Edits

dice

34
Stars
1
Forks
Watchers

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards