preference-learning topics

This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refe...

SMARTlab-Purdue

machine-learning

preference-learning

reinforcement-learning

robot-navigation

reward-bench

367

Stars

45

Forks

Watchers

RewardBench: the first evaluation tool for reward models.

allenai

preference-learning

rlhf

metis

34

Stars

8

Forks

Watchers

Python-based GUI to collect Feedback of Chemist in Molecules

JanoschMenke

de-novo-drug-design

drug-discovery

generative-ai

human-in-the-loop

ICSFSurvey

137

Stars

3

Forks

Watchers

A comprehensive survey on Internal Consistency and Self-Feedback in Large Language Models.

IAAR-Shanghai

attention-head

chain-of-thought

data-augmentation

decoding

prelude

19

Stars

0

Forks

Watchers

Aligning LLM Agents by Learning Latent Preference from User Edits

gao-g

alignment

edits

gpt4

human-feedback

dice

34

Stars

1

Forks

Watchers

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

sail-sg

alignment

large-language-models

preference-learning

rlhf