Hao Sun
Results
5
repositories owned by
Hao Sun
PanelGPT
117
Stars
11
Forks
Watchers
We introduce new zero-shot prompting magic words that improves the reasoning ability of language models: panel discussion!
RewardShifting
26
Stars
2
Forks
Watchers
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
PCHID_code
15
Stars
0
Forks
Watchers
Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
Prompt-OIRL
28
Stars
5
Forks
Watchers
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
RewardModelingBeyondBradleyTerry
70
Stars
4
Forks
70
Watchers
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives