rlhf topic

List rlhf repositories

argilla

3.8k
Stars
358
Forks
Watchers

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Open-Assistant

37.0k
Stars
3.2k
Forks
Watchers

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

awesome-RLHF

3.3k
Stars
201
Forks
Watchers

A curated list of reinforcement learning with human feedback resources (continually updated)

LLaMA-Factory

61.5k
Stars
7.4k
Forks
Watchers

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

alpaca_eval

1.5k
Stars
231
Forks
Watchers

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

WebGLM

1.6k
Stars
134
Forks
Watchers

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

ChatGLM-Efficient-Tuning

3.7k
Stars
471
Forks
Watchers

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

LLMSurvey

10.1k
Stars
795
Forks
Watchers

The official GitHub page for the survey paper "A Survey of Large Language Models".

Cornucopia-LLaMA-Fin-Chinese

582
Stars
61
Forks
Watchers

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

log10

94
Stars
8
Forks
Watchers

Python client library for improving your LLM app accuracy