rlhf topics

argilla

3.8k

Stars

358

Forks

Watchers

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

argilla-io

active-learning

annotation-tool

artificial-intelligence

data-science

Open-Assistant

37.0k

Stars

3.2k

Forks

Watchers

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

LAION-AI

ai

assistant

chatgpt

discord-bot

awesome-RLHF

3.3k

Stars

201

Forks

Watchers

A curated list of reinforcement learning with human feedback resources (continually updated)

opendilab

deep-learning

deep-reinforcement-learning

human-feedback

large-language-models

LLaMA-Factory

62.8k

Stars

7.6k

Forks

62.8k

Watchers

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

hiyouga

bloom

fine-tuning

language-model

llama

alpaca_eval

1.5k

Stars

231

Forks

Watchers

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

tatsu-lab

deep-learning

evaluation

foundation-models

instruction-following

WebGLM

1.6k

Stars

134

Forks

Watchers

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

THUDM

chatgpt

llm

rlhf

webglm

ChatGLM-Efficient-Tuning

3.7k

Stars

471

Forks

Watchers

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

hiyouga

alpaca

chatglm

chatglm2

chatgpt

LLMSurvey

10.1k

Stars

795

Forks

Watchers

The official GitHub page for the survey paper "A Survey of Large Language Models".

RUCAIBox

chain-of-thought

chatgpt

in-context-learning

instruction-tuning

Cornucopia-LLaMA-Fin-Chinese

582

Stars

61

Forks

Watchers

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

jerry1993-tech

chinese

finance

large-language-models

llama

log10

94

Stars

8

Forks

Watchers

Python client library for improving your LLM app accuracy

log10-io

agents

ai

artificial-intelligence

autonomous-agents