rlhf topic
argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
WebGLM
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
log10
Python client library for improving your LLM app accuracy