alignment topic

List alignment repositories

agent-ci

353
Stars
44
Forks
353
Watchers

Deploy once. Continuously improve your AI agents in production.

csl

16
Stars
0
Forks
16
Watchers

Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts

realign

18
Stars
1
Forks
18
Watchers

Realign is a testing and simulation framework for AI applications.

activation-steering

129
Stars
22
Forks
129
Watchers

[ICLR 2025] General-purpose activation steering library

wfa

32
Stars
0
Forks
32
Watchers

Wavefront alignment algorithm (WFA) in Golang

WFGY

1.3k
Stars
107
Forks
1.3k
Watchers

WFGY 2.0. Semantic Reasoning Engine for LLMs (MIT). Fixes RAG/OCR drift, collapse & “ghost matches” via symbolic overlays + logic patches. Autoboot; OneLine & Flagship. ⭐ Star if you explore semantic...

24-Game-Reasoning

33
Stars
2
Forks
33
Watchers

超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of DeepSeek R1-Zero, DeepSeek R1

STAR-1

32
Stars
1
Forks
32
Watchers

[AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

ddro

35
Stars
3
Forks
35
Watchers

We introduce the direct document relevance optimization (DDRO) for training a pairwise ranker model. DDRO encourages the model to focus on document-level relevance during generation