alignment topic

List alignment repositories

magpie

793
Stars
69
Forks
793
Watchers

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

awesome-representation-engineering

44
Stars
1
Forks
Watchers

A resource repository for representation engineering in large language models

fuzzig

16
Stars
1
Forks
Watchers

Fuzzy finder algorithms a la Smith-Waterman for Zig.

beta-DPO

23
Stars
0
Forks
Watchers

[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

filtered-dpo

15
Stars
1
Forks
Watchers

Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lower-quality samples compared to those generated by the learning...

Jailbreak-In-Pieces

77
Stars
6
Forks
77
Watchers

[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

clip4dm

25
Stars
0
Forks
25
Watchers

Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)

llms-resist-alignment

38
Stars
1
Forks
38
Watchers

[ACL2025 Best Paper] Language Models Resist Alignment

BBTools

36
Stars
2
Forks
36
Watchers

BBTools: Official suite of fast, multithreaded bioinformatics tools for DNA/RNA analysis. BBMap aligner, BBDuk trimmer, BBMerge, and 90+ other tools. Actively maintained by Brian Bushnell.