notus
notus copied to clipboard

→

Metadata

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

Reame
Issues

Results 2 notus issues

Sort by recently updated

feat: add .ipynb for adding and updating models to `ollama`

davidberenstein1957

Curate UltraFeedack dataset's overall_score

Based on our curation efforts, we spotted a bug in the `overall_score` of UltraFeedback AI Critique score. TLDR: Responses getting the lowest score (1 or less) become a high score...

dvsrepo

team: ml

About

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

fine-tuning

zephyr

dpo

alignment-handbook

lm-alignment

trl

preference-data

159

Stars

Forks

Watchers

Owner

argilla-io

← Metadata

159

Stars

Forks

Watchers

Owner

argilla-io

Metadata

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

Back

notus notus copied to clipboard

Metadata

feat: add .ipynb for adding and updating models to `ollama`

Curate UltraFeedack dataset's overall_score

← Metadata

Owner

Metadata

notus
notus copied to clipboard