Summer

Results 5 issues of Summer

This is a potential plan for cleaning the Red Teaming data from Anthropics. - Step 1: Splitting data into Evil-Harmful and Harmless data. This is fairly easy. The Anthropics dataset...

data

add a demo for RankGen Classification as proposed in: https://github.com/LAION-AI/Open-Assistant/issues/382#issue-1519347873 Doesn't use https://github.com/anthropics/hh-rlhf just yet.

ml

This issue pertains to the ideas around watermarking Open Assistant generations via Linguistic Stenography. Linguistic Stenography is a field of active research however a few methods have emerged that make...

safety

Currently, the OA models have a fairly short window of context (Compared to other models). While efforts are underway to expand the context size, I suggest that we should use...

feature
ml