Summer issues

Repositories
Issues
Comments

Results 5 issues of


Summer

Cleaning data of Evil prompts for safer model training

This is a potential plan for cleaning the Red Teaming data from Anthropics. - Step 1: Splitting data into Evil-Harmful and Harmless data. This is fairly easy. The Anthropics dataset...

data

Add RankGen Classification

add a demo for RankGen Classification as proposed in: https://github.com/LAION-AI/Open-Assistant/issues/382#issue-1519347873 Doesn't use https://github.com/anthropics/hh-rlhf just yet.

using Linguistic Stenography in Open Assistant.

This issue pertains to the ideas around watermarking Open Assistant generations via Linguistic Stenography. Linguistic Stenography is a field of active research however a few methods have emerged that make...

safety

Add SciQ dataset

data

Using Hidden Engrams for long context

Currently, the OA models have a fairly short window of context (Compared to other models). While efforts are underway to expand the context size, I suggest that we should use...

feature