thu-coai

Results 40 repositories owned by thu-coai

BPO

241
Stars
14
Forks
Watchers

CharacterGLM-6B

249
Stars
18
Forks
Watchers

CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

COLDataset

161
Stars
15
Forks
Watchers

The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection

CritiqueLLM

96
Stars
0
Forks
Watchers

DiaSafety

22
Stars
2
Forks
Watchers

This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark

PICL

101
Stars
4
Forks
Watchers

Code for ACL2023 paper: Pre-Training to Learn in Context

SafetyBench

81
Stars
3
Forks
Watchers

Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety.

ShieldLM

41
Stars
0
Forks
Watchers

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

TaiLr

18
Stars
0
Forks
Watchers

ICLR2023 - Tailoring Language Generation Models under Total Variation Distance

Targeted-Data-Extraction

16
Stars
0
Forks
Watchers

Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation"