PKU-Alignment

Results 12 repositories owned by


                                            PKU-Alignment

omnisafe

911

Stars

130

Forks

Watchers

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

PKU-Alignment

benchmark

benchmark-suite

mujoco

pytorch

Safe-Policy-Optimization

296

Stars

Forks

Watchers

NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms

PKU-Alignment

benchmarks

constrained-reinforcement-learning

reinforcement-learning-algorithms

safe

AlignmentSurvey

116

Stars

Forks

Watchers

AI Alignment: A Comprehensive Survey

PKU-Alignment

alignment

awesome

deep-learning

safe-rlhf

1.3k

Stars

119

Forks

Watchers

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

PKU-Alignment

ai-safety

alpaca

beaver

datasets

safety-gymnasium

383

Stars

Forks

Watchers

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

PKU-Alignment

constraint-rl

constraint-satisfaction-problem

reinforcement-learning

safe-policy-optimization

beavertails

105

Stars

Forks

Watchers

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

PKU-Alignment

ai-safety

beaver

datasets

gpt

ProAgent

Stars

Forks

Watchers

ProAgent: Building Proactive Cooperative Agents with Large Language Models

PKU-Alignment

cooperative

cooperative-ai

human-ai

human-ai-interaction

SafeDreamer

Stars

Forks

Watchers

ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models

PKU-Alignment

constraint-rl

constraint-satisfaction-problem

reinforcement-learning

safe-policy-optimization

aligner

189

Stars

Forks

189

Watchers

[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

PKU-Alignment

aligner

alignment

llm

rlhf

SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (L...

PKU-Alignment

alignment

human-preferences

large-vision-models

text-to-video-generation