Costa Huang

Results 7 repositories owned by Costa Huang

cleanrl

4.6k
Stars
544
Forks
Watchers

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

portwarden

552
Stars
31
Forks
Watchers

Create Encrypted Backups of Your Bitwarden Vault with Attachments

gym-microrts-paper

37
Stars
3
Forks
Watchers

The source code for the gym-microrts paper.

invalid-action-masking

120
Stars
18
Forks
Watchers

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

a2c_is_a_special_case_of_ppo

17
Stars
2
Forks
Watchers

A2C is a special case of PPO!

cleanba

87
Stars
8
Forks
Watchers

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

lm-human-preference-details

101
Stars
3
Forks
Watchers