Sang Michael Xie

Results 2 repositories owned by Sang Michael Xie

giraffe

32
Stars
11
Forks
Watchers

Learning from Expert Data, Approximate IRL, and TD-Leaf for Deep Reinforcement Learning Chess, built on the recent Giraffe engine

doremi

254
Stars
31
Forks
Watchers

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets