Sang Michael Xie
Results
2
repositories owned by
Sang Michael Xie
giraffe
32
Stars
11
Forks
Watchers
Learning from Expert Data, Approximate IRL, and TD-Leaf for Deep Reinforcement Learning Chess, built on the recent Giraffe engine
doremi
254
Stars
31
Forks
Watchers
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets