Wesley Tansey
Wesley Tansey
hrt
The holdout randomization test: feature selection using black box predictive models
nurse-scheduling
A nurse scheduling app to help mental health nurses with daily staffer assignments
pycfr
A python implementation of Counterfactual Regret Minimization for poker
rl-tictactoe
A reinforcement learning agent for tic-tac-toe. Implements the example from Chapter 1 of Sutton and Barto.
sdp
Deep nonparametric estimation of discrete conditional distributions via smoothed dyadic partitioning
td_cfr
An implementation of Counterfactual Regret Minimization (CFR) via Temporal Difference (TD) learning
tstd0
An experiment with Thompson sampling and TD(0) on a grid world variant