bandit-algorithm topics

privacy-preserving-bandits

22

Stars

7

Forks

Watchers

Privacy-Preserving Bandits (MLSys'20)

mmalekzadeh

bandit-algorithm

bandit-algorithms

bandit-learning

contextual-bandits

thompson

45

Stars

18

Forks

Watchers

Thompson Sampling Tutorial

andrecianflone

bandit

bandit-algorithm

reinforcement-learning

thompson-sampling

thomas

23

Stars

8

Forks

Watchers

Another A/B test library

iheartradio

ab-testing

bandit

bandit-algorithm

bandits

LinUCB

28

Stars

11

Forks

Watchers

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

thunfischtoast

bandit-algorithm

bandit-learning

contextual-bandits

java

ReinforcementLearning_Sutton-Barto_Solutions

19

Stars

4

Forks

Watchers

Solutions and figures for problems from Reinforcement Learning: An Introduction Sutton&Barto

raklokesh

bandit-algorithm

batch-update

blackjack-montecarlo

dynaq