bandit-algorithm topic

List bandit-algorithm repositories

thomas

23
Stars
8
Forks
Watchers

Another A/B test library

LinUCB

28
Stars
11
Forks
Watchers

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

Solutions and figures for problems from Reinforcement Learning: An Introduction Sutton&Barto