thompson
thompson copied to clipboard

Published 20 hours ago •

→

Metadata

Thompson Sampling Tutorial

Readme
Issues

Thompson Sampling

Thompson Sampling is a Bayesian approach to multi-armed bandits. This notebook reviews the theory walks through my implementation and some experiments. The experiments should give you some good understanding of the behaviour of Thompson Sampling in comparison to epsilon-greedy and UCB. To run the notebook online, click this link and open with Colab.

For a more extensive review of the theory, checkout A Tutorial on Thompson Sampling by Russo et al., 2017.

About

Thompson Sampling Tutorial

reinforcement-learning

bandit

thompson-sampling

bandit-algorithm

45

Stars

18

Forks

Watchers

Owner

andrecianflone

← Metadata

45

Stars

18

Forks

Watchers

Owner

andrecianflone

Metadata

Thompson Sampling Tutorial

Back

thompson thompson copied to clipboard

Metadata

Thompson Sampling

← Metadata

Owner

Metadata

thompson
thompson copied to clipboard