contextual-bandits topic

List contextual-bandits repositories

LinUCB

28
Stars
11
Forks
Watchers

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

sinkhorn-policy-gradient.pytorch

38
Stars
10
Forks
Watchers

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

FairMachineLearning

15
Stars
4
Forks
Watchers

Implementation of provably Rawlsian fair ML algorithms for contextual bandits.

blocks

40
Stars
14
Forks
Watchers

Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)

MiniVox

27
Stars
5
Forks
Watchers

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

banditml

67
Stars
10
Forks
Watchers

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

implement basic and contextual MAB algorithms for recommendation system

python-ranker

21
Stars
1
Forks
Watchers

Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions

Neural-Thompson-Sampling

20
Stars
5
Forks
Watchers

Study of the paper 'Neural Thompson Sampling' published in October 2020