contextual-bandits topics

LinUCB

28

Stars

11

Forks

Watchers

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

thunfischtoast

bandit-algorithm

bandit-learning

contextual-bandits

java

sinkhorn-policy-gradient.pytorch

38

Stars

10

Forks

Watchers

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

pemami4911

combinatorial-optimization

contextual-bandits

deep-learning

permutation-algorithms

FairMachineLearning

15

Stars

4

Forks

Watchers

Implementation of provably Rawlsian fair ML algorithms for contextual bandits.

jtcho

contextual-bandits

jupyter

machine-learning

multi-armed-bandits

blocks

40

Stars

14

Forks

Watchers

Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)

lil-lab

contextual-bandits

machine-learning

natural-language-processing

natural-language-understanding

MiniVox

27

Stars

5

Forks

Watchers

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

doerlbh

acml

bandit-algorithms

contextual-bandits

interspeech

banditml

67

Stars

10

Forks

Watchers

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

banditml

bandits

contextual-bandits

neural-networks

personalization

multi-armed-bandits-for-recommendation-systems

32

Stars

8

Forks

Watchers

implement basic and contextual MAB algorithms for recommendation system

Heewon-Hailey

contextual-bandits

epsilon-greedy

matplotlib

multiarmed-bandits

python-ranker

21

Stars

1

Forks

Watchers

Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions

improve-ai

ab-testing

ai

contextual-bandits

improve-ai

Neural-Thompson-Sampling

20

Stars

5

Forks

Watchers

Study of the paper 'Neural Thompson Sampling' published in October 2020

RonyAbecidan

contextual-bandits

multi-armed-bandits

neural-network

neural-tangent-kernel