banditpylib icon indicating copy to clipboard operation
banditpylib copied to clipboard

time-varying signals

Open htcml opened this issue 5 years ago • 3 comments

Can you add MAB algorithms which can handle time-varying signals? Maybe non-stationary MAB algorithms are for this purpose?

htcml avatar Mar 03 '20 19:03 htcml

I can implement one of the exponential weighted methods like exp3 or a variant. There can also be simple modifications of existing algorithms like sliding window UCB or any of the adversarial strategies. I think the main part of the implementation will be the environment.

choltz95 avatar Mar 03 '20 20:03 choltz95

Just point this out for your reference. Daniel Russo outlines his approach in p. 43 section 6.3 non-stationary system of this tutorial: https://web.stanford.edu/~bvr/pubs/TS_Tutorial.pdf

htcml avatar Mar 07 '20 01:03 htcml

EXP3 is currently implemented in the ordinary multi-armed bandit.

Alanthink avatar Apr 22 '21 21:04 Alanthink