Maxime RICHE issues

Results 7 issues of


Maxime RICHE

Why deleting the content ?

Can I ask why have you deleted this content ? A legal problem ? (I implemented it too...)

Set of hyper-parameters to reproduce LOLA DICE

Are the current default hyper-parameters the one used to produce the results of the [DICE paper](https://arxiv.org/pdf/1802.05098.pdf)? Current default HP are (from [scripts/run_lola_dice.py](https://github.com/alshedivat/lola/blob/master/scripts/run_lola_dice.py)): ``` batch-size=64 runs=5 epochs=200 use_dice=True gamma=.96, lr_inner=.1, lr_outer=.2,...

Possible error in reported confidence interval used in the DICE paper

In the notebook [notebooks/dice/analysis.ipynb](https://github.com/alshedivat/lola/blob/master/notebooks/dice/analysis.ipynb) which is used to analyse the results and reproduce the fig.5 from the paper [DiCE: The Infinitely Differentiable Monte Carlo Estimator](https://arxiv.org/pdf/1802.05098.pdf), the confidence interval used is...

Maxime RICHE

Why deleting the content ?

Set of hyper-parameters to reproduce LOLA DICE

Possible error in reported confidence interval used in the DICE paper

Player blue and red are not currently symmetrical

Upgrade to RLLib 1.3 and improvements around amTFT

Upgrade to use RLLib master (almost v1.3).

Add quick and long end-to-end test for alt_offers