lola
lola copied to clipboard
Set of hyper-parameters to reproduce LOLA DICE
Are the current default hyper-parameters the one used to produce the results of the DICE paper? Current default HP are (from scripts/run_lola_dice.py):
batch-size=64
runs=5
epochs=200
use_dice=True
gamma=.96,
lr_inner=.1,
lr_outer=.2,
lr_value=.1,
lr_om=.1,
inner_asymm=True,
n_agents=2,
n_inner_steps=2,
value_batch_size=16,
value_epochs=0,
om_batch_size=16,
om_epochs=0,
use_baseline=False,
Or should we use the default from lola_dice/rpg.py?
epochs=100,
gamma=.96,
lr_inner=1., # lr for the inner loop steps
lr_outer=1., # lr for the outer loop steps
lr_value=.1, # lr for the value function estimator
lr_om=.1, # lr for opponent modeling
n_agents=2,
n_inner_steps=1,
inner_asymm=True,
om_batch_size=64, # batch size used for fitting opponent models
om_epochs=5, # epochs per iteration to fit opponent models
value_batch_size=64, # batch size used for fitting the values
value_epochs=5, # epochs per iteration to fit value functions
use_baseline=True,
use_dice=True,
use_opp_modeling=False,