adaptive-transformers-in-rl
adaptive-transformers-in-rl copied to clipboard
Isn't param "--use_gate" important for Pong?
Hey, in the paper StablizingTransformer..., there is a gate unit in the moudle of the GTrxl, but in your default params, the --use_gate is False. Why?