R Devon Hjelm

Results 53 comments of R Devon Hjelm

adding in some prints, it does appear that what I did is sending the data correctly to the tf logger, but not sure why it's not writing.

OK, I got it working with this factory: ``` def make_logger(label, steps_key, i): terminal_logger = TerminalLogger(label=label, print_fn=logging.info) tb_logger = TFSummaryLogger(summarydir, label=label, steps_key=steps_key) serialize_fn = base.to_numpy logger = aggregators.Dispatcher([terminal_logger, tb_logger], serialize_fn)...

I'm running into this as well and it's not clear what the fix should be

Problem is the table creation. Should be: ``` replay_buffer = reverb.Table( name=adders.DEFAULT_PRIORITY_TABLE, max_size=1000000, remover=reverb.selectors.Fifo(), sampler=reverb.selectors.Uniform(), rate_limiter=reverb.rate_limiters.MinSize(min_size_to_sample=1), signature=(tf.TensorSpec(2), tf.TensorSpec(1), tf.TensorSpec(None), tf.TensorSpec(None), tf.TensorSpec(2))) ```

The above solution removes the above error, but fails when needing to step through the trainer farther down in the tutorial. These are the correct buffer args I believe: ```...

There's no information whatsoever on the site where the ROMs come from and it's ambiguous whether Atari can (or will) exercise their IP rights when you download the ROMs.

I would check out Ben Poole's paper, who did a little more analysis as far as the estimating part: http://proceedings.mlr.press/v97/poole19a.html

Apologies, I gave up cortex on PIP a while ago. It can be installed via: https://github.com/rdevon/cortex

Sorry for the delay (busy with other things). Are you still trying to figure out how to do this?