sunchipsster1
sunchipsster1
Hi guys! Was trying out the awesome Temporal value transport code you guys kindly provided at https://github.com/deepmind/deepmind-research/tree/master/tvt While I got matching results to the Key-to-door --> with last phase reward...
Hi Aurelien, Thanks a lot for your book! It has been amazing. I have a question pertaining to policy gradients (Chapter 16 -- reinforcement learning). In the implementation, a policy...
Hello! I am very interesting in looking into the architecture of the DQNs. I opened up the file the Dopamine group kindly provided in Discrete_Domains --> atari_lib.py which defines the...
Hello! Thank you so much for putting up this beautiful work! After training on the args.txt configuration, I proceeded to evaluate as per the instructions. I obtained the following error....
Hello, hope you are well, and thank you so much for writing this awesome resource! I have a question about the training procedure. When I ran it in command prompt...
Hello and happy 2020! I am trying to run A3C-Meta-Bandit and run into the error below. All the cells are running fine, except for the final cell of the python...
Hello! I am currently running the task GoToRedBallGrey with the settings bow_endpool_res. According to the babyai paper, the observations should be 7x7x3 showing the pixels in front. However, here are...
Hello! Thank you so much for putting up this valuable resource! I was wondering if I may ask for some kind advice about replicating the results, which I have been...
Hi guys, I was building d4rl by the book. I ran: And obtained this error:  I looked thru the previous issues to see if anyone else had encountered this...