Matheus M. Centa
Matheus M. Centa
Hello! Hopefully, I can answer these questions :) 1. REINFORCE with Baseline can use any state-dependent baseline (you don't even have to use advantages, you can use the Q values...
I was thinking about recording this metadata by using Python's `logging` library since it has all the features we need. I also noticed today that you might need to fork...
You're right, I hadn't thought about the use case of debugging agents. Currently, we have: - `benchmarks`: these seem to be toy problems for exploration and generalization, which I think...
> Thanks for working on this! It would be really helpful if you can add the usage of the new argument in README. Also, can we also test it by:...
> For dimension match: yeah, I think we should assert that the loaded vectors have the same dimension as `--representation_size`, otherwise just abort the program. I was thinking about disabling...
> Also, how do we handle the case when vocabulary in the pre-trained embedding does not match the list of graph nodes? I'm not sure. I thought that it would...
Thanks for taking the time to help me out! I am kind of taking some time to study for my finals right now, but I will be back soon to...
I'm back from finals and vacations! I just implemented two of the improvements we talked about, and I wanted your opinion on this next one: the way the code is...
Hello, I just ran into this inconsistency while implementing a project this week and we basically copied the code from the FeedForwardPolicy from DQN to adapt the code for our...
@Miffyli Now that I read the issue a second time, I don't think it is worth it to change the behavior of the `layers` parameter now that v3 is on...