Ada Lovelace comments

Results 9 comments of


                                            Ada Lovelace

Making new predictions

The way this code is written you would not be able to access the model outside of the class object. There would be many ways we could about this but...

missing the initialization of target action value and refreshing the Qhat

1. There is a function updating the target model. Does this answer your question? 2. How about, state = [[a, b, c]] ?

model save and load does not work

ddqn is double here since there is no dueling implementation in the repo. The hyperparameters affect the performance of a model greatly. When comparing implementations please make sure to fix...

should update the weight every time step ?

1. Loading just the weights of the model then sampling from a new set of experiences may be leading to divergence. 2. How about saving after each episode? How about...

Would it make sense to restrict the action to what's possible?

The action shouldn't be restricted since it's the goal of the agent to learn what action to take for the most reward.

Predict the action for new environment - Inference

You would use the action associated with the most reward that is returned when you feed the state into the model.

Why are we training the neural network for only 1 epoch

The behavior we want from the Neural Network is to give us a similar output when given a similar state. As for the epoch, the model is constantly retrained after...

callbacks in autoencoder

Did this ever make it to a release?

CPU version with MKL

@dwSun @danqing I would also like to know the result of this as well. My 1060 is looking to be twice as fast than my Xeon Phi on a different...