Jason Liang

Results 3 issues of Jason Liang

Hi, If I have a policy (a tensorflow model) that has been trained using some other RL method, is there way for me to initalize ddpg with this policy?

Given a trained Seq2Seq (with 1 encoder and 1 decoder layer) model and an input sequence, how can I get the hidden state/vector of the encoder layer immediately after feeding...

Which one would you recommend to use to get the best possible performance?