Mehdi Mashayekhi
Mehdi Mashayekhi
can somebody please explain what are these parameters here https://github.com/huggingface/pytorch-openai-transformer-lm/blob/561d4096be7f66a49b7b989eff09e2ab6ba54bb7/model_pytorch.py#L303, e.g., offsets, init parameters, can you add some comments to this function?, thanks
Hi, quick question, don't we need to feed 'state' in decoder_body instead of 'output'? here in this line https://github.com/andrejonasson/dynamic-coattention-network-plus/blob/5182d91b2ff3707f9cafb308bf81f8bdd8bf5843/question_answering/networks/dcn_plus.py#L402 update: I run a toy experiment, output and state are the...
Hi, Thanks for sharing. I was wondering if you can explain why do we need two calls for ````apply_policy```` in the ````can_gru_policy_dynamics.py````, here https://github.com/openai/random-network-distillation/blob/f75c0f1efa473d5109d487062fd8ed49ddce6634/policies/cnn_gru_policy_dynamics.py#L69 and here https://github.com/openai/random-network-distillation/blob/f75c0f1efa473d5109d487062fd8ed49ddce6634/policies/cnn_gru_policy_dynamics.py#L83 Also, I have...
Hi, thanks for sharing this. I just noticed that the loss function used here https://github.com/gd-zhang/ACKTR/blob/9d61318117672262c78c06a976abf3cd47a54bd6/models/model.py#L93 is different than the loss function used to create fisher matrix in the original paper...