pytorch-openai-transformer-lm Confused by multiply n to both clf_logits and clf

Confused by multiply n to both clf_logits and clf_losses?

Open Vimos opened this issue 6 years ago • 3 comments

I am confused by the code below.

https://github.com/huggingface/pytorch-openai-transformer-lm/blob/eafc28abdfadfa0732f03a0fc65805c5bfb2ffe7/train.py#L52

https://github.com/huggingface/pytorch-openai-transformer-lm/blob/eafc28abdfadfa0732f03a0fc65805c5bfb2ffe7/train.py#L54

Is this due to any normalization? Thanks!

Sep 04 '18 08:09 Vimos

Yes, I don't understand that myself either. It was in the original TensorFlow code so I kept it as is. I have removed it in a fork of this code that I use for another project and it doesn't really change the results. I may try to dig further one day.

Sep 04 '18 09:09 thomwolf

Hi Thomas,

Do you have any insights now for this issue.

Mar 11 '19 20:03 pranoy-k

I don't

Mar 11 '19 20:03 thomwolf

pytorch-openai-transformer-lm pytorch-openai-transformer-lm copied to clipboard

Confused by multiply n to both clf_logits and clf_losses?

pytorch-openai-transformer-lm
pytorch-openai-transformer-lm copied to clipboard