fengxin619
fengxin619
seq2seq_model.py 108行 需要构建特殊的输出mask,屏蔽掉句子a的影响 预测的值不用取最后sep符号的结果 因此是到-1 predictions = predictions[:, :-1].contiguous() target_mask = token_type_id[:, 1:].contiguous() --------------------- 为什么target_mask是丢掉了[CLS]位,而predictions是丢掉[SEP]位,这在计算Loss的时候不是错位了么?
in keras, i dont find the part ? tf.train.ExponentialMovingAverage .....dose this affect the mode's effect?
rt,请问问题在哪里呢?
data:image/s3,"s3://crabby-images/1703e/1703e04e751591150a0435a0b3b6b22ce2b35c6c" alt="image"
Could you tell me where does the formula come from?