Jiale Gao
Results
1
comments of
Jiale Gao
Maybe should change the size of position_base to (1 * segment_num) rather than (batch_size * segment_num) when initializing. It's because we can't expend the tensor with the size in dimension...