Jiale Gao

Results 1 comments of Jiale Gao

Maybe should change the size of position_base to (1 * segment_num) rather than (batch_size * segment_num) when initializing. It's because we can't expend the tensor with the size in dimension...