BILL Xu

Results 1 issues of BILL Xu

According to the Section 3.3.1 in the paper, the input of BERT consists of query and context whose length should be `seq_len = n+m+2` and the output drops the representations...