Yonghee Cheon
Results
1
comments of
Yonghee Cheon
@codertimo Since BERT uses learned positional embeddings and it is one of the biggest difference between original transformers and BERT, I think it is quite urgent to modify the positional...