Yonghee Cheon

Results 1 comments of Yonghee Cheon

@codertimo Since BERT uses learned positional embeddings and it is one of the biggest difference between original transformers and BERT, I think it is quite urgent to modify the positional...