XingWu_UCAS

Results 17 issues of XingWu_UCAS

https://www.paddlepaddle.org.cn Thanks ~

enhancement
help wanted

![image](https://user-images.githubusercontent.com/17399203/77849733-6b100580-7200-11ea-93c7-79ee8feee26a.png) I am not familiar with pytorch's DistributedDataParallel, and I am confused that why only teacher_model is applied DistributedDataParallel in general_distill.py ?

I tried the code on RTE/SNLI/MNLI tasks, but UDA's results are worse, did anyone tried it before ?

Hi, The dev result of coCondenser on MSMARCO-Passage-Ranking-Submissions leaderboard is 0.443. Is it the results on Large size model ? Thank you @luyug ![image](https://user-images.githubusercontent.com/17399203/180633815-3cabfc02-5638-4cd9-9803-2c949b459d1f.png)

I pretrained a condeser-roberta-base on the same data and hyperparameters, but the results on downstream tasks were not high. Have you ever tried condenser pretraining on RoBERTa-base ? Thank you

![image](https://user-images.githubusercontent.com/17399203/178095194-4e28c991-f93b-4ade-a1b1-43b718ef1edb.png) Hi, Have you tried the InfoNCE loss in Global-local Feature Alignment ? [CLS] and [MSK] in the same sentence constitute positive pairs [CLS] and [MSK] in different sentence constitute...

您好, 我下载预训练数据后发现里面有一些 &amp , &lt 这样被转义后的token,这些您有做 unescape 么? ![image](https://user-images.githubusercontent.com/17399203/170313679-b8f26560-dff4-444b-a9d7-d9229bdf9904.png) 感谢

sub.json is organized in the format: [{'image': '4385058960_b0f291553e.jpg', 'caption': 'a wooden chair in the living room', 'url': 'http://static.flickr.com/2723/4385058960_b0f291553e.jpg'}, ...} but the downloaded sbu_images.rar is extracted as: 0000/ 0001/ 0002/ 0003/...

bug

### 🐛 Describe the bug 使用 gemini,必须是2的幂的卡数,不然出现 assert chunk_size % self.pg_size == 0 打印 chunk_size 是 40MB ### Environment 多台 8x80G A100,使用最新的code

bug