XingWu_UCAS issues

Results 17 issues of


                                            XingWu_UCAS

Is it possible to use PaddlePaddle as backend for texar ?

https://www.paddlepaddle.org.cn Thanks ~

enhancement

help wanted

Why only teacher_model is applied DistributedDataParallel in general_distill.py ?

![image](https://user-images.githubusercontent.com/17399203/77849733-6b100580-7200-11ea-93c7-79ee8feee26a.png) I am not familiar with pytorch's DistributedDataParallel, and I am confused that why only teacher_model is applied DistributedDataParallel in general_distill.py ?

Results on RTE/SNLI/MNLI tasks

I tried the code on RTE/SNLI/MNLI tasks, but UDA's results are worse, did anyone tried it before ?

results on leaderboard

Hi, The dev result of coCondenser on MSMARCO-Passage-Ranking-Submissions leaderboard is 0.443. Is it the results on Large size model ? Thank you @luyug ![image](https://user-images.githubusercontent.com/17399203/180633815-3cabfc02-5638-4cd9-9803-2c949b459d1f.png)

Have you tried condenser pretraining on RoBERTa ?

I pretrained a condeser-roberta-base on the same data and hyperparameters, but the results on downstream tasks were not high. Have you ever tried condenser pretraining on RoBERTa-base ? Thank you

Global-local Feature Alignment

![image](https://user-images.githubusercontent.com/17399203/178095194-4e28c991-f93b-4ade-a1b1-43b718ef1edb.png) Hi, Have you tried the InfoNCE loss in Global-local Feature Alignment ? [CLS] and [MSK] in the same sentence constitute positive pairs [CLS] and [MSK] in different sentence constitute...

预训练数据集中的 &amp &lt 需要做 unescape 么？

您好，我下载预训练数据后发现里面有一些 &amp , &lt 这样被转义后的token，这些您有做 unescape 么？ ![image](https://user-images.githubusercontent.com/17399203/170313679-b8f26560-dff4-444b-a9d7-d9229bdf9904.png) 感谢

Will you please fix the link in dataset_instruction/instruction? Thank you.

sbu caption dataset format

sub.json is organized in the format: [{'image': '4385058960_b0f291553e.jpg', 'caption': 'a wooden chair in the living room', 'url': 'http://static.flickr.com/2723/4385058960_b0f291553e.jpg'}, ...} but the downloaded sbu_images.rar is extracted as: 0000/ 0001/ 0002/ 0003/...

bug

[BUG]: 使用 gemini，必须是2的幂的卡数，不然出现 assert chunk_size % self.pg_size == 0

### 🐛 Describe the bug 使用 gemini，必须是2的幂的卡数，不然出现 assert chunk_size % self.pg_size == 0 打印 chunk_size 是 40MB ### Environment 多台 8x80G A100，使用最新的code

bug