GLUE-baselines
GLUE-baselines copied to clipboard
fail in downloading MRPC
I cannot download MRPC dataset and should have not been blocked from s3.amazonaws.com (l am in US). I have already tried
git clone https://github.com/wasiahmad/paraphrase_identification.git
python download_glue_data.py --data_dir glue_data --tasks all --path_to_mrpc=paraphrase_identification/dataset/msr-paraphrase-corpus
I noticed that there is actually no "MRPC in TASK2PATH, then why are we doing urlretrieve?
Following the history, I added
"MRPC":'https://firebasestorage.googleapis.com/v0/b/mtl-sentence-representations.appspot.com/o/data%2Fmrpc_dev_ids.tsv?alt=media&token=ec5c0836-31d5-48f4-b431-7480817f1adc'
to TASK2PATH and it is solved now.