cmc-csci181 icon indicating copy to clipboard operation
cmc-csci181 copied to clipboard

Zipped File

Open raequan opened this issue 5 years ago • 1 comments

I am having an issue running BERT when trying to use the new dataset. Looking into it, I see that gzip refers to a zipped file; however, it is showing up as an unzipped file. I think my file is unzipped, but in any case, what should I do. Here is an image of my terminal. Screen Shot 2020-05-12 at 8 30 32 PM

raequan avatar May 13 '20 03:05 raequan

The filename should be corona.multilang100.jsonl.gz, but your filename is corona.multilang100.jsonl. So it looks like you're right that you've unziped the file. Running

$ gzip corona.multilang100.jsonl

will rezip the file. Or if that doesn't work, you can redownload the file.

mikeizbicki avatar May 13 '20 04:05 mikeizbicki