CodeSearchNet icon indicating copy to clipboard operation
CodeSearchNet copied to clipboard

How big the dataset is?

Open skye95git opened this issue 4 years ago • 0 comments

The description in Setup: The datasets you will download (most of them compressed) have a combined size of only ~ 3.5 GB.

The description in Downloading Data from S3: The size of the dataset is approximately 20 GB.

They are all data downloaded by running script/setup. Why not the same amount of data? Which one is right? Does 3.5G refer only to the size of the dataset per programming language?

skye95git avatar Aug 22 '21 08:08 skye95git