training icon indicating copy to clipboard operation
training copied to clipboard

AccessDeniedException: 403 does not have storage.objects.list access to the Google Cloud Storage bucket.

Open zwang92 opened this issue 1 year ago • 2 comments

I am trying to follow https://github.com/mlcommons/training/blob/master/large_language_model/megatron-lm/README.md#data-download to download data on gs://mlperf-llm-public2 as following: gsutil cp -r gs://mlperf-llm-public2/c4/en_val_subset_json/c4-validation_24567exp.json .

It fails with error message as following: "AccessDeniedException: 403 [email protected] does not have storage.objects.list access to the Google Cloud Storage bucket. Permission 'storage.objects.list' denied on resource (or it may not exist)."

Could anyone give any suggestion on how to download gs://mlperf-llm-public2/c4/en_val_subset_json/c4-validation_24567exp.json ?

Thanks a lot

zwang92 avatar Jul 09 '23 01:07 zwang92

All required data can be downloaded using instructions in the S3 artifacts download section of the README.

ShriyaPalsamudram avatar Jul 31 '24 19:07 ShriyaPalsamudram

@zwang92 can you try with the updated instruction in the readme?

hiwotadese avatar Aug 14 '24 21:08 hiwotadese