training
training copied to clipboard
AccessDeniedException: 403 does not have storage.objects.list access to the Google Cloud Storage bucket.
I am trying to follow https://github.com/mlcommons/training/blob/master/large_language_model/megatron-lm/README.md#data-download to download data on gs://mlperf-llm-public2 as following: gsutil cp -r gs://mlperf-llm-public2/c4/en_val_subset_json/c4-validation_24567exp.json .
It fails with error message as following: "AccessDeniedException: 403 [email protected] does not have storage.objects.list access to the Google Cloud Storage bucket. Permission 'storage.objects.list' denied on resource (or it may not exist)."
Could anyone give any suggestion on how to download gs://mlperf-llm-public2/c4/en_val_subset_json/c4-validation_24567exp.json ?
Thanks a lot
All required data can be downloaded using instructions in the S3 artifacts download section of the README.
@zwang92 can you try with the updated instruction in the readme?