llm-foundry icon indicating copy to clipboard operation
llm-foundry copied to clipboard

MosaicML commands with S3

Open ashoksmavd opened this issue 2 years ago • 3 comments

I have try to implement MPT-7b-chat based on MosaicML platform. I have executed the first step to convert c4 data set to steaming type and store my shards files on s3 path, after that I have executed composer command to train my MPT-7B model with the shards stored under S3. It's showing some errors. I have change yaml file to commented the data local path, - and provide data-remote path to execute its always says index.json file took too long to download, bailing out.

--Any one help me out with this. I need to execute the composer command with my shards on s3

ashoksmavd avatar Jun 16 '23 10:06 ashoksmavd

HI ashoksmavd,

Can you provide your yaml so I can have a look?

codestar12 avatar Jun 16 '23 14:06 codestar12

hi @codestar12

Here I have attached my yaml file.

Building train loader... ERROR:composer.cli.launcher:Rank 1 crashed with exit code 1.

mpt-7b.txt

ashoksmavd avatar Jun 17 '23 05:06 ashoksmavd

Have you double checked the correctness of your s3 path and aws permissions? I was going to take a look at how you formated your s3 path but I assume you kept it generic for privacy reasons.

codestar12 avatar Jun 22 '23 03:06 codestar12

Closing as stale -- please re-open if you continue to have issues!

hanlint avatar Jul 24 '23 06:07 hanlint