aws-step-functions-data-science-sdk-python
aws-step-functions-data-science-sdk-python copied to clipboard
Problem with CheckpointConfig s3uri in Tensorflow estimator
Values from estimator not passing to the step function definition (JSON). Had to manually set it in json by editing the step function from console to make the training job work.
"CheckpointConfig": { "S3Uri": "s3://bucket_name/location/for/checkpoint/" },
S3Uri is taken from checkpoint_s3_uri right? but in my case this is not working. CheckpointConfig dictionary is not generated.
Thank you for taking a look at it.
Hello @roshsuta,
Sorry for the late response!
I was able to repro the issue by passing a Tensorflow Estimator with CheckpointConfig to TrainingStep constructor and creating a workflow with that step. The workflow definition is indeed missing the CheckpointConfig property.
Thank you for bringing this to our attention - tagging this as a bug.