Juan Acevedo

Results 15 comments of Juan Acevedo

the reason why your job completes is because the base TPU image knows to find main.sh as the entrypoint. Add this to the end of your Dockerfile: `ENTRYPOINT ["./main.sh"]` This...

@StateGovernment take a look at [this](https://github.com/entrpn/serving-model-cards/tree/main/serving-stable-diffusion) example. This is pytorch only, so you'll have to convert the model to pytorch as demonstrated in the training dreambooth repo. In your case,...

This is a deployment pattern for Vertex endpoints using custom containers. The readme describes how to deploy it to the endpoint as described in the Vertex AI documentation https://cloud.google.com/vertex-ai/docs/predictions/use-custom-container

@sayakpaul please help me review. I fixed the per step time calculations which were wrong previously. Thanks!

@sayakpaul I'm still working on this.