terraform-provider-iterative
terraform-provider-iterative copied to clipboard
☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes
``` TPI [INFO] LOG 0 >> 2022-05-21 16:50:59 tpi-task.service: Failed with result 'timeout'. TPI [INFO] Status: completed with errors • ``` It would be nice for the final logline to...
Add support for ```instance_permission_set```
`goreleaser/goreleaser-action` takes [~30min to build 8 packages](https://github.com/iterative/terraform-provider-iterative/actions/workflows/release.yml) on GHA atm. Advantages of using a cloud instance to build instead: - doubles up as a form of e2e testing - provision...
Revert NVIDIA/nvidia-docker#568 once https://github.com/NVIDIA/nvidia-container-toolkit/issues/257 is resolved
Can we use APIs to retrieve an estimated hourly price for `machine` on each provider to print to the screen on `apply`? atm it only says `{machine} on {cloud}`. Would...
One should be an alias of the other and should be resolved at the very beginning to follow up with the same identifier along the chain, of not is enforcing...
machine and, by extension, runner follow–up to #196 * [ ] Port https://github.com/iterative/terraform-provider-iterative/pull/384 to `iterative_machine` * [ ] Deprecate in–house machine images from https://github.com/iterative/terraform-provider-iterative/pull/409 * [ ] Make sure that...
For additional context see: * https://github.com/iterative/terraform-provider-iterative/pull/550 * https://github.com/iterative/terraform-provider-iterative/issues/501 * https://github.com/iterative/terraform-provider-iterative/issues/236
Once https://github.com/docker/cli/pull/2934 gets released, the changes introduced with #61 won't be required anymore and should be reverted.
Goal: recover deep learning jobs, minimize data sync for reusable machines (#209) Cloud data sync - all data syncs through a cloud directly (S3, etc). This scenario does not include...