terraform-google-kubernetes-engine icon indicating copy to clipboard operation
terraform-google-kubernetes-engine copied to clipboard

Possible flaky CI with remove_default_node_pool = true

Open apeabody opened this issue 3 years ago • 4 comments

TL;DR

Possible flaky CI with remove_default_node_pool = true

Error: Error deleting default node pool: googleapi: Error 400: Operation operation-{} is currently creating a node pool for cluster node-pool-cluster-{}. Please wait and try again once it is done.

Expected behavior

No response

Observed behavior

No response

Terraform Configuration

converge node-pool-local

Terraform Version

1.1.8

Additional information

No response

apeabody avatar Apr 15 '22 15:04 apeabody

Upstream provider issue https://github.com/hashicorp/terraform-provider-google/issues/10366

apeabody avatar Apr 15 '22 18:04 apeabody

This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 7 days

github-actions[bot] avatar Jul 23 '22 23:07 github-actions[bot]

This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 7 days

github-actions[bot] avatar Sep 30 '22 23:09 github-actions[bot]

Hello everyone,

Saw this issue and wanted to share my input on this and ask for any suggestions if there is a way to resolve the error even temporarily.

At the moment I am able to reproduce the 400 Error 7/10 times while provisioning a GKE cluster via Terraform with the option remove_default_node_pool = true.

The Terraform module I am running creates 4 resources:

google_compute_network google_compute_subnetwork google_container_cluster google_container_node_pool

Each time it errors out on google_container_cluster, around 6min into the resource creation.

If I set remove_default_node_pool = false. the module completes every time since it takes just a bit under 6 min to create the cluster.

I am sharing the last few lines of terraform apply debug output including the received error message: https://gist.githubusercontent.com/jacknif3/062dce1ae3d23e01d1295f5470e0091f/raw/264bf02ad59959ce50d081f116a1c36ed4028c51/gistfile1.txt

Please let me know if you need to see how I'm creating the resources or anything else, I'll be glad to try anything to resolve this issue.

Additional info:

Terraform version: 1.3.4 Google provider version: 4.42.1

jacknif3 avatar Nov 07 '22 15:11 jacknif3