terraform-provider-kubernetes kubernetes namespace already created fails

kubernetes namespace already created fails

Open katlimruiz opened this issue 2 years ago • 36 comments

Terraform Version, Provider Version and Kubernetes Version

Terraform version:
Terraform v0.15.5
on windows_amd64
+ provider registry.terraform.io/cloudflare/cloudflare v2.21.0
+ provider registry.terraform.io/hashicorp/azuread v1.5.1
+ provider registry.terraform.io/hashicorp/azurerm v2.62.1
+ provider registry.terraform.io/hashicorp/helm v2.2.0
+ provider registry.terraform.io/hashicorp/kubernetes v2.3.2
+ provider registry.terraform.io/hashicorp/local v2.1.0
+ provider registry.terraform.io/terraform-providers/azuredevops v0.1.5

Affected Resource(s)

kubernetes_namespace

Terraform Configuration Files

resource "kubernetes_namespace" "kubnss" { for_each = toset(var.namespaces) metadata { name = each.key } }

Expected Behavior

If the namespace is already created, it should just omit the statement and move on to the next one

Actual Behavior

Error: namespaces "xxxx-web-xxxxx" already exists │ │ with module.production.module.kubernetes_web.kubernetes_namespace.kubnss["myns"], │ on m/kubernetes/main.tf line 69, in resource "kubernetes_namespace" "kubnss": │ 69: resource "kubernetes_namespace" "kubnss" {

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

Sep 14 '21 16:09 katlimruiz

Good day @katlimruiz,

I do not think this is a bug.

You asked terraform to create a resource and it failed because it was already created. The common practice is to import that already created resource into terraform state so that terraform can manage it.

Thanks!

Oct 08 '21 16:10 curtbushko

However, Terraform whole idea is to be declarative, if the object exist and no diff is made, then it doesn't make sense to recreate it (hence no error). Every other object in TF works like this, why would a k8s namespace differ from it?. Even the k8s cluster work like that. It is this specific object that is not.

Oct 08 '21 19:10 katlimruiz

Are you positive that all terraform resources are ignored if they already exist?

I ask this because there have been dozens of times where we have broken the terraform state and I was forced to import buckets, databases, kubernetes services, etc before an apply worked correctly again...

Oct 08 '21 19:10 curtbushko

if they are in the state and they are already there, then yes, all terraform scripts work like that otherwise this would just never work.

This is my experience. When you already have a whole platform in place, and you want to use terraform, then yes, it is a pain in the b*tt because it tells you to import a lot of resources, and importing them is very slow and problematic.

When you create a platform from scratch in terraform, then things go much much easier. There are some objects like Kubernetes cluster that do not communicate their full creation to the cloud provider therefore (even documentation says) you have to separate the creation from the apply of more kub resources, otherwise it would not work.

The only times I've seen the state broken is when 1) you use git to store the state 2) you made changes to the cloud manually and therefore discrepancies occur.

Oct 08 '21 20:10 katlimruiz

I definitely agree, getting everything that you already created into terraform state is a big pain in the b*tt! I would really like to see an --auto-import or --ignore-exists flag added to terraform.

One tool that I haven't tried yet but looks interesting is terraformer. It might help with those missing resources.

I've had two cases of major terraform state corruption:

A bad version of the google cloud provider being released for one day. They changed a timestamp integer size for the BigQuery resource and then reverted it. I had to manually remove the timestamp field from the state to get it working and after that I always lock down my terraform provider versions. :)
Before kubernetes_manifest existed we used a provider called k8sraw that allows you to apply raw yaml files to kubernetes. It has been useful for Istio resources. It has bugs that sometimes corrupt the state :(

Oct 08 '21 20:10 curtbushko

@katlimruiz This is not a bug with the kubernetes provider. If you think terraform should magically do terraform import when an object doesn't exist in the state, that would be a feature request on terraform itself.

Oct 12 '21 11:10 jbg

I'm not saying that.

The namespace was created with tf and it was on the state.

On a second run it tried to recreate it which is wrong.

Sent from my iPhone12

On 12 Oct 2021, at 06:29, Jasper @.***> wrote:

@katlimruiz This is not a bug with the kubernetes provider. If you think terraform should magically do terraform import when an object doesn't exist in the state, that would be a feature request on terraform itself.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.

Oct 12 '21 13:10 katlimruiz

Did you confirm that the object was in the state after the first apply? Did the first apply complete successfully?

Oct 12 '21 14:10 jbg

I'm seeing the same issue.

Dec 03 '21 17:12 astove

The same here, any progress?

Jan 19 '22 11:01 chengjianhua

an option should be there to bypass the namespace creation if found already existent. the behaviour of failing on already-exist namespace renders then entire terraform template non-declarative.

Jan 20 '22 17:01 digihunch

Same thing happened to me just now.

Jan 23 '22 11:01 AxelJoly

Same here. Basically we have to comment out namespace.tf file after first apply, otherwise it's failing.

Jan 25 '22 14:01 pauleikis

Does your first apply fail? If the first apply is successful then the namespace is added to the state and creation of the resource shouldn't be attempted by tf the next time. On the other hand if something prevents the resource from being added to the state in the first apply then it will indeed try to create it again on the next run.

Jan 25 '22 14:01 jbg

it appears that if the resource (e.g. namespace) was created by terraform provider, then it remains declarative. Since it is registered in the tf state, attempting to create it again doesn't fail. However, if the resource had been created outside of terraform, then the kubernetes provider will fail on it.

Jan 26 '22 02:01 digihunch

That is normal, and is just how Terraform works. If you create a resource outside of Terraform, then you need to import it into the state before you try to manage it with Terraform.

Jan 26 '22 03:01 jbg

@jbg that's not true, helm_release for example, doesn't work the same way. If I installed some chart bypass terraform with helm install and then I added it to terraform with no changes, terraform apply will do the job, no error, and this is how things must be

and this is not magic, it is programming and there is a definition for "declarative" manner, whether you like it or not

Jul 21 '22 08:07 mossad-zika

That's not "how things must be". Read up on the design of terraform and why it uses state. The helm_release behaviour you describe is not how most terraform resources work. If you don't want it to be that way, then what you really want is a different tool.

Jul 21 '22 08:07 jbg

@jbg you are literally the only one who thinks this open ticket shouldn't raise a fix. And since the ticket is not closed apparently you are not in charge of this provider anyhow, so it would be nice if you will stop posting here about "magic" and other nonsense without even providing any links to prove your words

Jul 21 '22 09:07 mossad-zika

Just trying to help you understand the TF model and why this won't be "fixed". Feel free to keep waiting though! All the best!

Jul 21 '22 09:07 jbg

@jbg I don't need to wait for anything, because there is a properly working Kubernetes "kubectl" Provider

Jul 21 '22 11:07 mossad-zika

That's an unfortunate side-effect of kubectl provider calling kubectl apply internally which doesn't differentiate between create and modify. It's not consistent with TF's provider contract (which is followed by the vast majority of providers; try creating a resource that already exists with any of the major TF providers and you'll see), and causes problems if people didn't intend to overwrite existing objects with the same kind/namespace/name. The author of the kubectl provider considered fixing this behaviour in https://github.com/gavinbunney/terraform-provider-kubectl/issues/73 (by changing to use kubectl create when TF asks the provider to create a resource); it seems they didn't get around to it but hopefully it will be fixed in future.

TF's view of the world is the state. If the object doesn't exist in the state, the provider is asked to create it, and that operation is supposed to fail if the object already exists in the "real world". This is how almost every provider works, and it's the reason why the import operation exists. I'm curious what you think terraform import is for, if you think that the incorrect behaviour of the kubectl provider is "properly working".

Jul 21 '22 14:07 jbg

I came across this issue as well, applied the following workaround. IMO this is not a bug and I also don't think a feature is justified for something that can be worked around.

data "kubernetes_all_namespaces" "allns" {}

resource "kubernetes_namespace" "this" {
  for_each = toset([ for k in var.namespaces : k if !contains(keys(data.kubernetes_all_namespaces.allns), k) ])
  metadata {
    name = each.key
  }
  depends_on = [data.kubernetes_all_namespaces.allns] # potentially more if you want to refresh list of NS
}

Feb 17 '23 09:02 oferchen

@oferchen I don't agree with you, Terraform is a declarative language & it is supposed to ignore a resource that already exists in a state that is wanted. So an error because a resource exists is against Terraform.

Mar 09 '23 16:03 d3vpasha

It all comes down to the behaviour one would like. For instance:

kubectl create ns foo
kubectl create ns foo # <---- error


kubectl apply -f ns.yaml
kubectl apply -f ns.yaml # <---- no error

So it comes down to if you want the create or apply behaviour but it appears that this provider only support create. I was hoping to use the cleaner language features of terraform to deploy our applications however, these limitations do not make it a good alternative to tools like helm or kustomize.

There doesn't seem to be clean way in terraform to deploy your app to say develop and then again to staging without the staging apply destroying the develop app first.

Apr 01 '23 21:04 nsainaney

It all comes down to the behaviour one would like. For instance:
kubectl create ns foo
kubectl create ns foo # <---- error

kubectl apply -f ns.yaml
kubectl apply -f ns.yaml # <---- no error
So it comes down to if you want the create or apply behaviour but it appears that this provider only support create. I was hoping to use the cleaner language features of terraform to deploy our applications however, these limitations do not make it a good alternative to tools like helm or kustomize.

The big difference between ad-hoc applying of yaml to your cluster and Terraform is Terraform's state. Learning about it will explain why it works the way it does, and why all correctly-written providers work the same way as this one. Try creating a S3 bucket that already exists with the AWS provider, for example.

Start here to learn about state in Terraform: https://developer.hashicorp.com/terraform/language/state

If a resource is added to your TF config and it does not exist in the state, it will be created. If you desire to work with an existing object, you simply need to import it first. It's not a limitation, since it doesn't limit you, it merely forces you to be explicit -- not a bad thing when you're managing production infra.

https://developer.hashicorp.com/terraform/cli/import

There doesn't seem to be clean way in terraform to deploy your app to say develop and then again to staging without the staging apply destroying the develop app first.

Of course there is, this is a common use case. If you want to duplicate your whole config, look into workspaces. If you want to duplicate only part of it, define that part in a module and then use the module twice in your config, with a variable for namespace and whatever else needs to vary.

Apr 01 '23 23:04 jbg

There doesn't seem to be clean way in terraform to deploy your app to say develop and then again to staging without the staging apply destroying the develop app first.

Of course there is, this is a common use case. If you want to duplicate your whole config, look into workspaces. If you want to duplicate only part of it, define that part in a module and then use the module twice in your config, with a variable for namespace and whatever else needs to vary.

We provision the GKE cluster in the root module and deploy our app using a child module. We'd like to use the exact same module for all pre-production environments. Changing any variable in the child module (e.g. changing the namespace from develop to staging) triggers a destroy of the other environment (by design). We also tried to see if we could change the state storage based on variables e.g:

terraform {
 backend "gcs" {
   bucket  = "my-app"
   prefix  = var.app_environment == "develop" ? "terraform/develop-app-state" : "terraform/staging-app-state"
 }
}

works but we'd like to do this for any feature branch so the app environment is not deterministic. However, variables are not allowed in setting up the state so that seems to block us from reusing the same module to provision different kubernetes environments within namespaces.

I haven't explored workspaces so will check that out now.

Apr 02 '23 02:04 nsainaney

@jbg thank you for the RTFM tip. Just tried out workspaces and that was exactly what I was missing.

We have our terraform broken down into modules so for anyone trying this out, be sure to add a -chdir. What worked for me was:

terraform -chdir=gke init
terraform -chdir=app init
terraform -chdir=gke apply ....                                    # Sets up the GKE cluster
terraform -chdir=app workspace new <branch>
terraform -chdir=app apply ....                                    # Deploys the application for preview

Apr 02 '23 02:04 nsainaney

We provision the GKE cluster in the root module and deploy our app using a child module. We'd like to use the exact same module for all pre-production environments. Changing any variable in the child module (e.g. changing the namespace from develop to staging) triggers a destroy of the other environment (by design).

If you want to keep develop around and add staging alongside it, you wouldn't change the namespace from develop to staging on your existing module block (which means develop no longer exists in your config and will be removed upon apply).

Instead, you would add a second module block (referencing the same source) with the different variable. Both module instances can use the same provider(s).

Or, as you've found, you can use workspaces to have multiple separate states in the same backend. This is more appropriate when you want your whole configuration to be duplicated N times.

Apr 02 '23 03:04 jbg

What about leaving this resource and behavior as is, later renaming it to kubernetes_namespace_create while adding a kubernetes_namespace_apply with the desired adoption behavior so that the user can take advantage of both.

I don't think there is currently a workaround to check if a namespace exists before creating one with just the kubernetes provider?

Jun 19 '23 19:06 ashtonian

terraform-provider-kubernetes terraform-provider-kubernetes copied to clipboard

kubernetes namespace already created fails

Terraform Version, Provider Version and Kubernetes Version

Affected Resource(s)

Terraform Configuration Files

Expected Behavior

Actual Behavior

Community Note

terraform-provider-kubernetes
terraform-provider-kubernetes copied to clipboard