terraform-oci-oke icon indicating copy to clipboard operation
terraform-oci-oke copied to clipboard

Cluster Autoscaler installation fail

Open shunsukew opened this issue 1 year ago • 2 comments

Community Note

  • Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
  • Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment

Terraform Version and Provider Version

Terraform v1.6.6
on darwin_arm64
+ provider registry.terraform.io/hashicorp/cloudinit v2.3.3
+ provider registry.terraform.io/hashicorp/helm v2.12.1
+ provider registry.terraform.io/hashicorp/http v3.4.2
+ provider registry.terraform.io/hashicorp/null v3.2.2
+ provider registry.terraform.io/hashicorp/oci v5.33.0
+ provider registry.terraform.io/hashicorp/random v3.6.0
+ provider registry.terraform.io/hashicorp/time v0.10.0
+ provider registry.terraform.io/oracle/oci v5.30.0

Your version of Terraform is out of date! The latest version
is 1.8.2. You can update by downloading from https://www.terraform.io/downloads.html

Affected Resource(s)

module.{module_name}.module.extensions[0].null_resource.cluster_autoscaler[0]

Terraform Configuration Files

  // Autoscaler
  // https://oracle-terraform-modules.github.io/terraform-oci-oke/guide/extensions_cluster_autoscaler.html
  cluster_autoscaler_install           = true
  cluster_autoscaler_namespace         = "kube-system"
  cluster_autoscaler_helm_version      = "9.36.0"
  cluster_autoscaler_helm_values       = {}
  cluster_autoscaler_helm_values_files = []

  worker_pools = {
    oke-vm-autoscaler = {
      size = 1, // Do not scale out. Scale out below worker pool such as oke-vm-optimized.
      description      = "Node pool with cluster autoscaler scheduling allowed",
      allow_autoscaler = true,
      shape = "VM.Standard3.Flex"
      ocpus = 1,
      memory = 4,
      boot_volume_size: 50,
    },
    ...
}

Expected Behavior

Autoscaler is properly installed.

Actual Behavior

Installation fails due to remote-exec error. error message is here. Error message says, It looks like you are trying to use a client-go credential plugin that is not installed. which should be installed on the node.

Steps to Reproduce

Create OKE cluster with version 5.1.5, and set autoscaler config.

shunsukew avatar May 03 '24 02:05 shunsukew

hi, Thanks for logging this issue. In order for autoscaler installation to work, kubectl needs to be working too and it depends on oci-cli for authentication. The logs show that oci cli was not installed yet.

Can you please check on the operator host and confirm if that's the case?

hyder avatar May 06 '24 08:05 hyder

Hello, Yes, oci command is available on the operator host.

ubuntu@o-dev-japaneast-oke:~$ oci --version
3.40.1

Also, kubectl is available on the operator host. However, oci-cli was installed manually because it was not installed by default when provisioning by this oke module.

shunsukew avatar May 08 '24 04:05 shunsukew