cf-ops-automation icon indicating copy to clipboard operation
cf-ops-automation copied to clipboard

Deletion of a deployment transient fails associated deployment pipeline

Open gberche-orange opened this issue 7 years ago • 1 comments

When a deployment X is deleted from the paas-secret repo, the associated pipeline triggers and fails in execute-deploy-script (on missing deleted paas-secret files), becoming red. It is only after that update-pipeline gets executed that the corresponding X bosh resource (e.g. deploy-cassandra-guid) gets removed by the update-pipeline-generated resource.

Solutions:

  • live with it for now: ignore polluting red pipeline (and future red alerts)
  • COAB performs async delete
    • step 1: deletes enable-deployment.yml: triggers update-pipeline & removal of the deployment
    • Step 2: finish paas-secret clean up (remaining files)
  • modify delete support in COA
    • enable-deployment.yml supports a new flag
      • style: declarative vs imperative
        • declarative
          • desired_state=enabled|disabled
          • expires_at=date
        • imperative
          • action=delete
          • action=recover
        • optional async paas-secret clean up
          • in delete pipeline
          • in concourse execute deploy.sh (in each deployment)
      • impacts of async COA paas-secret clean up:
        • mixed ownership of files in paas-secret
          • currently service_instances owned by paas-secret
      • impact of lack of clean up of paas-secret files
        • possible colision on service-id recycling by CC API
        • cognitive load on ops team

Possible related stories

  • undelete
  • dev deployment expiration/leases
  • usage based billing (emited by the deployment).

gberche-orange avatar Feb 07 '18 10:02 gberche-orange

Maybe related to #146

o-orand avatar Jun 28 '18 13:06 o-orand