chaos-operator
chaos-operator copied to clipboard
Wrong default value of probe success percentage for experiments with `Awaited` status. Value is 100 instead of 0
BUG REPORT
What happened:
While running a chaos predefined workflow from litmus portal UI for sock-shop application, I deleted all the chaos engines and all workflows from the litmus namespace, resulting in the workflow failing, but somehow I got a resilience score as a non zero value, upon debugging it seems the experiment with awaited status had a probe success percentage of 100
What you expected to happen:
Ideally, as per the docs probe success percentage of awaited experiment result should have been 0 unless a concrete experiment verdict is available i.e either pass or fail.
How to reproduce it (as minimally and precisely as possible):
Run a workflow from the litmus portal and delete the chaos engines and argo workflow created on the agent while it runs after the experiment job is created.
Anything else we need to know?:
API version and code reference: https://github.com/litmuschaos/litmus/blob/master/litmus-portal/cluster-agents/subscriber/pkg/cluster/events/util.go
Here is the screenshot of the debugging.
data:image/s3,"s3://crabby-images/b6eaf/b6eafa9e6c6d4d2be2b4281409504f8674fcee87" alt="Screenshot 2021-03-16 at 1 06 49 PM"
https://github.com/litmuschaos/litmus/pull/2558 This PR adds a check on the portal's frontend to filter the value but it will be helpful if it's handled at the infra level too.
Another bug due to this issue:
data:image/s3,"s3://crabby-images/1a5cd/1a5cdb784ad72576d2fb792ed8f2c27047743c8c" alt="Screenshot 2021-03-30 at 12 56 59 PM"
This has already been fixed, closing the issue. Thanks for raising the issue!