[Feature][framework] CI/CD Domain Layer Entities Design & Development
Search before asking
- [X] I had searched in the issues and found no similar feature requirement.
Description
Where we at
The following status is how we support the CI/CD data analysis :
- We collect Gitlab Pipeline data into Tool Layer as
_tool_gitlab_pipeline - We collect Jenkins data into both Tool Layer and Domain Layer as
_tool_jenkins_jobs_tool_jenkins_buildsjobsandbuilds
In another word, we modeled the CI/CD Domain Layer Tables based on Jenkins, and the Gitlab Pipeline data never get into Domain Layer because they are quite different in terms of modeling. This is also true for other kinds of CI/CD systems.
Why we need it
However, CI/CD plays an important role during the R&D process, metrics like Build Success Rate, etc (@Startrekzky please add more examples) depend on it.
What is the problem
CI/CD modeling varies in different platforms, there is no de facto modeling. It is hard to design a sensible model for CI/CD Domain Layer Data.
How should we approach
I propose that we address the problem by the following step:
- Set up a couple of designated charts depending on CI/CD data that we would like to have @Startrekzky
- Assign a Veteran Developer to investigate the modeling of the most popular platforms, such as Github/Gitlab/Jenkins/ArgoCD, and propose a CI/CD Domain Layer Table design
- The design shall be evaluated by the PPMC members in 3 workdays while all Committers could share their opinions
- A Data-Oriented PoC (a set of tables with data from different platforms) should be presented for the Analyst (@Startrekzky @leglar @hezyin) to evaluate
- And then we implement
Use case
- We may create Standard CI/CD charts based on Unified Domain Layer Tables
- Users may exploit the Unified CI/CD Domain Layer Tables to create customized charts
Related issues
No response
Are you willing to submit a PR?
- [ ] Yes I am willing to submit a PR!
Code of Conduct
- [X] I agree to follow this project's Code of Conduct
CI/CD Metrics:
- No. of Builds (Similar to Deployment Frequency- D in DORA, which can be interpreted as the total No. of builds that meet a specific condition)
- Build success rate %
- Build result distribution %
- Build duration
-
Mean Lead Time for Changes (Depending on the definition of a
change, it may be a “build” that meets specific conditions.) -
Mean Time to Recovery (MTTR) (Depending on the definition of
recovery, it may start when a “bug” is created, and end when it's fixed.) -
Change Failure Rate (Depending on the definition of a
changeandfailure, it may be the failing percentage of “build” that meets specific conditions.)
Analysis Dimensions
- time: e.g. measure the average build duration over the calendar date
- job: e.g. compare the average build duration under different jobs
- repo (code base): e.g. compare the average build duration triggered by different repos
I created three issues for separate CI/CD tools. We will investigate those tools first, and after a new discussion, we will implement the collector and extractor of the plugins.
- https://github.com/apache/incubator-devlake/issues/2666
- https://github.com/apache/incubator-devlake/issues/2637
- https://github.com/apache/incubator-devlake/issues/2583
- https://github.com/apache/incubator-devlake/issues/2584
- https://github.com/apache/incubator-devlake/issues/2585
- https://github.com/apache/incubator-devlake/issues/2596
- https://github.com/apache/incubator-devlake/issues/2604
According to our discussion from mailing list, Metrics to be considered:
- No. of Builds (Similar to Deployment Frequencyhttps://linearb.io/blog/dora-engineering-metrics/#deployment-frequency- D in DORA, which can be interpreted as the total No. of builds that meet a specific condition)
- Build success rate %
- Build result distribution %
- Build duration in hour -- should this be minute? @Startrekzky
- Mean Lead Time for Changeshttps://linearb.io/blog/dora-engineering-metrics/#mean-lead-time-for-changes (Depending on the definition of a change, it may be a “build” that meets specific conditions. measure it beginning from merging the code to the main branch.)
- Mean Time to Recovery (MTTR)https://linearb.io/blog/dora-engineering-metrics/#mean-time-to-recovery-mttr (Depending on the definition of recovery, it may start when a “bug” is created, and end when it's fixed.)
- Change Failure Ratehttps://linearb.io/blog/dora-engineering-metrics/#change-failure-rate (Depending on the definition of a change and failure, it may be the failing percentage of “build” that meets specific conditions. Normally, this metric is calculated by counting the number of times a deployment results in a failure and dividing by the number of total deployments to get an average number. A lower average is better.)
Analysis Dimensions to be considered:
- time: e.g. measure the average build duration over the calendar date
- job: e.g. compare the average build duration under different jobs
- repo (code base): e.g. compare the average build duration triggered by different repos
In addition, the naming of tables should avoid confusion for users of any CI/CD tools, such as Jenkins, GitLabCI, GitHub Actions or CircleCI. For example, job means different entity in GitLabCI and Jenkins, thus we should consider not to use it in the domain layer.
We also need to consider Azure DevOps/CircleCI/Travis CI/AWS CodePipeline
This ticket can be closed for now