reliability-engineering topic

List reliability-engineering repositories

litmus

4.3k
Stars
680
Forks
Watchers

Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd....

OpenShift-Guide

139
Stars
31
Forks
Watchers

OpenShift Guide. Learn about the Red Hat OpenShift Container Platform, Data Science, Code Ready Containers, Podman, Buildah, and Kubernetes.

SurPyval

47
Stars
5
Forks
Watchers

A Python package for survival analysis. The most flexible survival analysis package available. SurPyval can work with arbitrary combinations of observed, censored, and truncated data. SurPyval can als...

sre-checklist

2.2k
Stars
245
Forks
17
Watchers

A checklist of anyone practicing Site Reliability Engineering

deep_cox_mixtures

29
Stars
7
Forks
Watchers

Code for the paper "Deep Cox Mixtures for Survival Regression", Machine Learning for Healthcare Conference 2021

stable-systems-checklist

50
Stars
9
Forks
Watchers

An opinionated list of attributes and policies that need to be met in order to establish a stable software system.

k6-docs

80
Stars
204
Forks
Watchers

The k6 documentation website.

sreworkbook-templates-md

37
Stars
16
Forks
Watchers

A collection templates ported from the SRE Workbook