reliability-engineering topic
awesome-sre
A curated list of Site Reliability and Production Engineering resources.
scram
Probabilistic Risk Analysis Tool (fault tree analysis, event tree analysis, etc.)
chaostoolkit
Chaos Engineering Toolkit & Orchestration for Developers
chaos-lambda
Serverless chaos monkey for AWS (runs on AWS Lambda) ☁️ 💥
awesome-sre-tools
A curated list of Site Reliability and Production Engineering Tools
Mission-Critical
This repository provides a design methodology and approach to building highly-reliable applications on Microsoft Azure for mission-critical workloads.
aws-well-architected-labs
Hands on labs and code to help you learn, measure, and build using architectural best practices.
reliability
Reliability engineering toolkit for Python - https://reliability.readthedocs.io/en/latest/
chaostoolkit-lib
The Chaos Toolkit core library
paas-cf
GOV.UK PaaS - Cloud Foundry