SRE topic

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

List SRE repositories

gecho

7
Stars
0
Forks
Watchers

Gecho - a HTTP request echo debugging service

command-line-cheat-sheet

49
Stars
22
Forks
Watchers

📝 A place to quickly lookup commands (bash, vim, git, AWS, Docker, Terraform, Ansible, kubectl)

awesome-sre

11.6k
Stars
1.5k
Forks
Watchers

A curated list of Site Reliability and Production Engineering resources.

DevOps-README.md

450
Stars
27
Forks
Watchers

What to Read to Learn More About DevOps

tutorials

3.5k
Stars
2.6k
Forks
90
Watchers

DevOps Tutorials

The principles that help to deploy safely to the production environment. If you like it:

devops-exercises

64.2k
Stars
14.1k
Forks
1.2k
Watchers

Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions

howtheysre

9.0k
Stars
762
Forks
Watchers

A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)

chaostoolkit

1.8k
Stars
182
Forks
Watchers

Chaos Engineering Toolkit & Orchestration for Developers

chaos-ssm-documents

265
Stars
73
Forks
Watchers

Collection of AWS SSM Documents to perform Chaos Engineering experiments