SRE topic

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

List SRE repositories

learning

1.7k
Stars
87
Forks
Watchers

Learning Shell,Python,Golang,System,Network

rundeck

5.4k
Stars
899
Forks
Watchers

Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts

SREWorks

1.7k
Stars
385
Forks
33
Watchers

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台

cloudprober

1.4k
Stars
152
Forks
Watchers

[Moved to cloudprober/cloudprober] An active monitoring software to detect failures before your customers do.

sre-interview-prep-guide

6.7k
Stars
1.7k
Forks
194
Watchers

Site Reliability Engineer Interview Preparation Guide

squzy

478
Stars
24
Forks
Watchers

Squzy - is a high-performance open-source monitoring, incident and alert system written in Golang with Bazel and love. Welcome to free SRE

howtheyaws

668
Stars
89
Forks
Watchers

A curated collection of publicly available resources on how technology and tech-savvy organizations around the world use Amazon Web Services (AWS)

gossh

169
Stars
44
Forks
Watchers

🚀🚀A high-performance and high-concurrency ssh tool written in Go. It is 10 times faster than Ansible. If you need much more performance and better ease of use, you will love it.

school-of-sre

7.7k
Stars
698
Forks
Watchers

At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.