sre-university icon indicating copy to clipboard operation
sre-university copied to clipboard

A complete study plan to become a Site Reliability Engineer.

SRE University

Site Reliability Engineering

  • (Book) Site Reliability Engineering - https://landing.google.com/sre/book/index.html
  • (Book) Site Reliability Workbook - https://landing.google.com/sre/workbook/toc/
  • (Book) Building Secure and Reliable Systems - https://landing.google.com/sre/resources/foundationsandprinciples/srs-book/
  • (Course) Intro to DevOps - https://www.udacity.com/course/intro-to-devops--ud611
  • (Course) Google Cloud Platform for Systems Operations - https://www.coursera.org/specializations/gcp-sysops
  • (Course) Measuring and Managing Reliability - https://www.coursera.org/learn/site-reliability-engineering-slos

Operating Systems

  • (Course) Introduction to Operating Systems - https://www.udacity.com/course/introduction-to-operating-systems--ud923
  • (Course) Advanced Operating Systems - https://www.udacity.com/course/advanced-operating-systems--ud189

Automation

  • (Tutorial) Ansible - https://www.digitalocean.com/community/tutorials/configuration-management-101-writing-ansible-playbooks
  • (Course) Terraform - https://www.udemy.com/course/learn-devops-infrastructure-automation-with-terraform/

Distributed Systems

  • (Tutorial) Introduction to Distributed Systems Design - http://www.hpcs.cs.tsukuba.ac.jp/~tatebe/lecture/h23/dsys/dsd-tutorial.html

Networking

  • (Book) Understanding Linux Network Internals - http://shop.oreilly.com/product/9780596002558.do

Programming Languages

Python

  • (Book) Learn Python 3 The Hard Way - https://learnpythonthehardway.org/python3/
  • (Course) Developing Scalable Apps in Python - https://www.udacity.com/course/developing-scalable-apps-in-python--ud858

Go

  • (Book) The Go Programming Language - https://www.amazon.com/Programming-Language-Addison-Wesley-Professional-Computing/dp/0134190440
  • (Webinar) Go Language for Ops and Site Reliability Engineering - https://www.youtube.com/watch?v=Q_H4hrUez80
  • (Hands On) https://gopherlabs.kubedaily.com/

Production Web App

  • (Tutorial) Building for Production: Web Applications - https://www.digitalocean.com/community/tutorial_series/building-for-production-web-applications
  • (Book) Production Ready Microservices - https://www.amazon.com/gp/product/1491965975/

Monitoring and Logging

  • (Course) Monitoring and Alerting with Prometheus - https://www.udemy.com/course/monitoring-and-alerting-with-prometheus/
  • (Book) Prometheus UP and Running - https://www.amazon.com/Prometheus-Infrastructure-Application-Performance-Monitoring/dp/1492034142

Continuous Integration | Continuous Delivery

  • (Course) Continuous Deliver Better Software - https://www.udemy.com/course/learn-devops-continuously-deliver-better-software/

Containers

  • (Course) Docker for Devops - https://www.udemy.com/course/docker-tutorial-for-devops-run-docker-containers/

Web Servers

Nginx

  • (Course) Nginx Fundamentals - https://www.udemy.com/course/nginx-fundamentals/

Cluster Management

Kubernetes

  • (Tutorial) Kubernetes Bootcamp - https://kubernetes.io/docs/tutorials/kubernetes-basics/
  • (Course) Scalable Microservices with Kubernetes - https://www.udacity.com/course/scalable-microservices-with-kubernetes--ud615
  • (Tutorial) Kubernetes Tutorial for Beginners - https://spacelift.io/blog/kubernetes-tutorial

Cloud

Amazon AWS

  • (Tutorial) Amazon AWS - https://aws.amazon.com/getting-started/tutorials/

Post-Mortem

  • Post-Mortem Template - https://sre.google/sre-book/example-postmortem/

Websites

  • http://highscalability.com
  • https://sreweekly.com

DevOps | SRE Roadmap

  • DevOps Roadmap - https://roadmap.sh/devops

Further Reading

Image Credits: https://github.com/kamranahmedse/developer-roadmap