15m-ops-break
15m-ops-break copied to clipboard
- The risk matrix (impact/frequency) and the long tail - Failure domains - Blast radius/bulkheading - Preconditions/limits of automation - Runbooks - Monitoring/observability
Git
Intro to Git - Merkle trees - Content addressable storage - Blobs, commits, heads - Repo, work tree, staging area - Merges vs rebase
Compare 3 ways of getting to the internet with packet capture and OSI diagrams - Router - packets not touched only moved - NAT - Layer 3/4 address rewrite -...
tools
Why tools Build your tools Get your tools Combine your tools Master your tools (only a few, choose carefully)
- TLS handshake, latency - SNI headers, encrypted SNI - Certificates and validation (client and server) - TLS record sizes and TCP MSS - Certificate chain and bundles
quite related to resilience engineering IMO * misconceptions * how can these be calculated in complex systems (especially when dependent on public clouds)
- What is a grid and why you need it - K8s, Nomad, ECS, etc - Responsibilities of a grid - Scheduling - Process management - Resource management - Auxiliary...
- What is init - What is systemd - `systemctl` - units, services, targets - socket activation - dependencies - `systemd-analyze`