All roadmaps
Roadmap

Site Reliability Engineer (SRE)

From Linux internals to running planet-scale systems, the complete path to a job-ready Site Reliability Engineer.

20stages133topics~84hours

Curated from the best, MDN · Kubernetes · AWS · OWASP · Google SRE & more

SREs command $180K-$350K+ at FAANG. The role is expanding as every company realizes they need production excellence, not just more features.

The complete path, 12 of 133 topics have lessons here; the other 121 are marked learn anywhere. We won't pretend we cover everything.

01
Stage 1 / 20 · 6 topics · 1 lessons

Foundations: What SRE Is

Understand the discipline, its origins, and how SRE differs from and relates to DevOps and traditional ops.

02
Stage 2 / 20 · 8 topics · 0 lessons

Linux & Operating System Internals

The OS is the substrate of everything SRE. Master processes, memory, I/O, and the kernel boundary.

03
Stage 3 / 20 · 9 topics · 0 lessons

Networking Fundamentals

Distributed systems are networked systems. Know the stack from cables to TLS to load balancers.

04
Stage 4 / 20 · 7 topics · 0 lessons

Programming & Automation

SREs write software. Build solid coding skills plus the scripting that automates operations away.

05
Stage 5 / 20 · 7 topics · 0 lessons

Distributed Systems Theory

The mental models behind why large systems fail in surprising ways.

06
Stage 6 / 20 · 8 topics · 1 lessons

Cloud Platforms

Modern SRE runs on the cloud. Know the core service categories and at least one provider deeply.

07
Stage 7 / 20 · 4 topics · 0 lessons

Containers

Packaging and isolating workloads, the foundation under orchestration.

08
Stage 8 / 20 · 9 topics · 0 lessons

Kubernetes & Orchestration

The dominant orchestration platform. Operating it well is central to most SRE roles.

09
Stage 9 / 20 · 6 topics · 2 lessons

Infrastructure as Code & Config Mgmt

Declarative, version-controlled infrastructure is non-negotiable at scale.

10
Stage 10 / 20 · 7 topics · 3 lessons

CI/CD & Release Engineering

Safe, fast, repeatable delivery is how reliability ships to production.

11
Stage 11 / 20 · 8 topics · 0 lessons

Observability & Monitoring

You cannot operate what you cannot see. The instrumentation core of SRE.

12
Stage 12 / 20 · 6 topics · 0 lessons

SLIs, SLOs & Reliability Engineering

The quantitative heart of SRE: defining, measuring, and budgeting reliability.

13
Stage 13 / 20 · 8 topics · 0 lessons

Incident Management & On-Call

When things break, this is the SRE's defining moment. Respond, mitigate, and learn.

14
Stage 14 / 20 · 6 topics · 0 lessons

Capacity, Performance & Scalability

Ensuring systems have the headroom to serve load, and finding bottlenecks when they don't.

15
Stage 15 / 20 · 6 topics · 2 lessons

Resilience & Chaos Engineering

Designing systems that degrade gracefully and proving it deliberately.

16
Stage 16 / 20 · 6 topics · 0 lessons

Databases & Stateful Systems

Stateful services are the hardest to operate reliably, and where outages hurt most.

17
Stage 17 / 20 · 6 topics · 2 lessons

Security & Compliance

Reliability includes security. SREs own the operational side of keeping systems safe.

18
Stage 18 / 20 · 5 topics · 1 lessons

Platform Engineering & Service Mesh

Advanced infrastructure SREs increasingly build internal platforms and run meshes.

19
Stage 19 / 20 · 5 topics · 0 lessons

Automation, AIOps & Modern Practice

Where SRE is heading: heavy automation, ML-assisted ops, and running AI systems.

20
Stage 20 / 20 · 6 topics · 0 lessons

Career, Interviews & Soft Skills

Landing and thriving in an SRE role takes more than technical depth.

You're job-ready.

Clear every stage, earn the certificate, and walk into interviews prepared. The complete path, nothing hidden, no gaps.

Destination reached