Discussion about this post

User's avatar
Donovan Baarda's avatar

It's also related to scale. The Google-style approach to SRE makes the amount of work running a reliable service scale with service complexity, not not service size. It's 10x as many different binaries that hurts, not 10x as many servers. Old school manual System Admin scales with the service size, so it's not so much the 10x as many different binaries that hurts, its the 10x as many servers to run them on.

Companies in their early stages have high service complexity relative to scale, so burning dedicated headcount trying to automate how it's run is more effort than just running it manually. At some point you reach a scale vs complexity tipping point when SRE makes more sense.

Expand full comment
Grzegorz Wierzowiecki's avatar

Here's a 3 min audio version of "Trouble starting an SRE team?" from Wednesday Wisdom converted using recast app.

https://app.letsrecast.ai/r/a5b0aae3-bbc9-48d2-9b36-39596f5ae587

Expand full comment
4 more comments...

No posts