- The Many Shapes of Site Reliability Engineering: https://medium.com/slalom-build/the-many-shapes-of-site-reliability-engineering-468359866517
- Google SRE: Site Reliability Engineering at a Global Scale: https://thenewstack.io/google-sre-site-reliability-engineering-at-a-global-scale/
- Will circuit breakers solve my problems?: https://brooker.co.za/blog/2022/02/16/circuit-breakers.html
- rfc1912 - Common DNS Operational and Configuration Errors: https://datatracker.ietf.org/doc/html/rfc1912#section-2.4
- EmacsNiftyTricks: https://www.emacswiki.org/emacs/EmacsNiftyTricks
- If you’re not using SSH certificates you’re doing SSH wrong: https://smallstep.com/blog/use-ssh-certificates/
- OpenSLO is a service level objective (SLO) language that declaratively defines reliability and performance targets using a simple YAML specification: https://openslo.com/
- Autometrics is an observability micro-framework built for developers. It makes it easy to instrument any function with the most useful metrics: request rate, error rate, and latency: