Reliable Insights

A blog on monitoring, scale and operational Sanity

December 11, 2015

It’s overloaded? Try harder!

Failed requests are a fact of life, network weirdness and machine failures are inevitable. It can be tempting to simply retry the request when this happens, but this may cause more harm than good.

Published by Brian Brazil in Posts

Tags: best practices, reliability, rpc

September 28, 2015

Healthchecking is Not Transitive

Systems such as Consul perform healthchecking of local services and expose this information to other machines within the cluster. Does this mean that the service will work when you try to talk to it?

Published by Brian Brazil in Posts

Tags: best practices, consul, healthchecking, reliability, rpc

Blog | Training | Book | Privacy