Usually alert thresholds are hardcoded in the alert. In more sophisticated setups, it would be useful for it to be parameterised based on another time series.
A blog on monitoring, scale and operational Sanity
Usually alert thresholds are hardcoded in the alert. In more sophisticated setups, it would be useful for it to be parameterised based on another time series.
While eager consumers flock to retailers both online and in-store for big savings during the infamous day after Thanksgiving known as Black Friday, one must wonder the cost incurred by companies accommodating both the crowds swarming their store floors and the increase in traffic from online shoppers.
In this blogpost we used Prometheus and the Blackbox exporter to observe the increase in latency experienced by some of the top online retailers in the US and UK.
Prometheus 2.1 added an API endpoint to take snapshots, let's see how to use it.
Every now and then someone asks what metric type a increasing timestamp should be. Let's take a look.
Prometheus 2.x has a different data format from 1.x, so how do you access your old data from 2.x?
At what point should you consider an alert resolved?
With the upcoming release of Prometheus 2.0 comes a new format for writing recording and alerting rules.
Switching to Prometheus for monitoring but still have older Nagios checks you're not quite ready to let go of just yet? Now you can monitor and alert on Nagios checks with the nrpe_exporter!