Reliable Insights

A blog on monitoring, scale and operational Sanity

November 19, 2018

Unit testing rules with Prometheus

As of 2.5.0, promtool has a feature to allow you to test your recording rules.

Read more

November 12, 2018

New Features in Prometheus 2.5.0

Prometheus 2.5.0 is now out, following on from 2.4.0 back in September with many fixes and improvements.

Read more

November 5, 2018

Probing DNS servers with the Blackbox exporter

Among the Blackbox exporter's probe types is DNS.

Read more

October 29, 2018

How many metrics should an application return?

While each application is different, a rough idea of how many metric there should be would be useful.

Read more

October 22, 2018

Debugging Blackbox exporter failures

Ever wanted more information about Blackbox probe failures?

Read more

October 15, 2018

Graph top N time series in Grafana

As of Grafana 5.3.0 there's a feature that allows correct graphing of the top N series over a duration.

Read more

October 8, 2018

Checking for HTTP 200s with the Blackbox Exporter

It's easy to check if HTTP and HTTPS endpoints are working with the Blackbox Exporter.

Read more

October 1, 2018

What is a job label for?

The job label is one of the labels your targets will always have. So how can you use it?

Read more

September 24, 2018

Alerting on approaching open file limits

In a previous post we looked at dealing with reaching the open file limit. How about alerting before it happens?

Read more

September 17, 2018

New Features in Prometheus 2.4.0

Prometheus 2.4.0 is now out, following on from 2.3.0 back in June with many fixes and improvements.

Read more

twitter
youtube
linkedin

Blog   |   Training   |   Book   |   Privacy