Reliable Insights

A blog on monitoring, scale and operational Sanity

June 17, 2019

Idempotent Cron Jobs are Operable Cron Jobs

Having to reconstruct how far a failed cron job had gotten and what exact parameters it was run with can be error prone and time consuming. There is a better way.

Read more

June 10, 2019

Monitoring memcached with Prometheus

As with many applications, there's an exporter for memcached.

Read more

June 3, 2019

Finding churning targets in Prometheus with scrape_series_added

Prometheus 2.10 has a new metric to make finding churn easier.

Read more

May 27, 2019

New Features in Prometheus 2.10.0

Prometheus 2.10.0 is now out, following on from 2.9.0 with many fixes and improvements.

Read more

May 20, 2019

Analyse a metric by kernel version

Using PromQL you can combine metrics for analysis.

Read more

May 13, 2019

Be discerning in what dashboards you share with users

There's no way that sharing metrics with your users or customers can go wrong. Right?

Read more

April 29, 2019

Avoid the Wall of Graphs

Data is not the same as information.

Read more

April 22, 2019

Using snmpbulkwalk to debug snmp_exporter issues

Many problems with the snmp_exporter turn out to actually be issues elsewhere, but how can you tell?

Read more

April 15, 2019

New Features in Prometheus 2.9.0

Prometheus 2.9.0 is now out, following on from 2.8.0 with many fixes and improvements.

Read more

twitter
youtube
linkedin

Blog   |   Training   |   Book   |   Privacy