Reliable Insights

A blog on monitoring, scale and operational Sanity

April 8, 2016

How does a Prometheus Counter work?

There are four standard types of metric in Prometheus instrumentation: Gauge, Counter, Summary and Histogram. Today we'll have a look at the principles around Counters, and how Prometheus differs from other monitoring systems.

Read more

March 17, 2016

New Features in Prometheus 0.17.0

Prometheus 0.17.0 was recently released with 394 commits since 0.16.2, let's see what the big changes are.

Read more

December 30, 2015

Understanding Machine CPU usage

High CPU load is a common cause of issues. Let's look at how to dig into it with Prometheus and the Node exporter.

Read more

December 16, 2015

Which are my biggest metrics?

As your Prometheus usage grows and starts to get loaded, it'd be useful to know which metrics are using the most resources so that you can re-evaluate their utility.

Read more

October 30, 2015

How to have labels for machine roles

It's a best practice with Prometheus that target labels should be constant over a target's entire lifetime. On the other hand it's useful to aggregate metrics across all the machines that are currently Apache servers. How can we do that?

Read more

October 17, 2015

Irate graphs are better graphs

Prometheus 0.16.1 was just released, and with it brings my addition of the irate function. This offers more responsive graphs and higher resolution dashboards.

Read more

October 11, 2015

New Features in Prometheus 0.16.0

Prometheus 0.16.0 has been released with a whopping 312 commits and 89 changes by 18 contributors since 0.15.1. That's a lot to swallow, so let's take a look at main changes and improvements.

Read more

September 16, 2015

Dropping metrics at scrape time with Prometheus

It's easy to get carried away by the power of labels with Prometheus. In the extreme this can overload your Prometheus server, such as if you create a time series for each of hundreds of thousands of users. Thankfully there's a way to deal with this without having to turn off monitoring or deploy a new version of your code.

Read more

August 26, 2015

Conway’s Life in Prometheus

Some monitoring systems are very limited in what calculations you can do with them. Prometheus is not such a system, and today I'm happy to say that half a year after it publicly launched, Prometheus is Turing Complete.

Read more

twitter
youtube
linkedin

Blog   |   Training   |   Book   |   Privacy