promql – Page 4 – Robust Perception | Prometheus Monitoring Experts

July 16, 2018

Absent Alerting for Jobs

Alerting on numbers being too big or small is easy with Prometheus. But what if the numbers go missing?

Published by Brian Brazil in Posts

Tags: alerting, prometheus, promql

May 28, 2018

Extracting raw samples from Prometheus

Sometimes you want the raw samples inside Prometheus for analysis or debugging. How do you get that?

Published by Brian Brazil in Posts

Tags: prometheus, promql

April 30, 2018

Identifying expensive alerting rules

Since Prometheus 2.1 there is a feature to view alerting rule evaluation times in the rules UI. In this blogpost we'll see an example of how this can be used to identify an expensive rule expression.

Published by Conor Broderick in Posts

Tags: alerting, prometheus, promql

April 23, 2018

Why can count(x > 5) not return 0?

When using the count aggregation operator you may have noticed that it sometimes returns nothing rather than 0. Why is this?

Published by Brian Brazil in Posts

Tags: prometheus, promql

March 19, 2018

Alerting on crash loops with Prometheus

If your applications are restarting regularly, whether due to segfaults or OOMs, it'd be nice to know.

Published by Brian Brazil in Posts

Tags: alerting, prometheus, promql

February 12, 2018

Alerting on gauges in Prometheus 2.0

One of the major changes introduced in Prometheus 2.0 was that of staleness handling. Previously for instant vectors, Prometheus would return a point up to 5 minutes in the past which caused a number of different issues.

Published by Conor Broderick in Posts

Tags: alerting, prometheus, promql

February 5, 2018

What percentage of time is my service down for?

Have you ever wondered what percentage of time a given service or application spends up or down?

Published by Conor Broderick in Posts

Tags: blackbox_exporter, prometheus, promql

January 1, 2018

Rule groups for hierarchical aggregation

Prometheus 2.0 brought with it rule groups, making hierarchical aggregation easier than ever.

Published by Brian Brazil in Posts

Tags: prometheus, promql

December 11, 2017

Why are Prometheus histograms cumulative?

Have you ever wondered why the buckets in histograms are not just counters of events that fall into each bucket?

Published by Brian Brazil in Posts

Tags: design, prometheus, promql, relabelling

December 4, 2017

Using time series as alert thresholds

Usually alert thresholds are hardcoded in the alert. In more sophisticated setups, it would be useful for it to be parameterised based on another time series.

Published by Brian Brazil in Posts

Tags: alerting, prometheus, promql

Reliable Insights