prometheus – Page 8 – Robust Perception | Prometheus Monitoring Experts

July 29, 2019

Step and query_range

Graphs from Prometheus use the query_range endpoint, and there's a non-trivial amount of confusion that it's more magic than it actually is.

Published by Brian Brazil in Posts

Tags: grafana, prometheus, promql

July 22, 2019

How should pipelines be monitored?

For online serving systems it's fairly well known that you should look for request rate, errors and duration. What about offline processing pipelines though?

Published by Brian Brazil in Posts

Tags: alerting, best practices, prometheus, promql

July 15, 2019

New Features in Prometheus 2.11.0

Prometheus 2.11.0 is now out, following on from 2.10.0 with many fixes and improvements.

Published by Brian Brazil in Posts

Tags: prometheus, releases

July 8, 2019

Switching between Prometheus servers in Grafana using data source variables

Having to maintain dashboards for every Prometheus server you have would be a bit annoying. Thankfully Grafana has a feature for this.

Published by Brian Brazil in Posts

Tags: grafana, prometheus

July 1, 2019

Why can’t I use the nodename of a machine as the instance label?

The machine knows its own name, couldn't Prometheus use it?

Published by Brian Brazil in Posts

Tags: design, prometheus, service discovery

June 24, 2019

How much disk space do Prometheus blocks use?

Memory for ingestion is just one part of the resources Prometheus uses, let's look at disk blocks.

Published by Brian Brazil in Posts

Tags: prometheus, tsdb

June 17, 2019

Idempotent Cron Jobs are Operable Cron Jobs

Having to reconstruct how far a failed cron job had gotten and what exact parameters it was run with can be error prone and time consuming. There is a better way.

Published by Brian Brazil in Posts

Tags: alerting, best practices, prometheus, reliability

June 10, 2019