What does and doesn't make a good grouping key?
A blog on monitoring, scale and operational Sanity
What does and doesn't make a good grouping key?
We recently looked at creating silences from the command line, what about from programs?
It's common to want reports from Prometheus, such as how many requests failed over an entire month.
The Python client can be used to check if a given metrics output is valid OpenMetrics format.
The standard way to use metrics in Prometheus is to declare them at file level, before using them. Why?
If you've determined a metric should be tested, how do you go about that?
Prometheus client libraries don't just export metrics in our format, they can parse that format too.
The Prometheus client library guidelines recommend having a Child
be returned via labels()
. Why?
A common question around Prometheus client libraries is how much RAM they'll use on a busy process. There tends to be disbelief when we say it's the same as an inactive server. Let's look deeper.
The default JSON output isn't always what you want when querying Prometheus. Let's see how to get out CSV files.