Reliable Insights

A blog on monitoring, scale and operational Sanity

March 17, 2016

New Features in Prometheus 0.17.0

Prometheus 0.17.0 was recently released with 394 commits since 0.16.2, let's see what the big changes are.

Read more

March 9, 2016

Life of a Label

Prometheus labels allow you to model your application deployment in the manner best suited to your organisation. As directly supporting every potential configurations would be impossible, we offer relabelling to give you the flexibility to configure things how you'd like.

Read more

March 1, 2016

Using PagerDuty with the Alertmanager

The new Alertmanager has integrations to a variety of popular notification mechanisms, one of those is PagerDuty. Let's see how easy it is to hook it in. Read more

February 26, 2016

What Alertmanager 0.1.0 means for you

The rewrite of the Alertmanger had its first release earlier this week after a beta period. While still providing the core functionality of deduplicating alerts, silencing, inhibiting and sending notifications, there's several major changes from the previous version.

Read more

February 19, 2016

Using JSON file service discovery with Prometheus

Prometheus offers a number of ways to find the targets to scrape, DNS, EC2, Consul, Kubernetes, Zookeeper and Marathon. But what if what you aren't using one of those?

Read more

February 9, 2016

I’ve got 99 Failure Modes, Yours is Just One

When running a production system there's an endless stream of issues that have the potential to cause you significant hassle. How should you deal with this?

Read more

February 3, 2016

SNMP Monitoring with Prometheus

Prometheus isn't limited to monitoring just machines and applications, it can provide insight for any system you can get metrics out of. That includes network devices, so let's look at how to monitor SNMP.

Read more

January 28, 2016

Exporting to Graphite with the Prometheus Java Client

I previously looked at how the Prometheus Python client can output to Graphite. You can do the same with the Java client, letting you instrument once and integrate with non-Prometheus monitoring systems.

Read more

January 20, 2016

Little Things Matter

As part of designing and building Prometheus, hundreds of technical decisions have to be made. Every one of them is important in building a sustainable consistent ecosystem. Today, let's look at one small decision that was made by the Prometheus developers in Consul service discovery.

Read more

January 13, 2016

Writing a Jenkins exporter in Python

I previously talked about writing JSON exporters in Python for Prometheus. Since then, the API for custom collectors in the Python client has been improved. So let's see how easy it is to create a new exporter!

Read more

twitter
youtube
linkedin

Blog   |   Training   |   Book   |   Privacy