I’m currently investigating metrics solutions for monitoring Kubernetes and the services that run on top of it. At present, we’re using Telegraf for gathering both host-level metrics as well as HTTP-endpoint metrics. Translating this into a Kubernetes environment seems to be challenging, however.
There exists a couple of Helm charts for deploying Telegraf as a DaemonSet (https://github.com/influxdata/tick-charts/tree/master/telegraf-ds) and as a deployment (https://github.com/influxdata/tick-charts/tree/master/telegraf-s). The DaemonSet approach would work for gathering host and container metrics, however I’m wondering if there’s also a simple approach to go about gathering metrics exposed at k8s pod HTTP endpoints.
I’ve also been looking into Prometheus (and the Prometheus Operator more specifically), but it feels quite heavyweight for what I want to do. That said, I do like its approach of annotating services to denote whether or not the collector should scrape metrics from associated pods’ endpoints.
Any guidance on this matter would be much appreciated!