After my update from telegraf 1.24.4 to 1.25, the agent was no longer able to send metrics to our graphite instance (running on docker).
The message being show in our logs is simply: 2023-01-03T14:28:30Z E! [agent] Error writing to outputs.graphite: could not write to any Graphite server in cluster
Hi @jpowers, my setup is very straightforward. I’m currently simply running telegraf 1.24.4, on an AWS EC2 instance, running Amazon Linux 2. Graphite is running on the same instance, in a docker container.
6ff757d94a56 graphiteapp/graphite-statsd:1.1.10-1 "/entrypoint" 13 days ago Up 13 days 0.0.0.0:2003-2004->2003-2004/tcp, :::2003-2004->2003-2004/tcp, 2013-2014/tcp, 127.0.0.1:2023-2024->2023-2024/tcp, 0.0.0.0:8080->8080/tcp, :::8080->8080/tcp, 8125/tcp, 127.0.0.1:8126->8126/tcp, 0.0.0.0:8125->8125/udp, :::8125->8125/udp, 0.0.0.0:8800->80/tcp, :::8800->80/tcp, 0.0.0.0:8443->443/tcp, :::8443->443/tcp graphite.service
The relevant part of my telegraf config
[[outputs.graphite]]
## TCP endpoint for your graphite instance.
## If multiple endpoints are configured, output will be load balanced.
## Only one of the endpoints will be written to with each iteration.
servers = ["192.168.0.108:2003"] # dash on VPC
## Prefix metrics name
prefix = "telegraf"
## Graphite output template
## see https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_OUTPUT.md
template = "host.tags.measurement.field"
## Enable Graphite tags support
#graphite_tag_support = true
## timeout in seconds for the write connection to graphite
timeout = 2
As stated before, this works with telegraf 1.24.4. As soon as I upgrade to 1.25, the telegraf agent starts complaining.
No, it did never work on 1.25.x. I moved from 1.24 to 1.25 and it stopped working.
OK, this is weird. I just cleared my versionlock, upgraded telegraf to 1.25.0.1 again … and it now starts working as if nothing happened. I will need to check my other machines as well, before crying victory.