Telegraf - scale out SNMP collectors?

jarush · November 14, 2018, 11:08pm

I’ve been using telegraf in our lab to collect SNMP data from some Nexus switches, UCS Fabric interconnects, and MDS SAN switches. Worked great. Deployed to production and the queries that were taking less than a second are now taking 45+ seconds. The production nexus 7ks have hundreds of interfaces and I’m worried the large list of switches i’m providing to telegraf is being processed serially. I’m just pulling the interface stats via the IF-NET MIB, nothing fancy. I’m running telegraf in a k8s container deployed via helm.

Before I visit other SNMP collectors or try and create my own helm chart for telegraf that blows out new single instance containers for each snmp server being monitored, I figured I’d ask here to see what folks are doing at scale. Is there a magical parallel setting I’m missing or a magical autoscale set for k8s that I’m ignorant of?

Topic		Replies	Views
Telegraf collecting snmp data from selected network interfaces Telegraf telegraf , performance , snmp , docker	6	2271	December 1, 2023
Best deployment strategy for production, 1x Telegraf per NE or per NE hardware type? Telegraf performance , kafka , snmp	5	73	December 18, 2024
Best way to scale Telegraf Telegraf telegraf	3	3451	October 4, 2018
Telegraf Slow to restart Telegraf telegraf	5	1631	June 22, 2019
Telegraf not collecting SNMP data telegraf	0	977	February 19, 2019

Telegraf - scale out SNMP collectors?

Related topics