I am looking to monitor 30+ servers with Collectd, InfluxDB and Grafana.
I have Collectd setup across all servers reporting metrics to an InfluxDB instance on a separate server. The interval setting on collectd is 1. This worked fine for a while but as I started increasing the servers reporting, InfluxDB slowed down as Grafana took 10+ seconds to pull stats on a single server. My guess is, the slow down is attributed to all those servers reporting metrics every second.
- Collectd Client Config: http://termbin.com/5xs8
- InfluxDB Config: http://termbin.com/hl66
Does my InfluxDB configuration look okay? Are they any changes I can make to have it handle all the incoming metrics better?