Currently we have some infrastructure where we have lots of telegraf instances collecting data and sending to one for monitoring. Now, there are cases when telegraf can be down(especially for windows machines) and I would like to monitor this as well. So, from monitoring server I want to have some kind of ping to all the machines and see if telegraf is running. Now from the plugins I see it seems to me that I should use:
socket listener for the nodes and on monitoring node use
net_response to check if running, is this solution OK?
Edit: Or I can use
http_listener plugin and basically from monitoring write some data. If I have data into DB it’s OK if not then smth is wrong.