Hi, this is a cross-post from community.grafana.com
“I want to learn about the alert mechanism and how is the best way to set them. I know that I can not use alerts if I use variables. Yet, when you have 500+ node to monitor let’s say simply cpu usage 90% for 1 min, how do you set alerts? Of course there are many other metrics. Should I create different dashboards for alerts separated completely from the main monitoring dashboards?”
It’s not Influx related directly but as far as I see this community is more active so I take my chances.
Also, I don’t have go with grafana. I would like to hear your experiences with TICK stack. Should I try to implement kapacitor? Alerting is crucial since dashboards does not scale.
I would really like to know how you manage alerts and incidents in your environments.