Managing kapacitor alerts at scale

michaelperzel · March 19, 2018, 9:23pm

We are working on productionalizing how we create kapacitor tasks, templates etc. We are planning on using Load directory service | Kapacitor 1.4 Documentation plus ansible. One of the issues we are thinking we will run into is how to prevent the tasks from ballooning into a lot of files that are nearly identical.

For example we have dozens of queues we want to monitor. We have a tickscript that pulls out the depth thats generic, it primarily takes the queue names as its input. But, we have different thresholds for each queue. So our initial solution is to render a task file per queue that contains the queue name and threshold. Problem is this results in a lot of files especially when you multiply across all the different kinds of alerts we want to generate.

Does anyone have any experience with a better way to manage kapacitor alerts?

katy · March 27, 2018, 3:48pm

The best way to manage Kapacitor alerts at scale is with sideload().

michaelperzel · March 27, 2018, 4:14pm

Thanks for the response. If I follow the documentation I could have a task that takes a sideloaded file which contains a yaml list of queue names and thresholds rather than a task per queue.

Do you know of any examples using a sideload? I did some google searching and couldn’t find an example.

Thanks.

katy · March 27, 2018, 4:35pm

There’s a start on an example in this description.

michaelperzel · March 28, 2018, 2:31pm

Sure I’ve reviewed that example but was hoping for something more in depth. Some questions: What is the format of the files at ‘file:///path/to/dir’ Is it just key: value? What is replaced by {{ host }}?

I’m having trouble visualizing how this works beyond the concept of loading fields/tags from a file.

Thanks.

katy · March 28, 2018, 4:37pm

The format of the files is key value data, either json or yaml. The {{ .host}} is replaced by the host tag value. This allows for creating specific overrides per host or whatever other tags you might need.

Dois1111 · March 28, 2018, 6:03pm

really appreciating this info katy. i’m new here so such kind of information is surely valuable for me. was wondering if i might ask you other questions as well? thanks again!

katy · March 28, 2018, 6:06pm

Of course. We’re here to help!

michaelperzel · April 2, 2018, 7:51pm

Is it possible to reference a field within a yaml/json file rather than have 1 file per queue in our case? eg have a file called threshold.yml with the format:
QUEUE_1:
threshold: 1000
QUEUE_2:
threshold: 2000

I got a file per queue to work but with the following.
Tick script:
dbrp “telegraf”.“autogen”
var data = stream
|from()
.measurement(‘infra_nix_mq_queue’)
.groupBy(‘queues’)
|window()
.period(5m)
.every(10s)
|sideload()
.source(‘file:/app/uid/kapacitor/sideload’)
.order(’{{.queues}}.yml’)
.field(‘threshold’, 9999)
|alert()
.crit(lambda: int(“depth”) > “threshold”)
.exec(’/usr/bin/python’, ‘/tmp/zenoss_event.py’)

Where the yaml file is the name of a queue with the format
threshold: 1000

michaelperzel · April 2, 2018, 8:42pm

This looks related.

Going to try it out.

michaelperzel · April 3, 2018, 3:09pm

Unfortunately he solved escaping his partition names but only requested being able to use something other than a flat file.

cxcv · April 3, 2018, 4:17pm

I think it’s a hack; it works, but having a hierarchical tree within the .yml files would be superior.
I’ve looked at the sideload source code and found no other way around.

Cheers
Benjamin

Topic		Replies	Views
Kapacitor sideload with path names as keys? Kapacitor kapacitor	2	1755	May 16, 2018
[Kapacitor] Sideload nested data and dynamic variables in tag/field names Kapacitor kapacitor	2	1039	June 17, 2019
Kapacitor Sideload	5	604	October 13, 2020
Multiple Thresholds alerts based on servers	6	1684	March 29, 2017
Error with Kapacitor script that replaces slashes in disk path with underscores in alert .id kapacitor	2	1359	October 6, 2017

Managing kapacitor alerts at scale

Related topics