Impossible to aggregate net measurement in telegraf database

vladpov · July 23, 2020, 7:30am

Hey everyone!!

We use telegraf for storing metrics in our InfluxDB. As we’re running out of disk space we’re using RPs and aggregating data and store them to different RPs. We ran into problem with aggregation of measurement called net which has around 94 fields which I can see in Grafana Dashboard. The problem is that it is even impossible to select one line of measurement with limited select. About counting the records I would even speak.

When I try to run query for aggregation with mean function that would mean data every 1m the machine is swiftly running out of RAM. We even tried to add RAM memory for 30GB which is insane and it still is running to 29 and then use 2GB swap so machine drops.

If anyone ran into the similar problem please offer a hand.

Thanks in advance!!

Anaisdg · July 24, 2020, 8:42pm

Hello @vladpov,
Have you considered using flux? You can perform aggregations easily across all fields in a measurement like so:

 from(bucket: "mybucket")
  |> range(start: v.timeRangeStart, stop: v.timeRangeStop)
  |> filter(fn: (r) => r["_measurement"] == "mymeasurement")
  |> mean(column: "_value")

vladpov · July 29, 2020, 7:06am

Hello @Anaisdg,

first of all thank you for your reply! The solution you offer sounds great If I understand it correctly example of code you sent would take the measurement and run mean function on specified column and time range.

If you could clarify couple questions that occur in my mind Is necessary to define time range? Can I run similar code on every record in the column? How and where is output stored? I mean if I run it on every column, is it possible to store mean output to one measurement column after column?

Thank you again for your help! Cheers

Anaisdg · July 29, 2020, 3:26pm

Hello @vladpov,
You’re welcome!
Yes, you need to define a time range, however you can query all the data in your bucket if your bucket doesn’t have too much data.
The output is stored in memory, but you can write the data to a new bucket with the to() and you can do this on a schedule with a task.

Does this help answer your questions?

Thank you! :))

vladpov · July 30, 2020, 6:14am

Great! Thank you @Anaisdg, I’ll give it a try!

Speaking of scheduling, I ran only to these options of scheduling a task. Is anyhow possible to schedule it with kind a if statement? Like if task is done successfully continue to the next column…

Best regards.
VP

system · July 30, 2020, 7:14am

This topic was automatically closed 60 minutes after the last reply. New replies are no longer allowed.

Anaisdg · July 30, 2020, 7:44pm

You can create multiple tasks and write the outputs to a new measurement or bucket. Then you can write a new task to operate on that output. Does that help?

Topic		Replies	Views
Is there a way a task could iterate over measurements and aggregate data into another bucket? InfluxDB 2 flux	4	2346	March 7, 2021
How to simply sum these values using a Telegraf aggregator? influxdb , telegraf	4	29	June 27, 2025
Select Into problems Telegraf chronograf	9	1983	March 1, 2019
How to turn influxQL sum() mean() into flux? InfluxDB 2 influxql , query , flux	4	581	October 14, 2022
Telegraf: route aggregations to different buckets Telegraf telegraf	3	616	December 23, 2024

Impossible to aggregate net measurement in telegraf database

Related topics