Using pushdown functions in InfluxDB 1.8

annariley · November 25, 2022, 10:32am

Hello,

I am trying to run large queries with window and aggregating (window and limit) in Flux with InfluxDB v1.8. To my understanding, when put after range() in my query, these functions are not working as pushdown functions, and when I query my database flux first pulls all values in the specified time range. This is causing my query to be extremely slow. Is there anyway to not pull all values from the database before aggregating in InfluxDB1.8? I am already using range, filter, and group. Any help to optimize would be appreciated.

Thank you!

table1 = from (bucket:"bucket")
|>range(start: v.timeRangeStart, stop: v.timeRangeStop)
|>filter(fn: (r) => r._measurement == "m1" and r.node == "${Node}" and r.mode == "mode1" and r.path == "path1")
|>filter(fn: (r) => r["_field"] == "field")
|> group(columns: ["$__interval", "mode", "device"])
|> drop(columns: ["_start", "_stop", "_field", "_measurement", "column", "node", "path", "dof"])
|>window(every: v.windowPeriod)
|>limit(n:1, offset:0)
|> window(every: inf)

table2 = from (bucket:"bucket")
|>range(start: v.timeRangeStart, stop: v.timeRangeStop)
|>filter(fn: (r) => r._measurement == "m2" and r.node == "${Node}" and (r.path == "path2" or r.path == "path3") and r["_field"] == "mean")
|> drop(columns: ["_start", "_stop", "_field", "_measurement", "node","path", "pos"])
|>window(every: v.windowPeriod)
|>limit(n:1, offset:0)
|> window(every: inf)

join(
  tables: {table1:table1, table2:table2},
  on: ["_time"]
)|> rename(columns: { _time: "_time_joined"})

Pooh · November 25, 2022, 10:45am

I suggest you show us the exact query you are currently using so that we have
some idea of what you are asking us to help to optimise.

Antony.

Jay_Clifford · November 25, 2022, 11:35am

Hi @annariley,
To confirm there really isn’t a notion of pushdown functions within 1.X for Flux. This was introduced in 2.X of InfluxDB. As @Pooh said we can at least try to simply the query for you.

annariley · November 25, 2022, 12:08pm

thanks- added to the post

Jay_Clifford · November 25, 2022, 12:27pm

Hi @annariley,
so it looks like you want to use pivot?

It seems you have made a lot of your fields generic so it is hard to tell if pivot will work? The join is your particular issue here. The limit is also interesting in one case you limit to 5 rows per window and 1 in the second table. Could use an aggregate window instead for the first table?

annariley · November 25, 2022, 1:03pm

thanks @Jay_Clifford I’ll add some clarification to the vague titles. Would aggregate window be more efficient than the decontructed version? I want to limit them to the same number per window but the actual number is trivial. I don’t think pivot would work because I still want the values to be vertical but I want the values in both tables to relate to a single time column. (The goal is to plot the values with table1_value as the x axis and table2_value as the y axis.)

Jay_Clifford · November 25, 2022, 1:22pm

Ah sadly pivoting is not possible across measurements. Pivoting would retain your values to be verticle they would just be indicated as two columns against the same timestamp. I personally would do the following:

table1 = from (bucket:"bucket")
|>range(start: v.timeRangeStart, stop: v.timeRangeStop)
|>filter(fn: (r) => r._measurement == "m1" and r.node == "${Node}" and r.mode == "mode1" and r.path == "path1")
|>filter(fn: (r) => r["_field"] == "field")
|> aggregateWindow(every: v.windowPeriod , fn: first)

table2 = from (bucket:"bucket")
|>range(start: v.timeRangeStart, stop: v.timeRangeStop)
|>filter(fn: (r) => r._measurement == "m2" and r.node == "${Node}" and (r.path == "path2" or r.path == "path3") and r["_field"] == "mean")
|> aggregateWindow(every: v.windowPeriod , fn: first)

join(
  tables: {table1:table1, table2:table2},
  on: ["_time"]
)|> rename(columns: { _time: "_time_joined"})

annariley · November 25, 2022, 2:11pm

Thanks, I’ll consider this. Is there a reason you dropped the group() and drop() steps?

Topic		Replies	Views
Custom Aggregate Window function that preserves timestamp from Selector InfluxDB 2 flux	1	344	June 13, 2023
InfluxQL query to Flux Fluxlang influxql , flux	1	518	April 17, 2023
Help for InfluxQL to Flux InfluxDB 2	7	701	April 23, 2023
Slow window function Fluxlang	3	644	May 23, 2022
Does InfluxQL (3.0) support something like aggregateWindow? query	1	333	October 3, 2023

Using pushdown functions in InfluxDB 1.8

Related topics