Flux performance compared to similar influxQL query

franck102 · November 8, 2022, 6:44pm

Am I missing something obvious when translating the influxQL query below to Flux? From Grafana the influxQL query executes in 191ms, the Flux one takes about 38s??
This is for the same time range, last 2 weeks with one record every 30s (so somewhere around 2.5M records)

SELECT  max(chargeur_velo) as chargeur_velo,  max(plaque) as plaque,  max(cuisine) as cuisine,  max(lave_vaisselle) as lave_vaisselle,  max(livebox) as livebox FROM ${db}.."power" WHERE $timeFilter GROUP BY time($watts_interval) fill(linear)

into

fields = ["chargeur_velo", "plaque", "cuisine", "lave_vaisselle", "livebox"]
from(bucket: "iota-oneweek")
  |> range(start: v.timeRangeStart, stop: v.timeRangeStop)
  |> filter(fn: (r) => r["_measurement"] == "power")
  |> filter(fn: (r) => contains(value: r._field, set: fields))
  |> aggregateWindow(every: 1d, fn: mean, createEmpty: false )

grant1 · November 8, 2022, 10:26pm

Do you have any tags in your data that Flux is querying? I found this webinar from Influxdb about Schema Design for IoT to be extremely helpful.

My impression is that this data should be re-organized as follows:

Tag: Device ( chargeur_velo | plaque | cuisine | lave_vaisselle | livebox )
Field: Power
Measurement would be something other than Power, for example, “Maison”

franck102 · November 9, 2022, 5:36am

Hi @grant1, I will look into your suggestion, but I think I found the main culprit: using the "contains"function in the second filter seems to prevent the push down.
If I simply use a static “or” expression performance is an order of magnitude better.

Franck

grant1 · November 9, 2022, 11:27am

OK, thanks for the additional info. I had never seen the contains() function, so was not aware of how it was used.

MzazM · November 9, 2022, 1:13pm

Yes, it is the contains(). It is known to be impacting a lot the perfromance. If you are using grafana, you can bypassing it using carefully some variables and some regex. Read the full discussion here

max0x7ba · December 9, 2022, 3:58am

There is a bug report for poor contains performance.

Topic		Replies	Views
Contains query performance (Finding an alternative) influxdb , query , flux , performance	0	574	February 9, 2023
Impact of contains() on performance InfluxDB 2 influxdb , flux , performance	7	3152	October 15, 2021
Inefficient queries when moving from INFLUXQL to FLUX flux , tasks	2	35	January 29, 2025
InfluxQL vs Flux performance on Influx 1.8 Fluxlang influxdb , chronograf , influxql , flux , performance	1	909	August 10, 2021
Performance of Flux vs InfluxQL influxdb , influxql , flux	1	1322	December 17, 2020

Flux performance compared to similar influxQL query

Related topics