Get the mean of 4 days per 15 minutes

mario_balti · May 24, 2023, 1:58pm

 I have this query

import "date"
days = date.truncate(t: -28d, unit: 1d) from(bucket: "my_data")
|> range(start: days)
|> filter(fn: (r) => r["_measurement"] == "my_portal")
|> filter(fn: (r) => r["_field"] == "EDC")
|> filter(fn: (r) => date.weekDay(t: r._time) == 2)

this will return the data of the last 4 Tuesdays
Each day contains data per 15 minutes
Now I want to combine these 4 days into one day which contains the average of these 4 Tuesdays per 15 minutes
My device sends data every 15 minutes

I want to get the average of these 4 Tuesday for each 15 minutes
for example
Tuesday 2/5/2023 06:30 value is 0.20
Tuesday 9/5/2023 06:30 value is 0.10
Tuesday 16/5/2023 06:30 value is 0.40
Tuesday 23/5/2023 06:30 value is 0.20
So I will get the average of these 4 days at 06:30 = 0.225
And then at 6:45 and 07:00 and so on
What should I add to the query in order to do that?
ps: I am using node js client to query the data

fercasjr · May 24, 2023, 3:16pm

group per day, then use mean function:
add this to your query after the last filter

  |> window(every: 1d)  //groups per day, you may need to use some kind of offset depending on the "time zone" you are working with.
  |> mean(column: "_value") //once you have 1 table per day mean function will return the mean on that sub group
  |> group() //this removes all groups, to combine all previous results on one single table.

one second tough, the previous code will work on the average per day but if think is not what you described, if you want the average of each given day at 15 min you may need to do 2 groups: first group per day second group times in that case. instead of window, extract the time and day from the _time using the date library, on a new column and group by time and date.

mario_balti · May 25, 2023, 6:47am

I don’t want to get the average per day
My final result should be the mean of the 15 minutes of these 4 days
So one single day with 96 rows each represent the average of that time (10:15 represent the average of the last 4 days at 10:15)

mario_balti · May 25, 2023, 7:16am

This shows what I really want

fercasjr · May 25, 2023, 12:39pm

This is the way then:

BRB with the code

fercasjr · May 25, 2023, 1:37pm

import "strings"
import "date"

from(bucket: "Bucketname")
  |> range(start: v.timeRangeStart, stop: v.timeRangeStop)
  |> filter(fn: (r) => r["_field"] == "fieldname")
  |> filter(fn: (r) => r["tag"] == "tag1" or r["tag"] == "tag2" or r["tag"] == "tag3")
  |> aggregateWindow(every: 15m, fn: last, createEmpty: false)

  
  |> filter(fn: (r) => date.weekDay(t: r._time) == 2)
  //|> map(fn: (r) => ({ r with Time: string(v: date.hour(t: r._time)) + ":" +  string(v: date.minute(t: r._time)) + ":" +  string(v: date.second(t: r._time)), })) 
  |> map(fn: (r) => ({ r with  Time: strings.substring(v: string(v: r._time), start: 11, end:19)})) 
  |> group(columns: ["tag", "Time"], mode:"by") 
  |> mean(column: "_value")    
  |> yield(name: "last")

mario_balti · May 26, 2023, 2:33pm

Thank you man
This works I think but there is only a small problem
This offset the time 15 minutes when showing the average
For example in the pic you see it shows the mean of 10:30 but it is actually the mean of 10:45
Why is that and how to fix it?

mario_balti · May 26, 2023, 2:54pm

I gues this was because using “last” method
|> aggregateWindow(every: 15m, fn: last, createEmpty: false)
Commenting this out solved the problem

fercasjr · May 26, 2023, 3:50pm

yes, that part actually depends on your data scheme, I let that line because on the bucket I used to test, I have data every 2s so I use an aggregate window of 15 min and mean, which means it will take 15 minutes worth of data and give me the mean and then it stores that value at the last timestamp.

but if you have on data point each 15 minutes that entire line is not necessary.

Topic		Replies	Views
InfluxDB 2 - get specific mean values InfluxDB 2 influxdb , query	2	719	April 20, 2022
Write back results from query (e.g. every hour) Fluxlang	6	976	February 25, 2020
Calculate mean of values across multiple tables? Fluxlang influxdb , time-series , query , flux	2	1391	July 2, 2020
Group by every hour of the day Fluxlang	4	6987	March 14, 2022
I want to retrieve data from a specific period of time InfluxDB 2	4	656	December 13, 2023

Get the mean of 4 days per 15 minutes

Related topics