How to get a cumulative sum of a product from different series?

robertortel · April 4, 2024, 1:47pm

Hi all,

I am using grafana 10.4.1 as part of home assistant on a raspi 4 together with influxdb 1.8.10.

I want to have a cumulative sum in a graph of my savings from my solar panels. That is easy in my eyes as long a the kWh cost remains constant. I do have a metric with the sum of the self-used kWh and this can be multiplied by some constant kWh cost.

But now I do have 2 kWh cost changes in fast succession and I want that cumulative sum still to be correct. So I introduced a new metric with the kWh cost to move away from that constant kWh cost.

But when integrating this into an influxql query (I want to use this query from grafana later) I do get no results and no error while all the series do have data!

select
CUMULATIVE_SUM( SUM(saving) )
from
(
    select
	self_used_kwh * cost_kwh as saving
    from
    (
        SELECT max("value") - min("value") as self_used_kwh FROM ha_db.autogen."kWh" WHERE ("friendly_name"::tag = 'Solar_selbst_verbraucht_Summe') 
    ),
    (
        SELECT mean("value") as cost_kwh FROM ha_db.autogen."EUR" WHERE ("friendly_name"::tag = 'Strom_Preis_pro_kWh')
    )
)
GROUP BY time(1d) 
fill(previous)

Any ideas on this? Thx.

scott · April 4, 2024, 2:13pm

@robertortel I think what you’re trying to do would actually require a join to align associated values into rows. Unfortunately, InfluxQL doesn’t support joins. I know this query is possible in Flux. Are you open to using Flux for this?

robertortel · April 4, 2024, 2:33pm

Thx scott,

I am open to use flux, but it will only help me in case grafana can issue such query as well. I want to bring the result into a grafana dashboard with other graphs.

Do you know whether and how I can issue a flux query from grafana?

scott · April 4, 2024, 3:04pm

Yes, Grafana can do Flux queries. You just need a separate InfluxDB data source configured to use Flux. Here’s how to set it up: Use Grafana with InfluxDB v1.8 | InfluxDB OSS v1 Documentation

Here’s what the query would look like if you were to hard code the time range (1 month) and window interval (1 day):

self_used_kwh =
    from(bucket: "ha_db/autogen")
        |> range(start: -1mo)
        |> filter(fn: (r) => r._measurement == "kWh")
        |> filter(fn: (r) => r.friendly_name == "Solar_selbst_verbraucht_Summe")
        |> aggregateWindow(every: 1d, fn: spread)
        |> set(key: "_field", value: "self_used_kwh")

cost_kwh =
    from(bucket: "ha_db/autogen")
        |> range(start: -1mo)
        |> filter(fn: (r) => r._measurement == "EUR")
        |> filter(fn: (r) => r.friendly_name == "Strom_Preis_pro_kWh")
        |> aggregateWindow(every: 1d, fn: mean)
        |> set(key: "_field", value: "cost_kwh")

union(tables: [self_used_kwh, cost_kwh])
    |> pivot(rowKey: ["_time"], columnKey: ["_field"], valueColumn: "_value")
    |> map(fn: (r) => ({r with _field: "saving", _value: r.self_used_kwh * r.cost_kwh}))
    |> cumulativeSum()

To make the time range and window interval configurable in Grafana:

self_used_kwh =
    from(bucket: "ha_db/autogen")
        |> range(start: v.timeRangeStart, v.timeRangeStop)
        |> filter(fn: (r) => r._measurement == "kWh")
        |> filter(fn: (r) => r.friendly_name == "Solar_selbst_verbraucht_Summe")
        |> aggregateWindow(every: v.windowPeriod, fn: spread)
        |> set(key: "_field", value: "self_used_kwh")

cost_kwh =
    from(bucket: "ha_db/autogen")
        |> range(start: v.timeRangeStart, v.timeRangeStop)
        |> filter(fn: (r) => r._measurement == "EUR")
        |> filter(fn: (r) => r.friendly_name == "Strom_Preis_pro_kWh")
        |> aggregateWindow(every: v.windowPeriod, fn: mean)
        |> set(key: "_field", value: "cost_kwh")

union(tables: [self_used_kwh, cost_kwh])
    |> pivot(rowKey: ["_time"], columnKey: ["_field"], valueColumn: "_value")
    |> map(fn: (r) => ({r with _field: "saving", _value: r.self_used_kwh * r.cost_kwh}))
    |> cumulativeSum()

robertortel · April 4, 2024, 5:18pm

Hi Scott,

I tested that query in chronograf first, but it runs into an error. And the very same error is returned from grafana as well:

[sse.dataQueryError] failed to execute query [C]: 500 Internal Server Error: {“error”:“panic: runtime error: invalid memory address or nil pointer dereference”}

scott · April 4, 2024, 5:41pm

Are there any more details in your logs?

robertortel · April 4, 2024, 5:43pm

At first, thank you for your time in translating the query!

No … my influx logs (which I can only access from the home assistant web gui) do not show any more details:

[...]
time="2024-04-04T19:12:43+02:00" level=info msg="Response: OK" component=server method=POST remote_addr="127.0.0.1:37852" response_time=11.380008ms status=200
time="2024-04-04T19:12:45+02:00" level=info msg="Response: Internal Server Error" component=server method=POST remote_addr="127.0.0.1:37866" response_time=1.774893427s status=500
time="2024-04-04T19:12:51+02:00" level=info msg="Response: OK" component=server method=GET remote_addr="127.0.0.1:60698" response_time="151.72µs" status=200
[...]

robertortel · April 4, 2024, 6:36pm

@scott I can adjust the loglevel of the influx running from home assistant. Any suggestions? I will retry the query then.

scott · April 4, 2024, 6:51pm

The “trace” log level will give you the most detail, but I don’t know how much it will help with the 500. Another thing to check is to make sure Flux is enabled in your InfluxDB 1.8 instance.

robertortel · April 4, 2024, 8:50pm

I could check the docker container of influxdb and indeed, flux is enabled:

root@a0d7b954-influxdb:/etc/influxdb# cat influxdb.conf | grep flux-ena
  flux-enabled = true
root@a0d7b954-influxdb:/etc/influxdb#

robertortel · April 4, 2024, 9:48pm

Log forwarding from influxdb to homeassistant might be buggy … I can’t get anything with debug or trace mode. I will retry later.

However I found this as an issue on this specific influxdb addon for home assistant:

github.com/hassio-addons/addon-influxdb

Runtime error - invalid memory address or nill pointer dereference

opened 08:23PM - 28 Mar 24 UTC

ghulleman

# Problem/Motivation With a flux query, an aggregate window with fn:max yields …results, but a sum gives a "panic: runtime error: invalid memory address or nill dereference Failes: from(bucket: "retentiondb/infinite") |> range(start: -1y) |> filter(fn: (r) => r["_measurement"] == "Wh") |> filter(fn: (r) => r["_field"] == "value") |> filter(fn: (r) => r["entity_id"] == "sb3_6_1av_40_563_daily_yield") |> aggregateWindow(every: 1mo, fn:sum) <img width="932" alt="2024-03-28_21-29-08" src="https://github.com/hassio-addons/addon-influxdb/assets/100074494/39902cf9-abe5-432f-85a9-5e688ac5e579"> Works: from(bucket: "retentiondb/infinite") |> range(start: -1y) |> filter(fn: (r) => r["_measurement"] == "Wh") |> filter(fn: (r) => r["_field"] == "value") |> filter(fn: (r) => r["entity_id"] == "sb3_6_1av_40_563_daily_yield") |> aggregateWindow(every: 1mo, fn:max) Only difference is the 'fn:max' / 'fn:sum' <img width="919" alt="2024-03-28_21-28-36" src="https://github.com/hassio-addons/addon-influxdb/assets/100074494/872b8530-234c-4e0b-99a0-f4d0ecdf473b"> Found https://github.com/influxdata/influxdb/issues/21649 (not this addon), more people have this issue. ## Expected behavior Expect to get a bar graph with a total per month of generated kWh of a converter. ## Actual behavior Gives a "panic: runtime error: invalid memory address or nill dereference" ## Steps to reproduce - Create a database 'rententiondb' with a retention policy 'infinite'. - Create a database 'temp_import' - Write data via influx addon to temp_import database, set precision to 's' [influx upload.txt](https://github.com/hassio-addons/addon-influxdb/files/14794978/influx.upload.txt) - Using the explorer window, execute "SELECT * INTO retentiondb.infinite.Wh FROM import_temp.autogen.Wh group by *" to import data to rentention/infinite. - Use explorer window, execute flux query: from(bucket: "retentiondb/infinite") |> range(start: -1y) |> filter(fn: (r) => r["_measurement"] == "Wh") |> filter(fn: (r) => r["_field"] == "value") |> filter(fn: (r) => r["entity_id"] == "sb3_6_1av_40_563_daily_yield") |> aggregateWindow(every: 1mo, fn:sum) ## Proposed changes

where this is given as well, which is an issue at influxdb itself:

github.com/influxdata/influxdb

Error: panic: runtime error: invalid memory address or nil pointer dereference

opened 08:51AM - 10 Jun 21 UTC

libindas

kind/bug kind/perf area/2.x

I am getting below error and restarting influxDB very frequently while querying …through grafana. Anyone can help or suggest solutions to solve this problem. Error: panic: runtime error: invalid memory address or nil pointer dereference Running influx with below parameter : InfluxDB Verion: 1.8.4 Kernel: 3.10.0-1127.19.1.el7.x86_64 [monitor] store-enabled = false I was downloaded and installed above version from https://repos.influxdata.com/rhel/ Sample Log file output. Jun 01 13:35:51 nv01-prod-rtb-influx01 influxd: panic: runtime error: invalid memory address or nil pointer dereference Jun 01 13:35:51 nv01-prod-rtb-influx01 influxd: [signal SIGSEGV: segmentation violation code=0x1 addr=0x18 pc=0x961750] Regards Libin

scott · April 4, 2024, 10:07pm

Interesting. I’m thinking this is probably related to an issue with the specific version of Flux packaged with InfluxDB 1.8.10. Unfortunately, it’s an old version of Flux and can’t be upgraded due to some dependency changes in Flux itself. The only way to get a newer version of Flux would be to upgrade to the latest version of InfluxDB v2, but I know this isn’t an option for many.

robertortel · April 5, 2024, 6:49am

As posted in the first bug, this might be related to only “fn:max yields”. I can’t see such parameters in your flux query, which indicates that it might be not that easy. But sometimes query rewrites make a difference … . Any suggestions maybe?

And no … I can’t update. I am dependent on the package provided by GitHub - hassio-addons/addon-influxdb: InfluxDB - Home Assistant Community Add-ons. And there is no other influx addon available for home assistant as far as I know.

scott · April 8, 2024, 2:23pm

There is a Home Assistant community add-on for InfluxDB v2: Home Assistant Add-on: InfluxDB v2 - Home Assistant OS - Home Assistant Community

robertortel · April 9, 2024, 6:20am

Very nice & thank you @scott :-).

And there is even some documentation on how to migrate from my influx 1.x to this influx 2.x. Well … more work to do, but at least this is something :-).

Yesterday I did check that already on my phone and read somewhere (but can’t find it now) that influx 2.x no longer supports influxql and I have to rebuild all my grafana queries to flux. Is that really true?

scott · April 9, 2024, 1:22pm

No, that’s not true. It was when v2 was first released, but InfluxQL support was added later on. It does require a little bit of setup since InfluxQL queries require a database and retention policy which were combined and replaced with “buckets” in v2. You have to map database/retention-policy combinations to v2 buckets to be able to use InfluxQL. Here’s some more information: https://docs.influxdata.com/influxdb/v2/query-data/influxql/

robertortel · April 11, 2024, 10:04pm

Hey @scott

Ok, so I do have an influx v2 meanwhile and migrated my influx v1 data to it. Home assistant is still writing into influx v1. That switch will happen later.

But now I can test with that data in influx v2.

I had to adjust your query to make it run:

self_used_kwh =
    from(bucket: "data")
        |> range(start: -1mo)
        |> filter(fn: (r) => r._measurement == "kWh")
        |> filter(fn: (r) => r.friendly_name == "Solar_selbst_verbraucht_Summe")
        |> filter(fn: (r) => r["_field"] == "value")
        |> aggregateWindow(every: v.windowPeriod, fn: spread)
        |> set(key: "_field", value: "self_used_kwh")

cost_kwh =
    from(bucket: "data")
        |> range(start: -1mo)
        |> filter(fn: (r) => r._measurement == "EUR")
        |> filter(fn: (r) => r.friendly_name == "Strom_Preis_pro_kWh")
        |> filter(fn: (r) => r["_field"] == "value")
        |> aggregateWindow(every: v.windowPeriod, fn: mean)
        |> set(key: "_field", value: "cost_kwh")

union(tables: [self_used_kwh, cost_kwh])
    |> pivot(rowKey: ["_time"], columnKey: ["_field"], valueColumn: "_value")
    |> map(fn: (r) => ({r with _field: "saving", _value: r.self_used_kwh * r.cost_kwh}))
    |> cumulativeSum()

I had to add those 2 “filter(fn: (r) => r[”_field"] == “value”)" lines and adjusted the range to test with it from influx2 data explorer.

But now it does only return a flat line with all values being zero. I can only suspect this is because the cost_kwh table has only very few values. It does only get a new value when my kWh cost changes, which I made manually for june 2023 where I started with home assistant and now 2 times within april 2024. So only 3 values in 10 months. Any ideas on that?

I noticed as well, that Flux did become deprecated with Influxdb v3 and sql and influxql are more focussed. Does this have any effect on influxdv 2 and the influxql capabilities here? Maybe such cumulative sum is now possible with influxql?

Best regards and thank you.

scott · April 11, 2024, 10:33pm

I’d have to see the actual query results to really give any proper guidance. I’m guessing that the visualization is just connect the disparate points and there’s actually large gaps in time with no data. Is that the case?

This has no effect on InfluxDB v2.

Cumulative sum is supported in InfluxQL with all version of InfluxDB:

robertortel · April 12, 2024, 6:42am

Well … I got confused here, as you told me to go to influx 2 to make that possible. But this was due to the join nature of my query, right, and not only from my request for a cum. sum?

I would like to show you some data here, but the result shows up as 3 tables when selecting raw data and I am only allowed to put one image in a post here . Is there a way export such result from the data explorer to a csv to upload it here in one file?

Thx.

robertortel · April 14, 2024, 5:21pm

Ok, whatever I told earlier with those 3 tables … it seems I was wrong. But it is indeed 2 tables which I do get as a result.
I selected 7 days (02.04. 00:00 until 09.04. 00:00) of old data here which are part of the period for which I did migrate that old data from my influx 1. You can see the result below.

Stuff to note:

that weird first line which looks like a different table
The single cost_kwh data only for the 07.04. while no cost change happened on that day. However a home assistant (HA) restart on that day may have caused that. But as the current mechanism only writes new data into that series on a real cost change (or on HA restart) there. So it is expected to be empty for very most days and should get its last real value, no matter at how many days in the past that last data exists

Thx for your help.

Topic		Replies	Views
Flux: Get total energy by month with an ever-increasing value InfluxDB 2 influxdb , time-series , grafana , query , flux	8	4651	January 18, 2023
Flux Query/ Function to find the cumulative sum of the data InfluxDB 2 influxdb , query	2	879	November 16, 2022
Multiple aggregation windows for power aggregation Fluxlang	10	2089	July 17, 2022
Learning Flux instead of influxQL for external Query? Clients? Welcome & Getting Started	9	1173	December 8, 2020
Multiply two field values and store the result in a new field	22	11493	June 9, 2020

How to get a cumulative sum of a product from different series?

Related topics