Nested query performance InfluxDB 1.8

rvdheij · June 8, 2020, 9:14am

I gather data from multiple agents into the same InfluxDB to provide an enterprise wide summary. The metrics I have are detailed hypervisor CPU usage gathered by a guest. For a total overview, I want to add up the per-CPU metrics and get a query like this:

select sum(usage) as usage from zvm_lpar_sytcup
where time > now() - 5m group by time(1m),seqnr,cputype

This runs fine and completes in 60 ms or so.
Now the challenge is that when two guests happen to run on the same hypervisor, we get the data from that hypervisor twice, and my sum() doubles the amount. So I came up with this:

select max(*) from
    (select sum(usage) as usage from zvm_lpar_sytcup
         where time > now() - 5m group by time(1m),seqnr,cputype,lpar)
    group by time(1m),seqnr,cputype

This gives the correct results, but the query takes 550 ms to run. I tried to add the where clause to the outer select as well, but that did not help much.
My Grafana dashboard gets the data for 3 or 6 hours, which takes 1.2 seconds with just a few systems feeding data. Now I can do a continuous query for that last one, which makes the dashboard quick again, but it does burn 1% of CPU non-stop (even when nobody is looking). I’m concerned about usage when we have a few dozen systems feeding data.

rvdheij · June 8, 2020, 10:12am

I now made two separate continuous queries; one for the inner select (that takes 60 ms) and one for the outer select (using the result from the first CQ) and that takes 2 ms. I suppose because I defined them in this order, that the inner runs before the outer, so it actually works… But since the various system clocks are not necessarily in sync, I run with ‘resample every 1m for 10m’ anyway, so would catch the data later.

But it seems we do have an issue with the nested query when that takes an order of magnitude longer.

Anaisdg · June 8, 2020, 8:21pm

Hello. @rvdheij,
I’m confused. Have you resolved your problem? Is there anything else I can help with? Thank you.

rvdheij · June 8, 2020, 8:32pm

Hi @Anaisdg
I have bypassed the issues by breaking up that nested query. I would like to understand why it was so expensive or what I could have done to avoid that.
-Rob

Anaisdg · June 8, 2020, 10:12pm

Hello @rvdheij,
It’s hard to know exactly without some more information, but I notice that your outer query isn’t bound with a time range, which could be the problem. Try adding a where clause to the outside query.
Thanks

system · June 9, 2020, 7:31am

This topic was automatically closed 60 minutes after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Nested queries performing horribly - performance bug? Store influxdb	4	1432	June 22, 2017
Graphing the sum of several counters Dashboards	5	6618	March 16, 2017
Select from multiple measurements influxql	5	13941	February 26, 2018
Poor subquery performance for aggregations influxdb , query , performance	1	928	March 23, 2021
Long query time (30sec+) on LANCache Dashboard InfluxDB 2	5	116	August 24, 2024

Nested query performance InfluxDB 1.8

Related topics