Backfilling a Continuous Query?

dcadwallader · August 9, 2017, 10:49pm

We have one measurement that stores fine-grained event data.

From this, we create several continuous queries that roll up the data in specific ways.

Let’s say that later we realize we want to have a new continuous query, but we wish we had created it at the beginning! Is it possible to “backfill” that continuous query so it runs not just on data going forward, but also fills in historical data as well?

sbains · August 9, 2017, 11:00pm

Yeah you can use the same CQ with time clauses under “where”.

e.g.

CQ
CREATE CONTINUOUS QUERY "1h_event_count"
ON "db_name"
BEGIN
SELECT sum(“count”) as "count"
INTO “2_years”."events"
FROM “6_months”."events"
GROUP BY time(1h)
END;

Backfill the measurement using:

SELECT sum(“count”) as "count"
INTO “2_years”."events"
FROM “6_months”."events"
where time > and time < "End Time"
GROUP BY time(1h)

JeremySTX · August 11, 2017, 1:28am

Just be careful if your data is sparse. The SELECT INTO will iterate over every possible time interval between the specified start and end times, which could be a lot of wasted queries if there is no data for an extended time period.

For example, if your devices only take readings during the day and are quiescent overnight, the SELECT INTO will generate a lot of pointless data searches looking for overnight data.

dcadwallader · September 26, 2017, 2:11pm

@sbains Thanks for the tip! If I’m backfilling months of data, is this query going to potentially cause InfluxDB to choke? My understanding about CQs is that they perform well because they are only looking at a small slice of time at once. With a very wide time range in the “where” clause, would this cause a huge mega-query that could cause Influx to crash or slow down for many hours?

sbains · September 26, 2017, 6:02pm

Yeah you will need to split the query into smaller time frames which will ensure that it doesn’t cause any performance issues.

Topic		Replies	Views
Continuous query on the full database influxdb , query	7	3318	April 10, 2019
Influxdb1.8 to create a continuous query does not work？ InfluxDB 1 influxdb , query	2	500	November 24, 2021
Continuous query gives different results than manual query execution InfluxDB 2 influxdb , query	2	665	June 24, 2020
CQ + Historical data? Store influxdb	1	606	March 29, 2017
Help with Continuous Queries for stats Store influxdb , schema	0	779	July 24, 2018

Backfilling a Continuous Query?

Related topics