Possible bug - data not deleted well after retention period has passed

Mikolaj_G · September 29, 2021, 4:21pm

I have the following three buckets:

Bucket redis_stats_1h receives data from another service and I have downsampling tasks that pass data from redis_stats_1h to redis_stats_2h and from redis_stats_2h to redis_stats_4h. The problem is that data in each bucket but is kept WAY LONGER than retention period + shard duration. In data explorer I can see the following results:

for query:

from(bucket: "redis_stats_1h")
  |> range(start: v.timeRangeStart, stop: v.timeRangeStop)
  |> filter(fn: (r) => r["_measurement"] == "redis_cmdstats")
  |> filter(fn: (r) => r["_field"] == "calls")
  |> filter(fn: (r) => r["clusterName"] == "<cluster_name>")
  |> filter(fn: (r) => r["cmdstat_name"] == "info")
  |> aggregateWindow(every: v.windowPeriod, fn: mean, createEmpty: true)
  |> yield(name: "mean")

I can see data from more than 24 hours, while there should be only data from the last 2 hours (retention period + shard group duration). The same thing happens in buckets redis_stats_2h and redis_stats_4h. Here there is a plot for bucket redis_stats_2h that was generated using the same query:

The total disk size of bucket redis_stats_1h is 147M.

Note that I created this buckets from scratch, that is I did not modify retention/shard duration periods.

I could not find any information in influx docs on why this could be happening: is this a bug, or am I doing something wrong?

I use influx 2.0.8.

MzazM · September 30, 2021, 12:36pm

if you type influx bucket list, what is the retention policy length associated to your 1h, 2h and 4h buckets?

Mikolaj_G · September 30, 2021, 2:49pm

I probably should have clarified that in my original post, but the first image is the result of command: “influx bucket list”. The retention policy is 1h for redis_stats_1h, 2h for redis_stats_2h and 4h for redis_stats_4h. All these buckets have shard group duration of 1h, as can be seen in my first screenshot.

Topic		Replies	Views
Influx Data Retention issue	0	209	December 1, 2023
Influxdb v2 old data with new retention policy InfluxDB 2 telegraf , schema , query , retention-policy	1	1539	June 28, 2021
[InfluxDB v2.7.1] Why the bucket retention period do not working? InfluxDB 2 influxdb	5	958	March 31, 2024
InfluxDB 2.0 - Data Loss InfluxDB 2 influxdb	1	525	July 30, 2020
Bucket set to "forever" for retention, but data deletes after 5 years InfluxDB 2	1	502	March 3, 2023

Possible bug - data not deleted well after retention period has passed

Related topics