What is the recommended practice for "protecting" some data from a retention policy?

fluffynukeit · August 1, 2018, 9:25pm

I have high rate data, and over time I want to eliminate old data and instead store lower rate aggregate data. Exact intended use case for CQs and retention policies. Great.

However, some data I want to keep forever no matter their age. Specifically, I want to keep the previous X minutes of full rate data prior to a fault. Fault event records are also stored in InfluxDB.

What’s the best way to achieve this in InfluxDB 1.6? Is it possible to automate with CQs or should I plan to deliberately execute a custom query to achieve this (say, after each fault is detected)? If so, what would such a query look like?

Thanks.

Anaisdg · August 1, 2018, 10:05pm

Hello,

Would using Kapacitor and event handlers to identify a fault and then backfill fuller resolution data with a CQ help you? (You could also use Kapacitor to create a TICK script with threshold parameters for fault metrics)

fluffynukeit · August 1, 2018, 10:36pm

If I understand correctly, I don’t think Kapacitor will help me here. This is on an embedded system. The fault conditions are identified by dedicated hardware circuits, so I don’t need to do any kind of streaming processing to find the fault; I just look for the right hardware signal and log it into InfluxDB. Kapacitor could do this but it’s a much more heavyweight solution compared to adding a line or two of Python.

And I believe the link about CQs that you list shows that backfilling cannot be achieved with a CQ but must be done using a regular INTO query. The most basic solution I can think of is to manually execute an INTO query to copy data to special, separate database used for saving fault data every time a fault condition occurs. I didn’t know if this was best practice or not. Again, Kapacitor could do this but executing it in Python isn’t difficult.

Anaisdg · August 2, 2018, 4:04pm

Hello,

Yes, you’re correct. Using the INTO query is the best way to alternate between downsampling data and retaining full resolution data. Using Kapacitor becomes really handy when you want to detect fault events with specific metric conditions and perform various aggregate functions. There is no best practice your case. Go with your personal preference.

Topic		Replies	Views
Is there a way ton keep key date values in influxdb, bypassing retention system? Store	3	711	February 26, 2018
Downsampling data in-situ influxdb	1	1128	June 19, 2018
Backfilling a Continuous Query? Store influxdb	4	7319	September 26, 2017
Continuous Query to straight copy from one retention to a different retention policy?	2	817	December 10, 2020
Kapacitor data aggregation does not save aggregated data for all series influxdb , kapacitor	8	1850	April 3, 2017

What is the recommended practice for "protecting" some data from a retention policy?

Related topics