Best practices for duplicating and restoring data from InfluxDB OSS for later analysis

somanyquestions · May 11, 2023, 1:53am

Hi Everyone,

I have a question regarding best practices for backing up InfluxDB data on an edge computer running InfluxDB OSS.

To provide some background, we have an edge computer running InfluxDB OSS (on Ubuntu) at a remote site with limited Internet connectivity that is capturing data from BLE beacons. Due to the amount of data we will be capturing on site and storage limits on the edge computers internal SSD, we will need to setup the bucket in InfluxDB to delete data that is older than 7 days and use a NAS connected to the edge computer to backup the data incrementally so that we can store all data for the duration of the project (estimated to be up to 10 TB over a six week period).

Can anyone please advise as to best practices for backing up the data to the NAS such that it can be restored for later analysis? Looking at this guide on the InfluxDB file system layout, our plan was to incrementally backup everything in the Data, WAL, and Metastore directories to the NAS. Will this be sufficient such that we can copy these onto another machine running InfluxDB OSS later on to do analysis on the full six weeks of data? Is there anything else we need to do or be mindful of?

Many thanks,

Michael.

ypnos · May 11, 2023, 1:15pm

In general, you should use backup/restore commands.

However it is not possible to simply restore new data into an existing bucket. You could restore to several different buckets instead and then use a flux query to merge the data from them into a single target bucket.

Topic		Replies	Views
Influxdb 2.x OSS backup and restore	3	839	August 9, 2022
Best Practice to outsource old data? InfluxDB 2 influxdata , backup	2	753	November 10, 2022
Influxdb backup when using docker containers (IOTstack) InfluxDB 1 backup , docker , raspberry	1	758	March 24, 2023
Backing up InfluxDB 2.0 Cloud? InfluxDB 2 backup	2	890	May 3, 2023
InfluxDB 2 Backup & Restore InfluxDB 2	0	702	March 5, 2021

Best practices for duplicating and restoring data from InfluxDB OSS for later analysis

Related topics