Memory consumtion increases during import of data

Lars · February 17, 2018, 10:45am

Hello community,
we have written a software, that’s supposed to migrate time series data into influxdb.
The size of the dataset that we want to migrate is about 2 billion entries in total spanning across approx. 130k series.
What our migration software does againt influxdb is basicly querying against some not (yet) existing series (getting emtpy results) and sending data time-series wise. So after data is imported it is actually not touched anymore during this phase.

While this is running, we observe a constantly increasing memory consumtion of the influxd process as shown here:

My questions are the following:

Why keeps memory usage increasing?
What can we do to influence this - either by changing settings of influxdb.config or by changing the behaviour of our migration-code?
What’s your expectation of memory consumtion importing data of the size mentioned above?

Thanks for your help and comments,
Lars

sbains · February 18, 2018, 4:28pm

I would suggest looking at

DB Schema
TSi1 option

rvdheij · February 19, 2018, 1:20pm

From my experience, 130K series may be heavy. When you have tags with infinite (or large) value range, the in-memory index grows pretty quick. You may have to change tags to fields and reduce cardinality.
IIRC my database did not fit in 3G when I had 40K series. But as the documentation says memory usage grows exponentially with cardinality, so you mileage may differ a lot.

Topic		Replies	Views
Influxdb memory issue	2	557	August 27, 2020
High Memory Usage Problem in influx version - 1.2.2	3	600	April 24, 2019
Memory consumption on bulk write InfluxDB 2	1	448	September 28, 2022
How can I reduce the memory usage of InfluxDB 1.7.2	14	18440	August 17, 2019
Tsi1 index uses more memory than without, preventing upgrade to influx2 InfluxDB 1 influxdb	2	605	April 21, 2023

Memory consumtion increases during import of data

Related topics