Full compaction proccess

Marijus · March 21, 2017, 10:21am

Hello! I had a strange behavior of InfluxDB last night. The main DB is about 640GB, and the partition to it is about 788G. When Influx started full compaction of database group, the CPU went mad, Heap in Use went mad and after a while disk got full and the Influx just stopped working. I am using InfluxDB 1.2.1.

My question: are there any basic recommendations about free disk space and how much I need it for 640GB DB to go through compaction process successfully?

Marijus

kostas · March 22, 2017, 10:26am

Hi Marijus,

Full group compactions create temporary duplicates of the tsm files inside the shard being processed.

In the worst case scenario, if all of your shards have been modified in the last cycle and are picked up for full compaction at the same time, you would need 2x the space used at rest.

Typically, you need to have at least twice the size of your current hot shards in reserve for compactions to run.

Marijus · March 22, 2017, 12:20pm

Hello Kostas,

Thank You for the reply.

Looks like I know what caused the problem. I was rewriting/changing some historical data to my InfluxDB, as I understand it was compacted already and since it got a bunch of new data it has to do the full compaction process again?

By the way is there a way to determine current hot shards?

Marijus

Topic		Replies	Views
Find what consume space Influx 1.7.7 InfluxDB 1 influxdb , query	6	389	February 1, 2023
Compact TSM files parameter: compact-full-write-cold-duration	0	392	January 15, 2023
Compactions resulting in high CPU and file corruption	0	1278	July 31, 2017
Strange disk space usage leak after InfluxDB 1.8.x upgrade	13	3845	May 8, 2023
What is the easiest way to fill a InfluxDB /data directory to a minimum size (for testing)? influxdb	2	1476	September 18, 2019

Full compaction proccess

Related topics