Replication stream buffer size

LC_BTS · April 18, 2024, 7:38am

Hello,

I’m using the replication stream feature to keep one bucket in sync with another remote one, no complain here, works very well !

I would like to get some technical information about the way the buffer is stored on disk:

is the data encoded? Compressed? Is it raw binary data?

The goal is to get an estimate of the disk space my queries would take if the remote is down for a certain amount of time. My write queries are always the same in terms of number of fields, measurement and tags. I can of course make an experiment and measure the size after a while but I’d like technical, precise information if possible

Thank you very much,

Anaisdg · April 19, 2024, 3:56pm

Hello @LC_BTS,
Welcome to the community! Thanks for your question. Out of curiosity what are you doing with InfluxDB?
I dont know. I’m asking around.
As far as estimating disk space of buffer, there isn’t a tool for this. I assume you’ll have to run some test where you make the destination unavailable and monitor disk usage. I’m sorry I can’t be more helpful.

Anaisdg · April 22, 2024, 4:43pm

@LC_BTS
Here’s what I gathered:

IIRC the EDR implementation is very similar to hinted handoff in Enterprise (and I assume storage format is similar). I think the Enterprise docs might have some guidance on sizing the hinted handoff queue you could pull from.
Configure InfluxDB Enterprise data nodes | InfluxDB Enterprise Documentation
InfluxDB Enterprise features | InfluxDB Enterprise Documentation

LC_BTS · April 23, 2024, 12:42pm

Hello, thanks for your answer, I’m managing a device that may loose internet connection for a few hours/days and I’d like to know if the replication stream mechanism can handle the downtime, given our current rate of measurements.

LC_BTS · April 23, 2024, 12:45pm

Hello, thanks for the links!

I also took a peak at the source code directly and found more information, the buffer is compressed in gzip using a go library, meaning I should be able to make more accurate tests using the same code.

Anaisdg · April 23, 2024, 7:20pm

@LC_BTS Thanks for sharing!

Topic		Replies	Views
Compression during replication? Or on remote buckets? InfluxDB 2	0	269	November 16, 2022
InfluxDB 2.2 replication InfluxDB 2	7	2048	June 22, 2022
InfluxDB subscriber disk size differs from "main" InfluxDB 1	3	484	July 21, 2021
Replication Stream throttled? InfluxDB 2 influxdb , performance	2	552	December 14, 2023
Error with replication InfluxDB 2	0	607	February 13, 2023

Replication stream buffer size

Related topics