Simulate and ingest high frequency live stream data in Influx for stress test

luca15f · January 18, 2019, 11:14am

Hi all,

I would like to simulate sensor data (i.e. float numerical data coming from machines, at very high frequency, say 10kHz), and send it to a server, where InfluxDB runs.

I have tried, by now, a simple multi-threaded python script, but i cannot reach that 10k values-er-second flow.

What else can I do to reach my goal?

rawkode · January 18, 2019, 1:32pm

Hey @luca15f,

You may have better luck with our stress testing tool, available on GitHub.

Let us know how you get on

luca15f · January 25, 2019, 10:56am

Hi @rawkode ,

I gave a look (and then extensively used) the mentioned stressing tool. It actually works as expected: I was able to simulate a very high frequency (peak frequency: 120kHz).

Activable flags for the insert cmd are very useful; the only thing that drove me crazy is that I could not specify how many concurrent writers to use for a test. For example (with -f flag always activated):

influx-stress insert … -b 5000 … led to the creation of 40 concurrent writers;
influx-stress insert … -b 10000 … (default batch) led to the creation of 20 concurrent writers;
influx-stress insert … -b 20000 … led to the creation of 10 concurrent writers;
…
influx-stress insert … -b 200000 … led to the creation of 1 (non-)concurrent writer.

It seems that #writers gets lowered by an half everytime i double the size of the used batch. In fact, all above tests had, more or less, the same result in terms of PPS inserted.

I am really interested in varying batch size while fixing concurrent writers number.

How can I achieve this goal?

P.S. I have noticed that, 49 times out of 50, if I specify a time limit (with the -r flag), the script actually crashes when time runs out, giving no (fundamental) information on number of inserted points and PPS (the results I were referring to above were measured everytime counting all points in the database, making me waste a lot of time). Is it a known issue, or can it depend on my machine?

Thanks, Luca

Topic		Replies	Views
Timestamp with stress tool influxdb	8	2319	June 23, 2020
Increasing InfluxDB insertion rate via Influx-Python lib	6	4509	January 2, 2019
Inserting data timing	5	1290	May 26, 2017
Telegraf ( statsD ) input not writting to InfluxDB Telegraf	0	706	April 5, 2017
What is the best way to insert 100k messages per second Telegraf	2	807	November 13, 2018

Simulate and ingest high frequency live stream data in Influx for stress test

Related topics