Performance tuning to get DataFrame (335280, 21) in one second

yuyongliao · June 14, 2020, 7:52am

Dear ALL,

I’m new to InfluxDB. Right now, we have 10 years of data. We need to get one-day data (size: 200MB, 335280 rows, 21 columns) in less than one second. Right now, we are doing POC on the InfluxDB standalone. However, it can’t meet our targets yet. Would you please suggest the performance tuning advice/guides for us to speed up our query? We need nearly one minute to get one-day data at this moment.

Keep safe!

Best regards,
Yuyong

Pooh · June 14, 2020, 8:53am

How about showing us the query you are using?

Then we might have a starting point to suggest what could be improved.

Also, which version of InfluxDB are you using, what operating system and
version is it installed on, and what sort of hardware is it on?

Antony.

yuyongliao · June 14, 2020, 12:56pm

Thanks, Antony!

Here is the python code I used.

client = DataFrameClient(host=bar_env.influxdbHost, port=bar_env.influxdbPort)
query = "select * from bardata.autogen.bardata where transactionDate = '2009-01-05';"
barDB = client.query(query)

InfluxDB version is 1.8.

The database is running on AWS Linux (4.14.177-139.254.amzn2.x86_64), 8CPU & 64G RAM, network bandwidth up to 25Gb

Topic		Replies	Views
Query times for "load all the data" queries Store influxdb , schema , influxql	10	4814	May 4, 2017
Duration request influx sometimes takes a lot of time	3	436	September 30, 2020
InfluxDB Query Time Spikes - Seeking Insights InfluxDB 2	0	204	September 4, 2023
Using python client, query on a specific bucket take 75.x seconds InfluxDB 2 flux	2	672	October 23, 2022
Only 1k rows/second query speed? Dashboards influxdb , query	0	498	April 4, 2020

Performance tuning to get DataFrame (335280, 21) in one second

Related topics