Duration request influx sometimes takes a lot of time

spyfox · September 8, 2020, 2:04pm

hello, I have a problem that I would like to understand, sometimes an influx reading request takes a lot of time.
I developed this little code to explain my problem, but I have this problem in productions with my real data

db_name = 'testBase'
inf_client = DataFrameClient(database=db_name)
inf_client.drop_database(db_name)
inf_client.create_database(db_name)
df = pd.DataFrame(data=list(range(100)),
                  index=pd.date_range(start='2014-11-16',
                                      periods=100, freq='H'), columns=['0'])
inf_client.write_points(df, db_name)
duration_data = {}
for i in range(0, 50):
    start_time = time.time()
    data = inf_client.query("select * from testBase  where time>='2014-11-16 00:00:00' and time<='2014-11-16 10:00:00'")
    duration = time.time() - start_time
    duration_data[i] = duration

data_df = pd.DataFrame(duration_data.items(), columns=["index", "duration"])
del data_df['index']
data_df.plot()

philjb · September 9, 2020, 8:24pm

Hi @spyfox - welcome to the community! It is hard to give a specific response without much more detail, but in general, InfluxDB needs to load memory mapped files. It takes time for the OS to pull these in, but once resident accessing the same data again is faster. There’s no promise the OS keeps the data resident either because of other demands. It is odd that iteration ~4 is the slow one; how consistent is your graph if you run it 100s of times? I assume the y-axis units are seconds? ~50ms for the slowest example? While this example query is interesting, I think we would be better served trying to optimize your production query if you’re willing to share it.

spyfox · September 10, 2020, 11:59am

thanks for your answer, yes, the y axis is second and x axis is iteration number
i try with 200 iteration and i have this:

i have the same situation

philjb · September 30, 2020, 7:26pm

Sorry- I meant if you run ~25 iterations quickly and then wait say 60 minutes and run 25 iterations of the query again, how consistent is it that the 4th query in particular is the slowest one? Not 200 iterations quickly.

Topic		Replies	Views
Using python client, query on a specific bucket take 75.x seconds InfluxDB 2 flux	2	672	October 23, 2022
InfluxDB Query Time Spikes - Seeking Insights InfluxDB 2	0	204	September 4, 2023
Python pandas influxdb 2.0 influxdb	2	993	May 6, 2020
Performance tuning to get DataFrame (335280, 21) in one second Welcome & Getting Started query	2	478	June 14, 2020
Query times for "load all the data" queries Store influxdb , schema , influxql	10	4814	May 4, 2017

Duration request influx sometimes takes a lot of time

Related topics