Understanding the role of timestamp precision

stefanobaghino · May 9, 2018, 9:05am

Hi all,

I’m trying to get a better understanding on the role of timestamp precision and how the storage engine takes advantage of this property, in order to better understand how to address certain concerns with my current project.

From what I got from the documentation, the HTTP API, unlike the CLI interface, allows you to explicitly specify a precision, for which it “recommend[s] using the least precise precision possible as this can result in significant improvements in compression.”

Why the verb “can”? What differentiates whether providing the precision improves the storage utilization efficiency? My intuition suggests me that it’s just a matter of using the same precision for a given measurement, but the documentation doesn’t state this precisely that and I haven’t read the code (yet).
Is precision considered only when it is passed explicitly, or is InfluxDB somehow able to infer that a given timestamp, despite being provided in nanoseconds, has actually a minute-level coarseness? The latter seems unlikely, intuitively it looks to me that it would be difficult to handle edge cases.
What happens if values are written with different precision levels to the same measurement? My understanding from the docs is that ultimately all information about write-time precision is lost, but I’d like to understand if and how this affects storage efficiency and read-time performance.

Thanks in advance!

katy · May 25, 2018, 11:28pm

Hey @stefanobaghino!

If you haven’t read this doc on the compression of the timestamps, it might help with this point. The “can” is because it can be very difficult to make promises about improvements in compression given the number of variables in the process. You’re correct in that different precisions would change the compression rates.
Right now, InfluxDB defaults to nanosecond precision. There’s an open GH issue regarding the default. InfluxDB doesn’t infer anything about the time currently, but that is one of the issues being discussed in that issue.
When values are written to the same measurement with different time precisions, nothing good happens. InfluxDB will add zeroes to the less precise time until it matches the specified (or default) precision, which increases the likelihood that there will be some collision with other points. It doesn’t so much affect the storage efficiency, but it might skew your data by writing over duplicate points.

I hope that helps!

cuxcrider · November 6, 2019, 7:05am

Hi @katy and @stefanobaghino,

I found this thread and it applies to a question I asked in a separate thread, so I thought I would give a bump here because I’m really confused about what is happening. Thanks!

Topic		Replies	Views
Why does using the coarsest possible timestamp precision when writing data to InfluxDB maximize performance? InfluxDB 1	2	1153	January 13, 2022
Are there any problems having multiple time precision within the same measurement? Store influxdb , time-series	3	1537	November 8, 2019
Different timestamp precision influxdb , time	0	494	April 20, 2023
Strange behavior http with -precision ms and ns Store influxdb	2	1668	October 2, 2020
Time precision in telegraf output or influxdb	5	2600	September 26, 2019

Understanding the role of timestamp precision

Related topics