Reduce cardinality by split tags to buckets

bentt6 · September 19, 2021, 8:26pm

Hi,

I’m planning a system that will use influxdb, I think I understand the concepts of tags/measurements/fields/buckets.

I have a tag that its value can be one of a few option(for example “Continents”), is it more efficient(memory/speed) to split it to different buckets ?

In a query I will always request information about specific continent.
the schema is the same and the RP is the same.

Thanks,
Ben

Anaisdg · September 20, 2021, 7:16pm

Hello @bentt6,
buckets, measurements, and tags are all indexed. Yes storing the information from a specific continent together could be more efficient from a query perspective. However, I would consider storing the information in separate measurements instead of separate buckets. There isn’t an performance advantage for storing in different measurements vs buckets.
Why are you considering storing the data in different buckets instead of measurements?

I would say that if you need to manage the authorization of the data from different continents differently, then I would then i would split the data into different buckets (with different tokens scoped to each bucket).
Otherwise I would put all my data in one bucket under different measurements.

Topic		Replies	Views
Is there performance penaly for having multipe measurements instead of one measurement with multiple tags	6	3700	April 16, 2019
Schema design: how may tags InfluxDB 2 influxdb , schema , query , flux	5	2775	February 23, 2021
Splitting data across measurements or introducing tags influxdb , influxdb-cloud-2-0	0	612	November 1, 2022
Noobie questions Welcome & Getting Started influxdb , cardinality	2	606	November 30, 2022
Space use when dealing with tags Store influxdb	2	1095	April 19, 2019

Reduce cardinality by split tags to buckets

Related topics