Is it more efficient to store metadata about points (unit of measure, variable) in a relational DB instead of repeating it for every point?

cemanuel · February 13, 2019, 10:35pm

I have a system with a bunch of sensors on it. Originally I was using MySQL to organize the data using the following schema:

The important tables for this discussion are:

System
Tag (caution: this is my own use of “tag” and shouldn’t be confused with InfluxDB definition)
Tag usage
Unit of measure (volt, celsius, etc.)
Variable (temperature, pressure, etc.)
Data point

A system must be modeled with one or more tags. A tag must be the source of a tag usage. A tag usage must be expressed in terms of a unit of measure, as well as a variable. A data point must come from a tag usage.

For example, consider a temperature sensor on the system. The tag “T1” would model that sensor in the database, and two tag usages could be 1=“actual temperature” and 2=“temperature setpoint”, with 1 and 2 being primary keys.

Therefore, data being inserted into the datapoint table would look like:
tag_usage=1,value=“64”,timestamp=“2019-02-13 14:00:02”
tag_usage=2,value=“65”,timestamp=“2019-02-13 14:00:02”

I’d like to use InfluxDB to store the time series data (i.e. the datapoint table) as that is what it is built to do. The documentation seems to be recommending that I store a tag “variable=temperature” with every data point representing a temperature measurement, etc. That seems like a violation of keeping a database in normal form and I would think that metadata would be stored elsewhere.

Related questions:
Schema for Plants, Devices and Signals
Schema Considerations - 1 tag key instead of multiple field keys
Using Influx to monitor environmental data
Schema advice for commercial solar monitoring

EDIT 1: Add related questions.
EDIT 2: Change voltage to temperature in last paragraph.
EDIT 3: Add another related question.
EDIT 4: Add another related question.

cemanuel · February 19, 2019, 6:53pm

Tag keys and values are stored only once per series, so sending them with every data point does not mean redundant data.

Bigman74066 · August 24, 2022, 10:00am

If tag keys and values are stored only once: Is there a way to only mention them once too (like storing the units of a signal, I don’t want to mention the units every time I store a value to InfluxDB. I want to mention once that the units are “bar” and continue by just storing the value for subsequent timestamps)

Topic		Replies	Views
Storing metadata in first point InfluxDB 2	1	779	October 17, 2022
Schema design: how may tags InfluxDB 2 influxdb , schema , query , flux	5	2753	February 23, 2021
Splitting data across measurements or introducing tags influxdb , influxdb-cloud-2-0	0	610	November 1, 2022
Schema design with metadata in tag keys and delete data Store	0	494	March 25, 2019
Best practices for choosing measurement, tags and fields Store	9	1601	January 2, 2024

Is it more efficient to store metadata about points (unit of measure, variable) in a relational DB instead of repeating it for every point?

Related topics