Telegraf: route aggregations to different buckets

tmcatalin · January 2, 2023, 10:54am

Hi,
I have few sensors sending data over MQTT and for which I would like to create 3 buckets:

one with raw data, to be kept for max 30 days
one with aggregated data, per hour, to be kept for 90 days
one with aggregated data, per day, to be kept for long periods
Below is the way I implemented the first two but I have the feeling could be done easier. In addition, I don’t know how to do the 3rd one.

[[aggregators.basicstats]]
  alias = "basic_stat"
  ## The period on which to flush & clear the aggregator.
  period = "1h"

  ## If true, the original metric will be dropped by the
  ## aggregator and will not get sent to the output plugins.
  drop_original = false

  ## Configures which basic stats to push as fields
  # stats = ["count","diff","rate","min","max","mean","non_negative_diff","non_negative_rate","stdev","s2","sum","interval"]  
  stats = ["count","min","max","mean"]
  
  fieldpass = ["humidity" , "temperature_C" , "rssi"]

[[outputs.influxdb_v2]]
    alias = "raw_data"

  urls = ["http://${DOCKER_INFLUXDB_INIT_HOST}:8086"]

  ## Token for authentication.
  token = "$DOCKER_INFLUXDB_INIT_ADMIN_TOKEN"

  ## Organization is the name of the organization you wish to write to; must exist.
  organization = "$DOCKER_INFLUXDB_INIT_ORG"

  ## Destination bucket to write into.
  bucket = "$DOCKER_INFLUXDB_INIT_BUCKET_RAW"

  insecure_skip_verify = false
  fielddrop = ["*_count", "*_min", "*_max", "*_mean"]

# # Configuration for sending metrics to InfluxDB
[[outputs.influxdb_v2]]
    alias = "aggregated_data"

  urls = ["http://${DOCKER_INFLUXDB_INIT_HOST}:8086"]

  ## Token for authentication.
  token = "$DOCKER_INFLUXDB_INIT_ADMIN_TOKEN"

  ## Organization is the name of the organization you wish to write to; must exist.
  organization = "$DOCKER_INFLUXDB_INIT_ORG"

  ## Destination bucket to write into.
  bucket = "$DOCKER_INFLUXDB_INIT_BUCKET_AGG"

  insecure_skip_verify = false
  fieldpass = ["*_count", "*_min", "*_max", "*_mean"]

I tried to tag, as below, all aggregated data but didn’t work.

    [aggregators.basicstats.tags]
	  influxdb_database = "aggregated"

In my view, this would solve the implementation of 3rd bucket (bucket tags for: inputs.mqtt_consumer, aggregation per hour and aggregation per day) but not working put me in the position to remain with the first 2 buckets.

Thank you

jpowers · January 5, 2023, 4:27pm

Hi,

Are you sending an enormous amount of data that you only want to keep 30 days of raw data? Is it possible to use the aggregation in your queries in InfluxDB, rather than trying to do this at collection time?

tmcatalin · January 7, 2023, 5:08pm

Hi,
In the end I would like to have the 3 buckets to keep a reasonable low size.
The question could be who is doing the aggregation: Telegraf or InfluxDB?
What would be the pros and cons of one or another?
Thanks

ThunderStone · December 23, 2024, 9:07am

Old topic, I know, but there is not a lot of information around this situation to be found it seems. I seem to have this working. You can add name_override to [[aggregators.basicstats]]. Or you can use name_suffix or name_prefix there for example. This change the name of the measurement for the aggregations. Now you can use the new name in name_pass in your outputs.

Update: Did not work with the name_suffix for me. Changed it to name_override and it started working again.

Topic		Replies	Views
BasicStats Aggregator Plugin - Multiple aggregator periods and custom tags Telegraf	2	2146	January 17, 2018
Write data in different buckets using telegraf Telegraf telegraf	2	4452	September 23, 2022
Send data between local buckets using Telegraf Telegraf influxdb , telegraf	3	820	April 3, 2023
Best practice on telegraf to multiple influxdb v2 buckets Telegraf telegraf	2	9802	August 17, 2021
Bucket_Tag to direct metrics to different buckets Telegraf telegraf	14	5965	April 13, 2021

Telegraf: route aggregations to different buckets

Related topics