Data Loss happens at telegraf side

SuneethaSaka · August 6, 2019, 2:06pm

Hi,
We are working on telegraf for collecting data from json file through “tail” in-out plugin and outputs to influxdb. Each point has 10 tag, 350 fields and 80 points are pushed to json file in 1 second. Everything is in single server.
Telegraf Configuration is
interval = 5s
flush interval 5s

influxdb timeout = 5s
time format is unix_ns.

We are facing some packet loss. for 3Lakh records some 1000 records are lost.
I did not find any losses in the influxdb (from the _internal database).
I am not able to figure out the reason for packet loss. Data load is only moderate as per the hardware sizing guideline in influxdb.

We cont afford of losing the data. Please help to find out the reason for data loss. Is telegraf culprit??

Thanks
Suneetha Saka

daniel · August 6, 2019, 7:02pm

I would start by turning on the internal plugin, and then check the internal_write measurement for the metrics_dropped field. This is tagged per output plugin, does it remain at zero?

SuneethaSaka · August 7, 2019, 5:44am

Hi @daniel , matrics dropped field shows zero…

daniel · August 7, 2019, 8:53pm

That tells us it’s not being dropped after being read. How do you deal with the input files size, are you rotating or truncating the input file?

SuneethaSaka · August 8, 2019, 5:10am

Now we are neither rotating nor truncating. We will run the test for 2-3 hours and check the data. We are still in initial phases.
Is there any chance that telegraf collecting data is missed, will the inout plugin hang, when output plugin is despacting data?.
We are using “inotify” for the tail plugin.
Not able to get the proper reason for packet drop, Or is there any specific telegraf configuration for so and so load of data.

Topic		Replies	Views
Outputs.influxdb did not complete within its flush interval - means points get lost Telegraf telegraf	1	139	October 31, 2024
Telegraf with multiple outputs: If one is down, no one gets the data Telegraf	10	6110	November 20, 2019
Some data not written by telegraf into influxDB after telegraf restarted	5	1305	November 14, 2019
Do batch import timeouts result in dropped data? Telegraf influxdb , telegraf	1	1622	April 24, 2019
Telegraf - Weird behaviour with JSON-transformation after first collection interval Telegraf telegraf , json	10	1160	February 24, 2023

Data Loss happens at telegraf side

Related topics