Read/Write 1 million metrics per second?

jminardi · September 20, 2024, 11:49pm

I am designing a system that should be able to process around 1 million metrics per second. Is this within the realm of possible with telegraf?

I can supply adequate compute resources if that’s all it takes, but I want to know what sort of scaling limitations there may be. Will I need to scale horizontally with more telegraf processes? Can I just run one process if I feed it enough resources?

My input is OPCUA and my output is prometheus.

Henjoe_Gutierrez · September 21, 2024, 4:13am

I supposed that 1 million of data is not coming from a single source right?
That’s too much for a single node opcua server.

If it’s cominf from multiple source you could have setup clustered influxdb and use a load balancer for read /write data to influxdb.

You could scale up horizontal (putting more instance of influxdb behind the load balancer) in order to accomodate the high ingestion of data (1M/secs).

jminardi · September 23, 2024, 10:27pm

It’s not coming from a single source, it’s coming from multiple sources. I am actually writing to Victoria metrics (prometheus compatible TSDB). Our current pipeline is able to handle this many metrics so I know the db can ingest the amount we need.

My question is specifically about telegraf. Can we use that to read OPCUA and write to a prometheus database with 1 million metrics per second? Will we need to run multiple instances of telegraf? Are there any write-ups or case studies of people doing something similar?

srebhan · October 2, 2024, 8:40am

I’d say it’s at least not out-of-range from Telegraf side. It of course depends on if you are doing other processing etc in Telegraf and if your machine can handle the potential peaks…

I would say give it a try! Let us know how it is going and feel free to ask if you need any help!

Topic		Replies	Views
Telegraf conf size Telegraf	3	186	March 7, 2024
Collection took longer than expected; not complete after interval of 500m Telegraf telegraf	6	1869	January 11, 2023
Help in tracking down performance issue in telegraf + opcua input plugin (1s interval not keeping up) Telegraf telegraf , performance	11	122	March 25, 2025
Scalability of a single telegraf docker Telegraf performance	2	336	December 18, 2023
Input and output capacity of telegraf Telegraf telegraf	6	1769	September 18, 2019

Read/Write 1 million metrics per second?

Related topics