OOM issues with Influx

gcyre · May 15, 2017, 2:40pm

Hello

We’ve had Influx running in production for a few months and recently started having issues with OOM-killer shutting down the influx service.

The server is decently powered, dual quad core CPUs/w 128GB of RAM and lots of disk. We have about 800 measurements with roughly 800K series. Is there any documentation I can look at that will give me some areas to look at to determine the health of our installation?

Not sure where to start to troubleshoot this.

thanks
Garry

jackzampolin · May 15, 2017, 5:46pm

@gcyre do you have a graph of memory utilization on the box? Also are there any other processes running on that host?

gcyre · May 15, 2017, 6:21pm

@jackzampolin

Here are 2 graphs I’m looking at

gcyre · May 15, 2017, 6:25pm

the only services running on this server are Influxdb, telegraf and kapacitor but we haven’t implemented any tick scripts yet.

gcyre · May 16, 2017, 7:58pm

I’ve been digging into the issue more and created some graphs based on the influx internal metrics, from what I can see there hasn’t really been any pressure on memory and cpu. I’m beginning to think there isn’t an issue and the problems we noticed are an isolated issue.

Is there a way to limit the information being logged to influxd.log? its being filled with [httpd] messages, all I would really want to see is any errors that are happening

thanks
Garry

jackzampolin · May 17, 2017, 3:23pm

@gcyre you can always use some grep-foo to look for 500’s: journalctl -u influxdb | grep -v " 500 "

Topic		Replies	Views
Influxd 2.4 : killed by oom InfluxDB 2	0	346	September 6, 2023
Sudden Spike in query requests killed server InfluxDB 2 influxdb , telegraf , grafana	0	412	April 12, 2022
[InfluxDB 1.8] Out of memory every 45-60 days Store influxdb	1	1165	December 23, 2020
Influxdb 1.7 - high ram - No queries influxdb	2	601	May 20, 2021
Influx crashes/stop showing data and we can't query InfluxDB 2 windows	5	828	August 2, 2022

OOM issues with Influx

Related topics