Disk I/O, CPU and Swap

MaxDiOrio · April 1, 2019, 1:08pm

Hi all,

I’m still pretty new to Influx - only have had it running a few months. We’re using it to do remote writes from several Prometheus instances in our Kubernetes clusters for long term storage. I have RP’s and CQ’s set up to downsample the data and everything is working well for the most part.

What I’ve been struggling with is InfluxDB’s use of memory, cpu and disk I/O.

Influx is running in a VMWare VM, with 8vCPU’s and 64GB of RAM. The disk is backed by a NetApp all flash storage array which is pretty screaming fast for our modest infrastructure. I’ve switched all the DB’s over to using TSI1 to help with memory.

Influx usually runs fine for a day or two. Then I start seeing CPU load go from 1-2 up to 8+ and during this time, Swap usage goes up, taking up the full 4GB of swap space defined, and suddenly disk I/O for reads goes through the roof.

I also see tsm.tmp files being generated and not being consolidated at all. I’m currently sitting with 116 .tsm.tmp files, and they are almost all from the same database, and in fact, most from the same shard.

I’m a little stumped as to why this is happening or how to troubleshoot it. Any help is greatly appreciated.

Thanks!

Max

MarcV · April 5, 2019, 8:31am

Hi @MaxDiOrio

there has been a similar issue in 1.2.4 ,
what is your version ?

this is the link to the issue ( which is solved )

github.com/influxdata/influxdb

InfluxDB 1.2.4 leaving .tsm.tmp files, consuming all disk

opened 08:07AM - 26 Jun 17 UTC

closed 02:01PM - 12 Oct 17 UTC

forsberg

### Bug report I seem to be hitting #7712, although with InfluxDB 1.2.4. So o…pening new issue as requested. __System info:__ [Include InfluxDB version, operating system name, and other relevant details] * InfluxDB 1.2.4-1 * Linux ospmon01 3.16.0-4-amd64 #1 SMP Debian 3.16.43-2 (2017-04-30) x86_64 GNU/Linux * Debian GNU/Linux 8.8 (jessie) * 12 cores (24 HTs), 48G of memory __Steps to reproduce:__ 1. Start InfluxDB 2. Wait a week. 3. Have a look at available disk. It has decreased significantly. 4. Restart InfluxDB. Disk size decreases significantly. Attaching debug info (note: If I try to do the curl provided as example, I just get back "no such profile all") as well as output of 'find /var/lib/influxdb'. [influxdb-debug.ospmon01.1498462213.tar.gz](https://github.com/influxdata/influxdb/files/1101430/influxdb-debug.ospmon01.1498462213.tar.gz) [var-lib-influxdb-files.txt](https://github.com/influxdata/influxdb/files/1101444/var-lib-influxdb-files.txt)

MaxDiOrio · April 5, 2019, 1:09pm

I’m running 1.7.5 and have always been newer than 1.2.4

Lostboy · April 5, 2019, 1:50pm

Do you have subscriptions setup ?
I have seen inadvertent loops set up that exhibit this behavior.

How about Kapacitor ?
Sometimes when Influx gets restarted it can create extra subscription jobs.
It doesn’t hurt to go in and delete all the Kapacitor made subscriptions as valid ones will be recreated.

MaxDiOrio · April 5, 2019, 3:30pm

No subscriptions anywhere. Really don’t use Kapacitor at all at this point.

Topic		Replies	Views
InfluxDB write IO - sky rocketing on fast hardware influxdb	0	426	September 14, 2020
InfluxDB as Prometheus backend: suggested configuration?	1	1044	April 7, 2020
Disk usage grows rapidly after a few days, cleans up with restart	6	1072	October 9, 2020
Internal data and cache management Store influxdb , time-series	1	4437	December 7, 2018
InfluxDB memory usage Store	0	3085	August 14, 2018

Disk I/O, CPU and Swap

Related topics