Influxdb CPU usage

Luv · April 27, 2017, 9:35pm

I need help on the CPU usage trend of influxdb.
In the below picture, the CPU usage rises to 68% after every 2 minutes. Telegraf runs at an interval of 10 seconds and sends data every 10 seconds to influxdb.

But why after 2 minutes I don’t get it.

Also, does influxdb uses RAM memory to cache the recently written and read data for fast availability ? If yes, how can I limit the value of that cache ?

sebito91 · April 27, 2017, 10:23pm

@Luv, good questions all around!

First off, regarding CPU usage you should take a look at some of the _internal metrics (which you should enable for collection) to see if garbage collection is occurring or something else is happening at that time.

There are situations where compactions will consume CPU usage when dropping measurements from one level of compression to another. That being said, the spikes you’re seeing are quite drastic and I’d look to see what else is happening on the box during those intervals.

As for the in-mem store, the concept of the “wal” vs “disk” is something to note with influxdb. If properly configured (you might want to paste your config here, masking any sensitive info if necessary), all writes will go into the in-mem index (stored in the write-ahead log ‘wal’) and replicated to disk asynchronously. Your writes are therefore ack’d after completing the wal write and async added to .tsm files on disk.

The compactions from one level of tsm to another will also happen asynchronously and concurrently with on-going reads/writes. The inluxDB team has made major improvements to fork this work off from the incoming metrics to ensure locking is not a problem with heavy write loads.

Luv · April 27, 2017, 10:45pm

About the _internal database, there are these measurements, if you could tell me which specific measurement to look in ?

influxdb_cq
influxdb_database
influxdb_graphite
influxdb_httpd
influxdb_memstats
influxdb_queryExecutor
influxdb_runtime
influxdb_shard
influxdb_subscriber
influxdb_tsm1_cache
influxdb_tsm1_engine 
influxdb_tsm1_filestore
influxdb_tsm1_wal
influxdb_write

Can I limit the value of main memory given to this in-mem store ? How much generally is used by this in-mem store ? and where is this WAL stored, in main memory ?

“This in-mem has the copy of all the data in the WAL and index of the data in TSM, and performs the read queries. The write queries are written to WAL and then sent to the TSM.” Am I correct here ?

I have asked too many questions in this reply, sorry for that

sebito91 · April 27, 2017, 10:55pm

Those are the influxdb collector measurements coming from telegraf, they’re close but not the internal ones I was referring to.

Instead, this is a config on the influxdb side itself. For example, my config is in /etc/influxdb/influxdb.conf:

###
### Controls the system self-monitoring, statistics and diagnostics.
###
### The internal database for monitoring data is created automatically if
### if it does not already exist. The target retention within this database
### is called 'monitor' and is also created with a retention period of 7 days
### and a replication factor of 1, if it does not exist. In all cases the
### this retention policy is configured as the default for the database.

[monitor]
  store-enabled = true # Whether to record statistics internally.
  store-database = "_internal" # The destination database for recorded statistics
  store-interval = "10s" # The interval at which to record statistics

Luv · April 27, 2017, 10:57pm

yes I disabled the internal metrics as I wanted it through Telegraf. I think they give the same data.

sebito91 · April 27, 2017, 10:58pm

[root@carf-metrics-influx01 influxdb]# influx
Connected to http://localhost:8086 version 1.2.3_go1.8.1
InfluxDB shell version: 1.2.3_go1.8.1
> use _internal
Using database _internal
> show measurements
name: measurements
name
----
database
httpd
queryExecutor
runtime
shard
subscriber
tsm1_cache
tsm1_engine
tsm1_filestore
tsm1_wal
write

> show field keys from runtime
name: runtime
fieldKey     fieldType
--------     ---------
Alloc        integer
Frees        integer
HeapAlloc    integer
HeapIdle     integer
HeapInUse    integer
HeapObjects  integer
HeapReleased integer
HeapSys      integer
Lookups      integer
Mallocs      integer
NumGC        integer
NumGoroutine integer
PauseTotalNs integer
Sys          integer
TotalAlloc   integer

Luv · April 27, 2017, 11:21pm

@jackzampolin @sebito91

Is it normal for influxdb to consume so much RAM ? It’s nearly 600mb of RAM and 1245mb of Virtual memory .

jackzampolin · April 27, 2017, 11:41pm

@Luv We mmap recent data and let the OS page it out as needed so the usage can look higher than it actually is. But ~2GB of RAM is not an unusual amout of memory for the process to use.

sebito91 · April 28, 2017, 6:51pm

You want to look more at RSS than Virtual mem, the latter can look inflated to @jackzampolin’s point.

Overall though, 600MB is a pretty small amount of mem to consume unless you’re not really sending a lot of data. For example, our machine is processing about 500k writes/sec and hosting ~46GB RSS.

Topic		Replies	Views
Influxdb CPU usage - is this normal? influxdb	1	2689	May 26, 2017
Periodically very high memory and CPU time usage InfluxDB 1	3	923	July 14, 2021
Compaction and periodic spikes in the CPU usage	4	1186	March 18, 2020
Best practices against overflow Telegraf uptime , performance	2	776	May 14, 2020
[question] Influxdb RAM usage Store influxdb	4	3682	May 1, 2017

Influxdb CPU usage

Related topics