Need suggestion to improve the query performance for huge data (bottleneck seems to be CPU of query thread)

lanyaye · August 31, 2020, 2:46am

I have installed influxDB v1.8.2 on centos7
my machine is 64G memory and CPU is Intel(R) Core™ i9-9900K 3.6GHz
I have one measurements: “stock”.“full_sz”.“order_sz”, it has about 59,660,943 records per one day.

influx
Connected to http://localhost:8086 version 1.8.2
InfluxDB shell version: 1.8.2

> show retention policies
name duration shardGroupDuration replicaN default
---- -------- ------------------ -------- -------
autogen 0s 168h0m0s 1 true
full_sz 0s 720h0m0s 1 false

> show tag keys on “stock” from “stock”.“full_sz”.“order_sz”
>
// No output since there’s no any tag on the measurements

> show field keys on “stock” from “stock”.“full_sz”.“order_sz”
name: order_sz
fieldKey fieldType
-------- ---------
chan integer
ex string
exchtime integer
md string
price integer
qty integer
seq integer
side string
src string
sym string
type string

> show series on “stock” from “stock”.“full_sz”.“order_sz”
key
—
order_sz

> show shards
name: stock
id database retention_policy shard_group start_time end_time expiry_time owners
– -------- ---------------- ----------- ---------- -------- ----------- ------
623 stock full_sz 623 2020-08-04T00:00:00Z 2020-09-03T00:00:00Z 2020-09-03T00:00:00Z
current there’s only one shards on retention policy “full_sz”

the line to write the measurements such as:
“POST /write?db=stock&p=%5BREDACTED%5D&precision=ns&rp=full_sz&u=writer HTTP/1.1”
order_sz sym=“000001”,ex=“Z”,exchtime=20200821093000000i,chan=2001i,seq=131i,price=155800i,qty=100i,side=“1”,type=“1”,md=“011”,src=“102” 1597973400000000011
order_sz sym=“000001”,ex=“Z”,exchtime=20200821093000000i,chan=2001i,seq=131i,price=156000i,qty=100i,side=“1”,type=“1”,md=“011”,src=“102” 1597973400000000012
…

the timestamp is incremened by 1 on within one second:
1597973400.000000011
1597973400.000000012
…
1597973400.000050032
1597973401.000000000
1597973401.000000001
…

I have load several day’s data, for one day 2020-08-25, it has about 59,660,943 records;

then I do the query:
SELECT sym, ex, exchtime, chan, seq, price, qty, side, type FROM stock.full_sz.order_sz WHERE time >= ‘2020-08-25’ AND time < ‘2020-08-25’ + 1d TZ(‘Asia/Shanghai’)

it takes long time:
> show queries;
qid query database duration status
— ----- -------- -------- ------
44 SELECT sym, ex, exchtime, chan, seq, price, qty, side, type FROM stock.full_sz.order_sz WHERE time >= ‘2020-08-25’ AND time < ‘2020 -08-25’ + 1d TZ(‘Asia/Shanghai’) stock 16m36s running
59 SHOW QUERIES

finally the journalctl log shows:
Aug 29 13:18:28 ds26 influxd[18742]: ts=2020-08-29T05:18:28.373587Z lvl=info msg=“Executing query” log_id=0Oti26c0000 service=query query=“SELECT sym, ex, exchtime, chan, seq, price, qty, side, type FROM stock.full_sz.order_sz WHERE time >= ‘2020-08-25’ AND time < ‘2020-08-25’ + 1d TZ(‘Asia/Shanghai’)”
Aug 29 13:37:45 ds26 influxd[18742]: [httpd] 192.168.0.23 - muse [29/Aug/2020:13:18:28 +0800] “GET /query?chunk_size=10000&chunked=true&db=stock&epoch=ns&p=%5BREDACTED%5D&q=SELECT+%22sym%22%2C%22ex%22%2C%22exchtime%22%2C%22chan%22%2C%22seq%22%2C%22price%22%2C%22qty%22%2C%22side%22%2C%22type%22+FROM+%22stock%22.%22full_sz%22.%22order_sz%22+WHERE+time%3E%3D%272020-08-25%27+AND+time%3C%272020-08-25%27%2B24h+tz%28%27Asia%2FShanghai%27%29&u=muse HTTP/1.1” 200 5104142042 “-” “-” 124fe3dc-e9b7-11ea-a0d5-9c5c8ebc1688 1156799961
it spends on 19 minutes 17 seconds.

the time is exceed my expected, I think it could be done within 5 mintues
show stats for shard full_sz:
> show stats
name: shard
tags: database=stock, engine=tsm1, id=580, indexType=tsi1, path=/home/influxdb/var/lib/influxdb/data/stock/full_sz/580, retentionPolicy=full_sz, walPath=/home/influxdb/var/lib/influxdb/wal/stock/full_sz/580
diskBytes fieldsCreate seriesCreate writeBytes writePointsDropped writePointsErr writePointsOk writeReq writeReqErr writeReqOk
--------- ------------ ------------ ---------- ------------------ -------------- ------------- -------- ----------- ----------
35278344877 1200651 4 0 0 0 872263918 87242 0 87242

name: tsm1_engine
tags: database=stock, engine=tsm1, id=580, indexType=tsi1, path=/home/influxdb/var/lib/influxdb/data/stock/full_sz/580, retentionPolicy=full_sz, walPath=/home/influxdb/var/lib/influxdb/wal/stock/full_sz/580
cacheCompactionDuration cacheCompactionErr cacheCompactions cacheCompactionsActive tsmFullCompactionDuration tsmFullCompactionErr tsmFullCompactionQueue tsmFullCompactions tsmFullCompactionsActive tsmLevel1CompactionDuration tsmLevel1CompactionErr tsmLevel1CompactionQueue tsmLevel1Compactions tsmLevel1CompactionsActive tsmLevel2CompactionDuration tsmLevel2CompactionErr tsmLevel2CompactionQueue tsmLevel2Compactions tsmLevel2CompactionsActive tsmLevel3CompactionDuration tsmLevel3CompactionErr tsmLevel3CompactionQueue tsmLevel3Compactions tsmLevel3CompactionsActive tsmOptimizeCompactionDuration tsmOptimizeCompactionErr tsmOptimizeCompactionQueue tsmOptimizeCompactions tsmOptimizeCompactionsActive
----------------------- ------------------ ---------------- ---------------------- ------------------------- -------------------- ---------------------- ------------------ ------------------------ --------------------------- ---------------------- ------------------------ -------------------- -------------------------- --------------------------- ---------------------- ------------------------ -------------------- -------------------------- --------------------------- ---------------------- ------------------------ -------------------- -------------------------- ----------------------------- ------------------------ -------------------------- ---------------------- ----------------------------
1411213589462 0 6969 0 2027161615251 0 0 17 0 367887072257 0 0 871 0 595477493339 0 0 217 0 706036649570 0 0 54 0 0 0 0 0 0

name: tsm1_cache
tags: database=stock, engine=tsm1, id=580, indexType=tsi1, path=/home/influxdb/var/lib/influxdb/data/stock/full_sz/580, retentionPolicy=full_sz, walPath=/home/influxdb/var/lib/influxdb/wal/stock/full_sz/580
WALCompactionTimeMs cacheAgeMs cachedBytes diskBytes memBytes snapshotCount writeDropped writeErr writeOk
------------------- ---------- ----------- --------- -------- ------------- ------------ -------- -------
1407453 157492999 231507354732 0 0 0 0 0 87242

name: tsm1_filestore
tags: database=stock, engine=tsm1, id=580, indexType=tsi1, path=/home/influxdb/var/lib/influxdb/data/stock/full_sz/580, retentionPolicy=full_sz, walPath=/home/influxdb/var/lib/influxdb/wal/stock/full_sz/580
diskBytes numFiles
--------- --------
35278344877 23

name: tsm1_wal
tags: database=stock, engine=tsm1, id=580, indexType=tsi1, path=/home/influxdb/var/lib/influxdb/data/stock/full_sz/580, retentionPolicy=full_sz, walPath=/home/influxdb/var/lib/influxdb/wal/stock/full_sz/580
currentSegmentDiskBytes oldSegmentsDiskBytes writeErr writeOk
----------------------- -------------------- -------- -------
0 0 0 87242

during the query, the top command shows one thread takes much CPU:
top - 13:19:34 up 33 days, 21:14, 3 users, load average: 0.85, 0.53, 0.27
Threads: 347 total, 2 running, 345 sleeping, 0 stopped, 0 zombie
%Cpu(s): 6.6 us, 0.0 sy, 0.0 ni, 93.4 id, 0.0 wa, 0.0 hi, 0.0 si, 0.0 st
KiB Mem : 32614340 total, 262644 free, 9475420 used, 22876276 buff/cache
KiB Swap: 16449532 total, 16352252 free, 97280 used. 21435168 avail Mem

** PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND**
18753 influxdb 20 0 16.9t 12.8g 4.6g R 99.7 41.3 13:27.31 influxd
18789 influxdb 20 0 16.9t 12.8g 4.6g S 2.3 41.3 11:06.70 influxd
18767 influxdb 20 0 16.9t 12.8g 4.6g S 1.0 41.3 15:58.36 influxd
18778 influxdb 20 0 16.9t 12.8g 4.6g S 1.0 41.3 13:15.00 influxd
18754 influxdb 20 0 16.9t 12.8g 4.6g S 0.7 41.3 15:52.25 influxd
18761 influxdb 20 0 16.9t 12.8g 4.6g S 0.3 41.3 13:55.76 influxd
18766 influxdb 20 0 16.9t 12.8g 4.6g S 0.3 41.3 13:04.77 influxd
18777 influxdb 20 0 16.9t 12.8g 4.6g S 0.3 41.3 13:59.17 influxd
1 root 20 0 46396 4764 2752 S 0.0 0.0 0:09.38 systemd
2 root 20 0 0 0 0 S 0.0 0.0 0:00.24 kthreadd
4 root 0 -20 0 0 0 S 0.0 0.0 0:00.00 kworker/0:0H
6 root 20 0 0 0 0 S 0.0 0.0 0:12.47 ksoftirqd/0
7 root rt 0 0 0 0 S 0.0 0.0 0:00.59 migration/0
8 root 20 0 0 0 0 S 0.0 0.0 0:00.00 rcu_bh
9 root 20 0 0 0 0 S 0.0 0.0 0:34.42 rcu_sched
10 root 0 -20 0 0 0 S 0.0 0.0 0:00.00 lru-add-drain
11 root rt 0 0 0 0 S 0.0 0.0 0:02.28 watchdog/0
12 root rt 0 0 0 0 S 0.0 0.0 0:02.16 watchdog/1
13 root rt 0 0 0 0 S 0.0 0.0 0:00.10 migration/1
14 root 20 0 0 0 0 S 0.0 0.0 0:00.20 ksoftirqd/1
16 root 0 -20 0 0 0 S 0.0 0.0 0:00.00 kworker/1:0H

I don’t know why this thread takes much CPU, maybe it doing the decompress the data?
could it be done by multiple thread?

could someone give some suggestion how to improve the query performance.
I just want the query done within 5 minutes instead of 19 minutes 17 seoncds.
Do i need add one tag? or there’s some other suggestion?

thanks!!

philjb · August 31, 2020, 4:03pm

What kind of hard drives do you have?

lanyaye · September 1, 2020, 2:31am

hello, philjb, I have normal Hard Disk Driver installed not SSD, since the history data is huge.
But when I use iotop command to monitor the I/O status. it shows:
(base) [lanzhou@ds26 ~]$ sudo iotop -o
Total DISK READ : 1263.61 K/s | Total DISK WRITE : 0.00 B/s
Actual DISK READ: 1263.61 K/s | Actual DISK WRITE: 0.00 B/s
PID PRIO USER DISK READ DISK WRITE SWAPIN IO> COMMAND
1467 be/4 influxdb 1263.61 K/s 0.00 B/s 0.00 % 0.00 % influxd -config /etc/influxdb/influxdb.conf
influxdb process I/O not high,

from the top cmd, seems thread cpu is full.
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
1546 influxdb 20 0 17.0t 11.7g 3.6g R 99.9 9.3 11:06.63 influxd
16340 lanzhou 20 0 41.1g 40.9g 40.9g S 2.7 32.6 0:09.72 ProdInfluxT+
16338 lanzhou 20 0 41.1g 40.9g 40.9g S 2.0 32.6 0:06.73 ProdInfluxT+

do you have any suggestion? thanks!!

lanyaye · September 1, 2020, 3:56am

There’s the information for explain analyze:

EXPLAIN ANALYZE SELECT sym, ex, exchtime, chan, seq, price, qty, side, type FROM stock.full_sz.order_sz WHERE time >= ‘2020-08-25’ AND time < ‘2020-08-25’ + 1d TZ(‘Asia/Shanghai’)
EXPLAIN ANALYZE

.
└── select
├── execution_time: 37m16.20906922s
├── planning_time: 536.019268ms
├── total_time: 37m16.745088488s
└── build_cursor
├── labels
│ └── statement: SELECT sym::string, ex::string, exchtime::integer, chan::integer, seq::integer, price::integer, qty::integer, side::string, type::string FROM stock.full_sz.order_sz TZ(‘Asia/Shanghai’)
└── iterator_scanner
├── labels
│ └── auxiliary_fields: chan::integer, ex::string, exchtime::integer, price::integer, qty::integer, seq::integer, side::string, sym::string, type::string
└── create_iterator
├── labels
│ ├── measurement: order_sz
│ └── shard_id: 623
├── cursors_ref: 0
├── cursors_aux: 9
├── cursors_cond: 0
├── float_blocks_decoded: 0
├── float_blocks_size_bytes: 0
├── integer_blocks_decoded: 298305
├── integer_blocks_size_bytes: 715065733
├── unsigned_blocks_decoded: 0
├── unsigned_blocks_size_bytes: 0
├── string_blocks_decoded: 238644
├── string_blocks_size_bytes: 389713653
├── boolean_blocks_decoded: 0
├── boolean_blocks_size_bytes: 0
└── planning_time: 535.919868ms

anatolijd · September 1, 2020, 1:42pm

hi,
can you show shard detailed report for the shards in RP full_sz , please ?
like
influx_inspect report -detailed /home/influxdb/var/lib/influxdb/data/stock/full_sz/623

Don’t know the possible cause, but it is interesting to see thre number of .tsm files and some other stats.

lanyaye · September 2, 2020, 12:53am

many thanks! DB RP stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 stock full_sz 623 There’s the report for RP full_sz shard 623:
Shard File Series New (est) Min Time Max Time Load Time
000000896-000000005.tsm 66 66 2020-08-25T01:14:33.5Z 2020-08-25T06:19:21.500003104Z 131.076ms
000000896-000000006.tsm 59 58 2020-08-25T01:14:33.5Z 2020-08-25T06:19:21.500004219Z 118.846201ms
000000896-000000007.tsm 2 1 2020-08-25T01:15:00.50000046Z 2020-08-25T06:19:21.500004219Z 25.361204ms
000001792-000000005.tsm 66 0 2020-08-25T06:19:13.500000068Z 2020-08-26T05:30:10.500006872Z 64.113537ms
000001792-000000006.tsm 59 0 2020-08-25T06:19:13.500000068Z 2020-08-26T05:30:07.500006847Z 47.851088ms
000001792-000000007.tsm 2 0 2020-08-25T06:19:21.500004221Z 2020-08-26T05:30:07.500006847Z 29.978241ms
000002688-000000005.tsm 65 0 2020-08-26T05:29:40.500003362Z 2020-08-27T03:24:50.500004594Z 59.494575ms
000002688-000000006.tsm 60 0 2020-08-26T05:30:03.500006896Z 2020-08-27T03:24:52.50000477Z 88.209547ms
000002688-000000007.tsm 2 0 2020-08-26T05:30:07.50000685Z 2020-08-27T03:24:52.50000477Z 32.421542ms
000003681-000000005.tsm 58 0 2020-08-27T03:24:04.500004937Z 2020-08-28T03:13:10.500005602Z 63.648877ms
000003681-000000006.tsm 63 0 2020-08-27T03:24:42.500002471Z 2020-08-28T03:13:12.50000624Z 77.307414ms
000003681-000000007.tsm 6 0 2020-08-27T03:24:52.500004771Z 2020-08-28T03:13:12.50000624Z 53.602722ms
000004739-000000005.tsm 55 0 2020-08-28T03:11:58.500004512Z 2020-08-31T03:09:44.500004008Z 106.049µs
000004739-000000006.tsm 62 0 2020-08-28T03:13:04.500002877Z 2020-08-31T03:09:44.500003144Z 104.482µs
000004739-000000007.tsm 10 0 2020-08-28T03:13:12.500006243Z 2020-08-31T03:09:44.500003144Z 51.067µs
000005764-000000005.tsm 57 0 2020-08-31T03:08:49.500003915Z 2020-09-01T03:18:52.500001304Z 94.033µs
000005764-000000006.tsm 62 0 2020-08-31T03:09:34.500003656Z 2020-09-01T03:18:53.500002909Z 108.912µs
000005764-000000007.tsm 8 0 2020-08-31T03:09:44.500003145Z 2020-09-01T03:18:53.500002909Z 52.813µs
000006244-000000004.tsm 121 0 2020-09-01T03:17:46.500003463Z 2020-09-01T07:31:03.500000001Z 161.508µs
000006244-000000005.tsm 5 0 2020-09-01T03:18:53.500002911Z 2020-09-01T07:30:00.500001875Z 30.419µs

Summary:
Files: 20
Time Range: 2020-08-25T01:14:33.5Z - 2020-09-01T07:31:03.500000001Z
Duration: 174h16m30.000000001s

Statistics
Series:
- stock (est): 125 (100%)
Total (est): 125

Measurements (est):
- index_sz: 14 (11%)
- order_sz: 11 (8%)
- snap_sz: 88 (70%)
- trade_sz: 12 (9%)

Fields (est):
- index_sz: 14
- order_sz: 11
- snap_sz: 88
- trade_sz: 12

Tags (est):
Completed in 796.297896ms

the shards contains 4 measurements, don’t know whether this has impact on performance?

philjb · September 2, 2020, 4:47pm

You might see better performance if you change some of your field values to tags. Since fields are not indexed, they require scans to evaluate queries. Having tags will also increase your series count (larger series cardinality is not always better). Influx parallelizes on series so you might see better cpu usage across cores with a few more series. Generally, you want a tag to have a fixed (relatively small) number of values. For example, “type” field might be a good candidate for a tag if it has a well defined list of possible values, but “price” is not a good candidate for a tag.

InfluxDB key concepts | InfluxDB OSS 1.8 Documentation is a good resource.

lanyaye · September 3, 2020, 12:59am

got it! thanks for your suggestion! I will have a try!

Topic		Replies	Views
Optimizing queries for multi-core servers InfluxDB 1 influxdb , schema , influxql , query , performance	0	587	January 19, 2023
Scaling Series and Queries on 16GB of RAM Store influxdb	4	1771	January 2, 2019
Queries take long time to execute Store influxdb , time-series	1	906	April 3, 2018
Query optimization influxdb	1	4574	March 29, 2018
Any Enhancement for InfluxDB Query Performance now? InfluxDB 2 query , performance	1	318	July 24, 2023

Need suggestion to improve the query performance for huge data (bottleneck seems to be CPU of query thread)

Related topics