[[inputs.logparser]] no output in --test mode and no metrics

fchiorascu · April 9, 2018, 12:13pm

Hi,

The internet/ GitHub is full of example and issues.
Kindly could someone guide me to have the best approach.
I’ve tried a lot of approache and as the telegraf --test mode doesn’t work for this case I’m on a dead end with this topic.

–Logs–
/var/log/nginx/access.log

127.0.0.1 - - [06/Apr/2018:15:03:15 +0000] “GET /nginx_status HTTP/1.1” 200 112 “-” “Go-http-client/1.1” “-”
127.0.0.1 - - [06/Apr/2018:15:03:17 +0000] “POST /elasticsearch/_msearch HTTP/1.1” 200 172 “https://127.0.0.1/app/kibana” “Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Firefox/52.0” “-”
127.0.0.1- - [06/Apr/2018:15:03:24 +0000] “POST /elasticsearch/_msearch HTTP/1.1” 200 172 “https://127.0.0.1/app/kibana” “Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Firefox/52.0” “-”
127.0.0.1 - - [06/Apr/2018:15:03:30 +0000] “GET /nginx_status HTTP/1.1” 200 112 “-” “Go-http-client/1.1” “-”
127.0.0.1 - - [06/Apr/2018:15:03:45 +0000] “GET /nginx_status HTTP/1.1” 200 112 “-” “Go-http-client/1.1” “-”
127.0.0.1 - - [06/Apr/2018:15:04:00 +0000] “GET /nginx_status HTTP/1.1” 200 112 “-” “Go-http-client/1.1” “-”

–Telegraf–
telegraf-1.5.3-1.x86_64

[[inputs.logparser]]
files = [“/var/log/nginx/access.log”]
from_beginning = true
name_override = “nginx_access_log”
[inputs.logparser.grok]
patterns = [“%{CUSTOM_LOG}”]
custom_patterns = ‘’’
CUSTOM_LOG %{COMBINED_LOG_FORMAT} %{NUMBER:response_time_us:float} %{NUMBER:request_time:float}
‘’’

–InfluxDB–
influxdb-1.5.0-1.x86_64

Kind Regards,

bolek2000 · April 9, 2018, 3:02pm

What metrics do you want to extract from the logs or want to see in Grafana (number of respone codes, bytes received …?) I did a lot of work in the last month on this topic, but there are many obstacles I found for a busy site. So it is not easy to say what exactly you have to do…
What is your actual problem now, what merics are you missing ? What metrics does the logaprser extract and what do you already have in your DB ?

fchiorascu · April 9, 2018, 3:12pm

Hi Bolek,

Thank you for the support.
My main focus is on the below KPI’s:
Response Codes (e.g. HTTP 1xx, 2xx, 3xxx, 4xxx, 5xxx, etc);
TCP, UDP, Dropped Pkts, Errors, Traffic already monitored.
I don’t know if I could extract other relevant information for NG|NX the open source version.
I’ve put also a Print screen of what I already have in the Dashboard regarding NG|NX.

There are many examples using non_negative_derivative.

***Old post: Collecting metrics with [[inputs.logparser]] no output on test

Regarding logparser for the moment nothing because I’ve tried many implementations of [[inputs.logparser]] no output in --test mode and nothing in InfluxDB database and I’m trying to understand better what I could do to have those in the InfluxDB.
What I want to have is related to response codes if possible.

Started from this Telegraf.conf example from Internet:

[[inputs.nginx]]
urls = [“http://localhost/nginx_status”]
[[inputs.logparser]]
files = [“/var/log/nginx/access.log”]
from_beginning = true
name_override = “nginx_access_log”
[inputs.logparser.grok]
patterns = [“%{COMBINED_LOG_FORMAT}”]

to many possible variants but no output yet.

Kind Regards,

bolek2000 · April 9, 2018, 3:56pm

In fact you can can extract every information from the access log, but then you need to write your own pattern matching for your log_format. We have added fields in the nginx log_format, so my patterns will not solve all your problems, but is a start. I had a hard learning curve for the Grok stuff. Also you can add response code as a value or as a tag, depending on what you want to achieve. I use response code as a tag and have values for response_time etc…
So first you need to check if your nginx log_format has all the information you need:
e.g.:
log_format compression '$remote_addr - $remote_user [$time_local] ’
'“$request” $status $bytes_sent ’
'“$http_referer” “$http_user_agent” +$request_time $upstream_response_time $pipe+ “$gzip_ratio” ’
‘“$host~$is_mobile $is_bot $sent_http_x_cache”’;

A (not complete) parser pattern can look like this for my use case:
%{CLIENT:client_ip:drop} %{NOTSPACE:ident:drop} %{NOTSPACE:auth:drop} [%{HTTPDATE:ts:ts-httpd}] "%{WORD:http_method:drop} %{PATHLEVEL1:pathlevel1:tag}(/|?)?.* HTTP/%{NUMBER:http_version:drop}" %{RESPONSE_CODE} (?:%{NUMBER:resp_bytes:int}|-) "%{DATA:referer:drop}" "%{DATA:user_agent:drop}" +(?:%{NUMBER:request_time:float}|-)

So then you need to write a regex pattern, to extract the relavat information what can be a bit tiresome, because you need to test that thourougly:
I add some links to pages that helped me:
https://github.com/influxdata/telegraf/blob/master/plugins/inputs/logparser/grok/patterns/influx-patterns
https://github.com/logstash-plugins/logstash-patterns-core/blob/master/patterns/grok-patterns

For testing I use the telegraf file output (and disable InfluxDb output) to see what gets sent by telegraf. Then you have to test pattern by pattern from the beginning to see if it matches correctly. In debug output of telegraf.log you can see “Grok No Match Lines”, but it gives no specific error message, so you need to find out youself, what the problem with the pattern is…Regex hell

So I extract for example domain names, parts of URLs and so on, depending on what information I need from the logs.

In Grafana I created tables with a query like this:

SELECT count(“resp_bytes”) FROM “proxy_access_log” WHERE (“domain” =~ /^$domain$/ AND “cache_status” =~ /^$cache_status$/ AND “mobile” =~ /^$mobile$/ AND “bot” =~ /^$bot$/) AND $timeFilter GROUP BY "response_code" Screenshot%20from%202018-04-09%2017-42-36

I found that the basicstats aggregator could help in making some basic counts and aggegations for you, but not on reponse codes.
Another way would be using response_code as a value and using a value_counter plugin like in this pull request:

github.com/influxdata/telegraf

Valuecounter aggregator plugin

influxdata:master ← piotr1212:valuecounter

opened 10:02AM - 29 Nov 17 UTC

piotr1212

+309 -0

A valuecounter aggregator plugin to count values in specified fields. Emits… the aggregated count of the values. A usecase for the valuecounter plugin is when you are processing a HTTP access log (with the logparser input) and want to count the HTTP status codes. The fields which will be counted must be configured with the `field_names` configuration directive. When no `field_names` is provided the plugin will not count any fields. The results are emitted in fields in the format: `originalfieldname_fieldvalue = count`. See also: https://community.influxdata.com/t/telegraf-count-field-values-processing-http-response-codes/3214/5 ### Required for all PRs: - [X] Signed [CLA](https://influxdata.com/community/cla/). - [X] Associated README.md updated. - [X] Has appropriate unit tests.

By now I didn’t manage to test this aggregator, because I went the way of counting the resp_bytes field and grouping it by resonse_code, to get the desired information.

fchiorascu · April 9, 2018, 4:09pm

Many thanks for this approach and support, one little question regarding nginx.conf.

$ cat /etc/nginx/nginx.conf
access_log /var/log/nginx/access.log main;

I’ve noticed in some cases there is: “access_log /var/log/nginx/access.log combined;”, this aspect is also important for logparser aspect?

Something very useful: GitHub - lebinh/ngxtop: Real-time metrics for nginx server

bolek2000 · April 9, 2018, 6:31pm

Thanks for the link, I didn’t know yet, it’s also my first deeper dive into nginx stuff.

I think the type of log is important as “combined” is a predefined log “style”.

The configuration always includes the predefined “combined” format:

log_format combined '$remote_addr - $remote_user [$time_local]
'“$request” $status $body_bytes_sent ’
‘“$http_referer” “$http_user_agent”’;

from:
http://nginx.org/en/docs/http/ngx_http_log_module.html

So if your format is not “combinded” your log fields or the type of time stamp or alike may look differently, so you need to look if the predefined Grok patterns really match your log style.
Read bottom lines of the telegraf Grok dcument and you see that COMBINED_LOG_FORMAT and COMMON_LOG_FORMAT have differences :

Also you or other sysops might have configured other custom log_formats, but here I am not too experienced, but as you can see in the example above my log_format is called “compression” not “combined” and looks like this:
access_log /var/log/nginx/wordpress.access.log compression

so you need find the “log_format main” in your configs: e.g.:
grep -R “log_format main” /etc/nginx/*

fchiorascu · April 10, 2018, 4:49am

Yes this is a good tool “ngxtop”, NG|NX open source has no Status Page unfortunately and not so much metrics.

$ cat /etc/nginx/nginx.conf

http {
    log_format  main  '$remote_addr - $remote_user [$time_local] "$request" '
                      '$status $body_bytes_sent "$http_referer" '
                      '"$http_user_agent" "$http_x_forwarded_for"';

    access_log  /var/log/nginx/access.log  main;

fchiorascu · May 1, 2018, 8:50am

I’ve managed somehow.

github.com/influxdata/telegraf

Add plugin to monitor webserver/nginx log files more easily

opened 04:55PM - 09 Apr 18 UTC

closed 08:07PM - 21 Sep 20 UTC

fchiorascu

feature request

## Directions ## Bug report N/A ### Relevant telegraf.conf: [[inputs.nginx…]] urls = ["https://localhost/nginx_status"] insecure_skip_verify = true response_timeout = "5s" ### System info: OS: CentOS Linux release 7.4.1708 (Core) Kernel: 3.10.0-514.6.2.el7.x86_64 Telegraf: telegraf-1.5.3-1.x86_64 ### Steps to reproduce: 1. ... 2. ... ### Expected behavior: To have more metrics in the open source version [[inputs.nginx]]. ### Actual behavior: Metrics exist at Workers level. ### Additional info: https://github.com/lebinh/ngxtop ## Feature Request I think is a good point here in order to track better and to monitor the NG|NX open source version. ### Proposal: To have metrics also for: request code [2xx, 3xx, 4xx, 5xx, total], uptime of nginx, version of nginx, other metrics. ``` $ ngxtop top remote_addr running for 168 seconds, 167 records processed: 0.99 req/sec top remote_addr | remote_addr | count | |----------------+---------| ``` ``` $ tail -f /var/log/nginx/access.log | ngxtop -f common running for 0 seconds, 0 records processed: 0.00 req/sec Summary: | count | avg_bytes_sent | 2xx | 3xx | 4xx | 5xx | |---------+------------------+-------+-------+-------+-------| | 0 | | 0 | 0 | 0 | 0 | Detailed: | request_path | count | avg_bytes_sent | 2xx | 3xx | 4xx | 5xx | |----------------+---------+------------------+-------+-------+-------+-------| ``` ### Current behavior: Not present there: 1. Response codes and need [[inputs.logparser]] to use in order to have the codes. 2. Uptime. 3. Version. 4. req/sec ### Desired behavior: It will be beter to have it with [[inputs.nginx]] directly those. Also uptime of NG|NX, version, ### Use case: [Why is this important (helps with prioritizing requests)] I think most of us are using the Open Source version instead of Nginx Plus version.

hsafe · February 4, 2020, 6:07am

@fchiorascu
I see that the issue is really old however am stuck in the exact same position…your last post also is totally another aspect of the monitoring which only ships the nginx-engine status i.e how many connection established handled and waiting…nothing related to the log parsing which is totally different realm.
I wonder anyone out there is having the same issue as me to successfully log pars specific part of the data to the monitoring-in my case the response code for a simple graph to monitor the values and put alerts based on them…

Topic		Replies	Views
Collecting metrics with [[inputs.logparser]] no output on test Telegraf influxdb , telegraf	0	1578	March 28, 2018
Telegraf log parser not show output while testing Telegraf	2	1080	December 10, 2019
Telegraf does not work Telegraf	0	810	October 16, 2019
[[inputs.tail]] no log	2	1140	March 16, 2021
[[inputs.logparser]] has no output Telegraf	6	2200	April 10, 2018

[[inputs.logparser]] no output in --test mode and no metrics

Related topics