Telegraf not collecting on all snmp inputs

#1

Hi,

i am having a issue with telegraf not reporting all my snmp inputs. Its odd as its working for a few, but not all??

When I am in grafana and I select the tag value for the hostname only 5 show of 38. Access is ok as i can perform a snmp walk to the device, but i can work out why or how to check why the others are not reporting

example of the snmp inputs. IP and community changes for the purpose of this question, but this is what i have in the telegraf.conf file.

All data is being written into influx dub as the output

[[inputs.snmp]]
agents = [ “10.0.0.1:161” ]
version = 2
name = "snmp"
community = “example”

[[inputs.snmp]]
agents = [ “10.0.0.2:161” ]
version = 2
name = "snmp"
community = “example”

[[inputs.snmp]]
agents = [ “10.0.0.3:161” ]
version = 2
name = "snmp"
community = “example”

[[inputs.snmp]]
agents = [ “10.0.0.4:161” ]
version = 2
name = "snmp"
community = “example”

[[inputs.snmp]]
agents = [ “10.0.0.5:161” ]
version = 2
name = "snmp"
community = “example”

[[inputs.snmp]]
agents = [ “10.0.0.6:161” ]
version = 2
name = "snmp"
community = “example”

[[inputs.snmp.field]]
name = "hostname"
oid = "RFC1213-MIB::sysName.0"
is_tag = true

[[inputs.snmp.table]]
name = "snmp"
inherit_tags = [ “hostname” ]
oid = “IF-MIB::ifXTable”

[[inputs.snmp.table.field]]
name = "ifName"
oid = "IF-MIB::ifName"
is_tag = true

Guys any ideas where or how i can troubleshoot this??

Thanks

Rob

#2

Do you have the [[inputs.snmp.table]] section defined for each input? Maybe you can upload the full config?

#3

Hi Daniel,

I only have 1 inputs.snmp.table defined, so i need this for every inmputes snmp defined?

I tried to create a new conf file in telegraph so I now have 2 files telegraf.conf and telegraf.conf.rpmnew
im assuming it will use telegraf.conf as default?

i have the default config and have ammended the snmp input. I have included the global settings, influxdb output and the snmp input below. IP and community have been changed.

# Telegraf Configuration
#
# Telegraf is entirely plugin driven. All metrics are gathered from the
# declared inputs, and sent to the declared outputs.
#
# Plugins must be declared in here to be active.
# To deactivate a plugin, comment out the name and any variables.
#
# Use 'telegraf -config telegraf.conf -test' to see what metrics a config
# file would generate.
#
# Environment variables can be used anywhere in this config file, simply prepend
# them with $. For strings the variable must be within quotes (ie, "$STR_VAR"),
# for numbers and booleans they should be plain (ie, $INT_VAR, $BOOL_VAR)


# Global tags can be specified here in key="value" format.
[global_tags]
  # dc = "us-east-1" # will tag all metrics with dc=us-east-1
  # rack = "1a"
  ## Environment variables can be used as tags, and throughout the config file
  # user = "$USER"


# Configuration for telegraf agent
[agent]
  ## Default data collection interval for all inputs
  interval = "20s"
  ## Rounds collection interval to 'interval'
  ## ie, if interval="10s" then always collect on :00, :10, :20, etc.
  round_interval = true

  ## Telegraf will send metrics to outputs in batches of at most
  ## metric_batch_size metrics.
  ## This controls the size of writes that Telegraf sends to output plugins.
  metric_batch_size = 1000

  ## For failed writes, telegraf will cache metric_buffer_limit metrics for each
  ## output, and will flush this buffer on a successful write. Oldest metrics
  ## are dropped first when this buffer fills.
  ## This buffer only fills when writes fail to output plugin(s).
  metric_buffer_limit = 10000

  ## Collection jitter is used to jitter the collection by a random amount.
  ## Each plugin will sleep for a random time within jitter before collecting.
  ## This can be used to avoid many plugins querying things like sysfs at the
  ## same time, which can have a measurable effect on the system.
  collection_jitter = "0s"

  ## Default flushing interval for all outputs. You shouldn't set this below
  ## interval. Maximum flush_interval will be flush_interval + flush_jitter
  flush_interval = "10s"
  ## Jitter the flush interval by a random amount. This is primarily to avoid
  ## large write spikes for users running a large number of telegraf instances.
  ## ie, a jitter of 5s and interval 10s means flushes will happen every 10-15s
  flush_jitter = "0s"

  ## By default or when set to "0s", precision will be set to the same
  ## timestamp order as the collection interval, with the maximum being 1s.
  ##   ie, when interval = "10s", precision will be "1s"
  ##       when interval = "250ms", precision will be "1ms"
  ## Precision will NOT be used for service inputs. It is up to each individual
  ## service input to set the timestamp at the appropriate precision.
  ## Valid time units are "ns", "us" (or "µs"), "ms", "s".
  precision = ""

  ## Logging configuration:
  ## Run telegraf with debug log messages.
  debug = true
  ## Run telegraf in quiet mode (error log messages only).
  quiet = false
  ## Specify the log file name. The empty string means to log to stderr.
  logfile = "/var/log/telegraf.log"

  ## Override default hostname, if empty use os.Hostname()
  hostname = ""
  ## If set to true, do no set the "host" tag in the telegraf agent.
  omit_hostname = false


###############################################################################
#                            OUTPUT PLUGINS                                   #
###############################################################################

# Configuration for influxdb server to send metrics to
[[outputs.influxdb]]
  ## The HTTP or UDP URL for your InfluxDB instance.  Each item should be
  ## of the form:
  ##   scheme "://" host [ ":" port]
  ##
  ## Multiple urls can be specified as part of the same cluster,
  ## this means that only ONE of the urls will be written to each interval.
  # urls = ["udp://localhost:8089"] # UDP endpoint example
  urls = ["http://localhost:8086"] # required
  ## The target database for metrics (telegraf will create it if not exists).
  database = "telegraf" # required

  ## Name of existing retention policy to write to.  Empty string writes to
  ## the default retention policy.
  retention_policy = ""
  ## Write consistency (clusters only), can be: "any", "one", "quorum", "all"
  write_consistency = "any"

  ## Write timeout (for the InfluxDB client), formatted as a string.
  ## If not provided, will default to 5s. 0s means no timeout (not recommended).
  timeout = "5s"
  # username = "telegraf"
  # password = "metricsmetricsmetricsmetrics"
  ## Set the user agent for HTTP POSTs (can be useful for log differentiation)
  # user_agent = "telegraf"
  ## Set UDP payload size, defaults to InfluxDB UDP Client default (512 bytes)
  # udp_payload = 512

  ## Optional SSL Config
  # ssl_ca = "/etc/telegraf/ca.pem"
  # ssl_cert = "/etc/telegraf/cert.pem"
  # ssl_key = "/etc/telegraf/key.pem"
  ## Use SSL but skip chain & host verification
  # insecure_skip_verify = false

  ## HTTP Proxy Config
  # http_proxy = "http://corporate.proxy:3128"

  ## Optional HTTP headers
  # http_headers = {"X-Special-Header" = "Special-Value"}

  ## Compress each HTTP request payload using GZIP.
  # content_encoding = "gzip"


# # Configuration for Amon Server to send metrics to.

# # Retrieves SNMP values from remote agents
  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"
  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"


#   ## SNMP community string.
#   community = "public"
#
#   ## The GETBULK max-repetitions parameter
#   max_repetitions = 10
#
#   ## SNMPv3 auth parameters
#   #sec_name = "myuser"
#   #auth_protocol = "md5"      # Values: "MD5", "SHA", ""
#   #auth_password = "pass"
#   #sec_level = "authNoPriv"   # Values: "noAuthNoPriv", "authNoPriv", "authPriv"
#   #context_name = ""
#   #priv_protocol = ""         # Values: "DES", "AES", ""
#   #priv_password = ""
#
#   ## measurement name
#   name = "system"
    [[inputs.snmp.field]]
     name = "hostname"
     oid = "RFC1213-MIB::sysName.0"
     is_tag = true
#
    [[inputs.snmp.table]]
#     ## measurement name
       name = "snmp"
       inherit_tags = [ "hostname" ]
       oid = "IF-MIB::ifXTable"

    [[inputs.snmp.table.field]]
       name = "ifName"
       oid = "IF-MIB::ifName"
       is_tag = true
#
#4

Yes, but if it is the same for each input you can add all the agent addresses to a single input:

  [[inputs.snmp]]
    agents = [ "xx.xx.xx.xx:161", "xx.xx.xx.xx:161" ]
    version = 2
    name = "snmp"
    community = "example"

    [[inputs.snmp.field]]
     name = "hostname"
     oid = "RFC1213-MIB::sysName.0"
     is_tag = true
#
    [[inputs.snmp.table]]
#     ## measurement name
       name = "snmp"
       inherit_tags = [ "hostname" ]
       oid = "IF-MIB::ifXTable"

    [[inputs.snmp.table.field]]
       name = "ifName"
       oid = "IF-MIB::ifName"
       is_tag = true
1 Like
#5

Ah ok!

let me try that.

Thanks Daniel

#6

Worked a treat! Thanks Daniel!

#7

Daniel can I ask another question??

If I want to collect a different SNMP value say up/down status or cpu can i just add another inputs.snmp.table and inputs.snmp.field witht he relevant oid? or do i need to have a new measurement and new inputs.snmp?

Thanks

Rob

#9

Yes, you can add another table or field.