Telegraf different measurements input from csv howto

aserkin · August 23, 2019, 8:56am

I’m having problems with CSV input to influxdb using telegraf plugins. Seems that file plugin uses csv_header_row_count only from the first file read. But i have multiple csv files with different sets of metrics.
Telegraf when started reads header row from the first file of “files = [”/data/siem/input/*.csv"]" and uses it for the next file which has its own different header row.
Is there a way to change this behavior ?

aserkin · August 23, 2019, 10:19am

Fixed that in source. Just removed the check if columns are already named. Below are diffs for
~/go/src/github.com/influxdata/telegraf/plugins/parsers/csv/parser.go

> 64c64
> <       if len(p.ColumnNames) == 0 {
> ---
> > //    if len(p.ColumnNames) == 0 {
> 84,89c84,89
> <       } else {
> <               // if columns are named, just skip header rows
> <               for i := 0; i < p.HeaderRowCount; i++ {
> <                       csvReader.Read()
> <               }
> <       }
> ---
> > //    } else {
> > //            // if columns are named, just skip header rows
> > //            for i := 0; i < p.HeaderRowCount; i++ {
> > //                    csvReader.Read()
> > //            }
> > //    }

daniel · August 26, 2019, 6:29pm

This is a bug, would you be able to open a new issue on the Telegraf GitHub page?

aserkin · August 27, 2019, 8:49am

Yup, did that

github.com/influxdata/telegraf

input.file plugin behavior with different input metrics files

opened 08:48AM - 27 Aug 19 UTC

closed 03:36PM - 13 Nov 20 UTC

aserkin

bug

### Relevant telegraf.conf: ``` [[inputs.file]] files = ["/data/siem/inpu…t/*.csv"] data_format = "csv" csv_header_row_count = 1 ``` ### System info: ### Steps to reproduce: 1. use two or more csv files for input but set different headers rows there ### Expected behavior: Read header row from every file in the input list and use it for ongoing metrics. ### Actual behavior: Telegraf when started reads header row only from the first file of “files = [”/data/siem/input/*.csv"]" and uses it for the next file which has its own different header row. ### Additional info: I just removed the column names check from the source. But ideally this should be configurable behavior. Below are diffs for ~/go/src/github.com/influxdata/telegraf/plugins/parsers/csv/parser.go ```diff > 64c64 - if len(p.ColumnNames) == 0 { > --- + // if len(p.ColumnNames) == 0 { > 84,89c84,89 - } else { - // if columns are named, just skip header rows - for i := 0; i < p.HeaderRowCount; i++ { - csvReader.Read() - } - } > --- + // } else { + // // if columns are named, just skip header rows + // for i := 0; i < p.HeaderRowCount; i++ { + // csvReader.Read() + // } + // } ```

toddwarrington · April 1, 2022, 1:30pm

Were you able to find a workaround for this problem? I tried creating an input section for each of the files I am reading. That works on the first time through, but subsequent intervals fail.

aserkin · April 4, 2022, 10:05am

Unfortunately not. I preferred to write my own processing script using python & influxdb module. The source files structure appeared too complicated to fight against it with telegraf. Or more likely i have not enough experience with telegraf.

toddwarrington · April 4, 2022, 12:21pm

I was planning to create multiple input sections, which still didn’t work with the previous version. After updating to the current version, it appears to be working just fine now. I am able to read multiple files, all with headers, without issue. Thanks for the response.

Topic		Replies	Views
Telegraf send only last line of CSV to influxDB Telegraf	3	734	October 4, 2022
Telegraf CSV input formatting problem Telegraf influxdb , telegraf , csv	10	2662	February 15, 2023
Inputs.file parsing Telegraf	5	1122	June 17, 2021
Help needed getting a CSV file into Influxdb using Telegraf Telegraf csv	1	551	March 7, 2022
Telegraf inputs sv Telegraf	1	263	April 3, 2023

Telegraf different measurements input from csv howto

Related topics