InfluxDB query to get distinct values from a table

sahebdatta · June 30, 2021, 9:15am

Hi Guys,

I am working on visualizing InfluxDB data in Grafana. I am having a table that shows the status of the devices connected to my system. The table is updated values every 1 minute (ie my software checks for connection status every minute for all the 500 devices and inserts them in the table those devices that are offline). Now I want to display the result. I display results for the last 1 minute using the query

SELECT * FROM "3_weeks"."STATUS" WHERE ("Connection" =~ /^$IPLike^*/ AND "Status" = 'Offline') AND time>now()-60s GROUP BY (*)

My table looks like this

Timestamp | Connection    | IPLike      | EdgeDeviceID | Status
================================================================
12345679  | 192.168.7.14  | 192.168.7.  | ED1          | Offline
12345678  | 192.168.14.15 | 192.168.14. | ED2          | Offline
12345667  | 192.168.14.15 | 192.168.14. | ED2          | Offline

Sometimes, the query outputs the same device twice.

Now I figured out I can restrict this by using the distinct command.

SELECT distinct("Connection") FROM "3_weeks"."STATUS" WHERE ("Connection" =~ /^$IPLike^*/ AND "Status" = 'Offline') AND time>now()-60s

This solves the repetition problem. But I don’t see the other columns. Can somebody help me with this, please?

Thanks in advance.

Cheers,
SD

Anaisdg · July 1, 2021, 12:52am

Hello @sahebdatta,
Apologies in advanced as my InfluxQL is rusty since I mostly use Flux now,
but have you tried selecting for the other fields as well?

SELECT distinct("Connection"), "IPLike", "EdgeDeviceID", "Status" FROM "3_weeks"."STATUS" WHERE ("Connection" =~ /^$IPLike^*/ AND "Status" = 'Offline') AND time>now()-60s

Or whatever your fields are?

sahebdatta · July 1, 2021, 6:32am

Hi @Anaisdg,

Thanks for the response. I also assumed that this way it should work but unfortunately this returns no results. It shows an error that says

InfluxDB Error: aggregate function distinct() cannot be combined with other functions or fields

My query looks like this

SELECT distinct("Connection"), "EdgeDeviceID" FROM "3_weeks"."STATUS" WHERE ("Connection" =~ /^$IPLike^*/ AND "Status" = 'Offline') AND time>now()-60s

Anaisdg · July 1, 2021, 2:51pm

Darn, then I’m not sure how you would do this with InfluxQL in 1.x other than to perform a join with kapacitor. What version of Influx are you using? We could do this with Flux. Otherwise I would suggest making an issue.

sahebdatta · July 2, 2021, 4:43am

InfluxDB version used: 1.8.x

Anaisdg · July 2, 2021, 5:50pm

If you enable Flux,
Your query would look something like:

from(bucket: "my_db")
  |> range(start: v.timeRangeStart, stop: v.timeRangeStop)
  |> filter(fn: (r) => r["_measurement"] == "my_meas")
  |> filter(fn: (r) => r["_field"] == "Connection" or r["_field"] == "IPLike" or r["_field"] == "EdgeDeviceID" or r["_field"] == "Status" )
  |> schema.fieldsAsCols()
  |> distinct(column: "Connection")

And the result will be a table with all of your fields as columns and it will be filtered for the distinct values of the Connection column and the values for the other fields at that those distinct timestamps.

Topic		Replies	Views
I need help with Flux InfluxDB 2	2	456	February 25, 2021
InfluxDB Flux - get count of distinct occurrences Fluxlang influxdb	4	8452	July 30, 2020
Sum col A where col B = X Fluxlang influxdb , flux	2	476	August 25, 2021
Query a counter of distinct values Telegraf influxdb	5	1725	July 19, 2019
InfluxDB Flux query how to get distinct values from query Fluxlang flux	2	1053	November 15, 2022

InfluxDB query to get distinct values from a table

Related topics