Looking for some insights as I am currently designing a monitoring platform for some nano computers (ARM), with SSD disks. We currently have a few hundred devices on field with very random Internet connection (some of which will never connect). Our frontend is Graphite and don’t plan to replace it.
Went out looking for collectd, statd, but telegraf/influxdb seems to do the job quite well (small fingerprint! <3).
Now the tricky bit seems to be the very unreliable Internet connections for these devices. Most of the time connecting once a month. So I’m struggling to find a way to store all these metrics, while controlling the db size (good point to influxdb with its retention policy) and uploading everything to our Graphite instance when connection is back. Maybe MQTT/AMQP would do the trick? But from what I read, data is kept either in memory or in a file on disk, which is a nogo as device often reboot and I don’t fancy working with huge text files.
Seen any similar use case around?
Thanks for any idea/help.