Thinking about monitoring backup execution via checkmk and wondering whether someone already looked into this (providing a simple Nagios/NRPE compatible agent)?
What exactly are you trying to achieve? Currently, you can track the backup progress in Pure UI and you can configure email reports which give you an overview of the daily activities? Is there something specific you would like to track?
The Mails and checking the Pure UI are fine but that's not the same as being able to feed backup status into a central monitoring system (that can generate alerts on its own based on policies).
I think a suitable way to generate output for an NRPE plugin would be per VM
- backup status (ok, warning, failed)
- time of backup executi0n (can then be used to generate graphs)
Ok. Sounds reasonable.
What would you need Pure to do? How would you feed your NRPE plugin? Would it have to dynamically query live data or would it be enough to have Pure dump the relevant data to a file every time there is a change that needs tracking?
I think the best way to gather data would be Pure (over)writing a specific file (e.g. /var/run/pure.status) with a list of all VMs containing status (OK, WARNING, CRITICAL) and the execution duration in minutes.
The plugin would then simply read this file and either parse information therein or simply pipe it through. The plugin will be called every other minute and as such only when a backup run has finished the information will change.
From your side it's most probably almost the same as the mail generation just in a different format and outputting to a file. Please give me a few days for a proposal how to format the text file. I would then write the plugin and eventually post a tutorial here in the forum (maybe we need a guides section soon) how to integrate Pure into either Nagios or checkmk.
Sounds like a very nice idea. Could the plugin support a database format (e.g. SQLite database in a single file)?
In any case, let me know when you have a list of necessary data fields.