Thinking about monitoring backup execution via checkmk and wondering whether someone already looked into this (providing a simple Nagios/NRPE compatible agent)?
The Mails and checking the Pure UI are fine but that's not the same as being able to feed backup status into a central monitoring system (that can generate alerts on its own based on policies).
I think a suitable way to generate output for an NRPE plugin would be per VM
- backup status (ok, warning, failed)
- time of backup executi0n (can then be used to generate graphs)
I think the best way to gather data would be Pure (over)writing a specific file (e.g. /var/run/pure.status) with a list of all VMs containing status (OK, WARNING, CRITICAL) and the execution duration in minutes.
The plugin would then simply read this file and either parse information therein or simply pipe it through. The plugin will be called every other minute and as such only when a backup run has finished the information will change.
From your side it's most probably almost the same as the mail generation just in a different format and outputting to a file. Please give me a few days for a proposal how to format the text file. I would then write the plugin and eventually post a tutorial here in the forum (maybe we need a guides section soon) how to integrate Pure into either Nagios or checkmk.