I have a Nagios service which has correctly noticed a problem. I am running a command on that machine that will fix that problem. However it will take a few hours to run. Until then it is still (correct) detected by Nagios as a problem. I can "acknowledge" the problem and I won't get notifications again, but if there is a problem with my clean up command, I won't know about it unless I remember to recheck.
Is there anyway to "acknowledge" a nagios problem for a certain amount of time? And after that time, if it's still a problem, to send a nagios alert as per normal? Sort of a "Ignore this problem for the next X mins/hours/days"? a "snooze" button for a nagios problem?
Best Answer
Yes, that's called downtime. To quote from the documentation:
There are two variants of downtime:
In this case you would want a fixed downtime with start time now and end time the expected completion of your command.