0
Fixed
Anturis central system alerting intermitten loss of server agent metrics
In recent days, we have been receiving a flood of alerts about the loss of server agent metric reports, only to have them restored shortly after.
Is the central system that collects the agents' data going through some sort of update? There does not appear to be anything wrong with our servers/agents (and for all of them to fail at the same time is one massive coincidence).
We tried configuring the monitors to alert at a higher number of successive failures, but it seems that rule does not apply to this type of situation; we still getting massive alert burst whenever the central system fails to pick up metrics from our servers.
Is the central system that collects the agents' data going through some sort of update? There does not appear to be anything wrong with our servers/agents (and for all of them to fail at the same time is one massive coincidence).
We tried configuring the monitors to alert at a higher number of successive failures, but it seems that rule does not apply to this type of situation; we still getting massive alert burst whenever the central system fails to pick up metrics from our servers.
Customer support service by UserEcho
At that time we had a problem with one of our server that affected part of our customers. The problem was caused by our current hosting provider. It is solved by now. We are already in process of migration to another provider's dedicated hosting.
Sorry for any inconvenience.
Would like the rule to follow the configuration we set (currently to 5 subsequent failures).
The system now should respect what you configured in Status Rule. If this is not the case please forward such fake incident notifications to support@anturis.com and we will check what's going on. E.g. one such glitch happened yesterday at 09:54 GMT and affected a portion of customers.
Thank you again for bringing this up.
Have adjusted memory monitors to 5 failures, will see how that goes. thanks.