Incidents

A complete incident management system with timeline, communication, and post-mortems. Incidents can be created manually or automatically when a critical/high alert goes unacknowledged.

Lifecycle

StateMeaning
OpenActive incident, needs attention
InvestigatingTeam analyzing root cause
IdentifiedRoot cause found, working on a fix
MonitoringFix applied, verifying stability
ResolvedIncident closed

Timeline

  • Every state change is recorded with timestamp and user.
  • Add manual notes to the timeline.
  • Correlated alerts are linked automatically.
  • Total duration and MTTR are calculated on resolution.

Notifications

Creating an incident notifies all channels configured for that severity. State changes send updates to the same channels, configured in Routing.