Skip to main content

Incidents & Alerts

Track, manage, and resolve incidents for your Nife deployments in one place.

What Is an Incident?

An incident is a detected service disruption or degradation — automatically created when an alert fires or when SRE Intelligence detects a critical anomaly.

Incident Lifecycle

Detected → Triggered → Acknowledged → Resolved
StateDescription
TriggeredAlert condition met, incident created
AcknowledgedTeam member has taken ownership
ResolvedIssue fixed, service restored

Viewing Incidents

  1. Navigate to your application in the Nife dashboard
  2. Go to InsightsIncidents & Alerts
  3. Filter by status, severity, or time range

Incident Details

Each incident shows:

  • Trigger — which alert or anomaly caused it
  • Affected service — the application and region
  • Timeline — when it started, acknowledged, and resolved
  • Related metrics — graphs of affected metrics during the incident
  • Linked alerts — all alerts that fired during the incident

Setting Up Alerts

Configure alert rules to automatically create incidents. See Creating Alert Rules.