Incident Analytics & Metrics
Measure your team's incident response performance, identify recurring patterns, and track improvement over time with FanDesk's incident analytics dashboard.
Key Metrics
MTTA — Mean Time to Acknowledge
The average time between an incident being triggered (created) and the first acknowledgment by a responder.
| Target | Description |
|---|---|
| < 5 minutes | Excellent — alert and response system is working |
| 5-15 minutes | Good — responders are monitoring effectively |
| 15-60 minutes | Needs improvement — consider better alerting or on-call coverage |
| > 1 hour | Critical gap — incidents are sitting unattended |
High MTTA usually indicates a problem with alerting, on-call coverage gaps, or responders not being notified quickly enough (WhatsApp notifications can help here).
MTTR — Mean Time to Resolution
The average time between an incident being triggered (created) and being marked resolved.
| Severity | Target MTTR |
|---|---|
| Critical | Under 2 hours |
| High | Under 4 hours |
| Medium | Under 24 hours |
| Low | Under 3 days |
MTTR varies significantly by severity and incident type. Track the trend over time — is it improving or worsening? — more than comparing against an absolute number.
MTTA vs MTTR
The gap between MTTA and MTTR is the investigation-and-fix time. A very low MTTA with high MTTR means responders acknowledge quickly but struggle to resolve. Improve with better runbooks, clearer escalation paths, and postmortem action items.
Analytics Dashboard
The Incidents analytics dashboard provides a comprehensive view of your incident data.
Summary Cards
| Card | Description |
|---|---|
| Total Incidents (30 days) | All incidents created in the past month |
| Open Incidents | Currently active (Triggered + Acknowledged + Investigating) |
| Critical / High Open | Highest severity incidents requiring immediate attention |
| Average MTTA | Calculated across all incidents in the selected period |
| Average MTTR | Calculated across all resolved incidents in the selected period |
Incident Trends Chart
Bar chart showing incident count over time (daily or weekly granularity):
- Spot spikes in incident frequency — often correlate with major deployments or infrastructure changes
- Identify recurring problem periods (e.g., Monday mornings after weekend maintenance)
- Track whether incident frequency is trending up or down
Distribution by Severity
Pie or bar chart showing the breakdown of incidents by severity level:
- A healthy pattern has more Low/Medium incidents than Critical/High
- Increasing proportion of Critical incidents is a warning sign of systemic instability
- Use this to justify stability investment to stakeholders
Distribution by Category
Which types of incidents are most common:
- Heavy on Outages → reliability and infrastructure investment needed
- Heavy on Security → security posture review needed
- Heavy on Performance → capacity planning or optimization needed
- Heavy on Bugs → QA and testing process improvements needed
MTTA and MTTR Over Time
Line charts showing how your average acknowledgment and resolution times change week over week:
- A downward trend means your team is getting faster
- Spikes often correspond to periods of high complexity, team changes, or coverage gaps
- Use this in team retrospectives to measure the impact of process changes
Top Responders
Leaderboard showing who handles the most incidents in the selected time period:
- Name, avatar, and incident count
- Use for workload balancing — one person handling 80% of incidents is a bus factor risk
- Use for recognition — acknowledge top contributors in team meetings
Incident Heatmap
Visual map of when incidents typically occur (day of week × hour of day):
- Identify if incidents cluster on Monday mornings (post-weekend change deployment risk)
- Identify if incidents cluster during business hours vs. off-hours (oncall coverage planning)
- Find patterns that suggest systemic causes rather than random failures
Exporting Incident Data
Export all incident data for external analysis or reporting:
- Go to the Incidents analytics page
- Set the date range using the range picker
- Click Export
- Download as CSV
- Open in Excel, Google Sheets, or your BI tool of choice
The CSV includes: incident ID, title, severity, category, status, created time, acknowledged time, resolved time, MTTA, MTTR, and assignee.
Using Analytics to Drive Improvement
Identify Patterns
Ask the right questions of your data:
- Do incidents spike after every deployment? → Improve deployment testing and staging validation
- Does MTTA jump on weekends? → On-call coverage gap or alerting not reaching responders
- Does one category dominate? → Systemic issue in that area needing dedicated investment
Improve Response with Runbooks
- High MTTR for a specific category? → Document resolution steps as runbooks linked from the category
- Repeated postmortem action items that don't get done? → Ensure they are tracked as real FanDesk tasks with owners and due dates
Track Progress Month Over Month
- Set a quarterly MTTA target (e.g., reduce from 45 minutes to under 15 minutes)
- Review the trend chart in monthly incident retrospectives
- Celebrate wins when metrics improve — it reinforces the behavior changes that caused improvement
Using DeskMate for Incident Analytics
Ask DeskMate for insights:
- "How many critical incidents did we have last month?"
- "What's our average MTTR for the last quarter?"
- "Which team member resolved the most incidents in Q1?"
- "Are there any patterns in when our outages occur?"
Next: Learn about the daily digest in Daily Digest.
Need help? Contact us at hello@fandesk.ai