Episode Details

Back to Episodes
The Problem with MTTR: Learning from Incident Reports | Courtney Nash

The Problem with MTTR: Learning from Incident Reports | Courtney Nash

Season 3 Episode 6 Published 3 years, 1 month ago
Description

Tracking Mean Time To Restore (MTTR) is standard industry practice for incident response and analysis, but should it be? 

Courtney Nash, an Internet Incident Librarian, argues that MTTR is not a reliable metric - and we think she's got a point.

We caught up with Courtney at the DevOps Enterprise Summit in Las Vegas, where she was making her case against MTTR in favor of alternative metrics (SLOs and cost of coordination data), practices (Near Miss analysis), and mindsets (humans are the solution, not the problem) to help organization better learn from their incidents.

Show Notes

OFFERS

  • Start Free Trial: Get started with LinearB's AI productivity platform for free.
  • Book a Demo: Learn how you can ship faster, improve DevEx, and lead with confidence in the AI era.

LEARN ABOUT LINEARB

  • AI Code Reviews: Automate reviews to catch bugs, security risks, and performance issues before they hit production.
  • AI & Productivity Insights: Go beyond DORA with AI-powered recommendations and dashboards to measure and improve performance.
  • AI-Powered Workflow Automations: Use AI-generated PR descriptions, smart routing, and other automations to reduce developer toil.
  • MCP Server: Interact with your engineering data using natural language to build custom reports and get answers on the fly.
Listen Now

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us