Learn from Google Cloud

Could Your Team Handle a
Multi-Region Outage Like Google's?

In 2019, a configuration bug brought Google Cloud's US network to its knees for hours. Incident Drill lets your team simulate this scenario and practice responding under pressure, so you're prepared when disaster strikes.

Google Cloud | 2019 | Outage (Network)

The High Stakes of Cloud Infrastructure

Modern infrastructure is incredibly complex. A single misconfiguration, like the one in Google's 2019 outage, can have a cascading impact, leading to service disruptions, data loss, and reputational damage. Relying on reactive measures is no longer enough.

PREPARE YOUR TEAM

Incident Drill: Practice Makes Perfect

Incident Drill provides a safe and realistic environment to practice responding to incidents like the 2019 Google Cloud Network Outage. We recreate the environment, inject the failure, and guide your team through the troubleshooting process. We help you identify weaknesses, improve communication, and build resilience.

🔥

Realistic Simulations

Experience the pressure of a real-world incident.

🔎

Root Cause Analysis

Uncover the underlying causes of complex failures.

🗣️

Team Collaboration

Improve communication and coordination during incidents.

📈

Performance Tracking

Measure your team's performance and identify areas for improvement.

📚

Post-Incident Review

Learn from mistakes and prevent future incidents.

⚙️

Customizable Scenarios

Tailor simulations to your specific infrastructure and needs.

WHY TEAMS PRACTICE THIS

Master Incident Response

  • Reduce downtime and minimize impact
  • Improve team communication and collaboration
  • Identify and address infrastructure weaknesses
  • Build confidence in your team's ability to handle crises
  • Meet compliance requirements for incident response training
  • Reduce the risk of future incidents

Google Cloud Network Outage - 2019

Initial Trigger Configuration Bug Introduced CRITICAL
~30 mins Network Control Plane Jobs Descheduled ERROR
~1 hour Multi-Region Network Failure MAJOR OUTAGE
~3-4 hours Service Restored RESOLVED

How It Works

1

Step 1: Select the Scenario

Choose the Google Cloud Network Outage simulation.

2

Step 2: Assemble Your Team

Gather your engineers and incident responders.

3

Step 3: Run the Simulation

Work together to diagnose and resolve the incident.

4

Step 4: Review and Learn

Analyze your performance and identify areas for improvement.

Ready to Level Up Your Incident Response?

Join the Incident Drill waitlist and be among the first to access our Google Cloud Network Outage simulation and other realistic incident scenarios.

Get Early Access
Founding client discounts Shape the roadmap Direct founder support

Join the Incident Drill waitlist

Drop your email and we'll reach out with private beta invites and roadmap updates.