Learn from Amazon Web Services
When Automated Capacity
Brought Down Half the Internet
In 2021, a seemingly routine AWS capacity increase triggered a cascading failure that crippled us-east-1, impacting countless services. Incident Drill helps engineering teams practice responding to similar high-pressure scenarios, turning chaos into calm.
WHY TEAMS PRACTICE THIS
Prepare for the Unthinkable
- ✓ Reduce downtime and minimize impact
- ✓ Improve incident response time
- ✓ Enhance team collaboration and communication
- ✓ Build confidence in handling critical incidents
- ✓ Identify weaknesses in your infrastructure
- ✓ Prevent future outages
How It Works
1
Step 1: Understand the Incident
Review the official AWS post-mortem and related resources.
2
Step 2: Simulate the Scenario
Use Incident Drill to recreate the conditions that led to the outage.
3
Step 3: Practice Your Response
Work with your team to diagnose the problem and implement solutions.
4
Step 4: Analyze and Improve
Review your team's performance and identify areas for improvement.
EXPLORE MORE
Related Incidents
Ready to Level Up Your Incident Response?
Join the Incident Drill waitlist and be among the first to experience realistic incident simulations.
Get Early Access →
✓ Founding client discounts
✓ Shape the roadmap
✓ Direct founder support