Learn from Various (Linux OS)

Remember when a single second
brought down Reddit and LinkedIn?

On June 30, 2012, the introduction of a leap second triggered a kernel timing bug in Linux, leading to widespread server lockups. Incident Drill lets your team practice responding to similar OS-level incidents in a safe, controlled environment.

Various (Linux OS) | 2012 | Bug/Outage (OS)

The Danger of Unforeseen Timing Issues

Unexpected events like leap seconds can expose hidden vulnerabilities in your systems. The 2012 incident highlighted the importance of robust time handling and the potential for seemingly minor changes to cause major outages. Teams must be prepared to diagnose and mitigate these unpredictable failures quickly.

PREPARE YOUR TEAM

How Incident Drill Prepares Your Team

Incident Drill provides realistic incident simulations based on real-world events like the Linux Leap Second Crash. Teams practice troubleshooting, collaborating, and implementing solutions under pressure, improving their ability to handle critical incidents effectively and minimizing downtime.

⏱️

Realistic Simulations

Experience the chaos of a real incident without the real-world consequences.

🧑‍💻

Collaborative Environment

Work together with your team to diagnose and resolve the issue.

🔎

Root Cause Analysis

Deeply understand the underlying causes of the incident.

📈

Performance Tracking

Measure your team's performance and identify areas for improvement.

📚

Post-Incident Review

Learn from your mistakes and prevent future incidents.

🛠️

Customizable Scenarios

Tailor the simulation to your specific infrastructure and needs.

WHY TEAMS PRACTICE THIS

Become Incident-Ready

  • Improve incident response time
  • Reduce downtime and impact
  • Strengthen team collaboration
  • Identify vulnerabilities in your systems
  • Enhance problem-solving skills
  • Build confidence in handling critical situations
June 30, 2012, 23:59:60 UTC
Leap Second Introduced
Moments Later
Kernel Timing Bug Triggered
Minutes to Hours
High CPU Spin-Wait Loops Occur Error
Ongoing
System Lockups & Service Disruptions (Reddit, LinkedIn)
Hours/Days Later
Patches Applied / Systems Rebooted Success

How It Works

1

Step 1: Simulate the Incident

Run a realistic simulation of the Linux Leap Second Crash.

2

Step 2: Diagnose the Root Cause

Identify the kernel timing bug and its impact on system performance.

3

Step 3: Implement a Solution

Apply patches or implement workarounds to mitigate the issue.

4

Step 4: Review and Learn

Analyze the incident and identify areas for improvement in your systems.

Ready to Practice Incident Response?

Join the Incident Drill waitlist and be among the first to experience realistic incident simulations. Prepare your team for anything!

Get Early Access
Founding client discounts Shape the roadmap Direct founder support

Join the Incident Drill waitlist

Drop your email and we'll reach out with private beta invites and roadmap updates.