We test Corax against real-world infrastructure failures across every vendor, platform, and scenario. Browse the results below.
The hub firewall's IKE daemon crashes, tearing down all 6 site-to-site IPSec VPN tunnels simultaneously. All branch offices lose connectivity to the data center. File shares, ERP, email, and VoIP between sites all fail.
The HA synchronization between a FortiGate firewall cluster pair fails due to a mismatched firmware version after one unit was updated. Session tables are out of sync. If the primary fails, the secondary has a stale configuration that will break VPN tunnels and NAT rules.
A junior admin pushes a firewall rule that blocks TCP port 443 outbound for the production server VLAN. All HTTPS-dependent services fail — API calls to payment gateways, cloud backups, software license checks, and update services all stop.
The Windows Print Spooler service crashes on the central print server after processing a corrupted print job from an updated driver. All 45 network printers become inaccessible. 400 users across 3 floors cannot print.
After a Lambda function deployment, cold start times spike from 2 seconds to 28 seconds due to a new heavy SDK dependency. API Gateway returns 504 Gateway Timeout for cold-start invocations. Provisioned concurrency was removed to save costs last month.
An AWS RDS PostgreSQL Multi-AZ instance experiences a hardware failure in the primary AZ. Automatic failover to the standby in the secondary AZ triggers. Applications experience 60-120 seconds of downtime. DNS endpoint resolves to new primary.
Azure AD Connect sync fails due to an expired service account password. Password hash sync stops working. New users created in on-prem AD are not provisioned in Azure AD. Existing cloud users with changed passwords cannot authenticate to M365 services.
An Azure App Service Plan hosting 3 production web apps starts returning 502 Bad Gateway errors after an Azure platform update. The apps intermittently crash with out-of-memory exceptions. Azure Status page shows degraded performance in East US 2 region.
The FreePBX server crashes due to an Asterisk segfault after a problematic module update. All 150 SIP phones lose registration. IVR, voicemail, call queues, and ring groups all go offline. No redundant PBX in place.
The SIP trunk between the on-premises PBX and the ITSP loses registration after the provider changes their SBC IP without notice. All outbound and inbound PSTN calls fail. Internal extension-to-extension calls still work.
The offsite backup copy job to Wasabi S3 stalls at 23% due to ISP bandwidth throttling during business hours. The 8TB backup set cannot complete within the backup window. Offsite RPO violated for disaster recovery compliance.
A Veeam incremental backup fails because the previous restore point was corrupted by a storage error. The backup chain is broken, requiring a new active full backup. With 95 VMs, the full backup will take 18 hours and consume 4TB of additional space.
The Veeam backup repository runs out of disk space during the nightly backup window. All backup jobs fail with 'insufficient disk space' errors. No backups have completed for 3 nights. RPO violated for all protected VMs.
An Exchange Database Availability Group (DAG) member server experiences a storage failure causing the active database copy to dismount. The passive copy on the second server activates but is 5 minutes behind due to log shipping lag.
The Exchange 2019 Hub Transport service freezes after a malformed email triggers an anti-malware scanning loop. The mail queue grows to 15,000 messages. Internal and external email delivery stops completely. Users unaware until critical business emails bounce.
Group Policy processing fails across the domain after a SYSVOL replication issue leaves the DFS-R replicated SYSVOL share inconsistent between DCs. Workstations receive incomplete or conflicting policies. Security baselines not enforcing.
The domain controller holding all 5 FSMO roles (PDC Emulator, RID Master, Infrastructure Master, Schema Master, Domain Naming Master) goes down due to a motherboard failure. Password changes, account creation, and domain joins all fail.
Active Directory replication between the two domain controllers fails due to a lingering object conflict. Users at the branch office (authenticating against DC-02) see stale group memberships and GPOs. Password changes on DC-01 not replicating to DC-02.
The internal enterprise CA intermediate certificate is revoked by mistake during a PKI cleanup. All certificates issued by the intermediate CA are now untrusted. Internal web apps, RADIUS 802.1X auth, and LDAPS all fail certificate validation.
The SSL/TLS certificate on the public-facing customer portal expires at midnight. Chrome and Edge users see NET::ERR_CERT_DATE_INVALID. HSTS enforcement prevents bypass. API clients receiving TLS handshake failures. Revenue-impacting for e-commerce.
Every scenario is tested against Corax's Neural Engine in a production environment with AI-powered root cause analysis.
Tests run continuously as new infrastructure patterns are added.