Timeframe: 5:55pm PST September 14, 2017 - 6:45pm PST September 14, 2017
Affected Systems: US Production environment
Symptoms: Internal synthetic tests detected Zuora API and UI failures
Root Cause: A virtual server failed due to a hardware failure, causing a disruption of service in a subset of our systems. One of the systems that experienced the failure was unable to failover to the redundant, standby system due to a system lock. While this is an initial analysis, we are continuing our investigation into why this service was unable to failover appropriately.
Resolution: The impacted services were migrated to a different virtual host and service was restored.
Future Preventative Measures: Decrease the affinity in our virtualization environment for key component services, and improve resiliency of our high availability components.