Incident Start Date: 19/08/2024 17:59 GMT
Incident End Date: 20/08/2024 01:50 GMT
Issue
During the incident, some users were unable to schedule content for publishing. The UI indicated that the publishing process was hanging, preventing users from successfully completing their tasks.
Root Cause
The issue was traced back to an undetected faulty shard within our infrastructure. Any publishing job routed to this shard was not processed, leading to the observed hang in the UI.
Corrective Actions
To prevent this from happening in the future, we have implemented the following measures: