Resolved
All services are verified to be operational.
Investigating
The Atlas service is up and running now. Team is continuing to monitor all services.
Investigating
All services are up and stable.
We are just fixing a minor issue with the Atlas business manager portal which should be resolved shortly.
Investigating
The fix has been applied. Services have recovered. We're actively monitoring.
Investigating
The team has identified the issue that is causing the issue. They're working out the fix that can be applied with minimal effort and time involved.
Investigating
The team is working directly with AWS engineering team to identify a resolution at this time.
Context:
This was a planned upgrade of our Kubernetes cluster which has resulted in a disruption post upgrade.
Identified
We’re having an issue while upgrading our compute cluster. It’s affecting some of our production workloads