CockroachDB Outage History

Past incidents and downtime events

Complete history of CockroachDB outages, incidents, and service disruptions. Showing 50 most recent incidents.

← Back to CockroachDB current status

February 2026(1 incident)

majorresolvedFeb 2, 04:56 PM — Resolved Feb 2, 07:46 PM

Advanced cluster creation on AWS taking longer than expected

4 updates

resolvedFeb 2, 07:46 PM

We have deployed and verified a fix. Advanced clusters should now create within normal time frames.

identifiedFeb 2, 07:31 PM

We are continuing to work on the fix for this issue. Next update will be at 15:00 US/Eastern.

identifiedFeb 2, 05:57 PM

We are preparing a fix for deployment, which should restore AWS cluster creation times to normal levels. We will update the status page when the fix is deployed, or by 14:30 US/Eastern with more information.

identifiedFeb 2, 04:56 PM

We have identified an issue that is causing Advanced cluster creation on AWS to take longer than usual. We are preparing a fix, and will update this status page with more information when we have an ETA for resolution.

October 2025(3 incidents)

criticalresolvedOct 31, 03:54 PM — Resolved Oct 31, 10:09 PM

Standard and Basic cluster creation on AWS delayed

7 updates

resolvedOct 31, 10:09 PM

All cluster operations are working normally.

monitoringOct 31, 08:18 PM

We have resolved the underlying issue that was preventing cluster creation and other cluster operations. We have re-enabled cluster creation on all clouds. We will monitor the situation to ensure that everything continues to operate normally.

identifiedOct 31, 07:58 PM

We are continuing to work on a fix for this issue. Next update will be by 4:30 PM US/Eastern time.

identifiedOct 31, 06:59 PM

We are continuing to work on a fix for this issue. Next update will be by 4:00 PM US/Eastern time.

identifiedOct 31, 06:22 PM

We have identified the cause of cluster creation failures impacting Basic and Standard AWS clusters. As a precautionary measure, while we apply a fix, we have disabled all cluster creation in Cockroach Cloud. We will restore cluster creation when it is safe to do so. There is no SQL availability impact to any existing clusters. We will update here by 3:00 PM US/Eastern with more information.

investigatingOct 31, 04:21 PM

We are investigating errors with Basic and Standard clusters on AWS. Cluster creation, certain cluster edit operations (such as adding a region), and certain CockroachDB Cloud Console functionality (such as listing databases or users) are also impacted. Basic and Standard clusters on GCP are operating normally at this time.

investigatingOct 31, 03:54 PM

We are investigating errors that are causing delays in creation of Basic and Standard clusters on AWS. Basic and Standard cluster creation on GCP is operating normally at this time.

majorresolvedOct 30, 02:50 PM — Resolved Oct 30, 04:05 PM

Advanced Cluster Creation Delayed

3 updates

resolvedOct 30, 04:05 PM

Cluster creation is operating normally, and all delayed clusters have finished creation.

monitoringOct 30, 03:25 PM

We have applied a mitigation and observe that cluster creations are proceeding again. We will monitor all impacted cluster creations to ensure they succeed.

investigatingOct 30, 02:50 PM

We are investigating errors that cause creation for Advanced clusters to take longer than usual. Standard and Basic cluster creation is operating normally.

majorresolvedOct 20, 01:28 PM — Resolved Oct 21, 02:56 PM

AWS us-east-1 outage impacting cluster operations and backups

3 updates

resolvedOct 21, 02:56 PM

The earlier AWS outage in the us-east-1 region has been fully resolved. All CockroachDB Cloud operations, including cluster creation, scaling, and backups, are functioning normally again. We’ve verified recovery across affected systems and customer clusters, and normal operations have resumed.

monitoringOct 20, 10:52 PM

AWS has resolved the underlying outage in the us-east-1 region, and we’re seeing recovery across CockroachDB Cloud operations. Cluster creation, scaling, and backups are now succeeding again. We’re continuing to monitor for full stability before marking this incident as resolved.

identifiedOct 20, 01:28 PM

We’re currently impacted by an ongoing AWS outage in the us-east-1 region. Customers may experience failures or delays when creating new clusters, adding nodes, or running backups in this region. Existing clusters remain operational, but backup operations may be delayed or fail intermittently. We’re continuing to monitor AWS recovery efforts and will provide updates as more information becomes available.

September 2025(1 incident)

majorresolvedSep 24, 12:31 PM — Resolved Sep 24, 04:00 PM

Google Kubernetes incident impacting cluster operations in GCP.

2 updates

resolvedSep 24, 04:00 PM

Google has reported the issue has been resolved and we've verified cluster operations are functional.

identifiedSep 24, 12:31 PM

We are currently experiencing an incident impacting certain cluster operations on Google Cloud Platform due to an known ongoing Google Kubernetes-related issue. Customers may encounter errors when creating or editing clusters. Our engineering teams are actively monitoring the situation and communicating with Google regarding resolution.

July 2025(5 incidents)

majorresolvedJul 29, 11:08 PM — Resolved Jul 29, 11:35 PM

SQL availability impacted across multiple Serverless clusters.

3 updates

resolvedJul 29, 11:35 PM

We have confirmed resolution of the incident impacting SQL availability for the affected cluster. SQL availability is restored and the associated cluster is now healthy.

monitoringJul 29, 11:17 PM

Correction from the original status update. This issue appears to have been isolated to cluster crl-prod-6rw. We have identified the cause, mitigated and confirmed that cluster availability has returned. We are currently completing additional validations.

investigatingJul 29, 11:08 PM

We are observing an incident impacting SQL availability across multiple Serverless clusters. We are currently investigating the issue.

minorresolvedJul 21, 09:39 PM — Resolved Jul 22, 04:22 AM

Dependent Service incident intermittently impacting CockroachCloud Operations

2 updates

resolvedJul 22, 04:22 AM

LetsEncrypt has indicated the issue has been mitigated. We have validated cluster creation behaviour is now returning to normal.

identifiedJul 21, 09:39 PM

We are aware of an ongoing Incident with dependent service "Let's Encrypt": https://letsencrypt.status.io/, which may impact cluster creation and add region operations for CockroachDB Cloud Advanced clusters. We will update this status page with more information as it becomes available.

maintenanceresolvedJul 16, 12:30 PM — Resolved Jul 16, 04:58 PM

Cloud Console and API Planned Maintenance 16:00 - 17:00 UTC

3 updates

resolvedJul 16, 04:58 PM

The maintenance has been completed, and the cloud console and API are working normally.

identifiedJul 16, 04:01 PM

Maintenance is beginning now. The next update will be before 17:00 UTC.

identifiedJul 16, 12:30 PM

We will be performing a planned maintenance operation on Cockroach DB Cloud Console and the Cockroach DB Cloud API today from 16:00 - 17:00 UTC. During this time, you may experience errors attempting cluster create or edit operations. Cockroach DB Cloud Advanced, Standard, and Basic clusters will remain fully available during this time. (An earlier version of this notice incorrectly stated the time of maintenance as 14:00 - 15:00 UTC; the maintenance will be from 16:00 - 17:00)

majorresolvedJul 3, 04:58 PM — Resolved Jul 3, 05:39 PM

Cluster operations impaired

2 updates

resolvedJul 3, 05:39 PM

We have corrected a misconfiguration which was preventing cluster operations from succeeding. We are monitoring the success of cluster operations already in progress and new operations.

investigatingJul 3, 04:58 PM

We are investigating an issue that may prevent or delay cluster operations from succeeding.

majorresolvedJul 2, 05:42 PM — Resolved Jul 2, 08:46 PM

Cluster operations taking longer than expected

4 updates

resolvedJul 2, 08:46 PM

All cluster operations are succeeding normally at this time.

monitoringJul 2, 07:32 PM

We continue to see cluster operations succeeding, and will actively monitor to ensure the issue does not reoccur.

identifiedJul 2, 06:50 PM

We have identified the issue preventing cluster operations from proceeding, and we see that operations are moving forward. We will next update this incident by 16:00 EDT.

investigatingJul 2, 05:42 PM

We are investigating cluster operations, including creating or editing clusters, that are taking longer than usual.

June 2025(4 incidents)

majorresolvedJun 27, 10:29 PM — Resolved Jun 27, 10:36 PM

Issue impacting Basic and Standard tier cluster creation in AWS.

2 updates

resolvedJun 27, 10:36 PM

We have identified and rolled out a fix for the issue impacting cluster creation in AWS. This incident is now resolved.

identifiedJun 27, 10:29 PM

We have identified an issue impacting Basic and Standard tier cluster creation in AWS. We have conducted an initial investigation and actively working to resolve the issue.

majorresolvedJun 12, 06:22 PM — Resolved Jun 12, 09:31 PM

Upstream cloud provider API is unavailable

4 updates

resolvedJun 12, 09:31 PM

Google has reported that mitigations have been applied and services are now recovering.

monitoringJun 12, 07:54 PM

Google has updated their public status page to indicate the underlying issue is resolved, and individual Google Cloud services are in the process of restoring full service. We are continuing to monitor the issue and impact to CockroachDB Cloud.

identifiedJun 12, 07:08 PM

In addition to cluster operations, backups to Google Cloud Storage, including managed backups from CockroachDB Cloud clusters on GCP, may be impacted.

identifiedJun 12, 06:22 PM

We are aware of errors in the Google Cloud API which may impact cluster creation and edit operations for CockroachDB Cloud Advanced clusters in GCP. We will update this status page with more information as it becomes available.

majorresolvedJun 4, 07:10 PM — Resolved Jun 4, 09:35 PM

Metrics Pages Unavailable in Cockroach Cloud Console for Basic and Standard Clusters

4 updates

resolvedJun 4, 09:35 PM

The issue that was preventing metrics pages from loading in the Cockroach Cloud console for Basic and Standard tier clusters has been fully resolved. Metrics are now loading as expected, and we’ve received confirmation from our service provider that the underlying issue has been addressed. We will continue to monitor for any signs of recurrence, but no further impact is expected at this time. Thank you for your patience.

monitoringJun 4, 08:19 PM

Metrics visualizations are now loading as expected. We are continuing to monitor system behavior to ensure stability and will mark this incident as resolved once we confirm full recovery.

identifiedJun 4, 07:31 PM

We’ve identified the source of the issue that was preventing metrics pages from loading in the Cockroach Cloud console for Basic and Standard tier clusters. The underlying service has recovered, and metrics visualizations are now loading as expected. We are continuing to monitor the situation to ensure full stability.

investigatingJun 4, 07:10 PM

We are currently investigating an issue that is preventing some metrics pages from loading in the Cockroach Cloud console for Basic and Standard tier clusters. Metrics data collection remains unaffected; however, we are seeing unexpected errors when retrieving metrics for display in the console. We are actively working to identify the root cause and restore full functionality. Further updates will be provided as our investigation continues.

majorresolvedJun 1, 07:29 AM — Resolved Jun 1, 07:29 AM

A brief network disruption affecting clusters in the AWS eu-west-1, eu-west-2, and eu-central-1 regions.

1 update

resolvedJun 2, 07:45 AM

Incident Start AWS experienced a brief network blip in the eu-west-1, eu-west-2, and eu-central-1 regions. Clusters in these regions were unavailable between June 1, 19:29–19:35 UTC. Incident End The network issue affecting clusters in the impacted AWS regions has been fully resolved *automatically* as of 19:35 UTC. All affected clusters are now operating normally. Please contact support if you have any concerns.

November 2024(2 incidents)

majorresolvedNov 19, 04:48 PM — Resolved Nov 19, 06:36 PM

Elevated error rate for some Standard and Basic clusters

3 updates

resolvedNov 19, 06:36 PM

The Engineering team has isolated and resolved the elevated error rate for all affected clusters.

monitoringNov 19, 05:06 PM

The root cause of the issue was identified and resolved. Engineering is monitoring the situation.

investigatingNov 19, 04:48 PM

We have received alerts indicating elevated error rates on isolated Standard and Basic clusters. The Engineering team is currently investigating the issue.

noneresolvedNov 12, 01:00 AM — Resolved Nov 12, 01:00 AM

Elevated errors on serverless platform

1 update

resolvedNov 13, 04:24 PM

On Tuesday Nov 12, at approximately 1am UTC, we received alerts of elevated errors due to an underlying host hardware failure. The workloads on this host were migrated to a healthy host and the errors resolved once the migration was complete.

October 2024(1 incident)

minorresolvedOct 24, 07:00 PM — Resolved Oct 24, 09:05 PM

AWS Advanced cluster creation delayed

3 updates

resolvedOct 24, 09:15 PM

The upstream incident has been resolved, and we have confirmed that cluster creation is working normally.

identifiedOct 24, 08:06 PM

We are continuing to monitor the upstream incident.

identifiedOct 24, 08:04 PM

We are aware of an AWS incident that is preventing successful completion of Advanced cluster creation and region addition. Once the cloud provider has resolved their incident, we will ensure clusters create successfully.

September 2024(2 incidents)

minorresolvedSep 25, 02:15 PM — Resolved Sep 25, 02:30 PM

Small number of serverless clusters unavailable

2 updates

resolvedSep 25, 02:35 PM

We have deployed a mitigation and all clusters are available again.

investigatingSep 25, 02:27 PM

We are aware of an issue affecting a small number of serverless clusters that is causing unavailability. Our SRE team is investigating.

minorresolvedSep 18, 07:24 PM — Resolved Sep 18, 09:33 PM

Some AWS clusters not showing available backups on the "Backup and Restore" page

2 updates

resolvedSep 18, 09:33 PM

We have deployed a fix which has resolved the issue.

identifiedSep 18, 07:24 PM

We have identified an issue where some AWS clusters are unable to list available backups on the "Backups and Restore" page. Backups are still being performed and available if a restore of a database or table is required.

August 2024(1 incident)

majorresolvedAug 9, 09:37 PM — Resolved Aug 9, 10:33 PM

Increased error rate in Serverless restore operations

2 updates

resolvedAug 9, 10:33 PM

We have corrected a misconfiguration that was causing restore errors, and have applied the fix to all serverless clusters.

investigatingAug 9, 09:37 PM

We are investigating increased errors in restore operations to serverless clusters.

July 2024(2 incidents)

majorresolvedJul 18, 11:20 PM — Resolved Jul 19, 03:09 PM

CockroachDB Unavailable in Azure Central US

3 updates

resolvedJul 19, 03:09 PM

We have now verified that all Azure clusters are operational and have completed backup jobs.

monitoringJul 19, 05:49 AM

Latest update from Microsoft status page is all green, with some residual effects. CockroachDB clusters with nodes in Microsoft’s Central US region are all up and running with some background jobs like backup failing. We are continuously monitoring the status of the clusters.

identifiedJul 18, 11:36 PM

CockroachDB clusters with nodes in Microsoft’s Central US region may be unavailable. We are monitoring the situation and waiting for an update from Microsoft. For more information see Microsoft’s status page at https://azure.status.microsoft/en-us/status/

majorresolvedJul 18, 01:56 AM — Resolved Jul 18, 02:45 PM

Dedicated Cluster Operations Errors

7 updates

resolvedJul 18, 02:45 PM

Cluster operations and Cloud Console pages are restored to normal working order.

monitoringJul 18, 04:20 AM

We have resolved the underlying cause of the outages, and are monitoring to ensure the system remains operational.

identifiedJul 18, 03:41 AM

We are continuing to work on a fix for this issue.

identifiedJul 18, 03:16 AM

We have further identified that this issue may impact pages within the Cloud Console that connect to Serverless and Dedicated clusters. SQL connections to both Serverless and Dedicated clusters are working normally. We are continuing to investigate the cause of this outage and working towards a solution.

identifiedJul 18, 03:04 AM

We are continuing to work on a fix for this issue.

identifiedJul 18, 02:34 AM

We are continuing to work on a fix for this issue.

identifiedJul 18, 02:22 AM

We are investigating an issue that is preventing CockroachDB Cloud Dedicated cluster creation and edit operations from running successfully.

June 2024(2 incidents)

minorresolvedJun 18, 08:31 PM — Resolved Jun 20, 07:56 PM

Azure cluster creation and modification impaired

4 updates

resolvedJun 20, 07:56 PM

Azure is no longer experiencing capacity issues in eastus.

monitoringJun 19, 04:32 PM

Due to capacity issues in the Azure East US region, some cluster operations such as cluster creation and modification may fail if East US is the target region.

monitoringJun 18, 11:07 PM

Azure cluster creation is now operational. Some existing clusters will be updated during their pre-scheduled maintenance windows to fix cluster modification on those clusters.

identifiedJun 18, 08:31 PM

Due to a change made by our cloud provider, cluster creation and cluster modification operations on Azure (only) are currently impaired while we prepare a fix.

minorresolvedJun 17, 02:48 PM — Resolved Jun 17, 02:48 PM

Cluster setting update caused a brief latency increase and availability impact on a single Serverless cluster.

1 update

resolvedJun 17, 05:17 PM

A cluster setting update resulted in higher than expected CPU activity, which caused increased latency and caused unavailability for a single Serverless cluster. The setting was rolled back to prevent further impact. The issue has been resolved. There should be no further availability impact.

May 2024(2 incidents)

minorresolvedMay 17, 07:24 PM — Resolved May 18, 12:58 PM

Intermittent failures when creating Azure CC dedicated clusters

2 updates

resolvedMay 18, 12:58 PM

This incident has been resolved.

monitoringMay 17, 07:24 PM

Azure is currently experiencing capacity issues for VMs in US regions. Users may experience intermittent failures creating clusters. Cockroach Labs is monitoring for errors. Azure notes that users "may receive error notifications when performing service management operations on a zonal level- such as create, delete, update, scaling, start, stop - for resources hosted in these regions. This issue affected various Intel and AMD general purpose VM sizes such as DSv4, DDSV4, DSv5, Dasv5, Bv2, etc. series."

majorresolvedMay 17, 12:42 AM — Resolved May 17, 02:01 AM

GCP Networking Outage

2 updates

resolvedMay 17, 02:01 AM

The Google Cloud Platform networking issue has been resolved. If you have any questions or concerns please submit a ticket to our support portal.

monitoringMay 17, 12:42 AM

We are aware of an ongoing Google Cloud Platform networking issue. This is impacting a select number of GCP Cockroach Cloud clusters. Cockroach Labs engineers are actively monitoring the situation. More information can be found here: https://status.cloud.google.com/incidents/xVSEV3kVaJBmS7SZbnre

February 2024(1 incident)

minorresolvedFeb 16, 04:04 PM — Resolved Feb 16, 09:48 PM

CockroachDB Cloud Dedicated cluster creation in AWS us-east-1 degraded

3 updates

resolvedFeb 16, 09:48 PM

This incident has been resolved.

monitoringFeb 16, 04:14 PM

We are continuing to monitor for any further issues.

monitoringFeb 16, 04:04 PM

We are currently monitoring an incident with AWS firehose creation in `us-east-1` that is causing cluster creation in `us-eastt-1` to take longer than normal. There is no impact to existing clusters. Our engineers are monitoring the situation.

January 2024(2 incidents)

minorresolvedJan 24, 04:51 AM — Resolved Jan 24, 04:05 PM

Degraded connectivity between clusters and external services.

4 updates

resolvedJan 24, 04:05 PM

Fix deployed on all affected clusters. CockroachDB Dedicated is mitigated as well. All services back to normal and incident closed.

identifiedJan 24, 10:56 AM

Fix has been identified and deployment to impacted clusters in progress. All CockroachDB Serverless environments are mitigated now.

identifiedJan 24, 07:00 AM

We are continuing to work on a fix for this issue.

identifiedJan 24, 04:51 AM

A subset of clusters are unable to reach external services, affecting features like CMEK, log export, and metrics export on impacted clusters. The clusters themselves remain available. We're actively working on a mitigation and will post updates until the issue is resolved.

minorresolvedJan 19, 12:29 AM — Resolved Jan 19, 01:30 AM

Clusters on Azure may be experiencing instability

2 updates

resolvedJan 19, 01:30 AM

We have replaced the failing nodes and all clusters are stable now.

investigatingJan 19, 12:29 AM

Some customers on Azure may be experiencing instability with their clusters, but all clusters remain up and functional at this time.

December 2023(1 incident)

majorresolvedDec 22, 02:01 AM — Resolved Dec 23, 02:46 AM

Failures on cluster operations

4 updates

resolvedDec 23, 02:46 AM

All cluster operations should now succeed. We are monitoring for any additional failures.

monitoringDec 22, 10:27 PM

Vertical scaling of clusters on AWS may take longer than expected.

identifiedDec 22, 05:07 PM

We've identified a root cause and are working on a solution. Current impact is limited to vertical scaling of AWS clusters only.

identifiedDec 22, 02:01 AM

We are currently seeing some operations on AWS clusters failing, including cluster creation and modification. We've investigated and found a root cause and are working on a mitigation.

November 2023(2 incidents)

majorresolvedNov 21, 12:00 PM — Resolved Nov 21, 12:00 PM

Severless clusters in GCP europe-west1 inaccessible

1 update

resolvedNov 21, 02:14 PM

Serverless clusters in GCP europe-west1 were inaccessible.

minorresolvedNov 14, 09:53 PM — Resolved Nov 14, 11:35 PM

Cluster operations failing due to quay.io registry outage

2 updates

resolvedNov 14, 11:35 PM

Cluster operations appear to once again be healthy. We will continue to monitor for subsequent events/impacts due to the ongoing Quay.io issue.

monitoringNov 14, 09:53 PM

The Quay.io container registry is experiencing issues at the moment. CockroachDB Cloud is monitoring updates - https://status.quay.io/

October 2023(2 incidents)

majorresolvedOct 30, 08:25 PM — Resolved Oct 30, 08:36 PM

Single sign-on to console is failing

2 updates

resolvedOct 30, 08:36 PM

The upstream provider incident has been resolved, and SSO authentication is working again.

monitoringOct 30, 08:25 PM

Due to an upstream provider outage, all authentication methods for the Cockroach Cloud console are failing, other than authenticating via username and password. Existing sessions will continue to work until they time out.

noneresolvedOct 13, 04:06 PM — Resolved Oct 13, 05:14 PM

Cluster operations failing due to quay.io registry outage

3 updates

resolvedOct 13, 05:14 PM

Quay.io has indicated that the incident is resolved on their end.

monitoringOct 13, 05:12 PM

A fix has been implemented and we are monitoring the results.

identifiedOct 13, 04:06 PM

The Quay.io container registry is experiencing issues at the moment. Cockroach Cloud is monitoring updates - https://status.quay.io/incidents/3dq7w1tvs16f?u=bcn3g37v5rgb

September 2023(2 incidents)

minorresolvedSep 28, 09:09 PM — Resolved Sep 29, 02:33 PM

AWS is experiencing networking issues in the us-east-1 region

3 updates

resolvedSep 29, 02:33 PM

This incident has been resolved.

monitoringSep 29, 02:32 PM

We are continuing to monitor for any further issues.

monitoringSep 28, 09:09 PM

We've been advised of a networking issue in the AWS us-east-1 region that may affect cluster creation. We're currently waiting on an update from AWS for an estimated recovery time. https://health.aws.amazon.com/health/status

majorresolvedSep 8, 03:47 PM — Resolved Sep 8, 05:01 PM

Connectivity Issues

4 updates

resolvedSep 8, 05:01 PM

This incident has been resolved.

monitoringSep 8, 04:09 PM

A fix has been implemented and we are monitoring the results.

investigatingSep 8, 03:50 PM

We are continuing to investigate this issue.

investigatingSep 8, 03:47 PM

We are currently looking into connectivity issues in our Serverless environment. The impact is currently isolated to our `Europe-West-1` Serverless cluster in GCP

August 2023(3 incidents)

minorresolvedAug 11, 04:45 PM — Resolved Aug 12, 04:50 PM

Some serverless clusters may be unable to restore backups

4 updates

resolvedAug 12, 04:50 PM

Incident has been resolved.

monitoringAug 12, 04:13 PM

Incident has been mitigated and we are currently monitoring.

identifiedAug 11, 05:57 PM

We've identified the root cause and are currently working on a mitigation.

investigatingAug 11, 04:45 PM

We are currently investigating the issue.

majorresolvedAug 3, 01:12 AM — Resolved Aug 3, 03:53 AM

Serverless intermittent connection failures

2 updates

resolvedAug 3, 03:53 AM

Our engineers have deployed a fix and the issue is now resolved

investigatingAug 3, 01:12 AM

Serverless clusters are experiencing intermittent connection failures in the AWS us-east-1 region.

majorresolvedAug 2, 04:05 PM — Resolved Aug 2, 07:44 PM

Serverless Cluster Resource Limit Throttling

4 updates

resolvedAug 2, 07:44 PM

This issue has now been resolved. Users with clusters that were previously throttled will now see their changes to resource limits reflected.

identifiedAug 2, 07:02 PM

The scope of the impact has been limited to serverless clusters running AWS eu-west-1 and AWS ap-south-1. Clusters that have hit their resource limits will be throttled despite having their limits increased. One of two issues identified has been mitigated.

investigatingAug 2, 04:10 PM

Some Serverless clusters that have hit their resource limits are still being throttled even after limits have been increased.

investigatingAug 2, 04:05 PM

Some Serverless clusters that have hit their resource limits are still being throttled even after limits have been increased. Currently only AWS eu-west-1 and ap-south-1 host clusters are impacted

July 2023(3 incidents)

majorresolvedJul 25, 09:55 PM — Resolved Jul 25, 11:07 PM

Serverless clients unable to access GCP Multi-Region Serverless Public Preview hosts

3 updates

resolvedJul 25, 11:07 PM

Issues impacting GCP Multi-Region Serverless Public Preview host cluster connectivity are now resolved.

identifiedJul 25, 10:55 PM

We have identified the issue impacting connectivity to Serverless clusters on GCP Multi-Region Serverless Public Preview host and are currently working towards resolution.

investigatingJul 25, 09:55 PM

Serverless customers may experience issues when connecting to serverless clusters running on the GCP Multi-Region Serverless Public Preview host. Other serverless clusters continue to be fully functional.

minorresolvedJul 18, 03:15 PM — Resolved Jul 18, 04:14 PM

Intermittent failures when creating CC dedicated clusters on Azure

3 updates

resolvedJul 18, 04:14 PM

This incident has been resolved.

identifiedJul 18, 03:24 PM

We are continuing to work on a fix for this issue.

identifiedJul 18, 03:15 PM

Azure is currently experiencing capacity issues for certain VM sizes in eastus2 and westeurope. Cockroach Labs is working with support to fully diagnose and mitigate any errors.

noneresolvedJul 11, 03:28 PM — Resolved Jul 11, 11:14 PM

Intermittent failures when creating CC dedicated clusters on Azure

2 updates

resolvedJul 11, 11:14 PM

This incident has been resolved.

identifiedJul 11, 03:28 PM

Azure is currently experiencing capacity issues for certain VM sizes in eastus2 and westeurope. Cockroach Labs is working with support to fully diagnose and mitigate any errors.

June 2023(3 incidents)

noneresolvedJun 14, 10:24 PM — Resolved Jun 15, 01:57 PM

Intermittent failures when creating CC dedicated clusters on Azure

2 updates

resolvedJun 15, 01:57 PM

We're no longer seeing elevated error rates for Azure clusters.

identifiedJun 14, 10:24 PM

The issue has been identified and Cockroach Labs is working with Azure to fully diagnose and mitigate any errors.

minorresolvedJun 13, 02:15 PM — Resolved Jun 14, 12:54 AM

Multi-Region Serverless Degraded Performance

5 updates

resolvedJun 14, 12:54 AM

This incident has been resolved.

monitoringJun 14, 12:51 AM

A fix has been implemented and we are monitoring the results.

identifiedJun 13, 05:43 PM

The issue has been identified and a fix is being implemented.

investigatingJun 13, 04:09 PM

We are continuing to investigate this issue and have confirmed it is isolated to a small number of queries.

investigatingJun 13, 03:06 PM

We are currently investigating an issue with our AWS Multi-Region Public Preview Serverless clusters where a small percentage of queries are failing.

majorresolvedJun 13, 08:13 PM — Resolved Jun 13, 11:24 PM

AWS us-east-1 seeing elevated errors

4 updates

resolvedJun 13, 11:24 PM

This incident has been resolved.

monitoringJun 13, 09:55 PM

Cluster creation is still delayed or failing un AWS us-east-1 regions.

monitoringJun 13, 09:22 PM

AWS us-east-1 appears to be recovering and we are seeing recovery of impacted systems. Please see https://health.aws.amazon.com/health/status for details

investigatingJun 13, 08:13 PM

AWS is seeing elevated error rates in us-east-1, clusters in that region will be in a degraded state.

May 2023(3 incidents)

minorresolvedMay 24, 12:13 AM — Resolved May 24, 03:12 AM

Cluster creation on AWS failing in some regions

6 updates

resolvedMay 24, 03:12 AM

This incident has been resolved.

monitoringMay 24, 02:47 AM

A fix has been implemented and we are monitoring the results.

identifiedMay 24, 02:27 AM

The issue has been identified and a fix is being implemented.

investigatingMay 24, 12:17 AM

We are currently investigating this issue.

monitoringMay 24, 12:13 AM

Cluster creation on AWS is failing in eu-west-1. We have not observed this issue in any other regions at this time. We've identified a root cause and expect a fix to be deployed within the next 24 hours.

investigatingMay 24, 12:13 AM

majorresolvedMay 23, 06:51 PM — Resolved May 23, 07:15 PM

Multi-Region Serverless Clusters Are Experiencing Connection Issues

4 updates

resolvedMay 23, 07:15 PM

This incident has been resolved.

monitoringMay 23, 07:07 PM

A fix has been implemented and we are monitoring the results.

identifiedMay 23, 06:56 PM

The issue has been identified and a fix is being implemented.

investigatingMay 23, 06:51 PM

We are currently investigating this issue.

majorresolvedMay 17, 09:00 PM — Resolved May 17, 11:00 PM

Partial outage for Serverless clusters in us-east-1.

1 update

resolvedMay 18, 12:51 AM

Partial outage for Serverless clusters in us-east-1. Some users may be unable to connect to their clusters.

April 2023(1 incident)

minorresolvedApr 28, 06:23 PM — Resolved Apr 28, 09:23 PM

Edit cluster currently disabled.

4 updates

resolvedApr 28, 09:23 PM

This incident has been resolved.

monitoringApr 28, 09:03 PM

Cluster edit is re-enabled and should be functioning as expected. We are currently validating GKE cluster creation.

identifiedApr 28, 06:35 PM

We are currently investigating issues related to Edit Cluster and GKE cluster creation. Edit Cluster functionality is currently disabled. GKE cluster creation is currently failing. No other stability issues are expected.

identifiedApr 28, 06:23 PM

We are currently investigating an issue related to Edit Cluster. Edit Cluster functionality is currently disabled. No other stability issues are expected.

March 2023(1 incident)

minorresolvedMar 24, 08:52 PM — Resolved Mar 27, 03:34 PM

Elevated error rate for GCP cluster creation

3 updates

resolvedMar 27, 03:34 PM

No further instances of related job failures were observed while monitoring over the past 48 hours.

monitoringMar 24, 10:18 PM

We have identified the likely cause of the issue. We have confirmed this is isolated to specific GCP regions with limited reach. We are continuing to monitor the situation.

investigatingMar 24, 08:52 PM

We are observing an elevated error rate for cluster creation in GCP. We are currently investigating the issue.