Node Failure Library

September 22, 2023

What Happens to Tablets (Shards) When Node Is Lost and Then Brought Back Into Cluster?

Learn how YugabyteDB handles node loss and recovery to ensure consistent, continuous availability. With features like a 3-second leader re-election process and a robust fallback mechanism, your cluster can tolerate disruptions and resume optimal operation efficiently.

What Happens to a Downed Node When It Comes Back Into the Cluster?

September 1, 2023

By Marko Rajcevic

YugabyteDB

What Happens to a Downed Node When It Comes Back Into the Cluster?

Learn how YugabyteDB ensures stability when a downed node rejoins the cluster. Our latest blog covers the essential 15-minute RAFT timeout and the built-in fault tolerance mechanisms that keep your data secure and accessible.

If a Node Becomes Unavailable, Does YugabyteDB Propagate Client Requests to Surviving Nodes? Or Do Clients Have to Handle That Retry Logic?

May 19, 2023

By Marko Rajcevic

Distributed SQL

If a Node Becomes Unavailable, Does YugabyteDB Propagate Client Requests to Surviving Nodes? Or Do Clients Have to Handle That Retry Logic?

This blog explores YugabyteDB’s ability to propagate client requests to surviving nodes when a node becomes unavailable. The behavior depends on factors such as client connection method, driver used, and connection pooling.

Fine-Grained Control for High Availability: Increasing Node Failure Threshold

May 27, 2021

By Taylor Mull

Distributed SQL How It Works Performance Benchmarks

Fine-Grained Control for High Availability: Increasing Node Failure Threshold

The purpose of this blog post is to show how you can increase the failure threshold when a node goes down, which happens for a variety of reasons including hardware or network issues and most commonly maintenance. The failure threshold is the amount of time YugabyteDB would wait, after a node goes down, for it to potentially come back up. After reaching this threshold the physical data will begin to move from the dead node to other nodes in the cluster.

…

Node Failure

Explore Distributed SQL and YugabyteDB in Depth