Find out more

The Distributed SQL Blog

YugabyteDB 1.1 New Feature: Speeding Up Queries with Secondary Indexes

YugabyteDB 1.1 New Feature: Speeding Up Queries with Secondary Indexes

Welcome to another post from our ongoing series where we highlight a new feature from the latest 1.1 release! Today we are going to look at secondary indexes.

Defining Secondary Indexes

A database index is a data structure that improves the speed of data retrieval operations on a database table. Typically, databases are very efficient at looking up data by the primary key. A secondary index can be created using one or more columns of a database table,

Read more

Ravi Murthy: Starting a New Chapter with YugaByte

Ravi Murthy: Starting a New Chapter with YugaByte

In case you missed the announcement, Ravi Murthy has joined YugaByte as our VP of Engineering. Read on to learn more about his experiences leading the teams who managed the explosive growth of applications and data at Facebook, plus what’s next at YugaByte!

After almost 7 years at Facebook, I am super-excited to start in my new role as YugaByte’s VP of Engineering. This post provides a brief history of my career through the years,

Read more

Understanding How YugabyteDB Runs on Kubernetes

Understanding How YugabyteDB Runs on Kubernetes

As we reviewed in “Docker, Kubernetes and the Rise of Cloud Native Databases”, Kubernetes has benefited from rapid adoption to become the de-facto choice for container orchestration. This has happened in a short span of only 4 years since Google open sourced the project in 2014. YugabyteDB’s automated sharding and strongly consistent replication architecture lends itself extremely well to containerized deployments powered by Kubernetes orchestration. In this post we’ll look at the various components involved in getting YugabyteDB up and running as Kubernetes StatefulSets.

Read more

Benchmarking an 18 Terabyte YugabyteDB Cluster with High Density Data Nodes

Benchmarking an 18 Terabyte YugabyteDB Cluster with High Density Data Nodes

For ever-growing data workloads such as time series metrics and IoT sensor events, running a highly dense database cluster where each node stores terabytes of data makes perfect sense from a cost efficiency standpoint. If we are spinning up new data nodes only to get more storage-per-node, then there is a significant wastage of expensive compute resources. However, running multi-terabyte data nodes with Apache Cassandra as well as other Cassandra-compatible databases (such as DataStax Enterprise) is not an option.

Read more

Apache Cassandra DB Architecture Fundamentals

Apache Cassandra DB Architecture Fundamentals

What is the Apache Cassandra Database?

Apache Cassandra is a distributed open source database that can be referred to as a “NoSQL database” or a “wide column store.” Cassandra was originally developed at Facebook to power its “Inbox” feature and was released as an open source project in 2008. Cassandra is designed to handle “big data” workloads by distributing data, reads and writes (eventually) across multiple nodes with no single point of failure.

Read more

Explore Distributed SQL and YugabyteDB in Depth

Discover the future of data management.
Learn at Yugabyte University
Get Started
Browse Yugabyte Docs
Explore docs
PostgreSQL For Cloud Native World
Read for Free