Start Now

The Distributed SQL Blog

Thoughts on distributed databases, open source, and cloud native

Distributed SQL Tips and Tricks – February 15, 2022

Welcome back to our tips and tricks blog! I have the pleasure of recapping distributed SQL questions from around the Internet.

This blog series would not be possible without Dorian Hoxha, Franck Pachot, and Frits Hoogland. We also thank our incredible user community for not being afraid to ask questions.

Do you have questions? Make sure to ask them on our YugabyteDB Slack channel,

Read more

TPC-C Benchmark: Scaling YugabyteDB to 100,000 Warehouses

We’re excited to announce a new milestone in our TPC-C benchmark journey! YugabyteDB can now scale up to 100,000 warehouses with an efficiency of 99.78%. More specifically, this results in 630,000 operations per second on the database cluster, observed over a two-hour period.

For those new to TPC-C, it’s an OLTP system benchmarking tool used to measure performance when handling transactions generated by a real-world OLTP application. It models a business that has a warehouse,

Read more

YugabyteDB and Apache Superset: Explore and Visualize Open Source Data

Enterprises today collect and store vast amounts of data. However, realizing business value from this data requires modern data exploration and visualization tools.

Apache Superset is a widely popular, open source data exploration and visualization platform used and supported by enterprises such as Airbnb, Lyft, and Nielsen. More specifically, Superset enables users to quickly and easily explore and visualize large amounts of data stored in YugabyteDB. As a result,

Read more

Linux Performance Tuning: Dealing with Memory and Disk IO

Linux is a general purpose operating system. This means it’s created to generally do what is right, instead of having specific code paths to perform what is right for a single specific task—and potentially be wrong for others.

Linux does not have tunable parameters for reserving memory for caching disk pages (the page cache), like operating systems such as HPUX (dbc_min_pct, dbc_max_pct) or AIX (minperm%, maxperm%). Instead,

Read more

Securing YugabyteDB: Part 1 – Server-to-Server Encryption in Transit

Encryption in transit is a common requirement for client-to-server communication. It is particularly important for YugabyteDB, a 100% open source, distributed SQL database built to accelerate cloud native agility. YugabyteDB typically stores important user and customer data at an organization.

In this ongoing blog series, we take a look at the different aspects of encryption in transit for YugabyteDB. This first post will focus on encryption in transit for the database’s internal RPC communication protocol,

Read more

A Matter of Time: Evolving Clock Sync for Distributed Databases

Distributed clock synchronization is critical for many applications, including distributed SQL databases. Clock synchronization needs to keep up with the other demands in our modern infrastructure, such as:

  • Applications that have increasing performance requirements while distributing data across different geographic regions
  • Network infrastructure and computing power that is improving constantly

A distributed SQL database is highly available and resilient to failures when deployed across a cluster of nodes.

Read more

My Yugabyte Journey: From Intern to Full-Time Software Engineer

Hello. My name is Tim Elgersma, and I’m a software engineering intern on the YSQL team at Yugabyte. I have one semester left in my bachelor of Computer Science program at the University of Waterloo. In this blog post, I’d like to talk about my experience interning here over the past several months, and why I’m excited to join the company full time upon graduation.

Adding tablespaces to tablegroups

My onboarding at Yugabyte went pretty smoothly.

Read more

PostgreSQL Timestamps and Timezones: How to Navigate the Interval Minefield

This is the second of a two part blog post series about the date-time data types that PostgreSQL, and therefore YSQL, support. The first part dealt with the basic business of representing moments (when things happen). The relevant data types here are time, date, and timestamp—where the latter has a without time zone and a with time zone variant.

Read more

PostgreSQL Timestamps and Timezones: What You Need to Know—and What You Don’t

Anecdotal reports indicate that some PostgreSQL programmers are daunted by the date and time data types, and by how operations that use values of these data types might be affected by the session’s timezone setting. Even experienced developers struggle when they first embark on a critical project that relies on this functionality. YugabyteDB’s YSQL subsystem gives the application developer the same experience as PostgreSQL. So some YSQL users will find the topic challenging, too.

I recently completed a careful and exhaustive study of the topic so that I could write it all up in YugabyteDB’s YSQL documentation.

Read more

YugabyteDB Integrates with Arctype SQL Client

PostgreSQL has eaten the world. And so Yugabyte set out to build the most Postgres-compatible, scalable, and resilient database. YugabyteDB is not only wire compatible with PostgreSQL, it is code compatible by reusing the upper half of PostgreSQL. Thanks to this compatibility, YugabyteDB can connect to the vast majority of database tools that integrate with Postgres.

In a previous blog post, Yugabyte Developer Advocate Franck Pachot showed how to connect YugabyteDB in Arctype.

Read more

Get started in any cloud, container or data center