Start Now

The Distributed SQL Blog

Thoughts on distributed databases, open source, and cloud native

Working with PostgreSQL Data Types in YugabyteDB

In the world of databases, data types restrict what can be considered as valid values in a table’s column. For example, if we want a column to store only integer values, we can specify that the column be an int column. Enforcing what type of data can go into a column has the added benefit of helping with storage and in some cases, query performance.

Generically, SQL data types can be broadly divided into following categories.

Read more

YugabyteDB Community Engineering Update, Tricks and Tips – Oct 18, 2019

Welcome to this week’s community update where we recap a few interesting questions that have popped up in the last week or so on the YugabyteDB Slack channel, the Forum, GitHub or Stackoverflow. We’ll also review upcoming events, new blogs and documentation that has been published since the last update. Ok, let’s dive right in:

Running yb-ctl status does not give node information


AndrewLiuRM over on the forums asked why the yb-ctl command doesn’t give node information when issued after a manual deployment onto a Docker container.

Read more

9 Techniques to Build Cloud-Native, Geo-Distributed SQL Apps with Low Latency

This post is an in-depth look at the various techniques that applications needing low latency and high availability can leverage while using a geo-distributed SQL database like YugabyteDB so that the negative impacts of an high-latency, unreliable Wide Area Network (WAN) are minimized.

Geo-Distributed SQL is the Future of RDBMS

Enterprises are increasingly moving to cloud-native applications powered by microservices architecture. These applications run on elastic cloud infrastructure such as serverless frameworks and containers.

Read more

How to: PostgreSQL Fuzzy String Matching In YugabyteDB

Before analyzing a large dataset that contains textual information, it’s important to scrub it and eliminate duplicates when necessary. To remove duplicates, you may need to compare strings referring to the same thing, but that may be written slightly different, have typos or were misspelled. Alternatively, you might need to join two tables on a column (let’s say on company name), and these can appear slightly different in both tables.

Fuzzy String Matching (or Approximate String Matching) is the process of finding strings that approximately match a pattern.

Read more

YugabyteDB Engineering Update – Oct 14, 2019

We are pleased to announce that YugabyteDB 2.0.1 is live! You can read the release notes of this and previous versions here. This release is shipping with 36 new enhancements and bug fixes.

What’s YugabyteDB? It is an open source, high-performance distributed SQL database built on a scalable and fault-tolerant design inspired by Google Spanner. Yugabyte’s SQL API (YSQL) and drivers are PostgreSQL wire compatible

[#1851] YSQL: Support for Pushdown of Aggregate Queries

With the YCQL API,

Read more

2019 Distributed SQL Summit Recap and Highlights

Well, that’s a wrap! Yugabyte would like to extend a special thanks to JD and Amanda from the team, and to all the speakers from Facebook, Google, Amazon, Pivotal, Salesforce, Narvar, Plume Design and others that presented at the first-ever Distributed SQL Summit on Sept 20, 2019.

If you couldn’t make it out to this year’s event, have no fear!

Read more

Recapping My Internship at Yugabyte – Jayden Navarro

It was a warm day in early October, and two large white tents occupied the lawn that sits between the Gates, Hewlett, and Packard buildings. Companies pasted the word “AI” in big bold letters across their banners, and students formed long lines, resumes in-hand, eager to learn about the Next Big Thing and how often the company cafeteria serves Poké.

Six months prior I had made the decision to leave my comfortable job as a Software Engineer in the networking industry and pursue a Masters degree in Computer Science at Stanford,

Read more

Getting Started with PostgreSQL Triggers in a Distributed SQL Database

Triggers are a basic feature that all monolithic SQL systems like Oracle, SQL Server and PostgreSQL have supported for many years. They are very useful in a variety of scenarios ranging from simple audit logging, to advanced tasks like updating remote databases in a federated cluster. In this blog, we’ll look at examples of INSERT, UPDATE and INSTEAD OF triggers in Yugabyte DB.

What’s Yugabyte DB? It is an open source, high-performance distributed SQL database built on a scalable and fault-tolerant design inspired by Google Spanner.

Read more

The Effect of Isolation Levels on Distributed SQL Performance Benchmarking

This post addresses a concern raised about a benchmarking result we recently published comparing the performance of YugabyteDB, Amazon Aurora and CockroachDB. It was pointed out that we unfairly used the default isolation level for each database rather than use serializable isolation level in all databases (even though serializable level was not required for these workloads). In addition, we are also happy to share additional results with the workloads run at YugabyteDB’s serializable isolation level.

Read more

Announcing YugabyteDB 2.0 GA: Jepsen Tested, High-Performance Distributed SQL

We are excited to announce the general availability of YugabyteDB 2.0! The highlight of this release is that it delivers production readiness for Yugabyte SQL (YSQL), our high-performance, fully-relational distributed SQL API. For those of you new to distributed SQL, YugabyteDB is a Google Spanner-inspired, cloud-native distributed SQL database that is 100% open source. It puts a premium on high performance, data resilience, geographic distribution while ensuring PostgreSQL compatibility.

Read more

Get started in any cloud, container or data center