Zardosht.Kasheff, Author at Percona Database Performance Blog

Zardosht.Kasheff

Explaining Ark Part 4: Fixing Majority Write Concern

Aug 07, 2014 | MySQL

This is the fourth post in a series of posts that explains Ark, a consensus algorithm we’ve developed for TokuMX and MongoDB to fix known issues in elections and failover. The tech report we released describes the algorithm in full detail. These posts are a layman’s explanation. This post assumes the reader is familiar with […]

Explaining Ark Part 3: Why Data May Be Lost on a Failover

Jul 30, 2014 | MySQL

This is the third post in a series of posts that explains Ark, a consensus algorithm we’ve developed for TokuMX and MongoDB to fix known issues in elections and failover. The tech report we released last week describes the algorithm in full detail. These posts are a layman’s explanation. In the first post, I discussed […]

Explaining Ark Part 2: How Elections and Failover Currently Work

Jul 25, 2014 | MySQL

This is the second post in a series of posts that explains Ark, a consensus algorithm we’ve developed for TokuMX and MongoDB to fix known issues in elections and failover. The tech report we released last week describes the algorithm in full detail. These posts are a layman’s explanation. In the first post, I described […]

Explaining Ark Part 1: The Basics

Jul 22, 2014 | MySQL

Last week, we introduced Ark, a consensus algorithm similar to Raft and Paxos we’ve developed for TokuMX and MongoDB. The purpose of Ark is to fix known issues in elections and failover. While the tech report detailing Ark explains everything formally, over the next few blog posts, I will go over Ark in layman’s terms. […]

Introducing Ark: A Consensus Algorithm For TokuMX and MongoDB

Jul 18, 2014 | MySQL

Most of the time, our blog posts explain what’s great about the MongoDB improvements we’ve already shipped in TokuMX. Sometimes, though, it’s fun to talk about what’s coming soon, especially when user feedback would really help get the feature right. In my next series of blog posts, I get to geek out and talk about […]

Why a Partitioned Collection Cannot Be Sharded

Jun 27, 2014 | MySQL

In TokuMX 1.5, we introduced partitioned collections for non-sharded clusters. That is, one can have a partitioned collection in a replica set, but one cannot shard a partitioned collection. In this post, I explain why. As I mentioned here, partitioned collections are useful for time-series data where we would like to keep a rolling period […]

Use TokuMX Partitioned Collections in Place of TTL Indexes

Jun 20, 2014 | MySQL

Take the following scenario. You have a time-series data application for which you would like to store a rolling period of data. For example, you may want to maintain the last six months of traffic logs for a website, in order to analyze activity of different periods of time. Or, you have an application maintaining […]

Best Practices for Partitioned Collections and Tables in TokuDB and TokuMX

Jun 13, 2014 | MySQL

In my last post, I gave a technical explanation of the performance characteristics of partitioned collections in TokuMX 1.5 (which is right around the corner) and partitioned tables in relational databases. Given those performance characteristics, in this post, I will present some best practices when using this feature in TokuMX or TokuDB. Note that these […]

Understanding the Performance Characteristics of Partitioned Collections

Jun 10, 2014 | MySQL

In TokuMX 1.5 that is right around the corner, the big feature will be partitioned collections. This feature is similar to partitioned tables in Oracle, MySQL, SQL Server, and Postgres. A question many have is “why should I use partitioned tables?” In short, it’s complicated. The answer depends on your workload, your schema, and your […]

Why TokuMX Changed MongoDB’s Oplog Format for Operations

Jun 05, 2014 | MySQL

Over several posts, I’ve explained the differences between TokuMX replication and MongoDB replication, and why they are completely incompatible. In this (belated) post, I explain one last difference: the oplog format for operations. Specifically, TokuMX and MongoDB log updates and deletes differently. Suppose we have a collection foo, with the following element:


rs0:PRIMARY> db.foo.find()
{ "_id" : 0, "a" : 0, "b" : 0 }

rs0:PRIMARY> db.foo.find()

{ "_id" : 0, "a" : 0, "b" : 0 }

We perform […]

The Difference Between TokuMX Partitioning and Sharding

Jun 04, 2014 | MySQL

In my last post, I described a new feature in TokuMX 1.5—partitioned collections—that’s aimed at making it easier and faster to work with time series data. Feedback from that post made me realize that some users may not immediately understand the differences between partitioning a collection and sharding a collection. In this post, I hope […]

Introducing Partitioned Collections for MongoDB Applications

May 29, 2014 | MySQL

TokuMX 1.5 is around the corner. The big feature will be something we discussed briefly when talking about replication changes in 1.4: partitioned collections. Before introducing the feature, I wanted to mention the following. Although TokuMX 1.5 is not available as of this writing, we would love to hear feedback on partitioned collections, which we […]

How TokuMX Secondaries Work in Replication

Apr 15, 2014 | MySQL

As I’ve mentioned in previous posts, TokuMX replication differs quite a bit from MongoDB’s replication. The differences are large enough such that we’ve completely redone some of MongoDB’s existing algorithms. One such area is how secondaries apply oplog data from a primary. In this post, I’ll explain how. In designing how secondaries apply oplog data, […]

On TokuMX Oplog, Tailable Cursors, and Concurrency

Apr 02, 2014 | MySQL

In a post last week, I described the difference in concurrency behavior between MongoDB’s oplog and TokuMX’s oplog. In short, here are the key differences: MongoDB protects access to the oplog with a database level reader/writer lock, whereas TokuMX does not. TokuMX can write data to the oplog concurrently, whereas MongoDB cannot. As a result, […]

On TokuMX (and MongoDB) Replication and Transactions

Mar 28, 2014 | MySQL

In my last post, I describe the differences between a TokuMX oplog entry and a MongoDB oplog entry. One reason why the entries are so different is that TokuMX supports multi-statement and multi-document transactions. In this post, I want to elaborate on why multi-statement transactions cause changes to the oplog, and explain how we changed […]

Comparing a TokuMX and MongoDB Oplog Entry

Mar 26, 2014 | MySQL

As I mentioned in my last post, TokuMX replication is completely incompatible with MongoDB replication. Replica sets (and sharded clusters, but that is for another blog) must be either entirely TokuMX or entirely MongoDB. This is by design. While elections and failover are basically the same, we have completely changed the oplog protocol. In the […]

Why TokuMX Replication Differs from MongoDB Replication

Mar 24, 2014 | MySQL

MongoDB replication has some great features. As I discussed in my last post, MongoDB’s crash safety design is very elegant. In addition to that, MongoDB has automatic failover, parallel slave replication, and prefetch threads on secondaries. The latter, as Mark Callaghan points out, is similar to “InnoDB fake changes”, a feature that has helped Facebook […]

My Favorite MongoDB Replication Feature: Crash Safety

Mar 18, 2014 | MySQL

At an extremely high level, replication in MongoDB and MySQL are similar. Both databases have exactly one machine, the primary (or master), that accepts writes from clients. With a single transaction (or atomic operation, in MongoDB’s case), the tables and oplog (or binary log in MySQL) are modified to reflect the change. The log captures […]

Migrating off MongoDB? Try TokuMX First

Feb 12, 2014 | MySQL

Patrick McFadin recently wrote a post titled “MongoDB. This is not the database you are looking for”, where he mentions “In the past year there has been a rising chorus of users stuck on a cliff with MongoDB and are desperate to get out.” The key theme seems to be that MongoDB may run into […]

How TokuMX was Born

Feb 11, 2014 | MySQL

With TokuMX 1.4 coming out soon, with (teaser) wonderful improvements made to sharding and updates (and plenty of other goodies), I’ve recently reminisced about how we got TokuMX to this point. We (actually, really John) started dabbling with integrating Fractal Tree® indexes into MongoDB in the summer of 2012, where we (really, he) prototyped using […]

MySQL 5.7
Support

Compare Percona to Leading Database Solutions

Software
Downloads

Valkey Contribution

Product Documentation

Resource Hub

Why Percona for MongoDB?

Why Percona for PostgreSQL?

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us