EmergencyEMERGENCY? Get 24/7 Help Now!

Why a Partitioned Collection Cannot Be Sharded

 | June 27, 2014 |  Posted In: Tokutek, TokuView

In TokuMX 1.5, we introduced partitioned collections for non-sharded clusters. That is, one can have a partitioned collection in a replica set, but one cannot shard a partitioned collection. In this post, I explain why. As I mentioned here, partitioned collections are useful for time-series data where we would like to keep a rolling period […]

Read More

TokuMX Compression & Performance Innovation Awards @ MongoDB World 2014

 | June 25, 2014 |  Posted In: Tokutek, TokuView

Today, at MongoDB World 2014 in New York City, we announced the winners of our inaugural TokuMX Compression & Performance Innovation Award. We selected two winners from amongst the nominees – Brightcove and Crowdtap. The announcement was made at the TokuMX Scale-up & Slim-down breakfast meet-up. This morning’s meet-up featured special guests Mark Callaghan of […]

Read More

TokuMX @ MongoDB World 2014

 | June 21, 2014 |  Posted In: Tokutek, TokuView

On Monday, some of the Boston-area Tokutek team will board New York bound trains to join our Union Square colleagues at MongoDB World 2014. We couldn’t be more excited. You would be too. TokuMX, our high-performance distribution of MongoDB, just had its first birthday. The rapid acceptance by the MongoDB community has surpassed even our […]

Read More

Announcing TokuMX v1.5

 | June 18, 2014 |  Posted In: Tokutek, TokuView

Tokutek is pleased to announce today’s release of TokuMX v1.5. Also worth noting is that TokuMX is exactly 1 year old tomorrow. But enough about birthdays, and more about features! This release brings with it the ability to partition a collection in unsharded TokuMX deployments. Zardosht Kasheff, one of Tokutek’s engineers, did an excellent job […]

Read More

Best Practices for Partitioned Collections and Tables in TokuDB and TokuMX

 | June 13, 2014 |  Posted In: Tokutek, TokuView

In my last post, I gave a technical explanation of the performance characteristics of partitioned collections in TokuMX 1.5 (which is right around the corner) and partitioned tables in relational databases. Given those performance characteristics, in this post, I will present some best practices when using this feature in TokuMX or TokuDB. Note that these […]

Read More

Understanding the Performance Characteristics of Partitioned Collections

 | June 10, 2014 |  Posted In: Tokutek, TokuView

In TokuMX 1.5 that is right around the corner, the big feature will be partitioned collections. This feature is similar to partitioned tables in Oracle, MySQL, SQL Server, and Postgres. A question many have is “why should I use partitioned tables?” In short, it’s complicated. The answer depends on your workload, your schema, and your […]

Read More

June 11 Webinar: Comparing MongoDB and TokuMX Replication

 | June 5, 2014 |  Posted In: Tokutek, TokuView

MongoDB replication has a lot of great features including crash safety, automatic failover and parallel slave replication. Although MongoDB’s replication is impressive in many ways, TokuMX™ replication internals are purposely designed differently. Register Now! SPEAKER: Zardosht Kasheff, Sr. Engineer DATE: Wednesday, June 11th TIME: 1pm EDT/10am PDT TokuMX is built on Fractal Tree® technology, unlike […]

Read More

Why TokuMX Changed MongoDB’s Oplog Format for Operations

 | June 5, 2014 |  Posted In: Tokutek, TokuView

Over several posts, I’ve explained the differences between TokuMX replication and MongoDB replication, and why they are completely incompatible. In this (belated) post, I explain one last difference: the oplog format for operations. Specifically, TokuMX and MongoDB log updates and deletes differently. Suppose we have a collection foo, with the following element:

We perform […]

Read More

The Difference Between TokuMX Partitioning and Sharding

 | June 4, 2014 |  Posted In: Tokutek, TokuView

In my last post, I described a new feature in TokuMX 1.5—partitioned collections—that’s aimed at making it easier and faster to work with time series data. Feedback from that post made me realize that some users may not immediately understand the differences between partitioning a collection and sharding a collection. In this post, I hope […]

Read More