TokuView

An Updated Description of Clustering Keys for TokuDB

Posted On August 6, 2014 | By Rich Prohaska | 0 comments

Covering indexes can result in orders of magnitude performance improvements for queries. Bradley's presentation on covering indexes describes what a covering index is, how it can effect performance, and why it works. However, the definition of a covering index can get cumbersome since MySQL limits the number of columns in a…

New TokuMX Meetup Groups Formed

Posted On August 1, 2014 | By Dave Rosenlund | 0 comments

Tokutek is pleased to announce the formation of two new TokuMX Bay Area Meetup Groups, as well as each group's inaugural meet up event. These groups are for anyone who's interested in delivering high-performance big data applications built with the world's most popular NoSQL solution, MongoDB. Join Tokutek team members,…

Explaining Ark Part 3: Why Data May Be Lost on a Failover

Posted On July 30, 2014 | By Zardosht Kasheff | 0 comments

This is the third post in a series of posts that explains Ark, a consensus algorithm we’ve developed for TokuMX and MongoDB to fix known issues in elections and failover. The tech report we released last week describes the algorithm in full detail. These posts are a layman’s explanation. In…

Explaining Ark Part 2: How Elections and Failover Currently Work

Posted On July 25, 2014 | By Zardosht Kasheff | 0 comments

This is the second post in a series of posts that explains Ark, a consensus algorithm we’ve developed for TokuMX and MongoDB to fix known issues in elections and failover. The tech report we released last week describes the algorithm in full detail. These posts are a layman’s explanation. In…

Explaining Ark Part 1: The Basics

Posted On July 22, 2014 | By Zardosht Kasheff | 0 comments

Last week, we introduced Ark, a consensus algorithm similar to Raft and Paxos we’ve developed for TokuMX and MongoDB. The purpose of Ark is to fix known issues in elections and failover. While the tech report detailing Ark explains everything formally, over the next few blog posts, I will go…

Introducing Ark: A Consensus Algorithm For TokuMX and MongoDB

Posted On July 18, 2014 | By Zardosht Kasheff | 4 comments

Most of the time, our blog posts explain what’s great about the MongoDB improvements we've already shipped in TokuMX. Sometimes, though, it’s fun to talk about what’s coming soon, especially when user feedback would really help get the feature right. In my next series of blog posts, I get to…

Why a Partitioned Collection Cannot Be Sharded

Posted On June 27, 2014 | By Zardosht Kasheff | 0 comments

In TokuMX 1.5, we introduced partitioned collections for non-sharded clusters. That is, one can have a partitioned collection in a replica set, but one cannot shard a partitioned collection. In this post, I explain why. As I mentioned here, partitioned collections are useful for time-series data where we would like…

TokuMX Compression & Performance Innovation Awards @ MongoDB World 2014

Posted On June 25, 2014 | By Dave Rosenlund | 0 comments

Today, at MongoDB World 2014 in New York City, we announced the winners of our inaugural TokuMX Compression & Performance Innovation Award. We selected two winners from amongst the nominees – Brightcove and Crowdtap. The announcement was made at the TokuMX Scale-up & Slim-down breakfast meet-up. This morning's meet-up featured…

TokuMX @ MongoDB World 2014

Posted On June 21, 2014 | By Dave Rosenlund | 0 comments

On Monday, some of the Boston-area Tokutek team will board New York bound trains to join our Union Square colleagues at MongoDB World 2014. We couldn't be more excited. You would be too. TokuMX, our high-performance distribution of MongoDB, just had its first birthday. The rapid acceptance by the MongoDB…

Use TokuMX Partitioned Collections in Place of TTL Indexes

Posted On June 20, 2014 | By Zardosht Kasheff | 0 comments

Take the following scenario. You have a time-series data application for which you would like to store a rolling period of data. For example, you may want to maintain the last six months of traffic logs for a website, in order to analyze activity of different periods of time. Or,…