Author Archives: Martin Farach-Colton

Elephants on a Trapeze: Keeping Big Data Agile

Posted on by Martin Farach-Colton

On April 1st, the Department of Computer Science at Rutgers University, where I am a professor, held an open house. I gave a talk called “Elephants on a Trapeze: Keeping Big Data Agile”.
The talk is an introduction to…

Leave a comment

OldSQL Tricks or NewSQL Treats

Posted on by Martin Farach-Colton

Why do B-trees need “Tricks” to work?
Marko Mäkelä recently posted a couple of “tips and tricks” you can use to improve InnoDB performance. Tips and tricks. A general purpose relational database like MySQL shouldn’t need “tips and tricks”…

1 Comment

Hot Column Addition and Deletion Part II: How it...

Posted on by Martin Farach-Colton

Hot Column Addition and Deletion (HCAD)
In the previous HCAD post, I described HCAD and showed that it can reduce the downtime of column addition (or deletion) from 18 hours to 3 seconds. In fact, the downtime of InnoDB…

4 Comments

Hot Indexing Part I: New Feature

Posted on by Martin Farach-Colton

From 31 minutes to 2 seconds
Hot Indexing Overview
TokuDB v5.0 introduces several features that are new to the MySQL world. Recently, we posted on HCAD: Hot Column addition and Deletion. In this post, we talk about Hot…

2 Comments

Hot Column Addition and Deletion Part I –...

Posted on by Martin Farach-Colton

From 18 hours to 3 seconds!
Hot Column Addition and Deletion (HCAD) Overview
TokuDB v5.0 introduces several features that are new to the MySQL world. In this series of posts, we’re going to present some information on these features:…

5 Comments

MySQL Partitioning: A Flow Chart

Posted on by Martin Farach-Colton

In Part 1, and Part 2 of this series, I presented some thoughts on partitioning. I heard some great feedback on why people use partitioning. Here, I present a flow chart that summarizes what I’ve learned. In summary:…

7 Comments

Partitioning, Free Lunches, & Indexing, Part...

Posted on by Martin Farach-Colton

Review
In part one, I presented a very brief and particular view of partitioning. I covered what partitioning is, with hardly a mention of why one would use partitioning. In this post, I’ll talk about a few use cases…

8 Comments

Partitioning, Free Lunches, and Indexing

Posted on by Martin Farach-Colton

Why partition?
Partitioning is a commonly touted method for achieving performance in MySQL and other databases. (See here, here, here and many other examples.) I started wondering where the performance from partitions comes from, and I’ve summarized…

5 Comments

Avoiding Fragmentation with Fractal Trees

Posted on by Martin Farach-Colton

Summary
B-trees suffer from fragmentation. Fragmentation causes headaches — in query performance and space used. Solutions, like dump and reload or OPTIMIZE TABLE are a pain and not always effective. Fractal trees don’t fragment. So if fragmentation is a problem,…

9 Comments

Announcing TokuDB v4.1

Posted on by Martin Farach-Colton

Tokutek is pleased to announce immediate availability of TokuDB for MySQL, version 4.1. It is designed for continuous querying and analysis of large volumes of rapidly arriving and changing data, while maintaining full ACID properties.
New in TokuDB v4.1…

Leave a comment