Percona Live: Data Performance Conference 2016 Logo

April 18-21, 2016

Santa Clara, California

Massively Distributed Backup at Facebook Scale

Massively Distributed Backup at Facebook Scale

 20 April 04:30 PM - 05:20 PM @ Ballroom G
Experience level: 
Beginner
Duration: 
50 minutes conference
Tracks:
Operations and Management
Topics:
MySQL
Python
Data in the Cloud

Description

MySQL is at the core of Facebook's persistent storage. The graph itself, including every like, comment, post and status, is stored in MySQL, along with many other things. This data is the company's most important asset, and we take great care to make sure everything is properly backed up. Yes, even the lol cats and puppy picture posts. Everything! As you can imagine, backing up this behemoth of a dataset is quite a challenge. The backup system Facebook runs for MySQL is multi-tiered and massively distributed. We employ binary log, full, and differential backups and clever hacks to balance speed, space and reliability. In this talk you'll learn how we backup Facebook, every single day. We'll go over the design, engineering and operational challenges we've had to overcome, and wrap up with some fun war stories.

Speakers

Dan Reif's picture

Dan Reif

Production Engineer, Facebook

Biography:

I am Dan.

Shlomo Priymak's picture

Shlomo Priymak

MySQL Database Engineer, Facebook

Biography:

Shlomo has been on the MySQL Infrastructure team at Facebook since 2011, managing one of the biggest MySQL clusters in the world — mostly by being lazy and making automation manage it instead of him. Before making the switch to manage 1000s of MySQL servers, Shlomo was pretty happy with 100s of them at companies like Sears and Wix, where he was a DBA and a developer. Prior to diving into to the MySQL world in 2006, Shlomo used to be a SQL Server DBA at the Israeli Intelligence Corps, but he can't tell you how many servers he managed there.

Share this talk