Percona Live 2017 Open Source Database Conference

April 24 - 27, 2017

Santa Clara, California

Slides from Percona Live

Time-series databases are the fastest growing category in databases today. We even held a keynote panel discussing some of the options earlier in the conference. But what exactly is "time-series data"? And why do we need a special database to handle it?
ScaleDB is a clustered, distributed storage engine for MySQL and MariaDB, optimised for high volume, high velocity structured data. The engine primarily serves streaming data workloads, although the core has also a locking mechanism designed for OLTP workloads. The objective of this talk is to introduce the functionalities of ScaleDB as a Storage Engine for MariaDB, features, architecture, practical use cases and performance tests...
Clients require strong security measures for PCI, HIPPA, or PHI where data managed by MySQL should be encrypted "at rest." InnoDB’s built-in encryption features work, but there are some caveats to that solution. In this talk, you'll see how to encrypt your entire disk to protect everything from data, redo logs, and binary logs.
Amazon Relational Database Service (Amazon RDS) makes it easy to set up, operate, and scale a relational database in the cloud. It provides cost-efficient and resizable capacity while managing time-consuming database administration tasks, freeing you to focus on your applications and business. This talk will provide guidance and tips for optimizing MySQL-compatible workloads on RDS.
In this talk, I will introduce Airbnb's change data change system, Spinaltap. I will briefly cover its design and then focus on various use cases inside Airbnb. These use cases cover both online serving production serving and offline large distributed batch processing.
Amazon Relational Database Service (Amazon RDS) makes it easy to set up, operate, and scale a relational database in the cloud. It provides cost-efficient and resizable capacity while managing time-consuming database administration tasks, freeing you to focus on your applications and business. This talk will provide guidance and tips for optimizing MySQL-compatible workloads on RDS.
Today everything is instrumented, generating more and more time-series data streams that need to be monitored and analyzed. When it comes to storing this data, many developers often start with some well-trusted system like PostgreSQL, but when their data hits a certain scale, give up its query power and ecosystem by migrating to some NoSQL or other "modern" time-series architecture. They face the traditional trade-off: query...
Join Peter Zaitsev, CEO of Percona, as he discusses the growth and adoption of open source databases and Percona’s commitment to remaining an unbiased champion of the open source database ecosystem. He will also provide updates and highlight exciting new developments in Percona Server software for MySQL and MongoDB.
Recently, graph data is emerging everywhere including manufacturing industries, biological databases, heath cares, network device information, fraud detection, IOT, logistics and social networks. Traditionally, graph data is considered to be hard to process in relational databases because they have highly connected relationships. But to prepare the evolutionary connected data era, industries have to be able to make their data more...
Mariella Di Giacomo - Successful stories around MySQL and MariaDB Multi-Source Replication Manjot Singh - What is Sharding Michael Wang - Use slow logs to collect unique queries and their performance continuously Jean-François Gagné - The two little bugs that almost brought down Booking.com
EVCache is an open source, fast, distributed cache. It is used at Netflix as a cache as well as a data store. EVCache powers user-facing applications, personalization algorithms or various micro-services. Until recently, EVCache stored multiple terabytes of data in RAM with multiple copies of the data for resiliency. With the continuing growth of data, this became a costly proposition. In this talk, we'll mainly cover how we used...
Learn how to boost performance 1,000x with in-memory store of petabytes of data for your MySQL-based applications. Apache Ignite is an in-memory computing platform that is built on top of a distributed key-value store and provides full-fledged SQL support. Nikita will show how Apache Ignite handles auto-loading of SQL schema and data, SQL indexes, compound indexes support, and various forms of SQL queries including distributed SQL joins...
Amazon Aurora is a fully managed relational database engine that combines the speed and availability of high-end commercial databases with the simplicity and cost-effectiveness of open source databases. It is purpose-built for the cloud using a new architectural model and distributed systems techniques to provide far higher performance, availability and durability than previously possible using conventional monolithic database...
Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse service that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. How we got there and what features are most important to our customers is another example of the Amazon Flywheel. In this talk, I am going to explain the main architecture of Amazon Redshift and a peak into the future of Cloud BI
How AWS Database Migration Service enables database freedom Database freedom is a powerful thing. If you look at how companies have been locked into places with their database providers for a number of years, you can understand the trepidation. Those proprietary offerings are expensive with punitive licensing and auditing terms. Since its launch in March 2016, AWS Database Migration Service (DMS) and AWS Schema Conversion Tool (SCT) have...
Join Jean-François Gagné as he presents a metric for observing Booking.com’s system health. This metric is not technical (not MySQL or Linux related) but it is one of the most important metric Booking.com has to detect a problem on the website. Many things impact this metric, including the time of the day, the day of the week or the season of the year. Moreover, some external events (political and cultural) also have visible effect on the...
The MySQL marketplace has quite a few High-Availability (HA) solutions such as Continuent Tungsten, Galera/XtraDB Cluster, various MySQL script/patch solutions, and RDS with multi-zone HA. Providing seamless, automatic failover with zero down-time maintenance in one data center can be challenging, but extending that same functionality across multiple sites in different continents truly makes that a difficult goal to achieve. Continuent has...
In this session we'll look at Tungsten Replicator, which enables real-time and efficient replication of data from your transactional database. We'll focus on the filtering side, for massaging your data before/during replication. During this session we will cover various solutions used by our customers, some of the complex deployment models, and how that information can be modified as part of the load into analytics targets. We...
VividCortex monitors lots of production database servers, which means we get to see lots of different database problems. One specific type of problem that we like to focus on is database stalls. We define stalls as short periods of time, typically one second, when work isn’t getting done. It’s easy to see when a database isn’t performing its work as usual, but trying to find the cause is much more difficult. I’ll talk about what kinds of...
I’ve been learning about database internals by learning the theory, seeing how other systems have implemented ideas, and trying to implement them myself. Of course, it’s not always easy, and I’m not making anything that compares to the scale of something like MySQL or MongoDB, but reinventing a few small wheels here and there has given me a better understanding of how bigger systems work. I’ll talk about how some fundamental design choices...