Percona Live 2017 Open Source Database Conference

April 24 - 27, 2017

Santa Clara, California

Slides from Percona Live

The world of technology is undergoing a rapid and permanent shift. We all know that we're moving to the cloud at an accelerating pace, but that's not the real news. The fundamental change that's taking place today is that our applications are far more distributed than they've ever been, in nearly invisible ways. And that's not good, because invisible means unmeasurable. This is how jaw-dropping performance problems...
Join Tomas Ulin, VP of MySQL Engineering at Oracle, to get the big picture overview of MySQL 8.0 development priorities, features that have already been delivered, and what is coming in upcoming releases.
The traditional open source infrastructure business model relied on the backing company providing either support and professional services or closed source tools to provide additional functionality like production tooling, enhanced security and monitoring. For a time this model was good and it worked. However, since the larger cloud vendors like AWS, Azure, and Google have started offering database as a service, those models are falling...
SQLite is the SQL database engine that runs the world. It is a critical component in all your devices. There are more instances of SQLite running today than all other database engines combined. This talk reviews the features, capabilities, limitations, and usage patterns of SQLite and asks why you are not using SQLite more yourself.
At Percona we see time series databases as a trend of 2017, hence the idea of having quick 5-minute lightning talks from projects that we think are stellar, followed by a quick panel on time series data stores in general. We discuss the "why a new time series data store" question, why these are ideal solutions, users, as well as where they see the future of this space.
Two years ago Continuent was acquired by VMware as the best-of-breed data services company to build their DBaaS solution. Changes in those plans led Continuent to be spun off back into an independent company. Continuent has been around since 2004, and our customers runs billions of dollars worth of business on our solutions. We have asked select Continuent customers join us to tell about how they use our multi-site/multi-master and...
Peter Zaitsev welcomes everyone to Percona Live Open Source Database Conference 2017
Since 5.7.2, MySQL implements parallel replication in the same schema, also known as LOGICAL_CLOCK (DATABASE based parallel replication is also implemented in 5.6 but this is not covered in this talk). In early 5.7 versions, parallel replication was based on group commit (like MariaDB) and 5.7.6 changed that to intervals. Intervals are more complicated but they are also more powerful. In this talk, I will explain in detail how they...
In my talk I will share with you our solution to the performance problem of MySQL. We were using MySQL and HandlerSocket over 5 years and at some point we decided to migrate from 5.6 to 5.7. The problem was that there was no HandlerSocket in 5.7. The other problem was that we could not use MySQL without HandlerSocket because MySQL alone didn’t stand our heavy workload (10-50K requests per second). We decided to solve it in a very...
Disaster recovery is perhaps the most ignored aspect of managing PostgreSQL in production systems, and its value is only realized in a moment of crisis. If not done right, absence of backup and recovery strategies can make one repent for days, if not months after a disaster. Majority of production systems only have at most one defined backup and recovery strategy, not realizing the benefits of using a hybrid solution or multiple types of...
This short talk will be about an incident that kept DBAs working on a weekend. Two bugs, one in our application code and one in the database, joined force and almost brought down Booking.com. And this occurred at one of the worst possible times. Curious about what happened: come to this talk to learn more.
ClickHouse is more than just a great «House» for analytics. It can also be used as data storage for Graphite. So, I will explain to you why we decided to try ClickHouse as time-series storage for Graphite despite other existing options (Whisper, Ceres, etc.) And we loved it. Still ClickHouse couldn’t provide for all needs of Graphite (for example, metric search). Thus, we developed Graphouse, an application we have recently opensourced...
There are many valid reasons to want your MongoDB data to interact with the relational/SQL ecosystem: the humongous amount of tools available; the ability to use the SQL query language; or the need for really fast aggregate queries. However, this process is not easy: it requires designing the schema, which is ever changing; and a batch ETL process, which might require external tools. Enter ToroDB Stampede: a secondary node for your...
Hawkular Metrics is a scalable, long-term, high performance storage engine for metric data. This session is an overview of the project that includes: history of the project, an overview of the Hawkular ecosystem, technical details of the project, developer features & APIs, and third party integrations.
Elasticsearch is a distributed, RESTful search and analytics engine. This workshop will cover foundational principles of search, and building a working search-engine application from scratch with the open source Elastic Stack (Elasticsearch, Logstash, Kibana, and Beats).
Traditionally, many mobile and web developers consider using stored procedures as a very old and obvious anti-pattern. But time goes on, Open Source (R)DBMSs become stronger, and it is a good moment to look at stored procedures and triggers at different angle. There are certain cases when server programming right inside the DBMS might be much better rather than developing code and deploing it to the "application" servers....
In December 2016, the health and performance monitoring team at Facebook open sourced our in-memory time series database: Beringei. Beringei is different from other in-memory systems, such as memcache, because it has been heavily optimized for storing time series data used specifically for health and performance monitoring. We optimized Beringei to have a very high write rate and low read latency, while being as efficient as possible in...
We are witnessing the advent of all-flash era. As pure read/write block device, flash SSD simply outperforms harddisk in terms of IOPS and cost-effectiveness. But this is not the whole story. As IO device, flash SSD has many different characteristics: to name a few, asymmetric read/write performance, abundant internal parallelism and an indirect address mapping layer, called FTL. We (SKKU MyFlashSQL Team) have been optimizing MySQL by...
"It's just a log, right?" How complicated could it be, how hard does it get? Basically the easiest thing in computer science, right? Wrong. Logs can impact your reliability, performance and quality of sleep in a million ways small and large. In this session we'll cover some of the lessons every engineer should know (and often learns the hard way), such as why good logging solutions are so expensive, why treating...
The storage subsystem has undergone tremendous innovation in order to keep up with the ever-increasing demand for throughput. Non Volatile Memory Express (NVMe) based solid state devices are the latest development in this domain, delivering unprecedented performance in terms of latency and peak bandwidth. NVMe drives are expected to be particularly beneficial for I/O intensive applications, with databases being one of the prominent use-...