Percona Live 2017 Open Source Database Conference

April 24 - 27, 2017

Santa Clara, California

Schedule - Day 1 | Tuesday 25 April 2017

Sponsor Percona Live Open Source Database Conference

Filter by Topics

Filter by Tracks

Tuesday 25 April 2017
  • 9:00am to 9:15am
    None
    • Business / Case Studies
    Peter Zaitsev (Percona)
    Peter Zaitsev welcomes everyone to Percona Live Open Source Database Conference 2017
    MySQL, Fundamentals, Business Intelligence
  • 9:20am to 9:45am
    None
    • Business / Case Studies
    Eero Teerikorpi (Continuent)
    Two years ago Continuent was acquired by VMware as the best-of-breed data services company to build their DBaaS solution. Changes in those plans led Continuent to be spun off back into an independent company. Continuent has been around since 2004, and our customers runs billions of dollars worth of business on our solutions. We have asked select Continuent customers join us to tell about how they use our multi-site/multi-master and advanced replication solutions.
    Fundamentals
  • 9:50am to 10:35am
    None
    • Business / Case Studies
    Paul Dix (InfluxData), Björn Rabenstein (SoundCloud Ltd.), Tal Levy (Elastic Search), Peter Zaitsev (Percona), Justin Teller (-)
    At Percona we see time series databases as a trend of 2017, hence the idea of having quick 5-minute lightning talks from projects that we think are stellar, followed by a quick panel on time series data stores in general. We discuss the "why a new time series data store" question, why these are ideal solutions, users, as well as where they see the future of this space.
    Fundamentals, Data as a Service, Data in the Cloud
  • 10:35am to 10:50am
    None
    • Developer
    Richard Hipp (SQLite.org)
    SQLite is the SQL database engine that runs the world. It is a critical component in all your devices. There are more instances of SQLite running today than all other database engines combined. This talk reviews the features, capabilities, limitations, and usage patterns of SQLite and asks why you are not using SQLite more yourself.
    Other OSDB

11:30am to 12:20pm

  • 11:30am to 12:20pm
    Ballroom A
    • Operations
    • Developer
    Sveta Smirnova (Percona)
    Performance Schema in version 5.6, released in February, 2013, is really powerful tool which can help DBA to find why every trickiest issue started to occur. Version 5.7 introduces even more instruments and tables. And while all these give you great power, you can stuck choosing which instrument to use.
    MySQL, Optimization, Tools and Techniques
  • 11:30am to 12:20pm
    Ballroom B
    • Wildcard
    John Schulz (The Pythian Group Inc)
    For many years the B-Tree has reigned supreme as the data structure of choice for most databases. There have always been a few other data structures used as well but they were generally not the way most data was stored. In the past few years a bewildering array of new storage structures seem to have appeared. Why have these alternatives come to be? What can they do for us? Each data structure has its strengths and optimizations for certain kinds of workload.
    Fundamentals
  • 11:30am to 12:20pm
    Ballroom C
    • Developer
    Geir Høydalsvik (Oracle)
    In this session, I will describe the new features that have all ready been announced for MySQL 8.0. In addition to Data Dictionnary, CTEs and windows function the session is covering: * Move to utf8(mb4) as MySQL's default character set * Language specific case insensitive collation for 21 languages (utf8) * Invisible index * Descending indexes * Improve usability of UUID and IPV6 manipulations * SQL roles * SET PERSIST for global variable values * Performance Schema, instrumenting data locks * Performance Schema, instrumenting error messages * Improved cost model with his
    MySQL, Fundamentals
  • 11:30am to 12:20pm
    Ballroom D
    • Operations
    • Business / Case Studies
    Marco "The Grinch" Tusa (Percona)
    ProxySQL is a flexible, fast and highly configurable piece of software that provides dramatic improvement to your cluster and high availability (HA) architecture. The presentation will discuss how to deploy ProxySQL in support of architectures using standard master/slave, Percona XtraDB Cluster/MySQL-Galera or MHA solutions. The presentation discusses details on: - ProxySQL host group - Using balancing and failover on standard master/slave - Integration with MHA - Integration with PXC/Galera - Sharding by hint Attendees will learn the best way to include ProxySQL to expand t
    MySQL, High Availability, Sharding
  • 11:30am to 12:20pm
    Ballroom E
    • Developer
    Peter Zaitsev (Percona)
    In this talk, I will introduce Percona’s latest open source software tool: Percona Monitoring and Management (PMM). I will briefly discuss high-level aspects of monitoring we’ve found work well in practice at Percona, and then spend the rest of the time with an online demonstration of how to use this tool to identify common MySQL issues. This demonstration will examine: · Base hardware performance and resource utilization assessment: CPU, disk, memory and network – all common performance problem causes.
    MySQL, Monitoring
  • 11:30am to 12:20pm
    Ballroom F
    • Operations
    Andrew Regner (Facebook)
    MySQL is by far the most common choice among Facebook engineering teams when they are looking for a persistent data store for their product or application. Not all of this data goes into the "Facebook Graph" as not everything developed inside Facebook applies to a user or something they are sharing. This creates a lot more unique use cases of MySQL inside Facebook than one team can operationally optimize for.
    MySQL, Tools and Techniques, Sharding
  • 11:30am to 12:20pm
    Ballroom G
    • Operations
    Art van Scheppingen (Severalnines AB)
    To operate MongoDB efficiently, you need to have insight into database performance. And with that in mind, we’ll dive into monitoring in this talk. MongoDB offers many metrics through various status overviews and commands, but which ones really matter to you? How do you trend and alert on them? What is the meaning behind the metrics? We’ll discuss the most important ones and describe them in ordinary plain MySQL DBA language. Finally we’ll have a look at the (open source) tools available for MongoDB monitoring and trending and compare them.
    MongoDB, Tools and Techniques, Monitoring
  • 11:30am to 12:20pm
    Ballroom H
    • Operations
    Ben Mildren (DigitalOcean)
    Percona Monitoring and Management (PMM) is a wonderful solution, it aims to provide out of the box monitoring and management for your MySQL (and/or MongoDB) environment.
    MySQL, Devops, Monitoring
  • 11:30am to 12:20pm
    Room 203
    • Operations
    • Wildcard
    Patrick King (Yelp)
    Abstract: Come on a journey with Patrick as he walks you through his transformation from a PostgreSQL DBA to a MySQL DBA. This talk will be a story of Patrick's journey where we'll walk through some of the major differences between PostgreSQL and MySQL and how we use both of them at Yelp. We'll talk about query plans and the query optimizer, index types and their structure, and the issues around schema changes; highlighting some fundamental differences between these two great database products.
    MySQL, Devops, Tools and Techniques
  • 11:30am to 12:20pm
    Room 204
    • Operations
    • Developer
    Kenny Gorman (Eventador.io)
    Apache Kafka has become very popular in the last few years. Kafka is a distributed pub/sub server for passing data in real-time. It's fault-tolerant, scalable, and extremely fast. In this talk I will discuss Kafka's core design, how it shares core architectural features of most modern databases, and how it can speed up certain workloads by amazing amounts.
    Other OSDB, Optimization, NoSQL
  • 11:30am to 12:20pm
    Room 209
    • Operations
    • Business / Case Studies
    Ignacio Iglesias (Percona)
    ---
    Devops, Tools and Techniques
  • 11:30am to 12:20pm
    Room 210
    • Developer
    Preetam Jinka (VividCortex)
    I’ve been learning about database internals by learning the theory, seeing how other systems have implemented ideas, and trying to implement them myself. Of course, it’s not always easy, and I’m not making anything that compares to the scale of something like MySQL or MongoDB, but reinventing a few small wheels here and there has given me a better understanding of how bigger systems work. I’ll talk about how some fundamental design choices regarding immutability, transactions, and ACID lead to some interesting trade-offs and implementation differences.
    MySQL
  • 12:20pm to 1:20pm
    None
    Lunch

1:20pm to 2:10pm

  • 1:20pm to 2:10pm
    Ballroom A
    • Developer
    Ike Walker (GitHub)
    In this session, I will discuss some of the practical uses of JSON in MySQL, focusing on version 5.7 but also discussing options for previous versions, and briefly discussing MySQL 8.0.
    MySQL, Tools and Techniques, Programming
  • 1:20pm to 2:10pm
    Ballroom B
    • Business / Case Studies
    Andrea Ponzo (Lastminute.com), Rick Pizzi (Lastminute.com)
    Even if InnoDB is universally recognized by DBAs as "the" engine to use for production data, there are use cases where InnoDB just does not fit. In this talk we will introduce the most popular engines available for MySQL to date, focusing on the limitations that plague InnoDB when it's about inserting rows at high rate and/or using a random primary key, like an UUID. We will describe how we worked around this limitations by using the TokuDB engine, first showcasing advantages of TokuDB versus InnoDB in terms of write performance.
    MySQL, Optimization, Tools and Techniques
  • 1:20pm to 2:10pm
    Ballroom C
    • Operations
    • Developer
    Manyi Lu (Oracle)
    There are substantial improvements in the Optimizer in MySQL 8.0. Most noticeably, we have added support for advanced SQL features like common table expressions, windowing functions and grouping() function. We also made DBAs’ life easier with invisible index, and additional hints that can be used together with the query rewrite plugin. On the performance side, cost model changes will make a huge impact. We have made JSON support even more powerful by adding functions requested by our users. Come and learn about new features in MySQL 8.0!
    MySQL, Optimization, NoSQL
  • 1:20pm to 2:10pm
    Ballroom D
    • Operations
    Shlomi Noach (GitHub)
    Orchestrator is a MySQL topology manager and a failover solution, used in production on many large MySQL installments. It allows for detecting, querying and refactoring complex replication topologies, and provides reliable failure detection and intelligent recovery & promotion. This session walks through orchestrator setup, deployment and usage best practices.
    MySQL, Tools and Techniques, High Availability
  • 1:20pm to 2:10pm
    Ballroom E
    • Operations
    • Developer
    Colin Charles (Percona), Ronald Bradford (Pythian)
    This presentation will discuss the features of MySQL/MariaDB that when enabled and used improve the default usage of MySQL. Many cloud-based applications fail to - Using appropriate filesystem permissions - Using TLS/SSL for connections - Using TLS/SSL with MySQL replication - Using external authentication plugins (LDAP, PAM, Kerberos) - Encrypting all your data at rest - Monitoring your database with the audit plugin - Reviewing and rejecting SQL injections - Designing application access using traditional firewall technology - Other MySQL/MariaDB security features We will end
    MariaDB, MySQL, Tools and Techniques
  • 1:20pm to 2:10pm
    Ballroom F
    • Developer
    zhai weixiang (Alibaba)
    The original MySQL provides an ability that if innodb_read_only is turned on, you can setup multiple instances based on same data file. But this approach will prevent all data changes, which is unpractical for most applications. In this session, I'll share how we change InnoDB to make it possible to scale out read only workload by sharing data files, but still provides all types of workload that a normal MySQL allows.
    MySQL, Optimization, Replication
  • 1:20pm to 2:10pm
    Ballroom G
    • Developer
    Grant Killian (Rackspace), Antonios Giannopoulos (Rackspace)
    Sitecore is a very popular .Net CMS which relies on MongoDB for several key components. This presentation will review all Sitecore elements as they relate to MongoDB, and consider the broader lessons one can learn for any application with parallel types of usage. We’ll share how to optimize MongoDB for Sitecore workload patterns, such as for managing various categories of HTTP Session state, Sitecore “xDB” write-heavy analytics, and for the more asynchronous map-reduce operations necessary when MongoDB does Sitecore data processing.
    MongoDB, Optimization, Ecommerce
  • 1:20pm to 2:10pm
    Ballroom H
    • Developer
    Peter Zaitsev (Percona), Nikita Ivanov (GridGain)
    MySQL® is an extremely popular and widely used RDBMS. Apache® Ignite™ is the leading open source in-memory computing platform which can provide speed and scale to RDBMS-based applications. Apache Ignite is inserted between existing application and data layers and works with all common RDBMS, NoSQL and Hadoop® databases. Join Nikita Ivanov, GridGain CTO and Co-Founder, and Peter Zaitsev, CEO and Co-Founder of Percona, as they discuss how you can supplement MySQL with Apache Ignite.
    Devops, Optimization, Business Intelligence
  • 1:20pm to 2:10pm
    Room 203
    • Operations
    Emanuel Calvo (Percona)
    Postgres Logical Decoding feature has been added in 9.4 and it is continuously improving thankfully to its vibrant community. In this talk we are going to walk through its concepts, usage and some of the new things up to come in further releases. Logical Decoding is one of the features that are under the BDR implementation, allowing bi directional streams of data between Postgres instances. Also, it allows to stream data outside Postgres into many other data systems.
    Other OSDB, Tools and Techniques
  • 1:20pm to 2:10pm
    Room 204
    • Developer
    Victor Tarnavsky (Yandex), Alexey Milovidov (ClickHouse)
    Yandex team has built one of the best opensource databases for analytics. It's fast, capable for storing petabytes of data and supports SQL. This talk is ClickHouse overview: features and benchmarks, plans and statuses, use cases and real users feedback. I will start with current ClickHouse status in DBMS market: how many users, what's the community size, and what's going on in contributing. In the main part I will cover up key database features and capabilities.
    Other OSDB, Optimization, Business Intelligence
  • 1:20pm to 2:10pm
    Room 209
    • Business / Case Studies
    • Developer
    Ignacio Iglesias (Percona)
    ---
    Tools and Techniques, Data in the Cloud
  • 1:20pm to 2:10pm
    Room 210
    • Operations
    Matt Ullmer (Yelp)
    Yelp has grown rapidly over the last several years, and we were forced to quickly scale our infrastructure to keep up. While every company experiences growth in a different way, there are some realities that all growing and successful organizations will face: geographical expansion will become a requirement, launching new database clusters will become vital, and all hosts will become replaceable. You must have automation and a powerful toolset, or be forced to put in a lot of hours and manual labor to accomplish these goals.
    MySQL, Optimization, Tools and Techniques

2:20pm to 3:10pm

  • 2:20pm to 3:10pm
    Ballroom A
    • Business / Case Studies
    Jeremy Tinley (Etsy.com), John Cesario (Square, INC), Ryan Lowe (Square, Inc.)
    Capacity planning is hard. It’s even harder if you want to do it right. One of the industry’s dirty little secrets is that in today’s computing climate, there are few organizations for whom the benefits of formal and rigorous capacity planning outweigh the costs. Having said that, there are a lot of easy and low-cost to get started with capacity planning that can provide significant value to any organization. Attendees of this talk will have a clear path forward to lead capacity planning efforts.
    Fundamentals, Metrics, Monitoring
  • 2:20pm to 3:10pm
    Ballroom B
    • Operations
    • Business / Case Studies
    David Peterson (Salesforce)
    We’ve all used or heard about various filesystems like ext3, ext4, or XFS but what about copy-on-write filesystems like btrfs or ZFS. With XFS being the de facto standard to host MySQL on, learn why Pardot chose ZoL (ZFS On Linux) instead to host their highly transactional MySQL database servers in production on. ZoL is a port of OpenZFS to Linux that was released in 2013. However, since then there has been a lot of FUD about its stability, performance, and just general use of it in the enterprise at a large scale.
    MySQL, Optimization, Tools and Techniques
  • 2:20pm to 3:10pm
    Ballroom C
    • Operations
    • Developer
    Sunny Bains (Oracle)
    The next version of MySQL will be a major release of new features and capabilities including a new data dictionary hosted in InnoDB. Learn all about the changes coming in the next version of InnoDB.
    MySQL, Fundamentals
  • 2:20pm to 3:10pm
    Ballroom D
    • Operations
    Jonah Berquist (GitHub)
    gh-ost is a new tool by GitHub which changes the paradigm of MySQL online schema changes, designed to overcome today's limitations and difficulties in online migrations.
    MySQL, Devops, Tools and Techniques
  • 2:20pm to 3:10pm
    Ballroom E
    • Operations
    Brian Cain (Dropbox)
    Overview: There are a number of built-in MySQL commands and third party tools out there for generating data to help tune and diagnose your servers. All that information is great, but not always easy to parse through; especially if you are in the middle of an emergency. What do you do when you need to see only the non-sleeping processlist threads, pick out the top 5 longest currently running queries, or show the oldest open transactions from show engine innodb status?
    MySQL, Devops, Tools and Techniques
  • 2:20pm to 3:10pm
    Ballroom F
    • Operations
    • Developer
    Lixun Peng (Alibaba Cloud)
    Flashback: - Makes use of the binary log to roll back an instance, database or table to a previous snapshot. - Is available as a first release is MariaDB 10.2.4/RDS MySQL 5.6. - Is implemented on the server-level, so supports all storage engines. - Makes use of full image format binary logs. - Is currently a mysqlbinlog feature (the --flashback option) The talk will discuss how Flashback is currently implemented, what it currently can and can't do, and what features are in the pipeline for future MariaDB/AliSQL releases.
    MariaDB, MySQL, Devops
  • 2:20pm to 3:10pm
    Ballroom G
    • Business / Case Studies
    • Developer
    Jon Dokulil ()
    Almost three years ago, Hudl moved from a monolithic to microservices architecture. I'll touch on why we made the move, but won't spend much time on the overall pros/cons of microservices. Instead I'll focus on our architecture, how we interpreted the "micro" in microservices. I'll talk about how we balance architecting for resilience and cost-efficiency, how we think about share-nothing in our architecture (and when to break that rule) and how we drastically improved our uptime. MongoDB powers nearly all of our services.
    MongoDB, Tools and Techniques, High Availability
  • 2:20pm to 3:10pm
    Ballroom H
    • Developer
    Mathias Herberts (Cityzen Data)
    Warp 10 is an Open Source solution for managing and analyzing time series data in multiple environments. Warp 10 offers an ingestion and storage layer based on LevelDB or HBase, and a data manipulation environment built around a custom language called WarpScript designed from the ground up for time series data analysis. WarpScript can be leveraged on data residing in Warp 10 or any other other datasource. WarpScript can also be used on massive datasets via its integration in Pig, Spark and Flink or on real time data streams through its bindings for Storm or Flink. This session will
    Tools and Techniques, Time Series, Metrics
  • 2:20pm to 3:10pm
    Room 203
    • Developer
    Lukas Fittl (Citus Data), Ozgun Erdogan (Citus Data)
    If you’re building a SaaS application, you probably already have the notion of tenancy built in your data model. Typically, most information relates to tenants / customers / accounts and your database tables capture this natural relation. With smaller amounts of data, it’s easy to throw more hardware at the problem and scale up your database.
    PostgreSQL, Tools and Techniques, Sharding
  • 2:20pm to 3:10pm
    Room 204
    • Business / Case Studies
    • Developer
    Michal Kuchta (Seznam.cz), Tomáš Komenda (Seznam.cz)
    Seznam.cz is the largest and the most visited web portal and search engine in the Czech Republic. It is one of a few search engines in the World which successfully competes with Google in the field of local full-text search. Besides the Search engine, Seznam runs over 40 different web services such as News portals, Map portal, Email service and many more. Thanks to various services we have many projects where we need different data warehouses.
    Tools and Techniques, Business Intelligence, Sharding
  • 2:20pm to 3:10pm
    Room 209
    • Business / Case Studies
    • Developer
    Ignacio Iglesias (Percona)
    ---
    Data as a Service, NoSQL, Data in the Cloud
  • 2:20pm to 3:10pm
    Room 210
    • Business / Case Studies
    • Developer
    Peter Friedenbach (Clustrix, Inc.)
    When looking for the “right” RDBMS for your application, there are many variables you need to consider to ensure you make the right choice. Not all databases are created equal, and you are inevitably going to come across some performance benchmark statistics when evaluating your options. There are a confusing variety of published benchmarks out there: YCSB, Sysbench with a variety of different versions and transaction mixes like 95:5 or 50:50, and others. What do these all mean? How do they relate to what I am trying to accomplish with my application?
    Other OSDB, Metrics, Business Intelligence

3:50pm to 4:40pm

  • 3:50pm to 4:40pm
    Ballroom A
    • Developer
    Bill Karwin (SchoolMessenger)
    We all have tasks from time to time for bulk-loading external data into MySQL. What's the best way of doing this? That's the task I faced recently when I was asked to help benchmark a multi-terabyte database. We had to find the most efficient method to reload test data repeatedly without taking days to do it each time. In my presentation, I'll show you several alternative methods for bulk data loading, and describe the practical steps to use them efficiently.
    MySQL
  • 3:50pm to 4:40pm
    Ballroom B
    • Operations
    Daniël van Eeden (Booking.com)
    If your MySQL connections travel trough your network and maybe even over the internet then there are many possibilities where someone could eavesdrop. Protecting this with SSL (or TLS actually) is the obvious solution, but there are many SSL related settings. And you need a certificate, which you could buy or generate one yourself. I'll cover a bit of authentication where it relates to SSL. I'll demonstrate how to set up SSL and why you should pick certain settings.
    MySQL
  • 3:50pm to 4:40pm
    Ballroom C
    • Developer
    Luis Soares (Oracle), Lars Thalmann (Oracle)
    The most popular High Availability (HA) techniques deployed are based on making services redundant, in particular by means of replication. This fits quite naturally in the MySQL universe as the MySQL server has provided a mature replication solution for over a decade now.
    MySQL, High Availability, Replication
  • 3:50pm to 4:40pm
    Ballroom D
    • Operations
    Susanne Lehmann (Yelp)
    How do you correctly direct traffic to your databases? How do you add new replicas or remove broken or retired replicas from the traffic flow? How do you scale elastically? Every company has their own way of how they solve the service discovery problem. Some use solely DNS, others use VIPs, Consul, …..etc. At Yelp we’ve had great success using SmartStack for service discovery in most of our infrastructure, and we wanted to use it for MySQL too.
    MySQL, Tools and Techniques, High Availability
  • 3:50pm to 4:40pm
    Ballroom E
    • Operations
    • Developer
    Josh Varner (SurveyMonkey)
    While MySQL is a very popular technology, there are still many developers who are not familiar with best practices in using MySQL, or in some cases relational databases in general.
    MySQL, Devops, Programming
  • 3:50pm to 4:40pm
    Ballroom F
    • Operations
    Garrett Plasky (Evernote), Chris Shumake (Evernote, Inc. )
    We'll be discussing using Google Cloud Platform and how it's different from a locally-controlled co-lo.
    MySQL, Devops, Data in the Cloud
  • 3:50pm to 4:40pm
    Ballroom G
    • Operations
    • Business / Case Studies
    David Murphy (Percona)
    We will dive into all the MongoDB Enterprise features, and discuss where there are open and closed source alternatives to these. The goal is for people to walk away knowing what they would give up using community or PSMDB over MongoDB Enterprise. Additionally, they should be aware if the open source version will meet their needs without requiring additional DevOps tooling being built by their team. Some of the areas we will cover: * LDAP Authentication * Audit Logs * Enterprise Backup options * Analytics and BI * Engine choices * Monitoring
    MongoDB, Fundamentals
  • 3:50pm to 4:40pm
    Ballroom H
    • Operations
    Kyle Hailey (Amazon)
    Today’s digital businesses depend more and more on databases in the cloud as a core foundation to their applications. Performance of mission-critical databases is crucial for the success of these businesses, yet businesses face challenges identifying database performance issues due to lack of visibility into database performance in the cloud. Visibility into database performance is difficult due to a wide range of incomplete tools that can be difficult to install, configure and maintain.
    Optimization, Data in the Cloud, Monitoring
  • 3:50pm to 4:40pm
    Room 203
    • Developer
    Nikolay Samokhvalov (-)
    Traditionally, many mobile and web developers consider using stored procedures as a very old and obvious anti-pattern. But time goes on, Open Source (R)DBMSs become stronger, and it is a good moment to look at stored procedures and triggers at different angle. There are certain cases when server programming right inside the DBMS might be much better rather than developing code and deploing it to the "application" servers.
    PostgreSQL, Tools and Techniques, Programming
  • 3:50pm to 4:40pm
    Room 204
    • Business / Case Studies
    • Developer
    Christopher Meiklejohn (Université catholique de Louvain)
    The CAP theorem demonstrates a spectrum of both consistency and availability tradeoffs distributed systems must make if they wish to remain available during network partitions.
    NoSQL, High Availability, Replication
  • 3:50pm to 4:40pm
    Room 209
    • Business / Case Studies
    • Developer
    Ignacio Iglesias (Percona)
    ---
    Devops, Tools and Techniques, Data in the Cloud
  • 3:50pm to 4:40pm
    Room 210
    • Developer
    MC Brown (Continuent)
    In this session we'll look at Tungsten Replicator, which enables real-time and efficient replication of data from your transactional database. We'll focus on the filtering side, for massaging your data before/during replication. During this session we will cover various solutions used by our customers, some of the complex deployment models, and how that information can be modified as part of the load into analytics targets. We will also look into how to replicate to non-transactional environments and how to customize and develop your own appliers and customizations.
    MySQL, Fundamentals

4:50pm to 5:15pm

  • 4:50pm to 5:15pm
    Ballroom A
    • Business / Case Studies
    • Developer
    Marco "The Grinch" Tusa (Percona)
    ProxySQL is a very powerful platform that allows us to manipulate and manage our connections and queries in a simple but effective way. Historically MySQL lacks in sharding capability.
    MySQL, Tools and Techniques, Sharding
  • 4:50pm to 5:15pm
    Ballroom B
    • Developer
    Mark Callaghan (Facebook)
    MyRocks is the RocksDB storage engine for MySQL. We have continued to make it better in the last 6 months. I will explain the new features and improvements to performance and efficiency. I will also explain the workloads for which it might be a great fit. Finally I will describe the differences between InnoDB and MyRocks.
    RocksDB, MySQL, Optimization
  • 4:50pm to 5:15pm
    Ballroom C
    • Operations
    Giuseppe Maxia (VMware)
    MySQL 8.0 introduced roles: a new security and administrative feature that allows DBAs to simplify user management and increases security of multi-user environments. Using roles is easy, once you have digested all the documentation.
    MySQL, Fundamentals
  • 4:50pm to 5:15pm
    Ballroom D
    • Operations
    • Developer
    Guangzhou Zhang (Alibaba)
    This session is to talk about how Alibaba built up Relational Database Service (RDS) for PostgreSQL and MySQL in its public cloud service, which has been serving thousands of users and still fast growing. The techniques used to achieve high availability and fulfill SLA will be detailed: database proxy, resource isolation, security, failover etc. The pitfalls that one should avoid while building up such service will be highlighted. Not only the architecture but also database kernel level enhancements will be introduced.
    Other OSDB, Data in the Cloud, High Availability
  • 4:50pm to 5:15pm
    Ballroom E
    • Business / Case Studies
    Charity Majors (Honeycomb)
    Do you feel like you know everything going on in the fast-moving field of monitoring, instrumentation and observability? Of course you don't: nobody does! But here's where to start. We'll do a whirlwind tour of the major categories (APM, log aggregation and metrics) and discuss their use cases, origin history and who uses them, as well as some true stories about crazy ancient design choices.
    Monitoring
  • 4:50pm to 5:15pm
    Ballroom F
    • Operations
    David Gildeh (Dataloop)
    At Dataloop.IO, we needed to adopt a new TSDB to hold billions of time series metrics for our SaaS monitoring tool. This talk will look into the performance, pros and cons of each of the top open-source TSDBs and why we decided to adopt Dalmatimer DB at Dataloop.IO.
    Devops, Tools and Techniques, Time Series
  • 4:50pm to 5:15pm
    Ballroom G
    • Operations
    Antonios Giannopoulos (Rackspace)
    With MongoDB 3.2 WiredTiger is now the default storage engine. This presentation will go through the configuration parameters when using MongoDB with WiredTiger and demonstrate performance benchmarks against different configuration values. Running WiredTiger with default settings is fine for most workloads but some deployments may require changes to the default settings and especially the WiredTiger core engine configuration variables. We will review all available configuration variables and go over benchmarks that will help us understand the impact of each variable change.
    MongoDB, Optimization
  • 4:50pm to 5:15pm
    Ballroom H
    • Operations
    David Murphy (Percona)
    We will cover what new features are in both MongoDB Community 3.4 and Percona Server for MongoDB 3.4. We will talk about the new graph functions, parallel balancing, sharding improvements, MongoDB zones, and many more new or improved areas of MongoDB. This talk is to give the community a lay of the land on what is working, and what might be good to wait to use. If you are new to MongoDB or have not yet switched to 3.4, this will be the high-level talk to give you the run down on official and unofficial changes.
    MongoDB, Fundamentals, NoSQL
  • 4:50pm to 5:15pm
    Room 203
    • Business / Case Studies
    Christophe Pettus (PostgreSQL Experts Inc.)
    Uber made waves in 2016 by publishing a paper describing their reasons for moving from PostgreSQL to MySQL. So, were they right? Mostly, no. We'll talk about the points they raise about PostgreSQL vs MySQL, and how they could have stayed on PostgreSQL while avoiding or mitigating the problems they faced... and honestly talk about how PostgreSQL can improve to meet their criticisms.
    PostgreSQL
  • 4:50pm to 5:15pm
    Room 204
    • Operations
    • Developer
    Brian Hawkins (Proofpoint)
    Time series data is everywhere and there is a lot of it. This session goes over coding and configuration challenges faced in order to setup the smallest KairosDB/Cassandra cluster that can handle 1 million metrics per second. Topics include: - An introduction to KairosDB - Coding challenges and what didn't work - Configuring Cassandra
    Optimization, Time Series, Programming
  • 4:50pm to 5:15pm
    Room 209
    • Business / Case Studies
    • Developer
    Ignacio Iglesias (Percona)
    ---
    Metrics
  • 4:50pm to 5:15pm
    Room 210
    • Business / Case Studies
    • Developer
    TBD
    Ignacio Iglesias (Percona)
    ---
    NoSQL, Business Intelligence

5:15pm to 5:40pm

  • 5:15pm to 5:40pm
    Ballroom A
    • Business / Case Studies
    • Wildcard
    Martin Arrieta (Pythian), Amanda MacDonald (Pythian)
    Clients are implementing changes at a fast rate, putting extra demand on us, the humans behind the task. How do we reduce human errors, while keeping up with the demand and still provide value? Would you get into a plane if the pilot didn’t check the aircraft? Ever had surgery? How do you know the surgeon didn’t leave a sponge behind? Checklists! While not glamorous or fancy, Human Reliability checklists will reduce human errors when DBA’s are working in high stress environments, and even when they are doing routine changes. After all, what is ever really routine when you are a DBA?
    MySQL, MongoDB, Tools and Techniques
  • 5:15pm to 5:40pm
    Ballroom B
    • Business / Case Studies
    • Developer
    Herman Lee (Facebook Inc.)
    MyRocks is a Facebook project to integrate the RocksDB LSM storage engine into MySQL. In the past, we have covered basic functionality and performance benefits of MyRocks. In this talk, we'll cover new features of MyRocks necessary to meet our scaling requirements. We have reduced data load times for the storage engine and added better diagnostic capabilities. We have found and fixed various performance and data consistency issues in RocksDB. We will also highlight the various challenges we faced deploying the MyRocks storage engine to Facebook's social graph infrastructure.
    RocksDB, MySQL, Devops
  • 5:15pm to 5:40pm
    Ballroom C
    • Developer
    Georgi Kodinov (Oracle), Mike Frank (Oracle)
    This session provides the developer centric view on MySQL 8.0 security. MySQL 8.0 is continuing in the direction set by MySQL 5.7: to be secure by default. The security development team has added quite a few interesting features to the core MySQL server, like for example SQL roles or making the ACL statements atomic.We will discuss the roles implementation and consider some of the interesting possibilities it provides.
    MySQL, Programming
  • 5:15pm to 5:40pm
    Ballroom D
    • Developer
    Yunjing Xu ()
    Smyte is building a fraud and spam detection platform that analyzes all of the traffic running through busy consumer websites and mobile apps. In this talk I'm going to describe how we build our own distributed database, SmyteDB, by integrating Kafka with RocksDB. In our design, Kafka enables us to support database replication and linearization without reinventing distributed primitives. Meanwhile, RocksDB's unique data model allows us to customize the storage layer for a wide variety of operations such as counting and list membership.
    RocksDB, Other OSDB, NoSQL
  • 5:15pm to 5:40pm
    Ballroom E
    • Business / Case Studies
    • Developer
    Baron Schwartz (VividCortex)
    In this talk you'll learn how we store and analyze time series data efficiently at VividCortex, using MySQL and Redis as a storage engine. VividCortex's time series workload presents interesting and unusual challenges that most conventional time series databases don't handle well, at a speed and volume that is also unusual. Building on MySQL and Redis enabled us to do this with low costs on relatively inexpensive EC2 instances.
    MySQL, Time Series, Metrics
  • 5:15pm to 5:40pm
    Ballroom F
    • Developer
    • Wildcard
    Christine Yen (Honeycomb)
    Honeycomb combines the raw accuracy of log aggregators, the speed of time series metrics, and the flexibility of APM (application performance metrics) to provide the world's first truly next-generation analytics service.
    Programming, Data in the Cloud, Metrics
  • 5:15pm to 5:40pm
    Ballroom G
    • Developer
    Jason Terpko (ObjectRocket by Rackspace)
    This presentation will demonstrate how you can use the aggregation pipeline with MongoDB similar to how you would use GROUP BY in SQL and the new stage operators coming 3.4. MongoDB’s Aggregation Framework has many operators that give you the ability to get more value out of your data, discover usage patterns within your data, or use the Aggregation Framework to power your application. Considerations regarding version, indexing, operators, and saving the output will be reviewed.
    MongoDB
  • 5:15pm to 5:40pm
    Ballroom H
    • Developer
    Denis Protyvenskyi (Percona)
    Percona Server for MongoDB is fully compatible with MongoDB Community Edition, and has some cool extra features (some of which are available only in MongoDB's Enterprise Edition). The features include: - Two extra storage engines: Percona Memory Engine and MongoRocks (based on RocksDB) - Audit - External authentication - HotBackup for WiredTiger and MongoRocks - Rate limiter extension The presentation is made by a Percona Server for MongoDB developer, so technical questions are welcome at the Q&A session.
    RocksDB, MongoDB
  • 5:15pm to 5:40pm
    Room 203
    • Operations
    Sean Chittenden (HashiCorp)
    PostgreSQL and ZFS were made for each other. This talk dives downstack into the internals and way that PostgreSQL consumes disk resources and tricks that are available if you run PostgreSQL on ZFS (ZFS on Linux, ZFS on FreeBSD, or ZFS on Illumos). Topics covered will include: *) Performance and sizing considerations *) Workload estimation heuristics *) Standard administrative practices that leverage ZFS *) Recovery using ZFS *) Performing database migrations using ZFS *) PITR of databases using ZFS
    PostgreSQL, Data in the Cloud, Replication
  • 5:15pm to 5:40pm
    Room 204
    • Operations
    • Developer
    Jesse White (The OpenNMS Group Inc.)
    Today's datasets are growing at an exponential rate. Collection, storage, analysis, and reporting are becoming more challenging, and the results more valued. A decade ago, RRDTool's algorithms were well suited to our requirements, but they fall short of scaling to current demands.
    Time Series, NoSQL, Metrics
  • 5:15pm to 5:40pm
    Room 209
    • Business / Case Studies
    Ignacio Iglesias (Percona)
    ---
    Data in the Cloud
  • 5:15pm to 5:40pm
    Room 210
    • Business / Case Studies
    TBD
    Ignacio Iglesias (Percona)
    ---
    Data in the Cloud