Percona's CMO, Laurie Coffin, opens the keynotes!
Geir Hoydalsvik will talk about the focus, strategy, investments and innovations evolving MySQL to power next-generation Web, mobile, Cloud and embedded applications. He will also discuss the latest and the most significant MySQL database release ever in its history, MySQL 8.0.
Amazon RDS is a fully-managed database service that allows you to launch an optimally configured, secure and highly available database with just a few clicks. It manages time-consuming database administration tasks, freeing you up to focus on your applications and business. This keynote features the latest news and announcements from RDS including the launches of Aurora Serverless, Parallel Query, Backtrack, RDS MySQL 8.0, PostgreSQL 10.0, Performance Insights, and several other recent innovations.
This keynote talk will provide an architectural overview of TiDB, how and why it's MySQL compatible, the latest features and improvements in TiDB 2.1 GA release, and how its multi-cloud fully-managed solution works.
In this keynote, Yoshi will share interesting lessons learned from Facebook's production deployment and operations of MyRocks and future MyRocks development roadmaps. Vadim will discuss MyRocks in Percona Server for MySQL and share performance benchmarks tests from on-premises and cloud deployments.
The market coined the term "enterprise grade" or "enterprise ready" to differentiate products and service offerings for licensed database software. For example: there may be a standard database software or an entry-level package that delivers the core functionality and basic features. Likewise, there may be an enterprise version, a more advanced package which goes beyond the essentials to include features and tools indispensable for running critical solutions in production. With such a differentiation found in commercial software, we may wonder whether a solution built on top of an open source database like PostgreSQL can satisfy all the enterprise requirements.
So, in this talk, we shall discuss how you can build an Enterprise Grade PostgreSQL using open source solutions.
You need to store and retain time-series data in MongoDB and HDDs can't keep up with your insert rate, but you can't afford to keep everything on SSD? Then this presentation is for you. You'll learn how to use sharding and shard tagging to keep your inserts and most recent data on the fast SSDs and your archived data on the cheap HDDs, and how to quickly and efficiently transition data from SSD to HDD. You'll also learn the best programming techniques for adding your time-series data and accessing the data as a stream without missing any data points.
Provide a look at what is Optane Technology at Intel and how some of the Software add-ons and Open Source Databases test out on this technology.
In this session, Arjen provides an overview of the shiny new in MariaDB Server 10.3, as well as putting a spotlight on the many other little improvements and their contributors, which together make the MariaDB ecosystem thrive.
Both Galera and Group Replication provide clustering solution with virtual synchronous replication. The different replication technologies used by these two products ensure that data is replicated, consistent and safe and that several failure scenarios can be handled maintaining high availability. Although the database layer is highly available, applications rarely know the status of the various nodes in the cluster, therefore sending traffic to the wrong node(s) in a degraded system can lead to a high error rate.
ProxySQL has the ability to regularly check the status of the various nodes in the cluster, and determine which nodes are healthy and where to send traffic.
This session will present how to configure ProxySQL as a middle layer between the application and a clustering solution with either Galera or Group Replication, what algorithms are used to route traffic to the right node(s) and ensure that high availability is achieved even in case of degraded cluster conditions.
ClickHouse is fast and scalable open source SQL DBMS from Yandex. In this talk, I will briefly review what's new in 2018, and concentrate on advice and case studies how ClickHouse can solve immediate problems without significant investments of time and resources. ClickHouse can handle petabytes of data, but it perfectly works for smaller projects, thanks to its performance and easy integration with MySQL, Logstash and other open source technologies.
Amazon Aurora is a fully managed relational database service that combines the speed and availability of high-end commercial databases with the simplicity and cost-effectiveness of open source databases. With Aurora, we've completely reimagined how databases are built for the cloud, providing you higher performance, availability, and durability than previously possible. We'll dive deep into architectural details of Amazon Aurora with MySQL and PostgreSQL compatibility and review recent innovations such as Parallel Query, Backtrack, Serverless, and Multi-Master, along with best practices for utilizing the power of relational databases at cloud scale.
In this day and age, maintaining privacy throughout our electronic communications is absolutely necessary. Creating user accounts and not exposing your MongoDB environment to the wider internet are basic concepts that have been missed in the past. Once that has been addressed, individuals and organizations interested in becoming PCI compliant must turn to securing their data through encryption. With MongoDB, we have two options for encryption: at rest (only available as enterprise feature with MongoDB) and transport encryption. In this session, we will review
-Why encryption is important
-What are the prerequisites to set up encryption
-Step by step for encryption at rest and in transit
-Encrypting data with volume encryption in the cloud
-Percona for MongoDB encryption features
PostgreSQL has seen increasing enterprise adoption over the last years. While it has always been considered a rock-solid and secure data base system, recent years brought native replication and partitioning, JSON support as well as increasing federation to other databases or data sources. Operational features like high availability, horizontal and vertical scalability or zero-downtime upgrades can be achieved natively and/or via external projects.
This talk will present the most important enterprise-related features currently implemented in PostgreSQL and provide examples on how use them. It will also discuss features or areas that could potentially hamper adoption.
Redundancy and high availability are the basis for all production deployments. With MongoDB this can be achieved by deploying replica set. In this talk we'll explore how MongoDB replication works and what are the components of a replica set. Using examples of wrong deployment configurations, we will highlight how to properly run replica sets in production, whether it comes to on-premise deployment or in the cloud.
-How MongoDB replication works
-Replica sets components/deployment typologies
-Practices for wrong deployment configuration
-Hidden nodes, Arbiter nodes, Priority 0 nodes
-Availability zones and HA in single region
-Monitoring replica sets status
Graph databases allow users to analyze highly interconnected datasets and find patterns within these relationships. Social networks, corporate hierarchies, fraud detection, network analytics, or building whole knowledge graphs are great use cases for graph databases. However, these datasets of nodes and connecting edges change over time. Whether you are a developer, architect or data scientist, you may want to time travel for analyzing the past or even predict tomorrow.
While your graph database may be lacking built-in support for managing the revision history of graph data, this talk will show you how to manage it in a performant manner for general classes of graphs. Best of all, this won't require any groundbreaking new ideas. We'll simply borrow a few tools and tricks from existing persistent data structure literature and adapt them for good performance within the graph database software.
How do you exclude one table access from a database with too many tables? In previous versions of MariaDB and MySQL, you have to specifically grant access to all other tables individually. This does not scale, especially if tables are created and dropped frequently.
Reverse privileges solve this problem and many other use cases. With just one SQL statement you can ensure a user can never access a resource, regardless which roles or other privileges are granted to it.
Reverse privileges do not work in a vacuum, so we will start with a broad view of the full privilege system, before diving into the specifics of denying access to resources.
MyRocks is a new storage engine from Facebook and available in Percona Server for MySQL. In what cases you will want to use it? We will check different workload and when MyRocks is most suitable for you. Also as for any new engine, it might be not trivial to set up and tune it properly, so we will review the most important settings to pay attention to.
I will talk about unusual and unique ClickHouse features:
- aggregation states as first-class citizens;
- multidimensional arrays and higher-order functions;
- approximate query execution;
- incremental data aggregation;
- query processing without the server;
I will show practical examples and use cases from Yandex and other companies.
Amazon Aurora Serverless is an on-demand, auto-scaling configuration for Aurora (MySQL-compatible edition) where the database will automatically start up, shut down, and scale up or down capacity based on your application's needs. It enables you to run your database in the cloud without managing any database instances. Aurora Serverless is a simple, cost-effective option for infrequent, intermittent, or unpredictable workloads. We'll explore these use cases, take a look under the hood, and delve into the future of serverless databases. We'll also hear a case study from a customer building new functionality on top of Aurora Serverless.
In this session get an overview of all the new security features in MySQL 8.0 and how they fit together to answer the modern security challenges. MySQL 8 takes a new step in tightening the security of MySQL installations and provides new and flexible tools including a brand new default authentication method, SQL roles, enhancements in transparent disk encryption, and modern password controls on password reuse, complexity, and brute force password guessing.
This session will provide a brief overview of container options (Docker, OpenShift, AWS) for PostgreSQL and MySQL. We'll discuss the benefits of containers and how to administer, develop and monitor your database on containers. I'll also review and answer the top questions I get on containers.
MongoDB 4.0 finally supports multi-document ACID transactions. During the presentation, we'll discuss what is a transaction and what ACID properties are.
We'll show how to use transactions in MongoDB doing some real example and we'll discuss the pros and cons and current limitations.
Vitess has many features which make it easy for it to be deployed on a container orchestration framework like Kubernetes.
+ Master fail overs without client seeing errors
+ Fast discoverability of masters and replicas
+ Excellent observability
+ Multi-cell high availability support
During this talk, I will describe these features.
Then we will switch gears I will demonstrate a new Vitess feature: VReplication.
This talk will be an overview of new features in both MariaDB Server 10.3 and MySQL 8.0, as well as a compare/contrast of how certain features are implemented. There will be a look into some of the ways in which MariaDB and MySQL have diverged.
The recent enhancements to MySQL replication make operations easier, further reduce the administrative overhead, provide a much-improved user experience, and enable running MySQL smoothly both at scale and/or in very tightly coupled network infrastructures such as in a
Come and join the engineers behind the product to get to know the latest and greatest replication features and how these enable the creation of rock solid, scalable and resilient database services able to keep up with the most demanding work and fault-loads. Take this opportunity to expend your MySQL knowledge and learn more about hot
topics such as Group Replication/InnoDB Cluster.
At MessageBird we send hundreds of millions of messages across the world every month, which generate billions of events. Providing real-time* analytics is key to being able to quickly deliver those messages to our customers. From measuring and optimising the performance of our platform, to providing insights to our customers, ClickHouse is being used far and wide inside our organisation.
In this talk, we look into how Clickhouse allows us to ingest a large amount of data and run complex analytical interactive queries. We also present the business needs that brought ClickHouse to our attention and detail the journey to its deployment. We cover the problems we faced, and how we dealt with them. We talk about our current Cloud production setup and how we deployed and use it. Last but not least, we talk about mistakes we did along the way and what we learned from running and maintaining a ClickHouse cluster by ourselves.
In recent years, MySQL has become a top database choice for new application development and migration from overpriced, restrictive commercial databases. We'll provide an overview of the MySQL and MariaDB options available on AWS, and do a deep dive on Amazon Relational Database Service (RDS), a fully-managed MySQL service, and Amazon Aurora, a MySQL-compatible database with up to 5x the performance and many additional innovations.
Security is always a challenge when we come to data but regulations like GDPR brings a new layer on top of it with rules more and more restrictive to access and manipulate data. Join us in this presentation to check security best practices, traditional and new features available for MySQL including features coming with the new MySQL 8.
On this talk DBA's and sysadmins will walk through the security features available on the OS and MySQL:
MySQL 8 features (undo, redo and binlog encryption)
We will share our experience of working with 2000 support customers and help the audience to become familiar with all the security concepts and methods, and give you the necessary knowledge to apply to your environment.
pg_chameleon is a lightweight replication system written in python. The tool can connect to the MySQL replication protocol and replicate the data changes in PostgreSQL.
Whether the user needs to set up a permanent replica between MySQL and PostgreSQL or perform an engine migration, pg_chameleon is the perfect tool for the job.
The talk will cover the history the current implementation and the future releases.
The audience will learn how to set up a replica from MySQL to PostgreSQL in a few easy steps. There will be also a coverage on the lessons learned during the tool's development cycle and a small demo.
Database performance affects organizational performance, and we tend to look for quick fixes when under stress. But how can we better understand our database workload and factors that may cause harm to it? What are the limitations in MongoDB that could potentially impact cluster performance?
In this talk, we will show you how to identify the factors that limit database performance. We will start with the free MongoDB Cloud monitoring tools. Then we will move on to log files and queries. To be able to achieve optimal use of hardware resources, we will take a look into kernel optimization and other crucial OS settings. Finally, we will look into how to examine performance of MongoDB replication.
At Facebook, we created MyRocks (RocksDB Storage Engine for MySQL) a few years ago. Since then, we have migrated from InnoDB to MyRocks in our top three largest database services, including Facebook Messenger. On the other hand, we still have many InnoDB instances, especially for general purpose use cases. At previous Percona Live conferences, I spoke about how we migrated from InnoDB to MyRocks. In this session, I am going to talk more about lessons learned after migration. Topics like what kinds of production issues we faced will be covered. I am also going to talk about how we migrated Facebook Messenger -- from HBase to MyRocks.
Database optimizers are constantly improving. During this session, we will cover the new optimizations implemented by the optimizer team for MariaDB 10.3 release and the planned projects for the next 10.4 release.
* Condition Pushdown through GROUP BY and PARTITION BY
* Split-grouping for derived tables (a guaranteed speedup where the optimization applies)
* Improved Engine-Independent Table Statistics support
* New optimizer default settings
We store data with the intention to use it: search, retrieve, group, sort... To perform these actions effectively MySQL storage engines index data and communicate statistics with the Optimizer when it compiles a query execution plan. This approach works perfectly well unless your data distribution is not even.
Last year I worked on several tickets where data follow the same pattern: millions of popular products fit into a couple of categories and rest used the rest. We had a hard time to find a solution for retrieving goods fast. Workarounds for version 5.7 were offered. However new MySQL 8.0 feature: histograms, - would work better, cleaner and faster. This is how the idea of the talk was born.
I will discuss
- how index statistics physically stored
- which data exchanged with the Optimizer
- why it is not enough to make correct index choice
In the end, I will explain which issues resolve histograms and why using index statistics is insufficient for fast retrieving of not evenly distributed data.
MySQL and PostgreSQL are the two most popular open-source relational databases. To help to choose between them, a comparison of their query optimizers has been carried out. The aim of this session is to summarize the outcome of the comparison. Specifically, to point out optimizer-related strengths and weaknesses.
As more and more people are moving to PostgreSQL from Oracle, a pattern of mistakes is emerging. They can be caused by the tools being used or just not understanding how PostgreSQL is different than Oracle. In this talk, we will discuss the top mistakes people generally make when moving to PostgreSQL from Oracle and what the correct course of action.
The purpose of the talk is to present how to enable SSL and data-at-rest encryption and to how SSL and data-at-rest encryption work in under-the-hood of Percona XtraDB Cluster. The presentation will cover a process of making Percona XtraDB Cluster (PXC) secure.
- A configuration of Client-Server Traffic encryption. Is there any difference between C-S encryption in PXC and PS?
- A configuration of replication encryption (SST, IST). How does it work under-the-hood?
- How to make things more comfortable with the pxc-encrypt-cluster-traffic variable.
- PXC data-at-rest encryption. How does it work? How to configure it?
Is there any part of data that is unencrypted? Yes.
- Temporary files encryption.
- Binlog encryption. Possible or not?
- keyring_file. Is useful? How to make it profitable?
- keyring_vault. How does it work?
- Is Mix-match keyring plugins possible? Yes, but useful only when transitioning from keyring_file.
- What is happening in SST when data-at-rest is enabled?
- Performance overhead.
PostgreSQL provides a way to communicate with external Data sources which can be another PostgreSQL instance or any other Database. The other database can be a Relational database like MySQL, Oracle or any NoSQL Database like MongoDB or Hadoop. PostgreSQL Implements ISO Standard call SQL-MED to achieve that capabilities. This Presentation will explain the advantage of this feature and the working of that feature with examples.
* What do writeConflicts indicate?
* Internal writeConflicts process
* When does writeConflicts indicate a problem?
- Small number of writeConflicts/sec
- Large number of writeConflicts/sec
* What to do if you see a large number of writeConflicts/sec
* Identify the operation causing conflicts in the mongod log file
* Improving update operation patterns
The myth that containers aren't ready for databases still persists. We're not buying it.
We have built and open-sourced the MySQL operator for Kubernetes to solve this problem as a layer in the Presslabs stack for open WordPress hosting infrastructure.
We operate with lots of small to medium-sized DBs, in a DB-per-service model, as we call it. The workloads are mostly reads, so we need to efficiently scale that. The MySQL asynchronous replication model fits the bill very well, allowing to scale horizontally from one server (with the obvious availability pitfalls) to tens of nodes.
The operator is based on Percona Server for MySQL for its operational improvements â€”like utility user and backup locksâ€”and relies on Orchestrator to do the automatic failovers.
We plan to integrate it with Percona Management & Monitoring. Community feedback on the operations it automates and to make sure that the data is safely stored is very welcome.
MariaDB 10.3 introduced system-versioned tables. This session explains what we can do with this feature, how to investigate data changes, how to emulate some data warehouse functionalities, how to recover from certain human mistakes.
MySQL replication allows you to write on one writer server and easily scale out reads by redirecting reads to reader servers. But how do we guarantee read consistency with their last write? Galera replication can guarantee that, while MySQL Group Replication and standard MySQL async replication cannot.
If you are running MySQL Server or Percona Server, version 5.7 or newer, with GTID enabled, ProxySQL 2.0 is now able to ensure read consistently with the last write. ProxySQL is able to stream GTID information from all the reader servers, and in real-time is able to determine which reader server(s) is able to execute the SELECT statement producing a resultset that is read consistently with the last write (and GTID) executed by each client.
This presentation will show the technical details that allow you to build an architecture with thousands of ProxySQL instances and MySQL servers, and how GTID information is processed in real-time with limited bandwidth footprint.
Have you ever heard of Graph databases? You should!
Graph databases are optimised to manage fast storage and traversal of nodes and their relationships. In simple terms, they make relationships (or joins in relational jargon) first-class citizens and queries of interrelated nodes blazing fast.
This talk is about the basic concepts of Graph databases and Neo4j, an open source database server build from the ground up for graphs.
We will cover these topics:
- Intro to graphs and why they matter in the database world
- Graphs modelling for relational minds: how to build graphs and how to make the most out of it
- Intro to Neo4j - architecture and use cases
- Cypher, a declarative graph query language
- Neo4j installation and use cases
Managed database services are gaining in popularity. In this session we look at how best to configure Amazon RDS for PostgreSQL and also look at common user operations of using RDS for PostgreSQL. We will also look beyond common user operations and into some specific optimizations related to upgrade, logical replication, performance, and reducing downtime.
This session describes existing encryption-at-rest features for MongoDB, with a focus on what is freely available in open source packages. You'll learn how to set it up, how to rotate your master keys, and how to achieve full encryption rotation. You'll also learn about the future direction for open source alternatives for MongoDB encryption.
Or to be better say my PostgreSQL "hello World".
In this presentation I will illustrate the ways, the accidents and the surprises I had in my journey as MySQL DBA to implement a specular solution with PostgreSQL as total probie.
I will start from the basics, covering my journey in:
- Basic configuration
- Security definition
- Create a database and tables
- The magic behind indexes
Do you ever wonder why your MongoDB databases seem to be much slower than expected? Do you experience periodical hot spots and want to know the culprit(s)? Do you want to know which database operations are the most expensive and the ones most meaningful to improve?
In this talk you will quickly understand the limits of MongoDB's built-in database profiler, especially in sharded environments. I will present an open source tool we developed to solve these issues. We created a tool that collects, stores and helps to visually analyze slow MongoDB operations. By using the tool you will get a better picture of your MongoDB clusters without the need of a deeper MongoDB expertise. The tool runs database commands simultaneously against a multitude of databases or mongod's and with just a few clicks you can easily intervene and spot performance issues.
We present a kV database named "SharedRocks" which use d master-slave replication architecture for single write and multi-reads. It has high availability feature and easy to recovery from crashing. It's built on RocksDB which is the most popular key-value storage engine and PolarStore(PolarFS). The Alibaba's shared file storage. The shared file storage engine provides file persistence with three replicas and variety of new generation hardwares like RDMA, SPDK, etc. SharedRocks implements the master-slave replication between SharedRocks data nodes that are distribution architecture. SharedRocks' read only(slave) nodes are deployed very convenient. because of both of data and status are persistent on shared file storage. SharedRocks can access them anywhere..
So, you are a developer or sysadmin and showed some abilities in dealing with databases issues. And now, you have been elected to the role of DBA. And as you start managing the databases, you wonderâ€¦
* How do I tune them to make best use of the hardware?
* How do I optimize the Operating System?
* How do I best configure MySQL or MariaDB for a specific database workload?
If you're asking yourself the following questions when it comes to optimally running your MySQL or MariaDB databases, then this talk is for you!
We will discuss some of the settings that are most often tweaked and which can bring you significant improvement in the performance of your MySQL or MariaDB database. We will also cover some of the variables which are frequently modified even though they should not.
Performance tuning is not easy, especially if you're not an experienced DBA, but you can go a surprisingly long way with a few basic guidelines.
MySQL 8.0 introduced the new MySQL Document Store, which combines the
robust relational foundation and the power of the SQL language in MySQL
with CRUD-style document access in a NoSQL database. In this session, we
runtime, and MySQL 8.0's X DevAPI to build modern applications. This
session describes the architecture of the MySQL Document Store, the X
DevAPI, the X Protocol and related new technologies.
Percona Monitoring and Management (PMM) is a free and open-source solution for managing and monitoring MySQL and MongoDB performance. It provides accurate per-second analysis for MySQL and MongoDB servers, which allows you to tune the database as efficiently as possible.
This session will also be a review of internal PMM architecture, an overview of all components, and the communications between them.
In this session, we will dive deep into HOT (Heap Only Tuple) update optimization. Utilizing this optimization can result in improved writes rates, less index bloat and reduced vacuum effort but to enable PostgreSQL to use this optimization may require changing your application design and database settings. We will examine how the number of indexes, frequency of updates, fill factor and vacuum settings can influence when HOT will be utilized and what benefits you may be able to gain.
Information security continue to be a major concern and priority of organizations, as regulations such as GDPR continue to drive compliance requirements. Databases are of particular interest, as this is where all data, including sensitive information, is stored. As the amount of data collected continues to soar, and as the types of open source databases deployed for specific applications continue to expand, business risk continues to increase. Fortunately there are some basic steps you can take to improve your security hygiene and effective solutions to help you mitigate risk.