Lin Ma (Carnegie Mellon University) delivers the talk, "Self-Driving Databases: It All Starts with Workload Forecasting", on DAY 2 of the Percona Live Open Source Database Conference 2019, 5/30, at Austin, TX.

Database management systems (DBMSs) are notoriously difficult to deploy and administer because of their long list of functionalities. If a system could optimize itself automatically, then it would remove many of the complications and costs involved with its deployment. Most of the advisory tools built by researchers and vendors are incomplete because they require humans to make the final decisions about any database change and only fix problems after they occur. Recent work has proposed "self-driving" DBMSs that optimize the system for both the application's current workload, as well as the expected workload in the future. These systems will support existing tuning techniques and capacity planning without requiring a human to determine the right way and proper time to deploy them.

The first step towards such an autonomous DBMS is the ability to model and predict the target application's workload. In this talk, I present a robust forecasting framework called "QueryBot 5000" that we designed for self-driving operations. The framework integrates with any DBMS to predict the expected arrival rate of queries in the future based on historical data. It then provides multiple prediction horizons (short- vs. long-term) with varying aggregation intervals. I also discuss our vision and progress on how a self-driving DBMSs uses these forecast models to optimize its performance.


Related Videos: Percona Live Sessions, Containers and Automation, Open Source Database

Open Source Adoption for Financial and Banking Institutions
Kubernetes: The Path to Open Source DBaaS
How to Maximize the Benefits of Using Open Source MongoDB with Percona Distribution for MongoDB
Engineering Data Reliably Using SLO Theory - Emily Gorcenski - Percona Live ONLINE 2020
Revision Control for Structured Data - Gavin Mendel Gleason - Percona Live ONLINE 2020
What If We Could Use Machine Learning Models as Tables? - Jorge Torres - Percona Live ONLINE 2020
Mastering Open Source Data Governance - Elisha Chitsenga - Percona Live ONLINE 2020