Karoly Nagy (Salesforce) delivers the talk, "From Scheduled Downtime to Self-Healing in Less Than a Year", on DAY 2 of the Percona Live Open Source Database Conference 2019, 5/30, at Austin, TX.
Infrastructure automation is not easy, especially for stateful services like MySQL (or any other database for that matter). It goes way beyond the capabilities of Ansible, Chef, SaltStack or other similar tools. In this session I'm going to show you how we went from fully manual operations to a self-healing system in less than a year at Salesforce. Having done this at several companies already, I've seen the common mistakes that can break your system and make your well intended scheduler/scripts/orchestrator a ticking bomb. I will share how to avoid these problems and build a robust and scalable automation framework that's been battle tested at companies such as Booking.com and Dropbox.
We will cover:
* Tool comparison
* Centralised vs decentralised system
* Concurrency handling
* Best practices and anti-patterns
Using PMM to Identify and Troubleshoot Problematic MySQL Queries
Google Cloud Platform: MySQL at Scale with Reliable HA
Analytical Queries in MySQL - PLO October 2020
From Containers to Kubernetes Operators - Philipp Krenn - Percona Live ONLINE 2020
MySQL Ecosystem on ARM - Krunal Bauskar - Percona Live ONLINE 2020