From Scheduled Downtime to Self-Healing in Less Than a Year

Thursday 11:55 AM - 12:45 PM

@ Hill Country A


50 minutes conference


Automation & AI

Infrastructure automation is not easy, especially for stateful services like MySQL (or any other database for that matter). It goes way beyond the capabilities of Ansible, Chef, SaltStack or other similar tools. In this session I'm going to show you how we went from fully manual operations to a self-healing system in less than a year at Salesforce. Having done this at several companies already, I've seen the common mistakes that can break your system and make your well intended scheduler/scripts/orchestrator a ticking bomb. I will share how to avoid these problems and build a robust and scalable automation framework that's been battle tested at companies such as and Dropbox.

We will cover:

* Tool comparison
* Centralised vs decentralised system
* Concurrency handling
* Best practices and anti-patterns


Karoly Nagy

Karoly Nagy (Salesforce)

Lead MySQL Engineer



Please sign in for 15 day access

Connect with Percona

Stay Connected on:

Percona Live Conferences

The Percona Live Open Source Database Conferences are the premier event for the diverse and active open source database community, as well as businesses that develop and use open source database software.

Contact Us

For general information about the event/expo/conference, including registration, please contact us at:

  • info(@)
  • +1-888-401-3401
  • +1-919-948-2863