Percona XtraDB Cluster: Failure Scenarios and their Recovery
Percona XtraDB Cluster (a.k.a PXC) is multi-master high-availability clustering solution. Given the multi-master aspect, there are multi-guards to protect cluster from entering an inconsistent state. Most of these guards are configurable based on user environment but if they are not configured properly it could cause the cluster to stall, fail, error-out.
In this session, we would go over some of these failure scenarios like cluster entering non-primary due to network partitioning, cluster stall due to flow control, data inconsistency causing shutdown of node, common problem during initial catch up (a.k.a State Snapshot transfer (SST)), delay in purging of transaction, blocking DDL causing complete cluster to staff, misconfigured cluster, etcâ€¦
We would also discuss how to solve some of these problems or have to safely recover from these failures.
About the Authors
Krunal BauskarKrunal joined Percona in September 2015. Before joining Percona he use to work as part of InnoDB team at MySQL/Oracle. He authored most of the temporary table revamp work besides lot of other features. In past he was associated with Yahoo! Labs researching on bigdata problems and database startup which is now part of Teradata. His interest mainly includes data-management at any scale and has been practicing it for more than decade now. He love to spend time with his family or get involved in some social work helping his society unless he is out for some near-by exploration drive. He is located out of Pune, India.
Alkin has extensive experience in enterprise relational databases working in various sectors for large corporations. With more then 20 years of industry experience he has acquired skills for managing large projects from ground up to production. For the past six years he's been focusing on e-commerce, SaaS and MySQL technologies. He managed and architected database topologies for high volume site at eBay Intl. He has several years of experience on 24X7 support and operational tasks as well as improving database systems for major companies. He has led MySQL global operations team on Tier 1/2 support for MySQL customers. Recently joined Percona's expert technical management team.