Alex has worked for the past two years on building the metrics and alerting infrastructure at Square.
27 April - 11:00 AM - 11:50 AM at Room 203
How do you surface important metrics across your infrastructure when needed most when you've got thousands of machines and each of those has thousands of metrics? Maintaining distributed collection systems with centralized storage and query layers, all with a desired uptime of 100% can be challenging. We'll discuss how Square manages its entire metrics stack from collection to the... [read more]