May 30, 2019
 Percona Live Open Source Database Conference 2019
  Alexander Zaitsev, Robert Hodges

Columnar stores like ClickHouse enable users to pull insights from big data in seconds, but only if you set things up correctly. This talk will walk through how to implement a data warehouse that contains 1.3 billion rows using the famous NY Yellow Cab ride data. We'll start with basic data implementation including clustering and table definitions, then show how to load efficiently. Next, we'll discuss important features like dictionaries and materialized views, and how they improve query efficiency. We'll end by demonstrating typical queries to illustrate the kind of inferences you can draw rapidly from a well-designed data warehouse. It should be enough to get you started--the next billion rows is up to you!



About the Authors

Alexander Zaitsev

Alexander is co-founder of Altinity. He has 20 years of engineering and engineering management experience in several international companies. Alexander is expert in high scale analytics systems design and implementation. He designed and deployed petabyte scale data warehouses, including one of earliest ClickHouse deployments outside of Yandex.
 

Robert Hodges

Robert has over 30 years of experience in database and application engineering. Prior to Continuent, he worked on object-relational databases and transaction processing at Sybase. He also held senior engineering positions at Portera and Exigen Group, where he designed and operated software-as-a-service applications. Robert is the original designer of Continuent's Tungsten solution.