ClickHouse is an open source DBMS for high-performance analytics, originally developed at Yandex for the needs of Yandex.Metrica web analytics system. It is capable of storing petabytes of data and processing billions of rows per second per server, all while ingesting new data in real-time.
I will talk about architectural decisions we made with ClickHouse, their consequences from the point of view of an application developer and how to determine if ClickHouse is a good fit for your use case.
I will cover the following topics:
* Overview of storage engine and query execution engine.
* Data distribution and distributed query processing.
* Replication and where it sits on the consistency-availability spectrum.