InfluxDB storage engine internals
Over the last year and a half we built an open source storage engine from scratch specifically for time series data. In this presentation I take a deep dive into the storage engine inside InfluxDB. More than just a single storage engine, InfluxDB is two engines in one: the first for time series data and the second, an index for metadata. I'll delve into the optimizations for achieving high write throughput, compression and fast reads for both the raw time series data and the metadata.
Paul Dix is cofounder and CTO of InfluxData, the company behind InfluxDB, the open source time series database. He has helped build software for startups, large companies and organizations like Microsoft, Google, McAfee, Thomson Reuters, and Air Force Space Command. He is the series editor for Addison Wesley's Data & Analytics book and video series. In 2010 Paul wrote the book Service Oriented Design with Ruby and Rails for Addison Wesley's Professional Ruby series. In 2009 he started the NYC Machine Learning Meetup, which has over 9,000 members. Paul holds a degree in computer science from Columbia University.