Percona Live 2017 Open Source Database Conference

April 24 - 27, 2017

Santa Clara, California

OpenTSDB - Time Series Schema on Schemaless NoSQL

OpenTSDB - Time Series Schema on Schemaless NoSQL

 26 April - 1:00 PM - 1:50 PM @ Room 204
Experience level: 
Intermediate
Duration: 
50 minutes conference
Tracks:
Developer
Topics:
Other OSDB
Time Series
Metrics
Monitoring

Description

This talk will cover the special case of time series data and the evolution of various schemas from RRD files to RDBMS schemas to NoSQL stores. Particularly we'll focus on why, as the amount of time series data grows and slicing the data by various dimensions becomes important, many users eschewed RDBMS for NoSQL or custom data layers. We'll look at: * RRDTool * RDBMS * Single table RDBMS * Single table RDBMS with multi-dimensions * Partitioning RDBMS by time * Partitioning RDBMS by time and dimension * Moving to Key Value stores * Introduction to distributed hash tables (HBase, Bigtable, Cassandra 1.x) * OpenTSDB's schema on top of these tables * Alternative schemas on hash tables * Pros and Cons vs an RDBMS solution * (Time permitted) newer time series specific data stores (Druid, InfluxDB, Gorilla, others)

Speakers

Chris Larsen

Sr Software Engineer, Yahoo Inc.

Biography:

Developer and manager for OpenTSDB, an open source time series database.

Share this talk