In this blog post, I’ll review some of the open source database technologies discussions at Percona Live Europe.
In the last five years, the open source database community has been flourishing. There has been an explosion of creativity and innovation. The community has created many niche (and not so niche) solutions for various problems.
As a software engineer or architect, the number of available database options might excite you. You might also be intimidated about how to make the right technology choice. At Percona Live Europe, we have introductory talks for the relevant technologies that we find particularly interesting. These talks will help expand your knowledge about the available solutions and lessen intimidation at the same time.
I’m looking forward to the exciting technologies and talks that we’ll cover this year, such as:
- If you’re looking for an overview, do not miss the tutorial NoSQL Data Stores and Research and Practice by Felix Gessert
- There will also be a practical overview by Kris Köhntopp Data Diversity at Booking.com, showing how Booking.com uses MySQL, Hadoop, HBase, Cassandra, Riak, PostgreSQL, MongoDB and Kafka together
- Kimberly Wilkins will also share her experiences in Polyglot Persistence – DIVERGENT Databases for Different Purposes.
For talks and tutorials on specific uses cases, check out the following sessions:
- RocksDB is a very cool write optimized (LSM) storage engine, one of the few that has been in more than one database. In addition to the RocksDB-based systems inside Facebook, it can be used with MongoDB as MongoRocks and MySQL as MyRocks. It is also used inside next-generation database systems such as CockroachDB and TiDB. We have a lot of talks about RocksDB and related integrations, ranging from a MyRocks Tutorial by Yoshinori Matsunobu, to talk about MongoRocks by Igor Canadi, and a performance-focused talk by Mark Callaghan.
- Elastic is the leading technology for open source full-text search implementations (hence previous name ElasticSearch) — but it is much more than that. ElasticSearch, Kibana, Logstash and Beats allow you to get data from a variety of data searches and analyze and visualize it. Philip Krenn will talk about full-text search in general in his Full-Text Search Explained talk, as well as talk in more details about ElasticSearch in ElasticSearch for SQL Users.
- I am sure you’ve heard about Redis, the Swiss army knife of different data structures and operations. Redis covers many typical data tasks, from caching to maintaining counters and queues. Itamar Haber will talk about Extending Redis with Modules to make Redis an even more powerful data store.
- Apache Spark is another technology you’ve surely heard about. Apache Spark adoption has skyrocketed in recent years due to its high-performance in-memory data analyses, replacing or supplementing Hadoop installations. We will hear about Badoo’s experience processing 11 billion events a day with Spark with Alexander Krasheninnikov, and also learn how to use Spark with MongoDB, MySQL and Redis with Tim Vaillancourt.
- Apache Cassandra is a database focused on high availability and high performance, even when replicating among several data centers. When you think “eventual consistency,” perhaps Cassandra is the first technology that comes to mind. Cassandra allows you to do some impressive things, and Duy Hai Doan will show us some of them in his talk 7 things in Cassandra that you cannot find in RDBMS.
- ClickHouse is a new guy on the block, but I’m very excited about this distributed column store system for high-performance analytics. Built by the Yandex team to power real-time analytics on the scale of trillions of database records, ClickHouse went open source earlier this year. Victor Tarnavsky will share more details in his talk.
- Apache Ignite is another new but very exciting technology. Described as in-memory data fabric, it can be used for a variety of applications to supplement or replace relational databases — ranging from advanced data caching strategies to parallel in-memory processing of large quantities of data. Christos Erotocritou will talk about some of these use cases in his talk Turbocharge Your SQL Queries In-Memory with Apache Ignite.
- RethinkDB is an interesting OpenSource NoSQL database built from the ground up for scalable real-time applications. The end-to-end real-time data streaming feature is really cool, and allows you build interactive real-time applications much easier. Ilya Verbitskiy will talk about RethinkDB in his Agile web-development with RethinkDB talk.
- CockroachDB is a distributed database focused on survivability and high performance (borrowing some ideas from Google’s innovative Spanner database). Marc Berhault will talk database rocket science in his Inside CockroachDB’s Survivability Model.
- TiDB is another open source NewSQL database, inspired by Google Spanner and F1. It can use a variety of storage engines for data store, and it supports MySQL wire protocol to ease application migration. Max Liu explains How TiDB was built in his talk.
- ToroDB is a very interesting piece of technology. It is protocol-compatible with MongoDB, but stores data through a relational format in PostgreSQL. This can offer substantial space reduction and performance improvements for some workloads. Álvaro Hernández from 8Kdata will discuss this technology in his ToroDB: All your MongoDB data are belong to SQL talk.
As you can see we cover a wealth of exciting open source database technologies at Percona Live Europe. Do not miss a chance to expand your database horizons and learn about new developments in the industry. There is still time to register! Use the code PZBlog for a €30 discount off your registration price!
Percona Live Europe 2016 is part of Amsterdam eWeek. Amsterdam eWeek provides a platform for national and international companies that focus on online marketing, media and technology and for business managers and entrepreneurs who use them, whether it comes to retail, healthcare, finance, game industry or media. Check it out!