Starring Sakila: Building Data Warehouses and BI solutions using MySQL and Pentaho

Average: 8.8 (5 votes)

The Sakila sample database makes for a great mini-tutorial on how to design, create, load and use a data warehouse. This is the main thread of this mini tutorial. While the main thread of the turorial focuses on general business intelligence and data warehouse concepts, the tutorial also contains hands-on parts. The hands-on parts are a great way to get introduced to a variety of open source tools (mainly focused on the Pentaho) for building and developing data warehouses and business intelligence applications. To facilitate attendees in participating themselves in the hands-on parts, I will provide an archive containing all required software and materials, as well as detailed written instructions.

The following outline explains the structure of this tutorial in more detail

  • The Business Intelligence problem: Creating information out of data.
    • Different information needs and types of business intelligence
  • How data warehousing helps to solve the BI problem
    • What is a Data Warehouse?
    • Data Warehouse Requirements
    • Open source Analytic Database Systems (ADBMS)
  • Anatomy of a data warehouse
    • Data Warehouse Architectures
    • Dimensional modeling
  • Physical implementation of data warehouses
    • Star schemas and snowflakes
  • A star is born
    • Dimensional models for rentals and payments from the sakila database
    • Hands on: modeling a data warehouse with Kettle (a.k.a. Pentaho Data Integration)
  • Filling the data warehouse
    • Extraction, transformation and loading
    • Hands on: Data profiling with eobjects Data Cleaner
    • Hands on: Building ETL jobs and transformations with Kettle (a.k.a. Pentaho Data Integration)
  • Presenting the Data: BI Applications
    • Reporting, Dashboarding and Analysis
    • Hands on: Building reports with Pentaho report designer
    • Hands on: Building cubes and dynamic pivot tables with Mondrian (a.k.a. Pentaho Analysis Services)
    • Hands on: Dashboarding with the Community Dashboard Framework
Experience level: 

Schedule info

Time slot: 
10 April 13:30 - 16:30
Ballroom A

Schedule Info

10 April 13:30 - 16:30 @
Ballroom A


Roland Bouman's picture
Software Engineer, Pentaho

I'm @rolandbouman, a Web- and BI Developer and Information Analyst. I'm currently working as a Software Engineer for Pentaho, a world-leading Open Source Business Intelligence Suite. In the past, I have worked for Inter Access, MySQL AB, Sun Microsystems and Strukton Rail. I'm an active member of both the MySQL and Pentaho communities. I maintain a technical blog at and I co-authored two books about Pentaho: Pentaho Solutions and Pentaho Kettle Solutions.

Matt Casters's picture
Chief of Data Integration, Pentaho

I am the l chief architect and lead developer of the open source data integration project called Kettle or Pentaho Data Integration.
For the last 10 years I have been very busy writing this ETL tool which was open sourced in December 2005 and acquired by Pentaho (Open Source Business Intelligence) in 2006.


Sponsored By