SlideShare a Scribd company logo
AUTO-MDB: A FRAMEWORK FOR AUTOMATED MULTIDIMENSIONAL DATABASE DESIGN
VIA SCHEMA TRANSFORMATION
ALFREDO CUZZOCREA ICAR-CNR & UNIV. OF CALABRIA. ITALY
RIM MOUSSA, HEJER AKAICHI LATICE LAB. UNIV. OF TUNIS & ESTI UNIV. OF CARTHAGE . TUNISIA

Motivations

Questions of Developpers of BI Solutions

1. Advantages of On-Line Analytical Processing:
 Presentation -visual OLAP, user interaction
 Ease of Maintenance -data is stored as is viewed,
 Performance -aggregated data calculus,
2. BI market is booming, according to research from market
watchers, such as Pringle & Company and Gartner, the market
for BI platforms will remain one of the fastest growing software
markets in most regions
3. MDB Design milestone is often neglected, OLAP cubes are
defined in a haphazard way without worrying about
performance and maintenance cost.

Auto-MDB Framework
Simple Rules for turning business queries into OLAP
Cubes
 Measures definition
 Fact Table definition
 Dimensions definition

Turning Business Query Q8 of TPC-H benchmark into
an OLAP Cube

How to define cubes?
will there be a single cube or multiple cubes?
Which optimizations are the most suitable for running the
workload?
Data fragmentation & parallel OLAP ?
Derived data (aggregate tables, indexes, derived attributes, data
synopsis) ?

Project Goal
Full-featured solution for multidimensional database design

TPC-H*d Benchmark
TPC-H*d Benchmark
 Truly OLAP variant of TPC-H benchmark –the most prominent decision
support system benchmark
 TPC-H SQL workload translated into MDX (MultiDimensional
eXpressions)
 The workload is composed of 23 MDX statements for OLAP cubes and
23 MDX statements for OLAP business queries.

Screenshots of C8 and Q8 Pivot Tables and corresponding MDX
Statements

Future Work
References
Advanced
Virtual Cube
Design
Application to
TPC-DS
benchmark

TH
19

Investigate more
derived data
strategies, such as
data synopsis
calculus

E. F. Codd, S. B. Codd, and C. T. Salley. Providing OLAP to user-analysts:
An IT mandate. 1993.
Alfredo Cuzzocrea and Rim Moussa: Multidimensional Database Design via
Schema Transformation: Turning TPC-H into the TPC-H*d Multidimensional
Benchmark. COMAD, 2013.

Further Information
https://sites.google.com/site/rimmoussa/auto_multidimensional_dbs
rim.moussa@esti.rnu.tn

ACM CONFERENCE ON MANAGEMENT OF DATA COMAD@AHMEDABAD.INDIA 2013

More Related Content

PDF
SplunkLive! Frankfurt 2017 - DB Cargo
PPTX
Teresa Westfall - DoDIIS Worldwide 2010
PPTX
Demystify Big Data Breakfast Briefing - Juergen Urbanski, T-Systems
PPT
Media iQ fifth elephant teaser
PDF
Recent Updates on IBM System G — GraphBIG and Temporal Data
PDF
Deutsche Bahn: Reducing application time-to-market while improving overall qu...
PDF
NRB MAINFRAME DAY 09 - Gamal Khaldi - Wrap up and conclusions
 
PDF
Designing stations for safety and comfort
SplunkLive! Frankfurt 2017 - DB Cargo
Teresa Westfall - DoDIIS Worldwide 2010
Demystify Big Data Breakfast Briefing - Juergen Urbanski, T-Systems
Media iQ fifth elephant teaser
Recent Updates on IBM System G — GraphBIG and Temporal Data
Deutsche Bahn: Reducing application time-to-market while improving overall qu...
NRB MAINFRAME DAY 09 - Gamal Khaldi - Wrap up and conclusions
 
Designing stations for safety and comfort

What's hot (7)

PPTX
Why create a Data Mart with Dimensional Fact Model
PPTX
Quo vadis Power BI?
PDF
NRB and the Mainframe - Peter Hellemans
PDF
ArcReporting - Features & Benefits
PPTX
Sap module overview
PDF
Cadison electric-designer
PDF
TranSMART Hackathon Introduction Amsterdam 2015
Why create a Data Mart with Dimensional Fact Model
Quo vadis Power BI?
NRB and the Mainframe - Peter Hellemans
ArcReporting - Features & Benefits
Sap module overview
Cadison electric-designer
TranSMART Hackathon Introduction Amsterdam 2015
Ad

Viewers also liked (10)

PPT
C603 regional health observatory-its role in the generation and dissemination...
PDF
Multidimensional DB design, revolving TPC-H benchmark into OLAP bench
PPTX
Etl process in data warehouse
PPT
Multidimensional Database Design & Architecture
PPTX
What is ETL?
PDF
Introduction to ETL and Data Integration
PDF
Data mining (lecture 1 & 2) conecpts and techniques
PDF
ETL Process
DOC
Planejamento 3º ano ( 2ª série )
PPT
Data Warehouse Modeling
C603 regional health observatory-its role in the generation and dissemination...
Multidimensional DB design, revolving TPC-H benchmark into OLAP bench
Etl process in data warehouse
Multidimensional Database Design & Architecture
What is ETL?
Introduction to ETL and Data Integration
Data mining (lecture 1 & 2) conecpts and techniques
ETL Process
Planejamento 3º ano ( 2ª série )
Data Warehouse Modeling
Ad

Similar to Automation of MultiDimensional DB Design (poster) (20)

PDF
BICOD-2017
PDF
Bicod2017
PPTX
MULTI-DIMENSIONAL DATABASES.pptx
PDF
Business Intelligence and Multidimensional Database
PDF
ISNCC 2017
PDF
Efficient Information Retrieval using Multidimensional OLAP Cube
PPT
05 OLAP v6 weekend
PPT
OLAP Cubes in Datawarehousing
PPTX
PPTX
PPTX
PDF
With big data comes big responsibility
PPTX
Online analytical processing
PPTX
OLAP (Online Analytical Processing).pptx
PPTX
BI Introduction
PPTX
OBIEE ARCHITECTURE.ppt
DOC
86921864 olap-case-study-vj
PDF
Building a SSAS Tabular Model Database
PDF
Data warehousing unit 6.2
PDF
SSAS, MDX , Cube understanding, Browsing and Tools information
BICOD-2017
Bicod2017
MULTI-DIMENSIONAL DATABASES.pptx
Business Intelligence and Multidimensional Database
ISNCC 2017
Efficient Information Retrieval using Multidimensional OLAP Cube
05 OLAP v6 weekend
OLAP Cubes in Datawarehousing
With big data comes big responsibility
Online analytical processing
OLAP (Online Analytical Processing).pptx
BI Introduction
OBIEE ARCHITECTURE.ppt
86921864 olap-case-study-vj
Building a SSAS Tabular Model Database
Data warehousing unit 6.2
SSAS, MDX , Cube understanding, Browsing and Tools information

More from Rim Moussa (16)

PDF
data pipelines complexity human expertise and LLM era
PDF
customized eager lazy data cleansing for satisfactory big data veracity
PDF
doc oriented stores for mailing lists using elastic stack
PDF
scalable air quality analytics with apache spark and apache sedona
PDF
polystore_NYC_inrae_sysinfo2021-1.pdf
PDF
Big Data Projects
PDF
EMR AWS Demo
PDF
ER 2016 Tutorial
PDF
Asd 2015
PDF
Ismis2014 dbaas expert
PDF
Parallel Sequence Generator
PDF
Hadoop ensma poitiers
PDF
TPC-H analytics' scenarios and performances on Hadoop data clouds
PDF
Benchmarking data warehouse systems in the cloud: new requirements & new metrics
PDF
highly available distributed databases (poster)
PDF
parallel OLAP
data pipelines complexity human expertise and LLM era
customized eager lazy data cleansing for satisfactory big data veracity
doc oriented stores for mailing lists using elastic stack
scalable air quality analytics with apache spark and apache sedona
polystore_NYC_inrae_sysinfo2021-1.pdf
Big Data Projects
EMR AWS Demo
ER 2016 Tutorial
Asd 2015
Ismis2014 dbaas expert
Parallel Sequence Generator
Hadoop ensma poitiers
TPC-H analytics' scenarios and performances on Hadoop data clouds
Benchmarking data warehouse systems in the cloud: new requirements & new metrics
highly available distributed databases (poster)
parallel OLAP

Recently uploaded (20)

PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Lesson notes of climatology university.
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Insiders guide to clinical Medicine.pdf
PPTX
Institutional Correction lecture only . . .
102 student loan defaulters named and shamed – Is someone you know on the list?
Microbial disease of the cardiovascular and lymphatic systems
Abdominal Access Techniques with Prof. Dr. R K Mishra
Supply Chain Operations Speaking Notes -ICLT Program
Lesson notes of climatology university.
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Renaissance Architecture: A Journey from Faith to Humanism
PPH.pptx obstetrics and gynecology in nursing
Microbial diseases, their pathogenesis and prophylaxis
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Pharma ospi slides which help in ospi learning
Pharmacology of Heart Failure /Pharmacotherapy of CHF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Anesthesia in Laparoscopic Surgery in India
human mycosis Human fungal infections are called human mycosis..pptx
Insiders guide to clinical Medicine.pdf
Institutional Correction lecture only . . .

Automation of MultiDimensional DB Design (poster)

  • 1. AUTO-MDB: A FRAMEWORK FOR AUTOMATED MULTIDIMENSIONAL DATABASE DESIGN VIA SCHEMA TRANSFORMATION ALFREDO CUZZOCREA ICAR-CNR & UNIV. OF CALABRIA. ITALY RIM MOUSSA, HEJER AKAICHI LATICE LAB. UNIV. OF TUNIS & ESTI UNIV. OF CARTHAGE . TUNISIA Motivations Questions of Developpers of BI Solutions 1. Advantages of On-Line Analytical Processing:  Presentation -visual OLAP, user interaction  Ease of Maintenance -data is stored as is viewed,  Performance -aggregated data calculus, 2. BI market is booming, according to research from market watchers, such as Pringle & Company and Gartner, the market for BI platforms will remain one of the fastest growing software markets in most regions 3. MDB Design milestone is often neglected, OLAP cubes are defined in a haphazard way without worrying about performance and maintenance cost. Auto-MDB Framework Simple Rules for turning business queries into OLAP Cubes  Measures definition  Fact Table definition  Dimensions definition Turning Business Query Q8 of TPC-H benchmark into an OLAP Cube How to define cubes? will there be a single cube or multiple cubes? Which optimizations are the most suitable for running the workload? Data fragmentation & parallel OLAP ? Derived data (aggregate tables, indexes, derived attributes, data synopsis) ? Project Goal Full-featured solution for multidimensional database design TPC-H*d Benchmark TPC-H*d Benchmark  Truly OLAP variant of TPC-H benchmark –the most prominent decision support system benchmark  TPC-H SQL workload translated into MDX (MultiDimensional eXpressions)  The workload is composed of 23 MDX statements for OLAP cubes and 23 MDX statements for OLAP business queries. Screenshots of C8 and Q8 Pivot Tables and corresponding MDX Statements Future Work References Advanced Virtual Cube Design Application to TPC-DS benchmark TH 19 Investigate more derived data strategies, such as data synopsis calculus E. F. Codd, S. B. Codd, and C. T. Salley. Providing OLAP to user-analysts: An IT mandate. 1993. Alfredo Cuzzocrea and Rim Moussa: Multidimensional Database Design via Schema Transformation: Turning TPC-H into the TPC-H*d Multidimensional Benchmark. COMAD, 2013. Further Information https://sites.google.com/site/rimmoussa/auto_multidimensional_dbs rim.moussa@esti.rnu.tn ACM CONFERENCE ON MANAGEMENT OF DATA COMAD@AHMEDABAD.INDIA 2013