Building a Star Schema v1.1

Star Schemas
Patrick Cuba – Consultant
(SAS® Software)
Scalable Performance Data Engine
using

• Case Study – Need for SPDE
• SPDE Library
• Case Study – Need for SPDS
• SPDS Server
 Clusters
 Star Schema
 StarJoin
• Questions
• References
2

• Table build is 6 hours
• Query time is 20 minutes
• Latest is 360GB
• Generation tables hold 24 months
• Generation tables grown to 1TB each
• 300+ columns
• Four balances per credit card (Max 255)
• 20 million customers
• Growing customer base
• Keeps defaults customer balance
3

• At month end the cycle end
and latest credit card for
the month are added to
SAS Generation TablesCycle-end
Month EndCycle-endCycle-end
Cycle-end
Cycle-end
Cycle-end
Month end
Month end
Month end
• Accounts cycle at
different days in the
month
Latest
4

SAS Dataset
• SAS Datasets are flat files
Page 5
libname all_users’/disk1/metadata’;

• Under BASE SAS License
• Scalable Performance Data Engine (SPDE)
• On SMP server (at least 2 CPU’s)
• RAID
SAS SPD Dataset
Data
Part
Data
Part
Data
Part
Data
Part
Data
Part
HBX
Index
IBX
Index
Meta
libname all_users spde ’/disk1/metadata’
datapath= (’/disk2/userdata’ ’/disk3/userdata’)
indexpath= (’/disk4/userindexes’ ’/disk5/userindexes’)
partsize=128M;
6

• Star Schema using StarJoin
• Clustered Cycle & Month end
totalling 1TB
• Table build is 30-40 minutes
• Query time is seconds to 5
minutes
7
Dimension
Dimension
Fact
Dimension
Dimension

• Scalable Performance Data Server
• Client/Server
• SQL Pass-thru
8

• Clusters
M1
M2
M3
M4
M5
M6
M7
M8
Cluster
PROC SPDO LIBRARY=domain-name;
SET ACLUSER user-name;
CLUSTER CREATE cluster-table-name
MEM = SPD-Server-table1
MEM = SPD-Server-table2
MAXSLOT=24
QUIT;
9

• Facts and Dimensions
Dimension
Dimension
Fact
Dimension
Dimension
Pairwise :
7 Joins
1 Select
StarJoin:
3 Steps
10

execute(reset nostarjoin=<1/0>)
Page 11
• 1. Turn it

• 2. No
Dim
Dim
Fact
Dim
Dim
Dim
Dim

• 3. Single
Dim
Dim
Fact
Dim
Dim
• 4. Single
Fact
• 5. Fact & Dimension

14
Email: patrickcuba@live.co.za
Mobile: 0458 91 2634
Linkedin: http://www.linkedin.com/in/patrickcuba

STARJOIN
http://support.sas.com/documentation/cdl/en/spdsug/63088/HTML/default/vi
ewer.htm#n0mlj75x9c4dtzn1ves84e1op3jt.htm
SAS® 9.1 Scalable Performance
Data Engine
http://support.sas.com/documentation/onlinedoc/91pdf/sasdoc_91/base_data
eng_6996.pdf
SAS® 9.2
Scalable Performance
Data Engine
http://support.sas.com/documentation/cdl/en/engspde/61887/PDF/default/en
gspde.pdf
When should you use the SPDE engine
http://support.sas.com/rnd/scalability/spde/when.html

Building a Star Schema v1.1

More Related Content

What's hot (20)

Similar to Building a Star Schema v1.1 (20)

Building a Star Schema v1.1

Editor's Notes