SlideShare a Scribd company logo
The Briefing Room
Thinking Outside the Cube: How In-Memory Bolsters Analytics
Twitter Tag: #briefr The Briefing Room
Welcome
Host:
Eric Kavanagh
eric.kavanagh@bloorgroup.com
Twitter Tag: #briefr The Briefing Room
!   Reveal the essential characteristics of enterprise software,
good and bad
!   Provide a forum for detailed analysis of today s innovative
technologies
!   Give vendors a chance to explain their product to savvy
analysts
!   Allow audience members to pose serious questions... and get
answers!
Mission
Twitter Tag: #briefr The Briefing Room
Topics
This Month: ANALYTIC PLATFORMS
September: ANALYTICS
October: DATA PROCESSING
Twitter Tag: #briefr The Briefing Room
Analytic Platforms
~Albert Einstein
If	
  you	
  always	
  do	
  what	
  you	
  
always	
  did,	
  you	
  will	
  always	
  
get	
  what	
  you	
  always	
  got.	
  
“ “
Twitter Tag: #briefr The Briefing Room
Analyst: Mark Madsen
 Mark Madsen is president
of Third Nature, Inc.
Twitter Tag: #briefr The Briefing Room
!   IBM Cognos Business Intelligence is an enterprise BI platform
with an open-data access strategy
!   The platform includes IBM Cognos Dynamic Cubes, an in-
memory relational OLAP component that complements the
existing query engine
!   Dynamic Cubes can enable users to perform interactive
analysis and reporting over terabytes of data
IBM
Twitter Tag: #briefr The Briefing Room
Guest: Chris McPherson
Chris McPherson is a Senior Product Manager
on the IBM Business Analytics Platform team
in the IBM Canada Ottawa Lab. His current
area of responsibility is IBM Cognos Dynamic
Cubes but prior to that, he was product
owner for Modelling, Metadata and EII for
the Cognos suite of tools. He has more than
nine years of experience within the IBM
Business Analytics organization.
© 2012 IBM Corporation
IBM Cognos Dynamic Cubes
Chris McPherson – Senior Product Manager
IBM Business Analytics
© 2012 IBM Corporation10
High performance analytics over
growing data volumes
Aggregate awareness
Aggregate optimization
In-memory caching of
members, data, expressions,
results, and aggregates
Dynamic Cubes
Feature mission
© 2012 IBM Corporation11
Extensive caching
–  Shared caches
for maximum
reuse
–  All caches are
security aware
Data Cache
In-Memory Aggregate Cache
Expression Cache
Result Set Cache
Member
Cache
Security
MDX
Engine
Security
Data Warehouse
Security
© 2012 IBM Corporation12
Security
Security
Security
Data Cache
In-memory Aggregates
Expression Cache
Member
Cache
MDX
Engine
Result Set
Cache
Query Processor
DQM
Dynamic
Cube
DQM
© 2012 IBM Corporation13
Security
Security
Security
Data Cache
In-memory Aggregates
Expression Cache
Member
Cache
MDX
Engine
SQL queries to obtain
member information
Result Set
Cache
Query Processor
DQM
Dynamic
Cube
DQM
© 2012 IBM Corporation14
Security
Security
Security
Data Cache
In-memory Aggregates
Expression Cache
Member
Cache
MDX
Engine
SQL queries to obtain
member information
SQL queries
to obtain
aggregate
data
Result Set
Cache
Query Processor
DQM
Dynamic
Cube
DQM
© 2012 IBM Corporation15
Security
Security
Security
Initial Query
Data Cache
In-memory Aggregates
Expression Cache
Member
Cache
MDX
Engine
SQL queries to obtain
member information
SQL queries
to obtain
fact and
summary data
SQL queries
to obtain
aggregate
data
Search aggregate
cache for data
Result Set
Cache
Query Processor
DQM
Dynamic
Cube
DQM
© 2012 IBM Corporation16
Security
Security
Security
Initial Query
Data Cache
In-memory Aggregates
Expression Cache
Member
Cache
MDX
Engine
SQL queries to obtain
member information
SQL queries
to obtain
fact and
summary data
SQL queries
to obtain
aggregate
data
Search aggregate
cache for data
Result Set
Cache
Query Processor
DQM
Dynamic
Cube
DQM
© 2012 IBM Corporation17
Dynamic Cube Lifecycle
1. Model & publish
The image cannot be displayed. Your computer may not have enough
memory to open the image, or the image may have been corrupted.
Restart your computer, and then open the file again. If the red x still
appears, you may have to delete the image and then insert it again.
2. Deploy, manage
3. Reporting & analytics
4. Optimize
Dynamic Cube Server
Dynamic
Cube
Logs
CM
Warehouse
© 2012 IBM Corporation18
1. Launch Aggregate Advisor Wizard
2. Run with or without workload
Optimize per report, package, user, or
time
3. Advisor returns with in-memory and/or in-
database recommendations
4. Save recommendations
§ In-memory aggregates created on re-start
à No re-modeling or re-authoring required
§ DBA creates in-database aggregate tables,
and modeler updates model and redeploys
Aggregate Advisor for in-memory aggregates
Easy performance improvements
© 2012 IBM Corporation19
Virtual Cubes
Virtual cube used as
source for another
virtual cube
Combines cubes with
common Time dimension
Virtual cubes
combine two
cubes
Combines cubes
with nearly identical
dimensions
Inventory
Sales
SalesInventory
Store
Sales
Web
Sales
© 2012 IBM Corporation20
Time
Current Month
All Sales cube
All
Sales
Current
Month
Sales
Historic
Sales
Virtual Cubes
Low latency & faster cube refresh
© 2012 IBM Corporation21
Cognos Dynamic Cubes - Summary
High Performance
•  80x improvement with aggregates
•  80% queries under 3 seconds
•  Over 50% queries sub-second
Growing Data Volumes
•  Scalable to terabytes of fact data
Flexible and Optimized
•  You choose where to take advantage of in-
memory capabilities
•  Aggregate Advisor to easily create
optimized aggregates
Maximize Value of Data Warehouse
•  Aggregate awareness to balance load
across app and DB tiers
•  Reduce load on database through use of
application tier caching
21
© 2012 IBM Corporation22
Twitter Tag: #briefr The Briefing Room
Perceptions & Questions
Analyst:
Mark Madsen
Commentary	
  on	
  
analysis	
  and	
  
performance,	
  
IBM	
  Business	
  Analy8cs	
  
Briefing	
  Room	
  	
  	
  
August, 27 2013
Mark Madsen
www.ThirdNature.net
@markmadsen	
  
Terminology	
  Disambigufica8on	
  
Analysis:	
  
a.	
  The	
  separa7on	
  of	
  an	
  intellectual	
  or	
  material	
  whole	
  into	
  
its	
  cons7tuent	
  parts	
  for	
  individual	
  study.	
  
b.	
  The	
  study	
  of	
  such	
  cons7tuent	
  parts	
  and	
  their	
  
interrela7onships	
  in	
  making	
  up	
  a	
  whole.	
  
	
  
Analy8cs:	
  the	
  mathy	
  stuff,	
  like	
  sta7s7cs,	
  machine	
  
learning,	
  numerical	
  methods,	
  data	
  mining*	
  (so	
  I	
  won’t	
  
use	
  the	
  term	
  as	
  a	
  synonym	
  for	
  OLAP)	
  
	
  
In-­‐memory:	
  a	
  vague	
  term	
  mainly	
  implying	
  not	
  using	
  disks	
  
for	
  immediate	
  data	
  access	
  
BI	
  is	
  using	
  broken	
  metaphors	
  
We	
  think	
  of	
  BI	
  as	
  publishing,	
  which	
  is	
  only	
  one	
  part.	
  
Most	
  BI	
  is	
  built	
  on	
  an	
  outdated	
  interac8on	
  model	
  
Result	
  of	
  a	
  poor	
  interac8on	
  model	
  
Delayed	
  interac<on	
  disrupts	
  work	
  
"...each second of system response
degradation leads to a similar degradation
added to the user's time for the following
[command]. This phenomenon seems to be
related to an individual's attention span.
The traditional model of a person thinking
after each system response appears to be
inaccurate. Instead, people seem to have a
sequence of actions in mind, contained in a
short-term mental memory buffer.
Increases in SRT [system response time]
seem to disrupt the thought processes, and
this may result in having to rethink the
sequence of actions to be continued.“
Note nonlinearity in graph, an indication
that something important is happening.
“The Economic Value of Rapid Response Time “, IBM 1982
 Tradi8onal	
  BI	
  fails	
  to	
  put	
  users	
  into	
  the	
  flow	
  zone	
  
Flow	
  (Csíkszentmihályi)	
  
▪  Concept	
  of	
  engagement	
  and	
  
immersion	
  in	
  a	
  task	
  	
  
▪  The	
  appropriate	
  applica7on	
  
of	
  tools	
  and	
  knowledge	
  to	
  
analy7cal	
  problems	
  enables	
  
produc7vity.	
  
▪  The	
  s7lted	
  interac7on	
  of	
  BI	
  
disrupts	
  flow.	
  
	
  
Interac8on	
  8mescale	
  for	
  analysis	
  problems	
  
Un7l	
  you	
  resolve	
  this	
  task	
  performance	
  gap,	
  real	
  analysis	
  
work	
  is	
  a	
  challenge	
  (and	
  a	
  reason	
  why	
  Excel	
  remains	
  
popular).	
  
Days
Hours
Minutes
Seconds
Instantaneous
come back tomorrow
go to lunch
take a break
get some coffee
check email/FB
take a sip of coffee
immerse yourself in work
Flow is possible only in the
“less than 3 second” range
Future-­‐proofing	
  
The	
  tool	
  market	
  is	
  shiIing,	
  
driven	
  by	
  new	
  architectures	
  
that	
  are	
  enabled	
  by	
  new	
  
technologies.	
  	
  
Front-­‐end	
  tools	
  are	
  evolving	
  
away	
  from	
  BI-­‐as-­‐publishing,	
  
which	
  is	
  going	
  to	
  increase	
  
the	
  burden	
  on	
  the	
  back	
  end	
  
data	
  stores	
  and	
  cause	
  
interac7on	
  problems.	
  
You	
  need	
  to	
  evaluate	
  tools	
  
based	
  on	
  more	
  detailed	
  
usage	
  scenarios	
  and	
  
interac7ve	
  capabili7es,	
  less	
  
on	
  report-­‐building	
  features.	
  
BI	
  should	
  support	
  two	
  sets	
  of	
  ac8ons.	
  One	
  is	
  monitoring	
  
the	
  known,	
  one	
  is	
  analyzing	
  the	
  unknown.	
  
Collect
new data
Monitor
Analyze
Exceptions
Analyze
Causes
Decide Act
No problem No idea Do nothing
Act on the process
Usually days/longer timeframe
Act within the process
Usually real-time to daily
The real BI design point: context and point of use
Information use is diverse
and varies based on context:
▪  Get a quick answer
▪  Solve a one-off problem
▪  Make repetitive decisions
▪  Monitor routine processes
▪  Make complex decisions
▪  Choose a course of action
▪  Convince others to take
action
Different problems require
different response times in order
to be effective.
How	
  expensive	
  was	
  performance?	
  500	
  GB	
  DW…	
  
Maximum
Capacities
•  2 to 30 100MHz Intel Pentium
processors
•  Up to 3.5GB system memory
•  Up to 1.7TB of on-line storage
Base
Configuration
•  18 slot Sequent bus chassis
•  1 Proc card - dual 100MHz Pentium
CPUs
•  1 2.1GB SCSI boot disk
•  1 CD-ROM/QIC-525 1/4” Tape
•  1 Memory controller (64MB, 256MB)
•  1 Integrated Ethernet
•  5-slot VMEbus chassis
•  Room for 3 additional 5.25” devices
Expansion
Options
•  Up to 400 SCSI-2 disks
•  Up to 29 VMEbus slots
•  Up to 8 QCIC I/O controllers
•  Token Ring, FDDI LAN adapters
•  Sync or Async communications
ports
Price: $1.6 million in 1993
OLAP	
  was	
  a	
  response-­‐8me	
  answer	
  
The	
  Codd	
  OLAP	
  paper	
  wriPen	
  for	
  a	
  vendor	
  in	
  
1993:	
  state	
  of	
  the	
  art	
  client	
  technology	
  was	
  the	
  60	
  
Mhz	
  Intel	
  Pen7um,	
  Windows	
  version	
  3.1;	
  server	
  
tech	
  was	
  the	
  $1M+	
  database	
  server	
  
	
  
It’s	
  s7ll	
  hard	
  to	
  get	
  less	
  than	
  3	
  second	
  response	
  
7mes	
  from	
  a	
  round-­‐trip	
  to	
  a	
  DB	
  
	
  
It’s	
  s7ll	
  hard	
  to	
  get	
  interac7on	
  right	
  when	
  the	
  BI	
  
model	
  is	
  mainly	
  compose-­‐compile-­‐execute.	
  
	
  
You lied about it
being in-memory I didn’t say it
would all fit in at
the same time…
Differen8a8ng	
  in-­‐memory	
  claims	
  
Tool	
  vs	
  PlaEorm:	
  OLAP	
  is	
  (generally)	
  in-­‐memory	
  
technology;	
  there	
  are	
  tradeoffs	
  in	
  the	
  choice	
  
PlaEorm:	
  
a)  Conven<onal:	
  use	
  a	
  large	
  buffer	
  pool	
  and	
  cache	
  or	
  pin	
  
everything	
  in	
  memory.	
  Speeds	
  up	
  a	
  DB,	
  but	
  not	
  really	
  
“in-­‐memory”.	
  
b)  Memory	
  op<mized:	
  designed	
  assuming	
  all	
  or	
  mostly	
  in	
  
memory;	
  map	
  the	
  data	
  needed	
  for	
  opera7ons	
  to	
  
memory	
  and/or	
  add	
  features	
  to	
  recognize	
  and	
  use	
  
large-­‐memory	
  configura7ons.	
  
c)  In-­‐memory:	
  purpose-­‐built,	
  the	
  en7re	
  database	
  is	
  
resident	
  in	
  main	
  memory;	
  the	
  only	
  disk	
  access	
  is	
  
loading	
  on	
  a	
  cold	
  start	
  or	
  logging	
  changes.	
  
Some	
  ques8ons	
  to	
  start	
  discussion	
  
1.  Will	
  this	
  work	
  with	
  any	
  database	
  back-­‐end?	
  
2.  Who	
  are	
  these	
  features	
  aimed	
  at:	
  end	
  users	
  or	
  the	
  
people	
  who	
  define	
  structures	
  and	
  manage	
  data	
  for	
  the	
  
end	
  users?	
  
3.  Are	
  cube	
  defini7ons	
  sta7c	
  in	
  this	
  model?	
  
4.  Can	
  cubes	
  be	
  populated	
  in	
  slices	
  or	
  layers	
  based	
  on	
  what	
  
a	
  person	
  is	
  looking	
  at?	
  
5.  How	
  do	
  the	
  caching	
  improvements	
  address	
  cube-­‐
building	
  7mes?	
  
6.  Is	
  this	
  addressing	
  sta7c	
  performance	
  management	
  or	
  
dynamic?	
  
7.  Are	
  virtual	
  cubes	
  defined	
  by	
  the	
  user	
  or	
  admin	
  or	
  can	
  
they	
  be	
  automa7c?	
  
About	
  the	
  Presenter	
  
Mark	
  Madsen	
  is	
  president	
  of	
  Third	
  
Nature,	
  a	
  technology	
  research	
  and	
  
consul7ng	
  firm	
  focused	
  on	
  business	
  
intelligence,	
  data	
  integra7on	
  and	
  data	
  
management.	
  Mark	
  is	
  an	
  award-­‐winning	
  
author,	
  architect	
  and	
  CTO	
  whose	
  work	
  
has	
  been	
  featured	
  in	
  numerous	
  industry	
  
publica7ons.	
  Over	
  the	
  past	
  ten	
  years	
  
Mark	
  received	
  awards	
  for	
  his	
  work	
  from	
  
the	
  American	
  Produc7vity	
  &	
  Quality	
  
Center,	
  TDWI,	
  and	
  the	
  Smithsonian	
  
Ins7tute.	
  He	
  is	
  an	
  interna7onal	
  speaker,	
  
a	
  contributor	
  at	
  Forbes	
  Online	
  and	
  
Informa7on	
  Management.	
  For	
  more	
  
informa7on	
  or	
  to	
  contact	
  Mark,	
  follow	
  
@markmadsen	
  on	
  TwiPer	
  or	
  visit	
  	
  
hPp://ThirdNature.net	
  	
  
About	
  Third	
  Nature	
  
Third Nature is a research and consulting firm focused on new and
emerging technology and practices in analytics, business intelligence, and
performance management. If your question is related to data, analytics,
information strategy and technology infrastructure then you‘re at the right
place.
Our goal is to help companies take advantage of information-driven
management practices and applications. We offer education, consulting
and research services to support business and IT organizations as well as
technology vendors.
We fill the gap between what the industry analyst firms cover and what IT
needs. We specialize in product and technology analysis, so we look at
emerging technologies and markets, evaluating technology and hw it is
applied rather than vendor market positions.
CC	
  Image	
  AWribu8ons	
  
Thanks	
  to	
  the	
  people	
  who	
  supplied	
  the	
  crea7ve	
  commons	
  licensed	
  images	
  used	
  in	
  this	
  presenta7on:	
  
train_to_sea.jpg	
  -­‐	
  hPp://www.flickr.com/photos/innoxiuss/457069767/	
  
well	
  town	
  hall.jpg	
  -­‐	
  hPp://flickr.com/photos/tuinkabouter/1135560976/	
  
	
  
	
  
	
  
	
  
	
  
	
  
Twitter Tag: #briefr The Briefing Room
Twitter Tag: #briefr The Briefing Room
September: ANALYTICS
October: DATA PROCESSING
Upcoming Topics
www.insideanalysis.com
Twitter Tag: #briefr The Briefing Room
Thank You
for Your
Attention

More Related Content

PPTX
DCIM Software Five Years Later: What I Wish I Had Known When I Started (Case ...
PPTX
How to Manage Hybrid Data Center Environments
PPTX
North Devon Farms - Getting to know the Cloud 14th Oct 2015
PDF
Cloud Explained
PDF
10 Key Considerations for a Successful DCIM Project
PPTX
IDC DCIM Webinar - How to Take Control of Chaos in a Lights-Out Data Center
PDF
Accenture hana-in-memory-pov
PDF
5 technologies that shorten the document lifecycle & increase productivity
DCIM Software Five Years Later: What I Wish I Had Known When I Started (Case ...
How to Manage Hybrid Data Center Environments
North Devon Farms - Getting to know the Cloud 14th Oct 2015
Cloud Explained
10 Key Considerations for a Successful DCIM Project
IDC DCIM Webinar - How to Take Control of Chaos in a Lights-Out Data Center
Accenture hana-in-memory-pov
5 technologies that shorten the document lifecycle & increase productivity

Viewers also liked (7)

PPT
Creative Briefing
PDF
Debriefing nln tech conf poster -fc
PPTX
Effective Briefing Techniques
PPTX
Effective Briefing Techniques Day Three
PPT
The Brief For The Sistine Chapel
PPT
A practical guide to Creative Briefs and Briefings
PPTX
3 hard facts shaping higher education thinking and behavior
Creative Briefing
Debriefing nln tech conf poster -fc
Effective Briefing Techniques
Effective Briefing Techniques Day Three
The Brief For The Sistine Chapel
A practical guide to Creative Briefs and Briefings
3 hard facts shaping higher education thinking and behavior

Similar to Thinking Outside the Cube: How In-Memory Bolsters Analytics (20)

PDF
Data Engineer's Lunch #85: Designing a Modern Data Stack
PPTX
The Evolution of a Scrappy Startup to a Successful Web Service
PDF
Accounting Information Systems The Crossroads of Accounting and IT 2nd Editio...
PDF
IMS02 autonomics for ims with the ibm management console for ims and db2 fo...
PDF
Accounting Information Systems The Crossroads of Accounting and IT 2nd Editio...
PDF
Doing Analytics Right - Building the Analytics Environment
PPTX
engage 2015 - - 2015 - Infrastructure Assessment - Analyze, Visualize and Op...
PPTX
Big Data & Information Management Channel Manager
PDF
Take Action: The New Reality of Data-Driven Business
DOCX
PDF
ICS UserGroup - 2015 - Infrastructure Assessment - Analyze, Visualize and Opt...
PDF
An In-Depth Look at Pinpointing and Addressing Sources of Performance Problem...
PDF
Does it only have to be ML + AI?
PPTX
Building an Effective Data Warehouse Architecture
PDF
OSA Con 2022 - Scaling your Pandas Analytics with Modin - Doris Lee - Ponder.pdf
PPT
Nw2008 tips tricks_edw_v10
PPTX
Creating a Single Source of Truth: Leverage all of your data with powerful an...
PDF
The Growth Of Data Centers
PDF
Lean Enterprise, Microservices and Big Data
Data Engineer's Lunch #85: Designing a Modern Data Stack
The Evolution of a Scrappy Startup to a Successful Web Service
Accounting Information Systems The Crossroads of Accounting and IT 2nd Editio...
IMS02 autonomics for ims with the ibm management console for ims and db2 fo...
Accounting Information Systems The Crossroads of Accounting and IT 2nd Editio...
Doing Analytics Right - Building the Analytics Environment
engage 2015 - - 2015 - Infrastructure Assessment - Analyze, Visualize and Op...
Big Data & Information Management Channel Manager
Take Action: The New Reality of Data-Driven Business
ICS UserGroup - 2015 - Infrastructure Assessment - Analyze, Visualize and Opt...
An In-Depth Look at Pinpointing and Addressing Sources of Performance Problem...
Does it only have to be ML + AI?
Building an Effective Data Warehouse Architecture
OSA Con 2022 - Scaling your Pandas Analytics with Modin - Doris Lee - Ponder.pdf
Nw2008 tips tricks_edw_v10
Creating a Single Source of Truth: Leverage all of your data with powerful an...
The Growth Of Data Centers
Lean Enterprise, Microservices and Big Data

More from Inside Analysis (20)

PDF
An Ounce of Prevention: Forging Healthy BI
PDF
Agile, Automated, Aware: How to Model for Success
PDF
First in Class: Optimizing the Data Lake for Tighter Integration
PDF
Fit For Purpose: Preventing a Big Data Letdown
PDF
To Serve and Protect: Making Sense of Hadoop Security
PDF
The Hadoop Guarantee: Keeping Analytics Running On Time
PDF
Introducing: A Complete Algebra of Data
PDF
The Role of Data Wrangling in Driving Hadoop Adoption
PDF
Ahead of the Stream: How to Future-Proof Real-Time Analytics
PDF
All Together Now: Connected Analytics for the Internet of Everything
PDF
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
PDF
The Biggest Picture: Situational Awareness on a Global Level
PDF
Structurally Sound: How to Tame Your Architecture
PDF
SQL In Hadoop: Big Data Innovation Without the Risk
PDF
The Perfect Fit: Scalable Graph for Big Data
PDF
A Revolutionary Approach to Modernizing the Data Warehouse
PDF
The Maturity Model: Taking the Growing Pains Out of Hadoop
PDF
Rethinking Data Availability and Governance in a Mobile World
PDF
DisrupTech - Dave Duggal
PPTX
Modus Operandi
An Ounce of Prevention: Forging Healthy BI
Agile, Automated, Aware: How to Model for Success
First in Class: Optimizing the Data Lake for Tighter Integration
Fit For Purpose: Preventing a Big Data Letdown
To Serve and Protect: Making Sense of Hadoop Security
The Hadoop Guarantee: Keeping Analytics Running On Time
Introducing: A Complete Algebra of Data
The Role of Data Wrangling in Driving Hadoop Adoption
Ahead of the Stream: How to Future-Proof Real-Time Analytics
All Together Now: Connected Analytics for the Internet of Everything
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
The Biggest Picture: Situational Awareness on a Global Level
Structurally Sound: How to Tame Your Architecture
SQL In Hadoop: Big Data Innovation Without the Risk
The Perfect Fit: Scalable Graph for Big Data
A Revolutionary Approach to Modernizing the Data Warehouse
The Maturity Model: Taking the Growing Pains Out of Hadoop
Rethinking Data Availability and Governance in a Mobile World
DisrupTech - Dave Duggal
Modus Operandi

Recently uploaded (20)

PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Empathic Computing: Creating Shared Understanding
PDF
Machine learning based COVID-19 study performance prediction
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Encapsulation_ Review paper, used for researhc scholars
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
KodekX | Application Modernization Development
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Approach and Philosophy of On baking technology
PDF
cuic standard and advanced reporting.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Electronic commerce courselecture one. Pdf
NewMind AI Weekly Chronicles - August'25 Week I
Empathic Computing: Creating Shared Understanding
Machine learning based COVID-19 study performance prediction
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Encapsulation_ Review paper, used for researhc scholars
The AUB Centre for AI in Media Proposal.docx
Review of recent advances in non-invasive hemoglobin estimation
Understanding_Digital_Forensics_Presentation.pptx
MIND Revenue Release Quarter 2 2025 Press Release
KodekX | Application Modernization Development
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Approach and Philosophy of On baking technology
cuic standard and advanced reporting.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Diabetes mellitus diagnosis method based random forest with bat algorithm
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Electronic commerce courselecture one. Pdf

Thinking Outside the Cube: How In-Memory Bolsters Analytics

  • 1. The Briefing Room Thinking Outside the Cube: How In-Memory Bolsters Analytics
  • 2. Twitter Tag: #briefr The Briefing Room Welcome Host: Eric Kavanagh eric.kavanagh@bloorgroup.com
  • 3. Twitter Tag: #briefr The Briefing Room !   Reveal the essential characteristics of enterprise software, good and bad !   Provide a forum for detailed analysis of today s innovative technologies !   Give vendors a chance to explain their product to savvy analysts !   Allow audience members to pose serious questions... and get answers! Mission
  • 4. Twitter Tag: #briefr The Briefing Room Topics This Month: ANALYTIC PLATFORMS September: ANALYTICS October: DATA PROCESSING
  • 5. Twitter Tag: #briefr The Briefing Room Analytic Platforms ~Albert Einstein If  you  always  do  what  you   always  did,  you  will  always   get  what  you  always  got.   “ “
  • 6. Twitter Tag: #briefr The Briefing Room Analyst: Mark Madsen  Mark Madsen is president of Third Nature, Inc.
  • 7. Twitter Tag: #briefr The Briefing Room !   IBM Cognos Business Intelligence is an enterprise BI platform with an open-data access strategy !   The platform includes IBM Cognos Dynamic Cubes, an in- memory relational OLAP component that complements the existing query engine !   Dynamic Cubes can enable users to perform interactive analysis and reporting over terabytes of data IBM
  • 8. Twitter Tag: #briefr The Briefing Room Guest: Chris McPherson Chris McPherson is a Senior Product Manager on the IBM Business Analytics Platform team in the IBM Canada Ottawa Lab. His current area of responsibility is IBM Cognos Dynamic Cubes but prior to that, he was product owner for Modelling, Metadata and EII for the Cognos suite of tools. He has more than nine years of experience within the IBM Business Analytics organization.
  • 9. © 2012 IBM Corporation IBM Cognos Dynamic Cubes Chris McPherson – Senior Product Manager IBM Business Analytics
  • 10. © 2012 IBM Corporation10 High performance analytics over growing data volumes Aggregate awareness Aggregate optimization In-memory caching of members, data, expressions, results, and aggregates Dynamic Cubes Feature mission
  • 11. © 2012 IBM Corporation11 Extensive caching –  Shared caches for maximum reuse –  All caches are security aware Data Cache In-Memory Aggregate Cache Expression Cache Result Set Cache Member Cache Security MDX Engine Security Data Warehouse Security
  • 12. © 2012 IBM Corporation12 Security Security Security Data Cache In-memory Aggregates Expression Cache Member Cache MDX Engine Result Set Cache Query Processor DQM Dynamic Cube DQM
  • 13. © 2012 IBM Corporation13 Security Security Security Data Cache In-memory Aggregates Expression Cache Member Cache MDX Engine SQL queries to obtain member information Result Set Cache Query Processor DQM Dynamic Cube DQM
  • 14. © 2012 IBM Corporation14 Security Security Security Data Cache In-memory Aggregates Expression Cache Member Cache MDX Engine SQL queries to obtain member information SQL queries to obtain aggregate data Result Set Cache Query Processor DQM Dynamic Cube DQM
  • 15. © 2012 IBM Corporation15 Security Security Security Initial Query Data Cache In-memory Aggregates Expression Cache Member Cache MDX Engine SQL queries to obtain member information SQL queries to obtain fact and summary data SQL queries to obtain aggregate data Search aggregate cache for data Result Set Cache Query Processor DQM Dynamic Cube DQM
  • 16. © 2012 IBM Corporation16 Security Security Security Initial Query Data Cache In-memory Aggregates Expression Cache Member Cache MDX Engine SQL queries to obtain member information SQL queries to obtain fact and summary data SQL queries to obtain aggregate data Search aggregate cache for data Result Set Cache Query Processor DQM Dynamic Cube DQM
  • 17. © 2012 IBM Corporation17 Dynamic Cube Lifecycle 1. Model & publish The image cannot be displayed. Your computer may not have enough memory to open the image, or the image may have been corrupted. Restart your computer, and then open the file again. If the red x still appears, you may have to delete the image and then insert it again. 2. Deploy, manage 3. Reporting & analytics 4. Optimize Dynamic Cube Server Dynamic Cube Logs CM Warehouse
  • 18. © 2012 IBM Corporation18 1. Launch Aggregate Advisor Wizard 2. Run with or without workload Optimize per report, package, user, or time 3. Advisor returns with in-memory and/or in- database recommendations 4. Save recommendations § In-memory aggregates created on re-start à No re-modeling or re-authoring required § DBA creates in-database aggregate tables, and modeler updates model and redeploys Aggregate Advisor for in-memory aggregates Easy performance improvements
  • 19. © 2012 IBM Corporation19 Virtual Cubes Virtual cube used as source for another virtual cube Combines cubes with common Time dimension Virtual cubes combine two cubes Combines cubes with nearly identical dimensions Inventory Sales SalesInventory Store Sales Web Sales
  • 20. © 2012 IBM Corporation20 Time Current Month All Sales cube All Sales Current Month Sales Historic Sales Virtual Cubes Low latency & faster cube refresh
  • 21. © 2012 IBM Corporation21 Cognos Dynamic Cubes - Summary High Performance •  80x improvement with aggregates •  80% queries under 3 seconds •  Over 50% queries sub-second Growing Data Volumes •  Scalable to terabytes of fact data Flexible and Optimized •  You choose where to take advantage of in- memory capabilities •  Aggregate Advisor to easily create optimized aggregates Maximize Value of Data Warehouse •  Aggregate awareness to balance load across app and DB tiers •  Reduce load on database through use of application tier caching 21
  • 22. © 2012 IBM Corporation22
  • 23. Twitter Tag: #briefr The Briefing Room Perceptions & Questions Analyst: Mark Madsen
  • 24. Commentary  on   analysis  and   performance,   IBM  Business  Analy8cs   Briefing  Room       August, 27 2013 Mark Madsen www.ThirdNature.net @markmadsen  
  • 25. Terminology  Disambigufica8on   Analysis:   a.  The  separa7on  of  an  intellectual  or  material  whole  into   its  cons7tuent  parts  for  individual  study.   b.  The  study  of  such  cons7tuent  parts  and  their   interrela7onships  in  making  up  a  whole.     Analy8cs:  the  mathy  stuff,  like  sta7s7cs,  machine   learning,  numerical  methods,  data  mining*  (so  I  won’t   use  the  term  as  a  synonym  for  OLAP)     In-­‐memory:  a  vague  term  mainly  implying  not  using  disks   for  immediate  data  access  
  • 26. BI  is  using  broken  metaphors   We  think  of  BI  as  publishing,  which  is  only  one  part.  
  • 27. Most  BI  is  built  on  an  outdated  interac8on  model  
  • 28. Result  of  a  poor  interac8on  model   Delayed  interac<on  disrupts  work   "...each second of system response degradation leads to a similar degradation added to the user's time for the following [command]. This phenomenon seems to be related to an individual's attention span. The traditional model of a person thinking after each system response appears to be inaccurate. Instead, people seem to have a sequence of actions in mind, contained in a short-term mental memory buffer. Increases in SRT [system response time] seem to disrupt the thought processes, and this may result in having to rethink the sequence of actions to be continued.“ Note nonlinearity in graph, an indication that something important is happening. “The Economic Value of Rapid Response Time “, IBM 1982
  • 29.  Tradi8onal  BI  fails  to  put  users  into  the  flow  zone   Flow  (Csíkszentmihályi)   ▪  Concept  of  engagement  and   immersion  in  a  task     ▪  The  appropriate  applica7on   of  tools  and  knowledge  to   analy7cal  problems  enables   produc7vity.   ▪  The  s7lted  interac7on  of  BI   disrupts  flow.    
  • 30. Interac8on  8mescale  for  analysis  problems   Un7l  you  resolve  this  task  performance  gap,  real  analysis   work  is  a  challenge  (and  a  reason  why  Excel  remains   popular).   Days Hours Minutes Seconds Instantaneous come back tomorrow go to lunch take a break get some coffee check email/FB take a sip of coffee immerse yourself in work Flow is possible only in the “less than 3 second” range
  • 31. Future-­‐proofing   The  tool  market  is  shiIing,   driven  by  new  architectures   that  are  enabled  by  new   technologies.     Front-­‐end  tools  are  evolving   away  from  BI-­‐as-­‐publishing,   which  is  going  to  increase   the  burden  on  the  back  end   data  stores  and  cause   interac7on  problems.   You  need  to  evaluate  tools   based  on  more  detailed   usage  scenarios  and   interac7ve  capabili7es,  less   on  report-­‐building  features.  
  • 32. BI  should  support  two  sets  of  ac8ons.  One  is  monitoring   the  known,  one  is  analyzing  the  unknown.   Collect new data Monitor Analyze Exceptions Analyze Causes Decide Act No problem No idea Do nothing Act on the process Usually days/longer timeframe Act within the process Usually real-time to daily
  • 33. The real BI design point: context and point of use Information use is diverse and varies based on context: ▪  Get a quick answer ▪  Solve a one-off problem ▪  Make repetitive decisions ▪  Monitor routine processes ▪  Make complex decisions ▪  Choose a course of action ▪  Convince others to take action Different problems require different response times in order to be effective.
  • 34. How  expensive  was  performance?  500  GB  DW…   Maximum Capacities •  2 to 30 100MHz Intel Pentium processors •  Up to 3.5GB system memory •  Up to 1.7TB of on-line storage Base Configuration •  18 slot Sequent bus chassis •  1 Proc card - dual 100MHz Pentium CPUs •  1 2.1GB SCSI boot disk •  1 CD-ROM/QIC-525 1/4” Tape •  1 Memory controller (64MB, 256MB) •  1 Integrated Ethernet •  5-slot VMEbus chassis •  Room for 3 additional 5.25” devices Expansion Options •  Up to 400 SCSI-2 disks •  Up to 29 VMEbus slots •  Up to 8 QCIC I/O controllers •  Token Ring, FDDI LAN adapters •  Sync or Async communications ports Price: $1.6 million in 1993
  • 35. OLAP  was  a  response-­‐8me  answer   The  Codd  OLAP  paper  wriPen  for  a  vendor  in   1993:  state  of  the  art  client  technology  was  the  60   Mhz  Intel  Pen7um,  Windows  version  3.1;  server   tech  was  the  $1M+  database  server     It’s  s7ll  hard  to  get  less  than  3  second  response   7mes  from  a  round-­‐trip  to  a  DB     It’s  s7ll  hard  to  get  interac7on  right  when  the  BI   model  is  mainly  compose-­‐compile-­‐execute.    
  • 36. You lied about it being in-memory I didn’t say it would all fit in at the same time…
  • 37. Differen8a8ng  in-­‐memory  claims   Tool  vs  PlaEorm:  OLAP  is  (generally)  in-­‐memory   technology;  there  are  tradeoffs  in  the  choice   PlaEorm:   a)  Conven<onal:  use  a  large  buffer  pool  and  cache  or  pin   everything  in  memory.  Speeds  up  a  DB,  but  not  really   “in-­‐memory”.   b)  Memory  op<mized:  designed  assuming  all  or  mostly  in   memory;  map  the  data  needed  for  opera7ons  to   memory  and/or  add  features  to  recognize  and  use   large-­‐memory  configura7ons.   c)  In-­‐memory:  purpose-­‐built,  the  en7re  database  is   resident  in  main  memory;  the  only  disk  access  is   loading  on  a  cold  start  or  logging  changes.  
  • 38. Some  ques8ons  to  start  discussion   1.  Will  this  work  with  any  database  back-­‐end?   2.  Who  are  these  features  aimed  at:  end  users  or  the   people  who  define  structures  and  manage  data  for  the   end  users?   3.  Are  cube  defini7ons  sta7c  in  this  model?   4.  Can  cubes  be  populated  in  slices  or  layers  based  on  what   a  person  is  looking  at?   5.  How  do  the  caching  improvements  address  cube-­‐ building  7mes?   6.  Is  this  addressing  sta7c  performance  management  or   dynamic?   7.  Are  virtual  cubes  defined  by  the  user  or  admin  or  can   they  be  automa7c?  
  • 39. About  the  Presenter   Mark  Madsen  is  president  of  Third   Nature,  a  technology  research  and   consul7ng  firm  focused  on  business   intelligence,  data  integra7on  and  data   management.  Mark  is  an  award-­‐winning   author,  architect  and  CTO  whose  work   has  been  featured  in  numerous  industry   publica7ons.  Over  the  past  ten  years   Mark  received  awards  for  his  work  from   the  American  Produc7vity  &  Quality   Center,  TDWI,  and  the  Smithsonian   Ins7tute.  He  is  an  interna7onal  speaker,   a  contributor  at  Forbes  Online  and   Informa7on  Management.  For  more   informa7on  or  to  contact  Mark,  follow   @markmadsen  on  TwiPer  or  visit     hPp://ThirdNature.net    
  • 40. About  Third  Nature   Third Nature is a research and consulting firm focused on new and emerging technology and practices in analytics, business intelligence, and performance management. If your question is related to data, analytics, information strategy and technology infrastructure then you‘re at the right place. Our goal is to help companies take advantage of information-driven management practices and applications. We offer education, consulting and research services to support business and IT organizations as well as technology vendors. We fill the gap between what the industry analyst firms cover and what IT needs. We specialize in product and technology analysis, so we look at emerging technologies and markets, evaluating technology and hw it is applied rather than vendor market positions.
  • 41. CC  Image  AWribu8ons   Thanks  to  the  people  who  supplied  the  crea7ve  commons  licensed  images  used  in  this  presenta7on:   train_to_sea.jpg  -­‐  hPp://www.flickr.com/photos/innoxiuss/457069767/   well  town  hall.jpg  -­‐  hPp://flickr.com/photos/tuinkabouter/1135560976/              
  • 42. Twitter Tag: #briefr The Briefing Room
  • 43. Twitter Tag: #briefr The Briefing Room September: ANALYTICS October: DATA PROCESSING Upcoming Topics www.insideanalysis.com
  • 44. Twitter Tag: #briefr The Briefing Room Thank You for Your Attention