SlideShare a Scribd company logo
How Real Time Data
Requirements Change the Data
Warehouse Environment
Mark Madsen – September 17, 2008
www.ThirdNature.net




                     Attribution-NonCommercial-No Derivative
                     http://creativecommons.org/licenses/by-nc-nd/3.0/us/
Outline
 What’s real-time about?
 Impacts on the data
 warehouse architecture
        Delivering data to users
        Extracting the data
        Storing the data
        Operations
 Getting started


Third Nature, January 2008    Mark Madsen   Slide 2
Speeding Up the Data Warehouse



Why?
Faster reaction time

Reduced decision time

New process capabilities




Third Nature, January 2008   Mark Madsen   Slide 3
Which Decisions Benefit?
                             Strategic                 Operational
    Decision time            flexible, long cycle
                                         constrained, short
                                         cycle
    Decision scope broad, organizational narrow, departmental
                                         or process
    Decision model Complex                             Simple

    Data latency             High, history is core     Low, recent data is
                             to decisions              core to decisions

    Data scope               Many sources, many Few sources,
                             types, aggregated  structured, detailed
      Most real time needs will be driven by operational decision
      making, not strategic decisions.
Third Nature, January 2008               Mark Madsen                         Slide 4
Strategy, Decisions and Data Latency

Goal                          Increase share of low to mid market customers


Strategy               Reduce cost of products sold             Improve promotional performance



Tactics                      Efficient sourcing                     Decrease Out of Stocks




        Consolidate suppliers           Improve delivery compliance          Catch out of stocks
                                                                              before they occur


BI Needs
                    Reports &                Dashboards, alerts               Real time alerts &
                  spreadsheets                 & scorecards                  embedded analytics
Third Nature, January 2008                        Mark Madsen                                 Slide 5
What People Are Doing Today
                              Monthly      W eekly   Daily       Multiple times per day      On demand


  2002                  32%                    34%                               69%                      15%         6%




  2004              27%                  29%                         65%                       30%              19%




  2006       3                24                                  44                                 29



         0%            10%         20%         30%   40%          50%      60%         70%     80%        90%         100%
                                                                                                Sources: TDWI, Gartner


         At the same time, data volumes are rising for most data
         warehouses at 50% to 100% per year.

Third Nature, January 2008                                   Mark Madsen                                              Slide 6
BI Efforts Involving Real Time Data Access
                             Terms you may hear from the
                             BI market that imply real time:
                                   Operational BI
                                   Embedded analytics
                                   Decision automation
                                   Complex event processing
                                   Event-driven BI
                                   Process-driven BI

                             They are all similar in
                             requiring some level of low
                             latency data access.
Third Nature, January 2008   Mark Madsen                      Slide 7
Impacts on the DW Architecture
     Databases Dashboards           OLAP     Productivity   BAM/BPM         Reporting   Analytics Applications




                                                Data Consumers

                                                   Delivery

                             DW Platforms                                                        Adding current
                                                                                                 data to the system
                                     Warehouse                      Mart                         requires effort at all
                                     Database                                                    three layers
                                                                 Content
                                           ODS                    Store


                                  ETL                   EDR                    EII

      Databases              Documents     Flat Files       XML             Queues        ERP      Applications



                                            Source Environments
Third Nature, January 2008                                    Mark Madsen                                         Slide 8
One Architecture or Two?
   In-line with process:
                                                                   RT BI
      • Real time data flows separately
        from the warehouse data
      • May include a low-latency data
        store in the real time environment                         Process

      • This model be needed for
        extremely low latency data
                                                                   BI
      • More applicable for event-driven
                                                      Batch   DW



   Out of band:
      • Data to the consumer first flows                           Process
        through the DW
      • Unified architecture for both low
        and high latency data                                      BI &
                                                                   RT BI
      • More applicable for on-demand                  DW


Third Nature, January 2008              Mark Madsen                     Slide 9
User Interface: Two BI Usage Models
                                Demand driven
                                  • Users ask for current data
                                  • Most BI tools work this way
                                  • Harder to adapt these tools to
                                    event-driven models


                                Event driven
                                  • System takes action based on
                                    data, e.g. alerts, rule engines
                                  • May not have (or need) an end
                                    user interface
                                  • Need understanding of decision
                                    & action process for this model
Third Nature, January 2008   Mark Madsen                          Slide 10
BI Tools Need New Capabilities
    Embedding BI within
    applications
        • UI embedding
        • Full embedding
    Event-based integration
    Feeding BI data to
    applications: services, not
    SQL, may be desired

    Custom UI code may be
    preferable to a BI tool

Third Nature, January 2008    Mark Madsen   Slide 11
The Data Integration Layer
    • Integration is the most complex
      element of adding real time data.
    • Inline vs. out of band, demand vs.
      event-driven BI usage create
      different DI requirements.
    • You may not have exactly the
      same metrics, attributes or data
      extract logic.
    • Don’t count on replacing the ETL
      batch; more likely you are
      augmenting it.
    • You probably need to add new DI
      technologies to your portfolio.
    • Batch performance design isn’t
      like real time design.
Third Nature, January 2008       Mark Madsen   Slide 12
Speeding Up Data Integration Methods


      Single batch

                             Frequent batch

                                       Mini-batch

                                                            Continuous load

                                                                      Streaming



          Hourly+                                                        Immediate


Third Nature, January 2008                    Mark Madsen                         Slide 13
The Platform Layer: Data and Database
                                   • Schemas will need changes.
                                   • You don’t need to convert the
                                     entire database to a real time
                                     schema.
                                   • One schema or two?
                                   • Event-driven BI creates
                                     different query patterns and
                                     workloads.
                                   • Configuration and tuning may
                                     be different than what you are
                                     used to with traditional BI.
                                   • Application developers want
                                     services or ORMs, not SQL.


Third Nature, January 2008   Mark Madsen                       Slide 14
Different Platform Workloads
        Databases Dashboards        OLAP     Productivity   BAM/BPM   Reporting   Analytics Applications




                                                Data Consumers

                                                   Delivery

                             DW Platforms                                                 Three workloads:
                                                                                            Data loading +
                                     Warehouse                    Mart                      Normal BI +
                                     Database                                               Real time BI
                                                               Content                    = complications
                                           ODS                  Store


                                  ETL                   EDR              EII

         Databases           Documents     Flat Files       XML       Queues        ERP      Applications




Third Nature, January 2008
                                            Source Environments
                                                     Mark Madsen                                            Slide 15
Development, Maintenance & Operations
                                    • Real time decisions on real
                                      time data mean data
                                      quality plays a larger role,
                                      and it’s harder to address.
                                    • Warehouse availability
                                      becomes much more
                                      important to the business,
                                      and it isn’t just the
                                      database – it’s everything.
                                    • Performance and meeting
                                      strict BI SLAs will rise in
                                      importance since you are
                                      now tied in to business
                                      operations.

Third Nature, January 2008   Mark Madsen                      Slide 16
A Prescription for Getting Started
    1. Star with a decision
       process
    2. Define data needs for the
       process
    3. Ensure that data is
       available at the right
       latency
    4. Determine appropriate
       data integration
       technologies.
    5. Design and initiate
       upstream work
    6. Build
Third Nature, January 2008   Mark Madsen   Slide 17
Thanks




Third Nature, January 2008   Mark Madsen   Slide 18
CC Image Attributions
    Thanks to the people who supplied the creative commons licensed images used in this presentation:
    • Divers - http://flickr.com/photos/raveller/
    • Fast dog - http://flickr.com/photos/marinacvinhal/379111290/
    • Febo - http://flickr.com/photos/igor/419425754/
    • Subway - http://flickr.com/photos/neilsphotoalbum/504517855/
    • Cadillac ranch - http://flickr.com/photos/whatknot/179655095/




Third Nature, January 2008                          Mark Madsen                                         Slide 19
About the Presenter
                            Mark Madsen is president of Third
                            Nature, a technology research and
                            consulting firm focused on business
                            intelligence, data integration and
                            data management. Mark is an
                            award-winning author, architect and
                            CTO whose work has been featured
                            in numerous industry publications.
                            Over the past ten years Mark
                            received awards for his work from
                            the American Productivity & Quality
                            Center, TDWI, and the Smithsonian
                            Institute. He is an international
                            speaker, a contributing editor at
                            Intelligent Enterprise, and manages
                            the open source channel at the
                            Business Intelligence Network. For
                            more information or to contact Mark,
                            visit http://ThirdNature.net.
                      Page 20

More Related Content

PPSX
Data warehouse
PDF
Data warehouse architecture
PDF
Next Generation BI: current state and changing product assumptions
PPTX
DATA WAREHOUSING
PPT
Data warehousing
PDF
The technology of the business data lake
PPTX
Introduction to data warehousing
PPTX
Data warehouse
Data warehouse architecture
Next Generation BI: current state and changing product assumptions
DATA WAREHOUSING
Data warehousing
The technology of the business data lake
Introduction to data warehousing

What's hot (20)

PDF
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
PPTX
Data warehousing Demo PPTS | Over View | Introduction
PPT
Data warehouse 101-fundamentals-
PPTX
Enterprise resource planning system & data warehousing implementation
PPT
Warehousing dimension star-snowflake_schemas
PPT
DW 101
PDF
IOUG93 - Technical Architecture for the Data Warehouse - Presentation
PPT
Data Warehouse Basic Guide
PPT
Introduction to data warehousing
PPT
Data warehouse
PDF
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
PPT
Introduction to Data Warehouse
PDF
Rev_3 Components of a Data Warehouse
PDF
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
PPTX
Data warehouse design
PPT
Date warehousing concepts
PPTX
DATA WAREHOUSING
PPTX
Traditional Data-warehousing / BI overview
PPT
Bi presentation to bkk
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
Data warehousing Demo PPTS | Over View | Introduction
Data warehouse 101-fundamentals-
Enterprise resource planning system & data warehousing implementation
Warehousing dimension star-snowflake_schemas
DW 101
IOUG93 - Technical Architecture for the Data Warehouse - Presentation
Data Warehouse Basic Guide
Introduction to data warehousing
Data warehouse
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Introduction to Data Warehouse
Rev_3 Components of a Data Warehouse
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Data warehouse design
Date warehousing concepts
DATA WAREHOUSING
Traditional Data-warehousing / BI overview
Bi presentation to bkk
Ad

Viewers also liked (20)

PPTX
Traditional data warehouse vs data lake
PDF
Big Data Smart City processes and tools, Real Time data processing tools
PDF
Smart Cities Datasets and Applications Ecosystem: An Exploratory Analysis
PPTX
Realtime streaming architecture in INFINARIO
PDF
Telco Big Data Workshop Sample
PDF
Sharing & Sustaining Ecosystem Data
PDF
Real-time information analysis: social networks and open data
PDF
Banking & Smart City Ecosystem
PPTX
Oracle BI publisher intro
PDF
Big Data for Smart City
PDF
Km4city Smart City Ecosystem Urban Platform
PPTX
Smart Cities and the Value of Ecosystem Services
PDF
Big data real time architectures
PPS
Oracle XML Publisher / BI Publisher
PDF
Smart City Framework
PPTX
Database vs Data Warehouse: A Comparative Review
PPT
lecture 1 information systems and business strategy
PDF
Introduction to ETL and Data Integration
PPTX
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
PDF
An Introduction into the design of business using business architecture
Traditional data warehouse vs data lake
Big Data Smart City processes and tools, Real Time data processing tools
Smart Cities Datasets and Applications Ecosystem: An Exploratory Analysis
Realtime streaming architecture in INFINARIO
Telco Big Data Workshop Sample
Sharing & Sustaining Ecosystem Data
Real-time information analysis: social networks and open data
Banking & Smart City Ecosystem
Oracle BI publisher intro
Big Data for Smart City
Km4city Smart City Ecosystem Urban Platform
Smart Cities and the Value of Ecosystem Services
Big data real time architectures
Oracle XML Publisher / BI Publisher
Smart City Framework
Database vs Data Warehouse: A Comparative Review
lecture 1 information systems and business strategy
Introduction to ETL and Data Integration
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
An Introduction into the design of business using business architecture
Ad

Similar to How Real TIme Data Changes the Data Warehouse (20)

PDF
Intel Cloud summit: Big Data by Nick Knupffer
PDF
Building a business intelligence architecture fit for the 21st century by Jon...
PDF
White Paper Data Quality Process Design For Ad Hoc Reporting
PPTX
Big data? No. Big Decisions are What You Want
PDF
Customer summit - big data (final)
PDF
Intel Cloud Summit: Big Data
PPTX
IBM Smarter Business 2012 - PureSystems - PureData
PPTX
Introduction To Data Vault - DAMA Oregon 2012
PPTX
The New Enterprise Data Platform
PDF
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
PPTX
Outboard Feel Good NLS
PPTX
Anexinet Big Data Solutions
PDF
Big Data Beyond Hadoop*: Research Directions for the Future
PDF
Introduction to Modern Data Virtualization 2021 (APAC)
PDF
Bridging the Last Mile: Getting Data to the People Who Need It
PDF
OSC2012: Big Data Using Open Source: Netapp Project - Technical
PDF
Big data appliances for BI on Cloud
PDF
Talk IT_ Oracle_김태완_110831
PDF
Barak regev
PDF
Analyze This! Best Practices For Big And Fast Data
 
Intel Cloud summit: Big Data by Nick Knupffer
Building a business intelligence architecture fit for the 21st century by Jon...
White Paper Data Quality Process Design For Ad Hoc Reporting
Big data? No. Big Decisions are What You Want
Customer summit - big data (final)
Intel Cloud Summit: Big Data
IBM Smarter Business 2012 - PureSystems - PureData
Introduction To Data Vault - DAMA Oregon 2012
The New Enterprise Data Platform
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Outboard Feel Good NLS
Anexinet Big Data Solutions
Big Data Beyond Hadoop*: Research Directions for the Future
Introduction to Modern Data Virtualization 2021 (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It
OSC2012: Big Data Using Open Source: Netapp Project - Technical
Big data appliances for BI on Cloud
Talk IT_ Oracle_김태완_110831
Barak regev
Analyze This! Best Practices For Big And Fast Data
 

More from mark madsen (20)

PDF
Data Architecture: OMG It’s Made of People
PDF
Solve User Problems: Data Architecture for Humans
PDF
The Black Box: Interpretability, Reproducibility, and Data Management
PDF
Operationalizing Machine Learning in the Enterprise
PDF
Building a Data Platform Strata SF 2019
PDF
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
PDF
Architecting a Platform for Enterprise Use - Strata London 2018
PDF
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
PDF
How to understand trends in the data & software market
PDF
Pay no attention to the man behind the curtain - the unseen work behind data ...
PDF
Assumptions about Data and Analysis: Briefing room webcast slides
PDF
Everything Has Changed Except Us: Modernizing the Data Warehouse
PDF
A Pragmatic Approach to Analyzing Customers
PDF
Disruptive Innovation: how do you use these theories to manage your IT?
PDF
Briefing room: An alternative for streaming data collection
PDF
Building the Enterprise Data Lake: A look at architecture
PDF
Briefing Room analyst comments - streaming analytics
PDF
Everything has changed except us
PDF
Bi isn't big data and big data isn't BI (updated)
PDF
On the edge: analytics for the modern enterprise (analyst comments)
Data Architecture: OMG It’s Made of People
Solve User Problems: Data Architecture for Humans
The Black Box: Interpretability, Reproducibility, and Data Management
Operationalizing Machine Learning in the Enterprise
Building a Data Platform Strata SF 2019
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Platform for Enterprise Use - Strata London 2018
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
How to understand trends in the data & software market
Pay no attention to the man behind the curtain - the unseen work behind data ...
Assumptions about Data and Analysis: Briefing room webcast slides
Everything Has Changed Except Us: Modernizing the Data Warehouse
A Pragmatic Approach to Analyzing Customers
Disruptive Innovation: how do you use these theories to manage your IT?
Briefing room: An alternative for streaming data collection
Building the Enterprise Data Lake: A look at architecture
Briefing Room analyst comments - streaming analytics
Everything has changed except us
Bi isn't big data and big data isn't BI (updated)
On the edge: analytics for the modern enterprise (analyst comments)

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPT
Teaching material agriculture food technology
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Cloud computing and distributed systems.
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
A Presentation on Artificial Intelligence
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Spectral efficient network and resource selection model in 5G networks
Reach Out and Touch Someone: Haptics and Empathic Computing
Diabetes mellitus diagnosis method based random forest with bat algorithm
Building Integrated photovoltaic BIPV_UPV.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Teaching material agriculture food technology
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Cloud computing and distributed systems.
Dropbox Q2 2025 Financial Results & Investor Presentation
A Presentation on Artificial Intelligence
Per capita expenditure prediction using model stacking based on satellite ima...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
NewMind AI Monthly Chronicles - July 2025
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Review of recent advances in non-invasive hemoglobin estimation
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy

How Real TIme Data Changes the Data Warehouse

  • 1. How Real Time Data Requirements Change the Data Warehouse Environment Mark Madsen – September 17, 2008 www.ThirdNature.net Attribution-NonCommercial-No Derivative http://creativecommons.org/licenses/by-nc-nd/3.0/us/
  • 2. Outline What’s real-time about? Impacts on the data warehouse architecture Delivering data to users Extracting the data Storing the data Operations Getting started Third Nature, January 2008 Mark Madsen Slide 2
  • 3. Speeding Up the Data Warehouse Why? Faster reaction time Reduced decision time New process capabilities Third Nature, January 2008 Mark Madsen Slide 3
  • 4. Which Decisions Benefit? Strategic Operational Decision time flexible, long cycle constrained, short cycle Decision scope broad, organizational narrow, departmental or process Decision model Complex Simple Data latency High, history is core Low, recent data is to decisions core to decisions Data scope Many sources, many Few sources, types, aggregated structured, detailed Most real time needs will be driven by operational decision making, not strategic decisions. Third Nature, January 2008 Mark Madsen Slide 4
  • 5. Strategy, Decisions and Data Latency Goal Increase share of low to mid market customers Strategy Reduce cost of products sold Improve promotional performance Tactics Efficient sourcing Decrease Out of Stocks Consolidate suppliers Improve delivery compliance Catch out of stocks before they occur BI Needs Reports & Dashboards, alerts Real time alerts & spreadsheets & scorecards embedded analytics Third Nature, January 2008 Mark Madsen Slide 5
  • 6. What People Are Doing Today Monthly W eekly Daily Multiple times per day On demand 2002 32% 34% 69% 15% 6% 2004 27% 29% 65% 30% 19% 2006 3 24 44 29 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Sources: TDWI, Gartner At the same time, data volumes are rising for most data warehouses at 50% to 100% per year. Third Nature, January 2008 Mark Madsen Slide 6
  • 7. BI Efforts Involving Real Time Data Access Terms you may hear from the BI market that imply real time: Operational BI Embedded analytics Decision automation Complex event processing Event-driven BI Process-driven BI They are all similar in requiring some level of low latency data access. Third Nature, January 2008 Mark Madsen Slide 7
  • 8. Impacts on the DW Architecture Databases Dashboards OLAP Productivity BAM/BPM Reporting Analytics Applications Data Consumers Delivery DW Platforms Adding current data to the system Warehouse Mart requires effort at all Database three layers Content ODS Store ETL EDR EII Databases Documents Flat Files XML Queues ERP Applications Source Environments Third Nature, January 2008 Mark Madsen Slide 8
  • 9. One Architecture or Two? In-line with process: RT BI • Real time data flows separately from the warehouse data • May include a low-latency data store in the real time environment Process • This model be needed for extremely low latency data BI • More applicable for event-driven Batch DW Out of band: • Data to the consumer first flows Process through the DW • Unified architecture for both low and high latency data BI & RT BI • More applicable for on-demand DW Third Nature, January 2008 Mark Madsen Slide 9
  • 10. User Interface: Two BI Usage Models Demand driven • Users ask for current data • Most BI tools work this way • Harder to adapt these tools to event-driven models Event driven • System takes action based on data, e.g. alerts, rule engines • May not have (or need) an end user interface • Need understanding of decision & action process for this model Third Nature, January 2008 Mark Madsen Slide 10
  • 11. BI Tools Need New Capabilities Embedding BI within applications • UI embedding • Full embedding Event-based integration Feeding BI data to applications: services, not SQL, may be desired Custom UI code may be preferable to a BI tool Third Nature, January 2008 Mark Madsen Slide 11
  • 12. The Data Integration Layer • Integration is the most complex element of adding real time data. • Inline vs. out of band, demand vs. event-driven BI usage create different DI requirements. • You may not have exactly the same metrics, attributes or data extract logic. • Don’t count on replacing the ETL batch; more likely you are augmenting it. • You probably need to add new DI technologies to your portfolio. • Batch performance design isn’t like real time design. Third Nature, January 2008 Mark Madsen Slide 12
  • 13. Speeding Up Data Integration Methods Single batch Frequent batch Mini-batch Continuous load Streaming Hourly+ Immediate Third Nature, January 2008 Mark Madsen Slide 13
  • 14. The Platform Layer: Data and Database • Schemas will need changes. • You don’t need to convert the entire database to a real time schema. • One schema or two? • Event-driven BI creates different query patterns and workloads. • Configuration and tuning may be different than what you are used to with traditional BI. • Application developers want services or ORMs, not SQL. Third Nature, January 2008 Mark Madsen Slide 14
  • 15. Different Platform Workloads Databases Dashboards OLAP Productivity BAM/BPM Reporting Analytics Applications Data Consumers Delivery DW Platforms Three workloads: Data loading + Warehouse Mart Normal BI + Database Real time BI Content = complications ODS Store ETL EDR EII Databases Documents Flat Files XML Queues ERP Applications Third Nature, January 2008 Source Environments Mark Madsen Slide 15
  • 16. Development, Maintenance & Operations • Real time decisions on real time data mean data quality plays a larger role, and it’s harder to address. • Warehouse availability becomes much more important to the business, and it isn’t just the database – it’s everything. • Performance and meeting strict BI SLAs will rise in importance since you are now tied in to business operations. Third Nature, January 2008 Mark Madsen Slide 16
  • 17. A Prescription for Getting Started 1. Star with a decision process 2. Define data needs for the process 3. Ensure that data is available at the right latency 4. Determine appropriate data integration technologies. 5. Design and initiate upstream work 6. Build Third Nature, January 2008 Mark Madsen Slide 17
  • 18. Thanks Third Nature, January 2008 Mark Madsen Slide 18
  • 19. CC Image Attributions Thanks to the people who supplied the creative commons licensed images used in this presentation: • Divers - http://flickr.com/photos/raveller/ • Fast dog - http://flickr.com/photos/marinacvinhal/379111290/ • Febo - http://flickr.com/photos/igor/419425754/ • Subway - http://flickr.com/photos/neilsphotoalbum/504517855/ • Cadillac ranch - http://flickr.com/photos/whatknot/179655095/ Third Nature, January 2008 Mark Madsen Slide 19
  • 20. About the Presenter Mark Madsen is president of Third Nature, a technology research and consulting firm focused on business intelligence, data integration and data management. Mark is an award-winning author, architect and CTO whose work has been featured in numerous industry publications. Over the past ten years Mark received awards for his work from the American Productivity & Quality Center, TDWI, and the Smithsonian Institute. He is an international speaker, a contributing editor at Intelligent Enterprise, and manages the open source channel at the Business Intelligence Network. For more information or to contact Mark, visit http://ThirdNature.net. Page 20