SlideShare a Scribd company logo
The First Step in Information Management
looker.com
Produced by:
MONTHLY SERIES
In partnership with:
Simplifying Data Lake and Modern BI Architecture
February 1, 2018
Sponsored by:
Data Insights and Analytics: Simplifying Data Lake and Modern BI Architecture
2
Produc
t
Market
ing
Manag
er,
Looker
Kenny Cunanan
Product Marketing Manager & Database Expert
Some things get better automatically
3
Some things don’t
4
Some things don’t
5
Some things don’t
6
Some things don’t
7
Today
8
The Future
9
10
11
12
13
14
15
Data Insights and Analytics: Simplifying Data Lake and Modern BI Architecture
 What “Simplifying” Really Means
 Processes for Modern Business Intelligence (BI) Architecture
 Deployment Process Requirements
 Bridging the Gap: Traditional BI to Contemporary BI and Data Lakes
 Key Takeaways
 Q&A
Let’s Keep Things Simple Today …
pg 2First San Francisco Partners www.firstsanfranciscopartners.com© 2018
www.firstsanfranciscopartners.com
What “Simplifying” Really Means
Everything should be made as simple as possible, but not simpler.
– Albert Einstein
“Architecture” Defined
 The art and discipline of designing buildings and structures, from the
macro-level of urban planning to the micro-level of creating furniture
and machine parts.
 The design of any complex object or system. It may refer to the implied architecture of
abstract things such as music or mathematics, the apparent architecture of natural
things such as geological formations or living things, or explicitly planned architecture
of human-made things such as buildings, machines, organizations, processes, software
and databases.
 The organized arrangement of component elements to optimize the function,
performance, feasibility, cost and/or aesthetics of an overall structure.
pg 4© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
From The DAMA Guide to the Data Management Body of Knowledge
Simply put, it’s
about the
architecture.
Simplifying Doesn’t Mean It’s Simple
pg 5First San Francisco Partners www.firstsanfranciscopartners.com© 2018
“Simplified” architecture:
 Easier to use
 More flexible
 More consensus on its structure
 Easier to support “realistic” self-service capabilities
 Fewer modifications are required
 Easier to manage and govern
 Isn’t easily disrupted (broken)
 Simplicity can come from lessons learned
www.firstsanfranciscopartners.com
Processes Needed to Derive
Modern BI Architecture
The architect should strive continually to simplify;
the ensemble of the rooms should then be carefully considered
that comfort and utility may go hand in hand with beauty.
– Architect Frank Lloyd Wright
Two Lenses to Derive an Effective Architecture
pg 7© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Form
Developing the
architecture so all
stakeholders can
actually understand
and develop it
Progression
Develop architectures
that are best fit for
purpose and effective,
no matter how simple
or complex
Simple can be harder than complex.
You have to work hard to get your thinking clean to make it simple.
But it's worth it in the end, because once you get there you can move mountains.
– Steve Jobs
 Business needs
 Organizational culture
 Data characteristics
(latency, volumes and quality)
 Understand the data landscape
 Understand what you have now
Understand Architecture Characteristics
pg 8First San Francisco Partners www.firstsanfranciscopartners.com© 2018
Simply put,
use your
own data.
Granularity
Fact Volatility
Dimensional Complexity
Dimensional Volatility
“Historicity”
Latency
Cross Functionality/Distribution
Size
Source Complexity
Frequency
Response Time
Follow-up Time
Data Quality
Availability
Persistency
Access type
Algorithm Complexity
Content Variety
 Input funnel/what comes out is what you need
 Define the decision-making process
 Possible scenarios:
− What you have to use (existing department
database and tools, even Excel)
− What you could use (gap analysis)
 Set policies on what’s allowed and not allowed
 Manage what goes in the Data Lake
Use the Required Characteristics to Stay Simple
pg 9First San Francisco Partners www.firstsanfranciscopartners.com© 2018
Simply put, be aligned
with your business.
Your architecture
Landscape
Current
State
Business
Needs
Reality of the Data Lake
pg 10© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
 The Data Lake has changed due to storage availability, data management tools
and ease of which data can be managed.
 Today’s Data Lake is comprised of:
‒ Landing Zone
‒ Standardization Zone
‒ Analytics Sandbox
Reality of the Data Lake
pg 11© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
LANDING ZONE STANDARDIZATION ZONE ANALYTICS SANDBOX
DATA GOVERNANCE
DATA CONSUMERS
DATA OPERATIONS
DATA SOURCES
DATA SCIENTISTS
DATA MANAGEMENT
Data Consumers
pg 12© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Data Access Layer
Portals
Report, BI,
Query
Workbenches Labs
Data Services, Data Virtualization, ETL
Mobile
Data Logistics
DATA CONSUMERS
www.firstsanfranciscopartners.com
Deployment Process Requirements
My goal is to simplify complexity.
I just want to build stuff that really simplifies our base human interaction.
– Twitter Co-Founder Jack Dorsey
Have a Methodology
 Establish (but with a defined architecture)
a sandbox or proof of concept
 Define the vision of value and return
 Perform alignment
 Assess culture and organizational readiness
 Define long-term requirements for use
 Define operating models
 Design the BI/analytics architecture
 Develop a realistic roadmap
 Transition to a sustainable architecture
pg 14© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Copyright: First San Francisco Partners, 2017
REQUIREMENTS ROADMAP
OPERATING
MODEL
MEASUREMENT
AND SUSTAINMENT
ARCHITECTURE
AND DESIGN
IMPLEMENTATION
AND OPERATION
STRATEGIZE ACT
ENVISION
AND ALIGN
ASSESS
DISCOVER
INITIATE
Data Operations
Data Design
Data Requirements and Discovery
Data Capability Development
Data-Centric
Development
Life Cycle
(High Level)
Source Data
Discovery
Target Data
Architecture
Target Data
Modeling
Target Database
Build
Quality Assurance
Production
Migration
Production Data
Quality Monitoring
Information
Requirements
Map Source Data
to Information
Requirements
Source Data
Analysis
ETL Development
Report
Development
Architecture and
Design
Roadmap
Operating
Model
Strategy
Simply put,
you have to
execute.
Deployment Process
Technology
Rationalization
pg 15
www.firstsanfranciscopartners.com
Bridging the Gap:
Traditional BI to Contemporary BI
and Data Lakes
Simplicity is the ultimate sophistication.
– The Original Renaissance Man, Leonardo da Vinci
Traditional EDW blended with Data Lake
Bridging the Gap
pg 17© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
 Lack of agility
 Performance
 Hard to extend
 Structured data only
 Missed expectations
 Enables experimentation
 Satisfies timing and
turnaround issues
 Allows unstructured data
 Mature and useful
technology advances
Bridging the Gap
pg 18© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
Data Lake technology to leap frog Data Warehouse
Traditional EDW blended with Data Lake
 Lack of agility
 Performance
 Hard to extend
 Structured data only
 Missed expectations
 Enables experimentation
 Satisfies timing and
turnaround issues
 Allows unstructured data
 Mature and useful
technology advances
Organizations without
Data Warehouse simply
start with a Data Lake
Or organizations that need to
evolve their warehouse
CAREFULLY replace it with a
Data Lake
− Gather the data on your characteristics
− Align how you will use it with business needs
− Remember 30 years of lessons learned
Replacing an Enterprise Data Warehouse
pg 19First San Francisco Partners www.firstsanfranciscopartners.com© 2018
Simply put, form
follows function.
− Assume the characteristics are the same
− Blindly follow a reference architecture
− Just lift tables over to the lake
− Build it and they will come (they still won’t)
FSFP Reference Architecture
 Like an I-beam, the
data architecture
needs to take the
load of meeting
business objectives,
and distribute that
load to supportive
structures
pg 20© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
DATA INSIGHT ARCHITECTURE
Wrangling
Layer
Management Layer
Data Access Layer
Business Strategy
FSFP Reference Architecture
DATA INSIGHT ARCHITECTURE
pg 21© 2018 First San Francisco Partners www.firstsanfranciscopartners.com
1
Data Life
Cycles
Management
Data Usage
Vintage Area Contemporary Area
Business Strategy
Legacy BI and Reporting
Data Warehouse, ODS, Mart
ETL, EAI, Replication
Data Lake, Pond
NoSQL (HDFS, Graph)
Advanced Analytics
RDBMS, SQL, In-Memory
Appliance
Metadata Lineage Reference Data
Alignment
Data Monetization
Visualization DataWranglingMobile Logical DW
Unstructured Data
www.firstsanfranciscopartners.com
Key Takeaways and Q&A
KISS.
(Keep it simple, sweetheart.)
– Author Unknown
 Simplifying means being able to use and adapt your
BI/Data Lake architecture without a lot of trauma.
 If your BI/Data Lake architecture reflects your
business environment, it will be easier to
understand and use.
 Blindly adapting an external reference architecture is a formula for
confusion, i.e., complexity.
 Leverage what you have – i.e., the knowledge, expertise and opportunities
in your organization.
Key Takeaways
pg 23First San Francisco Partners www.firstsanfranciscopartners.com© 2018
Simply put, don’t
completely reinvent
the wheel.
Questions?
MONTHLY SERIES
Thank you for joining – thanks, also, to
Looker.com for sponsoring the webinar.
Please join our next webinar on Thursday, March 1,
The Importance of Effective Communications in Analytics.
John Ladley @jladley
john@firstsanfranciscopartners.com
Kelle O’Neal @kellezoneal
kelle@firstsanfranciscopartners.com

More Related Content

PDF
Data Management vs Data Strategy
PDF
Governing Big Data, Smart Data, Data Lakes, and the Internet of Things
PDF
DAS Slides: Graph Databases — Practical Use Cases
PDF
The future of bi isn't a bi tool
PDF
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
PDF
Trends in Enterprise Advanced Analytics
PDF
Everybody is a Data Steward – Get Over It!
PDF
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
Data Management vs Data Strategy
Governing Big Data, Smart Data, Data Lakes, and the Internet of Things
DAS Slides: Graph Databases — Practical Use Cases
The future of bi isn't a bi tool
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
Trends in Enterprise Advanced Analytics
Everybody is a Data Steward – Get Over It!
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics

What's hot (20)

PDF
DataEd Slides: Data Architecture vs. Data Modeling – Compare and Contrast
PDF
A Modern Approach to DI & MDM
PDF
DAS Slides: Data Modeling at the Environment Agency of England – Case Study
PDF
Big data as a gateway to knowledge management
PDF
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
PDF
Slides: How AI Makes Analytics More Human
PPTX
How Data is Driving AI Innovation
PDF
DataEd Slides: Approaching Data Management Technologies
PPTX
ADV Slides: Strategies for Transitioning to a Cloud-First Enterprise
PDF
Why Your Data Management Strategy Isn't Working (and How to Fix It)
PDF
DataEd Slides: Exorcising the Seven Deadly Data Sins
PDF
Data Systems Integration & Business Value PT. 3: Warehousing
PDF
ADV Slides: Modern Analytic Data Architecture Maturity Modeling
PDF
Big Data Strategies – Organizational Structure and Technology
PDF
RWDG: Measuring Data Governance Performance
PDF
Data Modeling Fundamentals
PDF
ADV Slides: The World in 2045 – What Has Artificial Intelligence Created?
PDF
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
PDF
Speed Matters - Intelligent Strategies to Accelerate Data-Driven Decisions
PDF
Metadata Strategies - Data Squared
DataEd Slides: Data Architecture vs. Data Modeling – Compare and Contrast
A Modern Approach to DI & MDM
DAS Slides: Data Modeling at the Environment Agency of England – Case Study
Big data as a gateway to knowledge management
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
Slides: How AI Makes Analytics More Human
How Data is Driving AI Innovation
DataEd Slides: Approaching Data Management Technologies
ADV Slides: Strategies for Transitioning to a Cloud-First Enterprise
Why Your Data Management Strategy Isn't Working (and How to Fix It)
DataEd Slides: Exorcising the Seven Deadly Data Sins
Data Systems Integration & Business Value PT. 3: Warehousing
ADV Slides: Modern Analytic Data Architecture Maturity Modeling
Big Data Strategies – Organizational Structure and Technology
RWDG: Measuring Data Governance Performance
Data Modeling Fundamentals
ADV Slides: The World in 2045 – What Has Artificial Intelligence Created?
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
Speed Matters - Intelligent Strategies to Accelerate Data-Driven Decisions
Metadata Strategies - Data Squared
Ad

Similar to Data Insights and Analytics: Simplifying Data Lake and Modern BI Architecture (20)

PPTX
Advanced Databases and Knowledge Management
PDF
Data Lake Architecture
PPTX
MTWO Complete Construction Cloud for Contractors
PDF
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
PDF
DI&A Webinar: Big Data Analytics
PPTX
TechEvent DWH Modernization
PDF
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
PDF
Data Architecture Strategies
PDF
Abhishek jaiswal
PDF
Data-Centric Business Transformation Using Knowledge Graphs
PDF
DAS Slides: Enterprise Architecture vs. Data Architecture
PDF
Analytics, Business Intelligence, and Data Science - What's the Progression?
PDF
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
PDF
Moving EA - from where we are to where we should be
PDF
DI&A Webinar: Building a Flexible and Scalable Analytics Architecture
PDF
How to make your data scientists happy
PDF
Accelerate Self-Service Analytics with Data Virtualization and Visualization
PDF
DI&A Slides: Data-Centric Development
PDF
Integrating Semantic Web in the Real World: A Journey between Two Cities
PDF
Data-Ed Online Webinar: Data Architecture Requirements
Advanced Databases and Knowledge Management
Data Lake Architecture
MTWO Complete Construction Cloud for Contractors
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DI&A Webinar: Big Data Analytics
TechEvent DWH Modernization
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture Strategies
Abhishek jaiswal
Data-Centric Business Transformation Using Knowledge Graphs
DAS Slides: Enterprise Architecture vs. Data Architecture
Analytics, Business Intelligence, and Data Science - What's the Progression?
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
Moving EA - from where we are to where we should be
DI&A Webinar: Building a Flexible and Scalable Analytics Architecture
How to make your data scientists happy
Accelerate Self-Service Analytics with Data Virtualization and Visualization
DI&A Slides: Data-Centric Development
Integrating Semantic Web in the Real World: A Journey between Two Cities
Data-Ed Online Webinar: Data Architecture Requirements
Ad

More from DATAVERSITY (20)

PDF
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
PDF
Data at the Speed of Business with Data Mastering and Governance
PDF
Exploring Levels of Data Literacy
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PDF
Make Data Work for You
PDF
Data Catalogs Are the Answer – What is the Question?
PDF
Data Catalogs Are the Answer – What Is the Question?
PDF
Data Modeling Fundamentals
PDF
Showing ROI for Your Analytic Project
PDF
How a Semantic Layer Makes Data Mesh Work at Scale
PDF
Is Enterprise Data Literacy Possible?
PDF
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing?
PDF
Data Governance Trends - A Look Backwards and Forwards
PDF
Data Governance Trends and Best Practices To Implement Today
PDF
2023 Trends in Enterprise Analytics
PDF
Data Strategy Best Practices
PDF
Who Should Own Data Governance – IT or Business?
PDF
Data Management Best Practices
PDF
MLOps – Applying DevOps to Competitive Advantage
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Data at the Speed of Business with Data Mastering and Governance
Exploring Levels of Data Literacy
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Make Data Work for You
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What Is the Question?
Data Modeling Fundamentals
Showing ROI for Your Analytic Project
How a Semantic Layer Makes Data Mesh Work at Scale
Is Enterprise Data Literacy Possible?
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends and Best Practices To Implement Today
2023 Trends in Enterprise Analytics
Data Strategy Best Practices
Who Should Own Data Governance – IT or Business?
Data Management Best Practices
MLOps – Applying DevOps to Competitive Advantage

Recently uploaded (20)

PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Machine learning based COVID-19 study performance prediction
20250228 LYD VKU AI Blended-Learning.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
sap open course for s4hana steps from ECC to s4
Encapsulation_ Review paper, used for researhc scholars
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
“AI and Expert System Decision Support & Business Intelligence Systems”
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Programs and apps: productivity, graphics, security and other tools
Reach Out and Touch Someone: Haptics and Empathic Computing
Digital-Transformation-Roadmap-for-Companies.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
NewMind AI Weekly Chronicles - August'25 Week I
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Network Security Unit 5.pdf for BCA BBA.
Empathic Computing: Creating Shared Understanding
Understanding_Digital_Forensics_Presentation.pptx
Machine learning based COVID-19 study performance prediction

Data Insights and Analytics: Simplifying Data Lake and Modern BI Architecture

  • 1. The First Step in Information Management looker.com Produced by: MONTHLY SERIES In partnership with: Simplifying Data Lake and Modern BI Architecture February 1, 2018 Sponsored by:
  • 4. Some things get better automatically 3
  • 11. 10
  • 12. 11
  • 13. 12
  • 14. 13
  • 15. 14
  • 16. 15
  • 18.  What “Simplifying” Really Means  Processes for Modern Business Intelligence (BI) Architecture  Deployment Process Requirements  Bridging the Gap: Traditional BI to Contemporary BI and Data Lakes  Key Takeaways  Q&A Let’s Keep Things Simple Today … pg 2First San Francisco Partners www.firstsanfranciscopartners.com© 2018
  • 19. www.firstsanfranciscopartners.com What “Simplifying” Really Means Everything should be made as simple as possible, but not simpler. – Albert Einstein
  • 20. “Architecture” Defined  The art and discipline of designing buildings and structures, from the macro-level of urban planning to the micro-level of creating furniture and machine parts.  The design of any complex object or system. It may refer to the implied architecture of abstract things such as music or mathematics, the apparent architecture of natural things such as geological formations or living things, or explicitly planned architecture of human-made things such as buildings, machines, organizations, processes, software and databases.  The organized arrangement of component elements to optimize the function, performance, feasibility, cost and/or aesthetics of an overall structure. pg 4© 2018 First San Francisco Partners www.firstsanfranciscopartners.com From The DAMA Guide to the Data Management Body of Knowledge
  • 21. Simply put, it’s about the architecture. Simplifying Doesn’t Mean It’s Simple pg 5First San Francisco Partners www.firstsanfranciscopartners.com© 2018 “Simplified” architecture:  Easier to use  More flexible  More consensus on its structure  Easier to support “realistic” self-service capabilities  Fewer modifications are required  Easier to manage and govern  Isn’t easily disrupted (broken)  Simplicity can come from lessons learned
  • 22. www.firstsanfranciscopartners.com Processes Needed to Derive Modern BI Architecture The architect should strive continually to simplify; the ensemble of the rooms should then be carefully considered that comfort and utility may go hand in hand with beauty. – Architect Frank Lloyd Wright
  • 23. Two Lenses to Derive an Effective Architecture pg 7© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Form Developing the architecture so all stakeholders can actually understand and develop it Progression Develop architectures that are best fit for purpose and effective, no matter how simple or complex Simple can be harder than complex. You have to work hard to get your thinking clean to make it simple. But it's worth it in the end, because once you get there you can move mountains. – Steve Jobs
  • 24.  Business needs  Organizational culture  Data characteristics (latency, volumes and quality)  Understand the data landscape  Understand what you have now Understand Architecture Characteristics pg 8First San Francisco Partners www.firstsanfranciscopartners.com© 2018 Simply put, use your own data. Granularity Fact Volatility Dimensional Complexity Dimensional Volatility “Historicity” Latency Cross Functionality/Distribution Size Source Complexity Frequency Response Time Follow-up Time Data Quality Availability Persistency Access type Algorithm Complexity Content Variety
  • 25.  Input funnel/what comes out is what you need  Define the decision-making process  Possible scenarios: − What you have to use (existing department database and tools, even Excel) − What you could use (gap analysis)  Set policies on what’s allowed and not allowed  Manage what goes in the Data Lake Use the Required Characteristics to Stay Simple pg 9First San Francisco Partners www.firstsanfranciscopartners.com© 2018 Simply put, be aligned with your business. Your architecture Landscape Current State Business Needs
  • 26. Reality of the Data Lake pg 10© 2018 First San Francisco Partners www.firstsanfranciscopartners.com  The Data Lake has changed due to storage availability, data management tools and ease of which data can be managed.  Today’s Data Lake is comprised of: ‒ Landing Zone ‒ Standardization Zone ‒ Analytics Sandbox
  • 27. Reality of the Data Lake pg 11© 2018 First San Francisco Partners www.firstsanfranciscopartners.com LANDING ZONE STANDARDIZATION ZONE ANALYTICS SANDBOX DATA GOVERNANCE DATA CONSUMERS DATA OPERATIONS DATA SOURCES DATA SCIENTISTS DATA MANAGEMENT
  • 28. Data Consumers pg 12© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Data Access Layer Portals Report, BI, Query Workbenches Labs Data Services, Data Virtualization, ETL Mobile Data Logistics DATA CONSUMERS
  • 29. www.firstsanfranciscopartners.com Deployment Process Requirements My goal is to simplify complexity. I just want to build stuff that really simplifies our base human interaction. – Twitter Co-Founder Jack Dorsey
  • 30. Have a Methodology  Establish (but with a defined architecture) a sandbox or proof of concept  Define the vision of value and return  Perform alignment  Assess culture and organizational readiness  Define long-term requirements for use  Define operating models  Design the BI/analytics architecture  Develop a realistic roadmap  Transition to a sustainable architecture pg 14© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Copyright: First San Francisco Partners, 2017 REQUIREMENTS ROADMAP OPERATING MODEL MEASUREMENT AND SUSTAINMENT ARCHITECTURE AND DESIGN IMPLEMENTATION AND OPERATION STRATEGIZE ACT ENVISION AND ALIGN ASSESS DISCOVER INITIATE
  • 31. Data Operations Data Design Data Requirements and Discovery Data Capability Development Data-Centric Development Life Cycle (High Level) Source Data Discovery Target Data Architecture Target Data Modeling Target Database Build Quality Assurance Production Migration Production Data Quality Monitoring Information Requirements Map Source Data to Information Requirements Source Data Analysis ETL Development Report Development Architecture and Design Roadmap Operating Model Strategy Simply put, you have to execute. Deployment Process Technology Rationalization pg 15
  • 32. www.firstsanfranciscopartners.com Bridging the Gap: Traditional BI to Contemporary BI and Data Lakes Simplicity is the ultimate sophistication. – The Original Renaissance Man, Leonardo da Vinci
  • 33. Traditional EDW blended with Data Lake Bridging the Gap pg 17© 2018 First San Francisco Partners www.firstsanfranciscopartners.com  Lack of agility  Performance  Hard to extend  Structured data only  Missed expectations  Enables experimentation  Satisfies timing and turnaround issues  Allows unstructured data  Mature and useful technology advances
  • 34. Bridging the Gap pg 18© 2018 First San Francisco Partners www.firstsanfranciscopartners.com Data Lake technology to leap frog Data Warehouse Traditional EDW blended with Data Lake  Lack of agility  Performance  Hard to extend  Structured data only  Missed expectations  Enables experimentation  Satisfies timing and turnaround issues  Allows unstructured data  Mature and useful technology advances Organizations without Data Warehouse simply start with a Data Lake Or organizations that need to evolve their warehouse CAREFULLY replace it with a Data Lake
  • 35. − Gather the data on your characteristics − Align how you will use it with business needs − Remember 30 years of lessons learned Replacing an Enterprise Data Warehouse pg 19First San Francisco Partners www.firstsanfranciscopartners.com© 2018 Simply put, form follows function. − Assume the characteristics are the same − Blindly follow a reference architecture − Just lift tables over to the lake − Build it and they will come (they still won’t)
  • 36. FSFP Reference Architecture  Like an I-beam, the data architecture needs to take the load of meeting business objectives, and distribute that load to supportive structures pg 20© 2018 First San Francisco Partners www.firstsanfranciscopartners.com DATA INSIGHT ARCHITECTURE Wrangling Layer Management Layer Data Access Layer Business Strategy
  • 37. FSFP Reference Architecture DATA INSIGHT ARCHITECTURE pg 21© 2018 First San Francisco Partners www.firstsanfranciscopartners.com 1 Data Life Cycles Management Data Usage Vintage Area Contemporary Area Business Strategy Legacy BI and Reporting Data Warehouse, ODS, Mart ETL, EAI, Replication Data Lake, Pond NoSQL (HDFS, Graph) Advanced Analytics RDBMS, SQL, In-Memory Appliance Metadata Lineage Reference Data Alignment Data Monetization Visualization DataWranglingMobile Logical DW Unstructured Data
  • 38. www.firstsanfranciscopartners.com Key Takeaways and Q&A KISS. (Keep it simple, sweetheart.) – Author Unknown
  • 39.  Simplifying means being able to use and adapt your BI/Data Lake architecture without a lot of trauma.  If your BI/Data Lake architecture reflects your business environment, it will be easier to understand and use.  Blindly adapting an external reference architecture is a formula for confusion, i.e., complexity.  Leverage what you have – i.e., the knowledge, expertise and opportunities in your organization. Key Takeaways pg 23First San Francisco Partners www.firstsanfranciscopartners.com© 2018 Simply put, don’t completely reinvent the wheel.
  • 41. Thank you for joining – thanks, also, to Looker.com for sponsoring the webinar. Please join our next webinar on Thursday, March 1, The Importance of Effective Communications in Analytics. John Ladley @jladley john@firstsanfranciscopartners.com Kelle O’Neal @kellezoneal kelle@firstsanfranciscopartners.com