SlideShare a Scribd company logo
www.edureka.co/data-warehousing-and-bi
Mastering In DataWarehousing & Business Intelligence
Slide 2 www.edureka.co/data-warehousing-and-bi
At the end of this module, you will be able to Know :
Objective
 What is Data Warehousing & Business Intelligence ?
 What is Data Warehousing Architecture ?
 What is Data Modelling / Introduction to ER Win r9 Tool ?
 What is ETL / An Open Source ETL Tool – Talend 5.x ?
 What is Business Intelligence / An Open Source Tableau Public 9.x ?
Slide 3Slide 3Slide 3 www.edureka.co/data-warehousing-and-bi
What is Data Warehouse
A data warehouse refers to a database that is maintained
separately from an organization’s operational database
“A data warehouse is a subject-oriented, integrated, time-
variant, and nonvolatile collection of data in support of
management’s decision-making process.”
Loosely Speaking
Officially Speaking
W.H. Inmon
Slide 4Slide 4Slide 4 www.edureka.co/data-warehousing-and-bi
Subject Oriented
Data Warehouse Properties
Non Volatile
Integrated
Time Variant
Data Warehousing
Slide 5Slide 5Slide 5 www.edureka.co/data-warehousing-and-bi
Subject-Oriented
Data is categorized and stored by business subject rather than by application
Equity
Plans
Shares
Customer
financial
information
Savings
Insurance
Loans
OLTP Applications Data Warehouse Subject
Slide 6Slide 6Slide 6 www.edureka.co/data-warehousing-and-bi
Integrated
OLTP Applications
Savings
Current
accounts
Loans
Data Warehouse
Data on a given subject is defined and stored once.
Customer
Slide 7Slide 7Slide 7 www.edureka.co/data-warehousing-and-bi
Time-Variant
Data is stored as a series of snapshots, each representing a period of time…
TIME DATA
Jan-97 January
Feb-97 February
Mar-97 March
Jan-97 January
Slide 8Slide 8Slide 8 www.edureka.co/data-warehousing-and-bi
Non-Volatile
Typically data in the data warehouse is not updated or deleted.
Insert Update Delete Read Read
Operational Warehouse
Load
Slide 9Slide 9Slide 9 www.edureka.co/data-warehousing-and-bi
Changing Data
Warehouse Database
First time load
Refresh
Refresh
Operational Database
Slide 10Slide 10Slide 10 www.edureka.co/data-warehousing-and-bi
What is Data Warehouse
“Data warehouse is the conglomerate of all data marts
within the enterprise. Information is always stored in the
dimensional model.”
Officially Speaking
Ralph Kimball
Slide 11Slide 11Slide 11 www.edureka.co/data-warehousing-and-bi
InmonVsRalphKimball
Characteristics Inmon Kimball
Business Decision Support
Requirements Strategic Tactical
Data Integration Requirement Enterprise Wise Integrator Individual Business Requirement
Data Structure Requirement
Data that meet multiple and varied
information needs and non-metric data
KPI, Business Performance Measure &
Scorecards Requirements
Persistency of Data in Source System Source Systems have high rate of
change
Source Systems are quite stable
Skill Sets Bigger Team of Specialists Small Team of Generalists
Time Constraint Longer Time Required Urgent delivery as this is subjective
Cost to Build High Start-up Costs Low Start-Up Costs
Slide 12Slide 12Slide 12 www.edureka.co/data-warehousing-and-bi
Data Ware Housing Products
Entity Relationship / Schema Modeling
Data Integration / ETL
Slide 13Slide 13Slide 13 www.edureka.co/data-warehousing-and-bi
What is Business Intelligence?
Business intelligence (BI) is the set of techniques and tools for the transformation of raw / production and
operational data into meaningful and useful information for business analysis purposes for various level.
Business intelligence (BI) talks about how traditional data which transform into the BI which have multiple
initiatives to measure, manage, and improve on the performance of individuals, processes, teams, and business
units for the specific business area.
During the operation of business, the following questions must be asked. The functions of monitoring,
analyzing, and planning delve into these questions as follows :
• What has happened?
• What is happening?
• Why?
• What will happen?
• What do we want to have happen?
Slide 14Slide 14Slide 14 www.edureka.co/data-warehousing-and-bi
What is Business Intelligence Categories?
Strategic Business Intelligence: Management collaborates and agrees on a strategy and a method in which they
would like to see information presented, for example, in maps, scorecards, reports, or dashboards.
Now that you have your strategy defined, it is imperative that you do something with the data that you have collected
Analytical Business Intelligence: Once Strategic BI sets the foundation in the form of key performance metrics,
then Analytical BI is employed to identify the source of an issue once it has been uncovered.
Tools like analytic dashboards, OLAP, predictive analytics, and ad hoc queries are utilized to determine the location or
the cause of a major problem.
Slide 15 www.edureka.co/data-warehousing-and-bi
Data Ware House Architecture
Slide 16Slide 16Slide 16 www.edureka.co/data-warehousing-and-bi
- CA ERwin Introduction
• ERwin is a popular data modeling
tool used by a number of major
companies throughout the world.
The product is currently owned,
developed, and marketed by
Computer Associates, a leading
software company.
• The product supports a variety of
aspects of database design,
including data modeling, forward
engineering (the creation of a
database schema and physical
database on the basis of a data
model), and reverse engineering
(the creation of a data model on the
basis of an existing database) for a
wide variety of relational DBMS,
including Microsoft Access, Oracle,
DB2, Sybase, and others.
Slide 17Slide 17Slide 17 www.edureka.co/data-warehousing-and-bi
Physical ER Diagram
Slide 18Slide 18Slide 18 www.edureka.co/data-warehousing-and-bi
Open Source VS Commercial ETL Tools
Open Source ETL Commercial ETL
License Details
Licensing Model Open-Core Model. No charges for extra
CPU cycles
Core Model. Extra usage charges can be
applied for over-the-limit CPU cycles
Licensing Channel via GNU public license/Apache license Vendor specific
Cost Differentiation
License Costs 30-60% < 1X times Commercial license
(Mostly)
1X times
Support and Maintenance Costs Close to Commercial vendor costs
(Mostly)
Close to Open-Source
Advanced Features Offered through "Commercial" Open-
Source
Add-ons
Type of Support and Maintenance Tiered pricing model. E.g.
Community(Free, Forums), Silver, Gold
and Platinum Editions
Not offered in tiers (Mostly). Add-on to
licensing costs
Slide 19Slide 19Slide 19 www.edureka.co/data-warehousing-and-bi
Talend Open DI Studio 5.x - Overview
• Talend Open Studio for Data Integration operates as a code generator, producing data-transformation scripts and
underlying programs in Java. Its GUI gives access to a metadata repository and to a graphical designer. The
metadata repository contains the definitions and configuration for each job - but not the actual data being
transformed or moved. All of the components of Talend Open Studio for Data Integration use the information in
the metadata repository.
• The product is based on Eclipse.
• Talend Open DI Studio typically use for:
• synchronization or replication of databases
• right-time or batch exchanges of data
• ETL (Extract/Transform/Load) for analytics
• data migration
• complex data transformation and loading
• data quality exercises
• big data
• Talend Open Studio for Data Integration primarily differs from Talend Enterprise Data Integration in that the
Enterprise version has a Subversion plug-in built in, as well as support for joblets. Using Talend Enterprise Data
Integration, ETL and ELT jobs can have a dynamic schema.
Slide 20Slide 20Slide 20 www.edureka.co/data-warehousing-and-bi
Talend - History
Slide 21Slide 21Slide 21 www.edureka.co/data-warehousing-and-bi
Talend - Partners
Slide 22Slide 22Slide 22 www.edureka.co/data-warehousing-and-bi
Talend - Customers
Slide 23Slide 23Slide 23 www.edureka.co/data-warehousing-and-bi
Talend - Market Position
Slide 24Slide 24Slide 24 www.edureka.co/data-warehousing-and-bi
Talend - Product Offering
Slide 25Slide 25Slide 25 www.edureka.co/data-warehousing-and-bi
Talend DI Studio - Welcome Page
Slide 26Slide 26Slide 26 www.edureka.co/data-warehousing-and-bi
Data visualization With Tableau
• Data visualization tools allow anyone to organize and
present information intuitively. This is becoming more
vital as data proliferates in every field from bar codes
in retail stores to player behavior in online games.
• All of this data is meaningless without a way to
organize and present important findings within it.
People comprehend data better through pictures than
by reading numbers in rows and columns.
• So by visualizing data, you are able to more effectively
ask and answer important questions such as “Where
are sales growing,” “What is driving growth” and “What
are the characteristics of my customers using different
services?” By using Tableau visualizations, you gain the
ability to quickly answer questions; your data becomes
a competitive advantage instead of an underutilized
asset.
Slide 27Slide 27Slide 27 www.edureka.co/data-warehousing-and-bi
Tableau Software, Inc.
Company and Leadership
• The company was founded in Mountain View, California in January, 2003 by
Chris Stolte, Christian Chabot and Pat Hanrahan.
• Based on a breakthrough from Stanford University, Tableau makes visual
analytics and business intelligence software that delivers:
• 10-100X productivity improvements
• amazing multi-dimensional discoveries
• web analytics at 1/10th the cost of a “BI Platform”
The company is headquartered in Seattle, WA.
Customers
+ Google
+ Allstate
+ Cornell
+ Harvard
+ Apple
+ NSA
+ Microsoft
+ 1,000’s more
Key Partners
+ Oracle OEM
+ Microsoft Gold Certified
+ Teradata Partner
Slide 28Slide 28Slide 28 www.edureka.co/data-warehousing-and-bi
Customers: 4,000 Strong Companies
Advertising and Marketing
Avenue A | Razorfish
DoubleClick
Draft FCB
Olgilvy
Predicta Brazil
The Martin Agency
Banking and Finance
Bank of America
Barclays Global
Citigroup
Draft FCB Group
EverBank Direct
Fifth Third Bank
Greater Iowa Credit Union
Wells Fargo
Charities
British Red Cross
Christian Relief Services Charities
DC Children & Youth Inv. Trust Corp.
Goodwill Industries
United Way of Rock River Valley
World Vision
Communications
Bell Canada
Bell South
FiberTower
Lucent
Mitel Networks Corporation
Motorola
Sprint
T-Mobile
Telstra
Verizon Communications
Consulting and Legal
Baker & McKenzie
Booz Allen Hamilton
Clorox Company
Cornerstone Research
Deloitte & Touche LLP
Electronic Data Systems Corporation
Ernst & Young
McKinsey & Company
Mercer
Norbridge
Energy & Utilities
Atomic Energy of Canada
Duke Energy Corporation
Louisville Water Company
Omaha Public Power District
Saudi Aramco
TXU
Williams Midstream
WindLogics
Engineering and Construction
Bechtel Corporation
Beezer Homes
Kiewit Corporation
McGraw-Hill Construction
St. Onge Company
Financial Information
Dunn & Bradstreet
Equifax
Fannie Mae
HTM Corporation
Moodys Investors Service
Scottish Re
Standard and Poor's
Food and Beverage
Chicken of the Sea
Nestlé
Prairie Berry Winery
Sierra Nevada Brewing Company
Starbucks Coffee
The Coca-Cola Company
Turkey Hill Dairy
Healthcare
Barnes Jewish Hospital
Blue Cross Blue Shield of Alabama
Caremark
Good Shepherd Medical Center
Harvard Medical School
Johnson & Johnson
Kaleida Health
LSU Health Sciences
Roche Diagnostics
St. Jude Children's Research
University of Miami Medical Center
Wake Forest Health Sciences
Insurance
AAA Allied Group
Amica
Esurance
Marsh & McLennan
Mutual of Omaha
Nationwide Insurance
Progressive
The Regence Group
Investment and Brokerage
Bridgewater Associates
Charles Schwab
Dundee Securities
Merrill Lynch
National Financial Partners
New Enterprise Associates
RBC Dain Rauscher
Rosenblatt Securities
Stone Castle Partners
Manufacturing
Air Products
Alcoa
Boeing
Dow Chemical
Hitachi
Honda
Jabil Circuit
KLA Tencor
Lockheed Martin
Pratt & Whitney
Ricoh
Sony
Steelcase
Toyota
Media and Entertainment
CNET Networks
Discovery Communications
Dow Jones and Company
Epic Games
Microsoft Xbox
New York Times
O'Reilly Media
Sony
TiVo
Univision
Pharamceutical
Alza Pharmaceutical
Cephalon
Eli Lilly
Johnson & Johnson
McKesson
Merck
Novo Nordisk
Pfizer
Sanofi Aventis
Government and Public Sector
Australia Attorney General
DC Government
Federal Aviation Administration (FAA)
Government of Canada
National Science Foundation
NSA - National Security Agency
NYC Department of Education
Pacific Northwest National Lab (PNNL)
SOCOM - US Special Ops Command
US Air Force
US Army
US Bureau of Land Management
US Department of Agriculture (USDA)
US Department of Justice
US Department of the Navy
Veteran's Benefit Association (VBA)
Research & Development
Bayer CropScience
Boeing Phantomworks
General Electric Global Research
Lawrence Livermore Labs
MITRE
National Institute of Health
National Reconnaissance Office
National Visualization Analytics Center
Pacific Northwest National Labs
Quest Diagnostics
Retail
Amazon.com
Barnes & Noble
Borders
Caremark CVS
Lowe's
MAPCO
Pilot Travel LLC
Safeway
Walmart
Wet Seal
Service and Outsourcing
ADP
Computer Information Concepts
EDS
Hmetrix
Madrona Solutions Group
Oco
StrategicOne
Wolters Kluwer
Technology
Adobe
AOL
BEA Systems
CNET Networks
eBay
Electronic Arts
ESRI
Google
HP
IBM
Microsoft
MySQL
Novell
Pay Pal
VMWare
TechNexxus
Travel & Leisure
Alaska Airlines
Bourne Leisure
Celebrity Cruise Lines
Expedia
Royal Caribbean Cruise Lines
Sandiego.com
Universities and Colleges
Appalachian State University
Chemetka Community College
Cornell University
DePaul University
Duke University
Georgetown University
Johns Hopkins University
London School of Economics
Ohio State University
Pitzer College
Providence College
San Diego State University
University College- Dublin
University of North Carolina – Chapel Hill
Slide 29Slide 29Slide 29 www.edureka.co/data-warehousing-and-bi
Tableau Products
Tableau Desktop
Create
Tableau Server
Share - Web
Tableau Reader
Share - Local
+ business intelligence
solution scales to
organizations of all sizes
+ share visual analytics with
anyone with a web
browser
+ publish interactive
analytics or dashboards
+ secure information and
manage metadata
+ collaborate with others
+ share visualizations &
dashboards on the
desktop
+ filter, sort, and page
through the views
+ “Acrobat for Data”
+ free download
+ ad hoc analytics,
dashboards, reports, graphs
+ explore, visualize, and
analyze your data
+ create dashboards to
consolidate multiple views
+ deliver interactive data
experiences
Tableau Public
Share - Everyone
+ create and publish
interactive visualizations
and dashboards
+ embed in websites and
blogs
+ free download and free
hosting service
Slide 30Slide 30Slide 30 www.edureka.co/data-warehousing-and-bi
How Do People Work With Tableau
Slide 31 www.edureka.co/data-warehousing-and-bi
Tableau Public - Start Page
Questions
Slide 32 www.edureka.co/data-warehousing-and-bi
Mastering in data warehousing & BusinessIintelligence

More Related Content

PDF
Agile Data Warehouse Design for Big Data Presentation
PPTX
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
PDF
Data-Ed Online Presents: Data Warehouse Strategies
PPT
Data warehouse 101-fundamentals-
PPT
Bi presentation to bkk
PDF
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
PDF
The opportunity of the business data lake
PPT
Data Warehouse Methodology
Agile Data Warehouse Design for Big Data Presentation
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Data-Ed Online Presents: Data Warehouse Strategies
Data warehouse 101-fundamentals-
Bi presentation to bkk
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
The opportunity of the business data lake
Data Warehouse Methodology

What's hot (20)

PDF
Data Vault Introduction
PDF
The technology of the business data lake
PPTX
Improving the customer experience using big data customer-centric measurement...
PPTX
Microsoft Business Intelligence - Practical Approach & Overview
PDF
Agile BI via Data Vault and Modelstorming
PPT
Dw & etl concepts
PPT
Microsoft business intelligence
PPTX
Bi Architecture And Conceptual Framework
PPSX
Data warehouse
PPTX
SAP Data Services
PPT
Datawarehouse & bi introduction
PPTX
Traditional Data-warehousing / BI overview
PDF
Introduction to Data Warehousing
PDF
Data Warehouse Design and Best Practices
PPTX
Enterprise Services Solutions
PDF
Project+team+1 slides (2)
PDF
Rev_3 Components of a Data Warehouse
PPT
Bi presentation Designing and Implementing Business Intelligence Systems
PPT
Data Provisioning & Optimization
Data Vault Introduction
The technology of the business data lake
Improving the customer experience using big data customer-centric measurement...
Microsoft Business Intelligence - Practical Approach & Overview
Agile BI via Data Vault and Modelstorming
Dw & etl concepts
Microsoft business intelligence
Bi Architecture And Conceptual Framework
Data warehouse
SAP Data Services
Datawarehouse & bi introduction
Traditional Data-warehousing / BI overview
Introduction to Data Warehousing
Data Warehouse Design and Best Practices
Enterprise Services Solutions
Project+team+1 slides (2)
Rev_3 Components of a Data Warehouse
Bi presentation Designing and Implementing Business Intelligence Systems
Data Provisioning & Optimization
Ad

Viewers also liked (12)

PDF
Is Data Scientist still the sexiest job of 21st century? Find Out!
PDF
Big Data Processing with Spark and Scala
PPTX
R and Visualization: A match made in Heaven
PDF
Power of Python with Big Data
PPTX
Big Data Analytics for Non-Programmers
PPTX
Spark for big data analytics
PPTX
Top 5 algorithms used in Data Science
PDF
Clare Corthell: Learning Data Science Online
PPTX
Health care and big data with hadoop – Beacuse prevention is better than cure
PPTX
Python for Big Data Analytics
PDF
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
PDF
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Is Data Scientist still the sexiest job of 21st century? Find Out!
Big Data Processing with Spark and Scala
R and Visualization: A match made in Heaven
Power of Python with Big Data
Big Data Analytics for Non-Programmers
Spark for big data analytics
Top 5 algorithms used in Data Science
Clare Corthell: Learning Data Science Online
Health care and big data with hadoop – Beacuse prevention is better than cure
Python for Big Data Analytics
Machine Learning In Python | Python Machine Learning Tutorial | Deep Learning...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Ad

Similar to Mastering in data warehousing & BusinessIintelligence (20)

PPTX
What is a Data Warehouse and How Do I Test It?
PPTX
How to Optimize Sales Analytics Using 10x the Data at 1/10th the Cost
PPTX
Data warehousing Concepts and Design.pptx
PPTX
introduction & conceptsdatawarehousing.pptx
PDF
Big Data Discovery
PDF
Data warehousing
PDF
MT101 Dell OCIO: Delivering data and analytics in real time
DOCX
Abdul ETL Resume
PDF
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
PDF
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
PDF
Accelerate Self-Service Analytics with Data Virtualization and Visualization
PPTX
Big Data's Impact on the Enterprise
PPTX
Building a strong Data Management capability with TOGAF and ArchiMate
PDF
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
PDF
TRANSFORM DATA WITH INSIGHTFUL ANALYTICS - BUSINESS INTELLIGENCE SOLUTIONS
PDF
Building the Artificially Intelligent Enterprise
PPT
DataWarehousingandAbInitioConcepts.ppt
PDF
Analytic Excellence - Saying Goodbye to Old Constraints
PDF
Management Information Systems for the Information Age 9th Edition Haag Solut...
What is a Data Warehouse and How Do I Test It?
How to Optimize Sales Analytics Using 10x the Data at 1/10th the Cost
Data warehousing Concepts and Design.pptx
introduction & conceptsdatawarehousing.pptx
Big Data Discovery
Data warehousing
MT101 Dell OCIO: Delivering data and analytics in real time
Abdul ETL Resume
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Big Data's Impact on the Enterprise
Building a strong Data Management capability with TOGAF and ArchiMate
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
TRANSFORM DATA WITH INSIGHTFUL ANALYTICS - BUSINESS INTELLIGENCE SOLUTIONS
Building the Artificially Intelligent Enterprise
DataWarehousingandAbInitioConcepts.ppt
Analytic Excellence - Saying Goodbye to Old Constraints
Management Information Systems for the Information Age 9th Edition Haag Solut...

More from Edureka! (20)

PDF
What to learn during the 21 days Lockdown | Edureka
PDF
Top 10 Dying Programming Languages in 2020 | Edureka
PDF
Top 5 Trending Business Intelligence Tools | Edureka
PDF
Tableau Tutorial for Data Science | Edureka
PDF
Python Programming Tutorial | Edureka
PDF
Top 5 PMP Certifications | Edureka
PDF
Top Maven Interview Questions in 2020 | Edureka
PDF
Linux Mint Tutorial | Edureka
PDF
How to Deploy Java Web App in AWS| Edureka
PDF
Importance of Digital Marketing | Edureka
PDF
RPA in 2020 | Edureka
PDF
Email Notifications in Jenkins | Edureka
PDF
EA Algorithm in Machine Learning | Edureka
PDF
Cognitive AI Tutorial | Edureka
PDF
AWS Cloud Practitioner Tutorial | Edureka
PDF
Blue Prism Top Interview Questions | Edureka
PDF
Big Data on AWS Tutorial | Edureka
PDF
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
PDF
Kubernetes Installation on Ubuntu | Edureka
PDF
Introduction to DevOps | Edureka
What to learn during the 21 days Lockdown | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Tableau Tutorial for Data Science | Edureka
Python Programming Tutorial | Edureka
Top 5 PMP Certifications | Edureka
Top Maven Interview Questions in 2020 | Edureka
Linux Mint Tutorial | Edureka
How to Deploy Java Web App in AWS| Edureka
Importance of Digital Marketing | Edureka
RPA in 2020 | Edureka
Email Notifications in Jenkins | Edureka
EA Algorithm in Machine Learning | Edureka
Cognitive AI Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Blue Prism Top Interview Questions | Edureka
Big Data on AWS Tutorial | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Kubernetes Installation on Ubuntu | Edureka
Introduction to DevOps | Edureka

Recently uploaded (20)

PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Mushroom cultivation and it's methods.pdf
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation theory and applications.pdf
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Approach and Philosophy of On baking technology
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
OMC Textile Division Presentation 2021.pptx
PPTX
Tartificialntelligence_presentation.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Enhancing emotion recognition model for a student engagement use case through...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
project resource management chapter-09.pdf
WOOl fibre morphology and structure.pdf for textiles
Encapsulation_ Review paper, used for researhc scholars
Zenith AI: Advanced Artificial Intelligence
Mushroom cultivation and it's methods.pdf
Hindi spoken digit analysis for native and non-native speakers
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation theory and applications.pdf
A comparative study of natural language inference in Swahili using monolingua...
Chapter 5: Probability Theory and Statistics
Approach and Philosophy of On baking technology
A novel scalable deep ensemble learning framework for big data classification...
Programs and apps: productivity, graphics, security and other tools
OMC Textile Division Presentation 2021.pptx
Tartificialntelligence_presentation.pptx
A Presentation on Artificial Intelligence
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
cloud_computing_Infrastucture_as_cloud_p
Enhancing emotion recognition model for a student engagement use case through...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
project resource management chapter-09.pdf

Mastering in data warehousing & BusinessIintelligence

  • 2. Slide 2 www.edureka.co/data-warehousing-and-bi At the end of this module, you will be able to Know : Objective  What is Data Warehousing & Business Intelligence ?  What is Data Warehousing Architecture ?  What is Data Modelling / Introduction to ER Win r9 Tool ?  What is ETL / An Open Source ETL Tool – Talend 5.x ?  What is Business Intelligence / An Open Source Tableau Public 9.x ?
  • 3. Slide 3Slide 3Slide 3 www.edureka.co/data-warehousing-and-bi What is Data Warehouse A data warehouse refers to a database that is maintained separately from an organization’s operational database “A data warehouse is a subject-oriented, integrated, time- variant, and nonvolatile collection of data in support of management’s decision-making process.” Loosely Speaking Officially Speaking W.H. Inmon
  • 4. Slide 4Slide 4Slide 4 www.edureka.co/data-warehousing-and-bi Subject Oriented Data Warehouse Properties Non Volatile Integrated Time Variant Data Warehousing
  • 5. Slide 5Slide 5Slide 5 www.edureka.co/data-warehousing-and-bi Subject-Oriented Data is categorized and stored by business subject rather than by application Equity Plans Shares Customer financial information Savings Insurance Loans OLTP Applications Data Warehouse Subject
  • 6. Slide 6Slide 6Slide 6 www.edureka.co/data-warehousing-and-bi Integrated OLTP Applications Savings Current accounts Loans Data Warehouse Data on a given subject is defined and stored once. Customer
  • 7. Slide 7Slide 7Slide 7 www.edureka.co/data-warehousing-and-bi Time-Variant Data is stored as a series of snapshots, each representing a period of time… TIME DATA Jan-97 January Feb-97 February Mar-97 March Jan-97 January
  • 8. Slide 8Slide 8Slide 8 www.edureka.co/data-warehousing-and-bi Non-Volatile Typically data in the data warehouse is not updated or deleted. Insert Update Delete Read Read Operational Warehouse Load
  • 9. Slide 9Slide 9Slide 9 www.edureka.co/data-warehousing-and-bi Changing Data Warehouse Database First time load Refresh Refresh Operational Database
  • 10. Slide 10Slide 10Slide 10 www.edureka.co/data-warehousing-and-bi What is Data Warehouse “Data warehouse is the conglomerate of all data marts within the enterprise. Information is always stored in the dimensional model.” Officially Speaking Ralph Kimball
  • 11. Slide 11Slide 11Slide 11 www.edureka.co/data-warehousing-and-bi InmonVsRalphKimball Characteristics Inmon Kimball Business Decision Support Requirements Strategic Tactical Data Integration Requirement Enterprise Wise Integrator Individual Business Requirement Data Structure Requirement Data that meet multiple and varied information needs and non-metric data KPI, Business Performance Measure & Scorecards Requirements Persistency of Data in Source System Source Systems have high rate of change Source Systems are quite stable Skill Sets Bigger Team of Specialists Small Team of Generalists Time Constraint Longer Time Required Urgent delivery as this is subjective Cost to Build High Start-up Costs Low Start-Up Costs
  • 12. Slide 12Slide 12Slide 12 www.edureka.co/data-warehousing-and-bi Data Ware Housing Products Entity Relationship / Schema Modeling Data Integration / ETL
  • 13. Slide 13Slide 13Slide 13 www.edureka.co/data-warehousing-and-bi What is Business Intelligence? Business intelligence (BI) is the set of techniques and tools for the transformation of raw / production and operational data into meaningful and useful information for business analysis purposes for various level. Business intelligence (BI) talks about how traditional data which transform into the BI which have multiple initiatives to measure, manage, and improve on the performance of individuals, processes, teams, and business units for the specific business area. During the operation of business, the following questions must be asked. The functions of monitoring, analyzing, and planning delve into these questions as follows : • What has happened? • What is happening? • Why? • What will happen? • What do we want to have happen?
  • 14. Slide 14Slide 14Slide 14 www.edureka.co/data-warehousing-and-bi What is Business Intelligence Categories? Strategic Business Intelligence: Management collaborates and agrees on a strategy and a method in which they would like to see information presented, for example, in maps, scorecards, reports, or dashboards. Now that you have your strategy defined, it is imperative that you do something with the data that you have collected Analytical Business Intelligence: Once Strategic BI sets the foundation in the form of key performance metrics, then Analytical BI is employed to identify the source of an issue once it has been uncovered. Tools like analytic dashboards, OLAP, predictive analytics, and ad hoc queries are utilized to determine the location or the cause of a major problem.
  • 16. Slide 16Slide 16Slide 16 www.edureka.co/data-warehousing-and-bi - CA ERwin Introduction • ERwin is a popular data modeling tool used by a number of major companies throughout the world. The product is currently owned, developed, and marketed by Computer Associates, a leading software company. • The product supports a variety of aspects of database design, including data modeling, forward engineering (the creation of a database schema and physical database on the basis of a data model), and reverse engineering (the creation of a data model on the basis of an existing database) for a wide variety of relational DBMS, including Microsoft Access, Oracle, DB2, Sybase, and others.
  • 17. Slide 17Slide 17Slide 17 www.edureka.co/data-warehousing-and-bi Physical ER Diagram
  • 18. Slide 18Slide 18Slide 18 www.edureka.co/data-warehousing-and-bi Open Source VS Commercial ETL Tools Open Source ETL Commercial ETL License Details Licensing Model Open-Core Model. No charges for extra CPU cycles Core Model. Extra usage charges can be applied for over-the-limit CPU cycles Licensing Channel via GNU public license/Apache license Vendor specific Cost Differentiation License Costs 30-60% < 1X times Commercial license (Mostly) 1X times Support and Maintenance Costs Close to Commercial vendor costs (Mostly) Close to Open-Source Advanced Features Offered through "Commercial" Open- Source Add-ons Type of Support and Maintenance Tiered pricing model. E.g. Community(Free, Forums), Silver, Gold and Platinum Editions Not offered in tiers (Mostly). Add-on to licensing costs
  • 19. Slide 19Slide 19Slide 19 www.edureka.co/data-warehousing-and-bi Talend Open DI Studio 5.x - Overview • Talend Open Studio for Data Integration operates as a code generator, producing data-transformation scripts and underlying programs in Java. Its GUI gives access to a metadata repository and to a graphical designer. The metadata repository contains the definitions and configuration for each job - but not the actual data being transformed or moved. All of the components of Talend Open Studio for Data Integration use the information in the metadata repository. • The product is based on Eclipse. • Talend Open DI Studio typically use for: • synchronization or replication of databases • right-time or batch exchanges of data • ETL (Extract/Transform/Load) for analytics • data migration • complex data transformation and loading • data quality exercises • big data • Talend Open Studio for Data Integration primarily differs from Talend Enterprise Data Integration in that the Enterprise version has a Subversion plug-in built in, as well as support for joblets. Using Talend Enterprise Data Integration, ETL and ELT jobs can have a dynamic schema.
  • 20. Slide 20Slide 20Slide 20 www.edureka.co/data-warehousing-and-bi Talend - History
  • 21. Slide 21Slide 21Slide 21 www.edureka.co/data-warehousing-and-bi Talend - Partners
  • 22. Slide 22Slide 22Slide 22 www.edureka.co/data-warehousing-and-bi Talend - Customers
  • 23. Slide 23Slide 23Slide 23 www.edureka.co/data-warehousing-and-bi Talend - Market Position
  • 24. Slide 24Slide 24Slide 24 www.edureka.co/data-warehousing-and-bi Talend - Product Offering
  • 25. Slide 25Slide 25Slide 25 www.edureka.co/data-warehousing-and-bi Talend DI Studio - Welcome Page
  • 26. Slide 26Slide 26Slide 26 www.edureka.co/data-warehousing-and-bi Data visualization With Tableau • Data visualization tools allow anyone to organize and present information intuitively. This is becoming more vital as data proliferates in every field from bar codes in retail stores to player behavior in online games. • All of this data is meaningless without a way to organize and present important findings within it. People comprehend data better through pictures than by reading numbers in rows and columns. • So by visualizing data, you are able to more effectively ask and answer important questions such as “Where are sales growing,” “What is driving growth” and “What are the characteristics of my customers using different services?” By using Tableau visualizations, you gain the ability to quickly answer questions; your data becomes a competitive advantage instead of an underutilized asset.
  • 27. Slide 27Slide 27Slide 27 www.edureka.co/data-warehousing-and-bi Tableau Software, Inc. Company and Leadership • The company was founded in Mountain View, California in January, 2003 by Chris Stolte, Christian Chabot and Pat Hanrahan. • Based on a breakthrough from Stanford University, Tableau makes visual analytics and business intelligence software that delivers: • 10-100X productivity improvements • amazing multi-dimensional discoveries • web analytics at 1/10th the cost of a “BI Platform” The company is headquartered in Seattle, WA. Customers + Google + Allstate + Cornell + Harvard + Apple + NSA + Microsoft + 1,000’s more Key Partners + Oracle OEM + Microsoft Gold Certified + Teradata Partner
  • 28. Slide 28Slide 28Slide 28 www.edureka.co/data-warehousing-and-bi Customers: 4,000 Strong Companies Advertising and Marketing Avenue A | Razorfish DoubleClick Draft FCB Olgilvy Predicta Brazil The Martin Agency Banking and Finance Bank of America Barclays Global Citigroup Draft FCB Group EverBank Direct Fifth Third Bank Greater Iowa Credit Union Wells Fargo Charities British Red Cross Christian Relief Services Charities DC Children & Youth Inv. Trust Corp. Goodwill Industries United Way of Rock River Valley World Vision Communications Bell Canada Bell South FiberTower Lucent Mitel Networks Corporation Motorola Sprint T-Mobile Telstra Verizon Communications Consulting and Legal Baker & McKenzie Booz Allen Hamilton Clorox Company Cornerstone Research Deloitte & Touche LLP Electronic Data Systems Corporation Ernst & Young McKinsey & Company Mercer Norbridge Energy & Utilities Atomic Energy of Canada Duke Energy Corporation Louisville Water Company Omaha Public Power District Saudi Aramco TXU Williams Midstream WindLogics Engineering and Construction Bechtel Corporation Beezer Homes Kiewit Corporation McGraw-Hill Construction St. Onge Company Financial Information Dunn & Bradstreet Equifax Fannie Mae HTM Corporation Moodys Investors Service Scottish Re Standard and Poor's Food and Beverage Chicken of the Sea Nestlé Prairie Berry Winery Sierra Nevada Brewing Company Starbucks Coffee The Coca-Cola Company Turkey Hill Dairy Healthcare Barnes Jewish Hospital Blue Cross Blue Shield of Alabama Caremark Good Shepherd Medical Center Harvard Medical School Johnson & Johnson Kaleida Health LSU Health Sciences Roche Diagnostics St. Jude Children's Research University of Miami Medical Center Wake Forest Health Sciences Insurance AAA Allied Group Amica Esurance Marsh & McLennan Mutual of Omaha Nationwide Insurance Progressive The Regence Group Investment and Brokerage Bridgewater Associates Charles Schwab Dundee Securities Merrill Lynch National Financial Partners New Enterprise Associates RBC Dain Rauscher Rosenblatt Securities Stone Castle Partners Manufacturing Air Products Alcoa Boeing Dow Chemical Hitachi Honda Jabil Circuit KLA Tencor Lockheed Martin Pratt & Whitney Ricoh Sony Steelcase Toyota Media and Entertainment CNET Networks Discovery Communications Dow Jones and Company Epic Games Microsoft Xbox New York Times O'Reilly Media Sony TiVo Univision Pharamceutical Alza Pharmaceutical Cephalon Eli Lilly Johnson & Johnson McKesson Merck Novo Nordisk Pfizer Sanofi Aventis Government and Public Sector Australia Attorney General DC Government Federal Aviation Administration (FAA) Government of Canada National Science Foundation NSA - National Security Agency NYC Department of Education Pacific Northwest National Lab (PNNL) SOCOM - US Special Ops Command US Air Force US Army US Bureau of Land Management US Department of Agriculture (USDA) US Department of Justice US Department of the Navy Veteran's Benefit Association (VBA) Research & Development Bayer CropScience Boeing Phantomworks General Electric Global Research Lawrence Livermore Labs MITRE National Institute of Health National Reconnaissance Office National Visualization Analytics Center Pacific Northwest National Labs Quest Diagnostics Retail Amazon.com Barnes & Noble Borders Caremark CVS Lowe's MAPCO Pilot Travel LLC Safeway Walmart Wet Seal Service and Outsourcing ADP Computer Information Concepts EDS Hmetrix Madrona Solutions Group Oco StrategicOne Wolters Kluwer Technology Adobe AOL BEA Systems CNET Networks eBay Electronic Arts ESRI Google HP IBM Microsoft MySQL Novell Pay Pal VMWare TechNexxus Travel & Leisure Alaska Airlines Bourne Leisure Celebrity Cruise Lines Expedia Royal Caribbean Cruise Lines Sandiego.com Universities and Colleges Appalachian State University Chemetka Community College Cornell University DePaul University Duke University Georgetown University Johns Hopkins University London School of Economics Ohio State University Pitzer College Providence College San Diego State University University College- Dublin University of North Carolina – Chapel Hill
  • 29. Slide 29Slide 29Slide 29 www.edureka.co/data-warehousing-and-bi Tableau Products Tableau Desktop Create Tableau Server Share - Web Tableau Reader Share - Local + business intelligence solution scales to organizations of all sizes + share visual analytics with anyone with a web browser + publish interactive analytics or dashboards + secure information and manage metadata + collaborate with others + share visualizations & dashboards on the desktop + filter, sort, and page through the views + “Acrobat for Data” + free download + ad hoc analytics, dashboards, reports, graphs + explore, visualize, and analyze your data + create dashboards to consolidate multiple views + deliver interactive data experiences Tableau Public Share - Everyone + create and publish interactive visualizations and dashboards + embed in websites and blogs + free download and free hosting service
  • 30. Slide 30Slide 30Slide 30 www.edureka.co/data-warehousing-and-bi How Do People Work With Tableau