SlideShare a Scribd company logo
Understanding
Big Data and
Data Analytics
Seta A. Wicaksana
Founder and CEO of
www.humanikaconsulting.com
1
Seta A. Wicaksana
0811 19 53 43
wicaksana@humanikaconsulting.com
• Managing Director of Humanika Amanah Indonesia – Humanika Consulting
• Managing Director of Humanika Bisnis Digital – hipotest.com
• Wakil Ketua Asosiasi Psikologi Forensik Indonesia wilayah DKI
• Business Psychologist
• Certified of Assessor Talent Management
• Certified of Human Resources as a Business Partner
• Certified of Risk Professional
• Certified of HR Audit
• Certified of I/O Psychologist
• Dosen Tetap Fakultas Psikologi Universitas Pancasila
• Pembina Yayasan Humanika Edukasi Indonesia
• Penulis Buku : “SOBAT WAY: Mengubah Potensi menjadi kompetensi” Elexmedia
Gramedia 2016, Industri dan Organisasi: Pendekatan Integratif menghadapi perubahan,
DD Publishing, 2020. Human Factor Engineering: Manusia dan Lingkungan Kerja. DD
Publishing, 2021, Psikologi Industri dan Organisasi, DD Publishing, 2021
• Organizational Development Expertise
• Sedang mengikuti tugas belajar Doktoral (S3) di Fakultas Ilmu Ekonomi dan Bisnis
Universitas Pancasila Bidang MSDM Disertasi Peran Utama Budaya Organisasi dalam
Agilitas Organisasi di Lembaga Pemerintah Non Kementrian XYZ
• Fakultas Psikologi S1 dan S2 Universitas Indonesia
• Mathematics: Cryptology sekolah ikatan dinas Akademi Sandi Negara
CONTENTS
• Introduction
• Module 1: Big Data
• Module 2: Business
Intelligence/Analytics
• Module 3: Visualization
• Module 4: Data Mining
3
Learning
Objectives
Upon successful completion of this
chapter, you will be able to:
• Explain the difference between BI,
Analytics, Data Marts and Big Data.
• Define the characteristics of data
for good decision making.
• Describe what Data Mining is.
• Explain market basket and
cluster analysis.
Introduction
5
Business Analytics, BI,
Big Data, Data Mining -
What’s the difference?
• Business Analytics – Tools to explore past
data to gain insight into future business
decisions.
• BI – Tools and techniques to turn data into
meaningful information.
• Big Data –data sets that are so large or
complex that traditional data processing
applications are inadequate.
• Data Mining - Tools for discovering
patterns in large data sets.
Businesses Need Support for
Decision Making
• Uncertain economics
• Rapidly changing environments
• Global competition
• Demanding customers
• Taking advantage of information
acquired by companies is a
Critical Success Factor.
Characteristics
of Data for Good
Decision Making
• Source: speakingdata blog
The Information Gap
• The shortfall between gathering information and using it
for decision making.
• Firms have inadequate data warehouses.
• Business Analysts spend 2 days a week gathering and
formatting data, instead of performing analysis. (Data
Warehousing Institute).
• Business Intelligence (BI) seeks to bridge the
information gap.
10
What is Big Data?
MODULE 1
What is Big Data?
• Massive sets of unstructured/semi-structured data from Web traffic,
social media, sensors, etc
• Petabytes, exabytes of data
• Volumes too great for typical DBMS
• Information from multiple internal and external sources:
• Transactions
• Social media
• Enterprise content
• Sensors
• Mobile devices
• In the last minute there were …….
• 204 million emails sent
• 61,000 hours of music
listened to on Pandora
• 20 million photo views
• 100,000 tweets
• 6 million views and 277,000 Facebook Logins
• 2+ million Google searches
• 3 million uploads on Flickr
What is Big Data? continued
• Companies leverage data to adapt products
and services to:
• Meet customer needs
• Optimize operations
• Optimize infrastructure
• Find new sources of revenue
• Can reveal more patterns and
anomalies
• IBM estimates that by 2015 4.4 million jobs
will be created globally to support big data
• 1.9 million of these jobs will be in the
United States
Understanding big data and data analytics big data
Where does Big Data come from?
Enterprise
“Dark Data”
Partner, Employee
Customer, Supplier
Public Commercial
Social Media
Transactions
Monitoring
Sensor
Economic
Population Sentiment
Email
Contracts
Network
Industry
Credit
Weather
Understanding big data and data analytics big data
Types of Data
Types of Data
• When collecting or gathering
data we collect data from
individuals cases on particular
variables.
• A variable is a unit of data
collection whose value can vary.
• Variables can be defined into
types according to the level of
mathematical scaling that can be
carried out on the data.
• There are four types of data
or levels of measurement:
Categorical
(Nominal) data
• Nominal or categorical data is data that comprises of
categories that cannot be rank ordered – each category is
just different.
• The categories available cannot be placed in any order
and no judgement can be made about the relative size or
distance from one category to another.
• Categories bear no quantitative relationship to one
another
• Examples:
• - customer’s location (America, Europe, Asia)
• - employee classification (manager, supervisor,
• associate)
• What does this mean? No mathematical operations
can be performed on the data relative to each other.
• Therefore, nominal data reflect qualitative differences
rather than quantitative ones.
•Systems for measuring nominal data must ensure that each
category is mutually exclusive and the system of measurement
needs to be exhaustive.
•Exhaustive: the system of categories system should have enough
categories for all the observations
• Variables that have only two responses i.e. Yes or No, are
known as dichotomies.
Nominal data
Examples:
What is your gender?
(please tick)
Male
Female
Did you enjoy the
film? (please tick)
Yes
No
Understanding big data and data analytics big data
• Ordinal data is data that comprises of categories that can be rank ordered.
• Similarly with nominal data the distance between each category cannot be
calculated but the categories can be ranked above or below each other.
 No fixed units of measurement
 Examples:
- college football rankings
- survey responses
(poor, average, good, very good, excellent)
•What does this mean?Can make statistical judgements and perform limited
maths.
Ordinal data
How satisfied are you with the level of
service you have received? (please tick)
Very satisfied
Somewhat satisfied
Neutral
Somewhat dissatisfied
Very dissatisfied
Example:
Understanding big data and data analytics big data
Interval and
ratio data
• Both interval and ratio data are
examples of scale data.
• Scale data:
• data is in numeric format ($50,
$100, $150)
• data that can be measured on
a continuous scale
• the distance between each can
be observed and as a result
measured
• the data can be placed in rank
order.
Interval data
• Ordinal data but with constant
differences between observations
• Ratios are not meaningful
• Examples:
• Time – moves along a continuous
measure or seconds, minutes and so
on and is without a zero point of time.
• Temperature – moves along a
continuous measure of degrees and is
without a true zero.
• SAT scores
Understanding big data and data analytics big data
Ratio data
• Ratio data measured on a continuous
scale and does have a natural zero point.
• Ratios are meaningful
• Examples:
• monthly sales
• delivery times
• Weight
• Height
• Age
Understanding big data and data analytics big data
(continued)
Classifying Data Elements in a Purchasing Database
Data for Business Analytics
1-28
Figure 1.2
If there was field (column) for Supplier Rating (Excellent, Good, Acceptable, Bad), that data
would be classified as Ordinal
Big Data Characteristics
Quickening speed of data
e.g. smart meters, process monitoring
Growing quantity of data
e.g. social media, behavioral, video
Increase in types of data
e.g. app data, unstructured data
VELOCITY
VOLUME
Gartner, Feb 2001
Variety of
data
48%
Volume of
data
35%
Velocity of
data
16%
Source: Getting Value from Big Data, Gartner Webinar, May 2012
Which Big Data characteristic is the biggest
issue for your organization?
Volume
• Volume
•Petabytes,
exabytes of data
•Volumes too
great for typical
DBMS
Volume - Bytes Defined
eBay data warehouse (2010) = 10 PB
eBay will increase this 2.5 times by 2011
Teradata > 10 PB
Megabyte: 220 bytes or, loosely, one million bytes Gigabyte: 230 bytes or, loosely one billion bytes
Velocity
• Velocity
•Massive
amount of
streaming data
Variety • Variety
• Massive sets of
unstructured/se
mi-structured
data from Web
traffic, social
media, sensors,
and so on
Big Data Opportunities
Discovering hidden insights
e.g. anomalies forensics, patterns,
trends
Making better informed decisions
e.g. strategies, recommendations
Automating business processes
e.g. complex events, translation
Understanding big data and data analytics big data
Understanding big data and data analytics big data
Understanding big data and data analytics big data
Understanding big data and data analytics big data
Which is the
biggest
opportunity for
Big Data in your
organization?
Understanding big data and data analytics big data
Quality
Improvement
• Opportunity
• Move from manual to automated inspection of burger
bun production to ensure and improve quality
• Data & Analytics
• Photo-analyze over 1000 buns-per-minute for color,
shape and seed distribution
• Continually adjust ovens and process automatically
• Result
• Eliminate 1000s of pounds of wasted product per year;
speed production; save energy; Reduce manual labor
costs
• Is the company using all of its “senses” to observe, measure and optimize business processes?
Understanding big data and data analytics big data
Identifying Fraud in Insurance
Opportunity
•Save and make money by reducing fraudulent
auto insurance claims
Data & Analytics
•Predictive analytics against years of historical claims
and coverage data
•Text mining adjuster reports for hidden clues, e.g.
missing facts, inconsistencies, changed stories
Results
•Improved success rate in pursuing fraudulent claims
from 50% to 88%; reduced fraudulent claim
investigation time by 95%
•Marketing to individuals with low propensity for fraud
What **“dark data” is just laying around that can transform
business processes?
**Operational data that is not being used. Consulting and market research company Gartner Inc. describes dark data as "information assets
that organizations collect, process and store in the course of their regular business activity, but generally fail to use for other purposes."
Understanding big data and data analytics big data
Learning
and Giving
for Better
Indonesia
www.humanikaconsulting.com

More Related Content

PPTX
Big Data Analytics
PPTX
Big data
PPTX
Big data visualization
PPTX
Big data in education
PPTX
IoT in Agriculture
PPTX
SURVEY AND SAMPLING
PPTX
Overpopulation presentation 1
PPTX
Big Data in Manufacturing Final PPT
Big Data Analytics
Big data
Big data visualization
Big data in education
IoT in Agriculture
SURVEY AND SAMPLING
Overpopulation presentation 1
Big Data in Manufacturing Final PPT

What's hot (20)

PPTX
Big data
PDF
Data Mining and Business Intelligence Tools
PPTX
Business analytics
PDF
Walmart Big Data Expo
PDF
Big Data
PPTX
Data Mining: What is Data Mining?
PPTX
Data visualization
PPT
Data mining slides
 
PPT
01 Data Mining: Concepts and Techniques, 2nd ed.
PPTX
PPTX
Big Data & Hadoop Introduction
PPTX
Data analytics
PPTX
Data analytics
PPTX
Data science & data scientist
PPTX
Data Warehousing Trends, Best Practices, and Future Outlook
PPTX
Overview of Big data(ppt)
PPTX
What is big data?
PDF
Lecture6 introduction to data streams
PDF
Lecture1 introduction to big data
Big data
Data Mining and Business Intelligence Tools
Business analytics
Walmart Big Data Expo
Big Data
Data Mining: What is Data Mining?
Data visualization
Data mining slides
 
01 Data Mining: Concepts and Techniques, 2nd ed.
Big Data & Hadoop Introduction
Data analytics
Data analytics
Data science & data scientist
Data Warehousing Trends, Best Practices, and Future Outlook
Overview of Big data(ppt)
What is big data?
Lecture6 introduction to data streams
Lecture1 introduction to big data
Ad

Similar to Understanding big data and data analytics big data (20)

PPT
Big Data and Data Analytics,Business Intelligence/Analytics
PPT
Big Data and data analytics ,Business Intelligence/Analytics
PPTX
bigdata introduction for students pg msc
PPTX
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
PPTX
Digital Economics
PPSX
Data Refinement: The missing link between data collection and decisions
PPTX
Big_Data.pptx
PPTX
Technologies and Innovation – Digital Economics
PDF
Business Analytics and Data mining.pdf
PPTX
bigdata- Introduction for pg students fo
PPTX
Advance Data Mining - Machine Learning -
PPTX
Is Your Marketing Database "Model Ready"?
PPTX
Is Your Marketing Database "Model Ready"?
PPTX
This is abouts are you doing the same time who is the best person to be safe and
PPTX
BA4206 UNIT 1.pptx business analytics ppt
PDF
Module 2 Data Collection and Management.pdf
PPTX
Trends in data analytics
PPTX
8205_Introduction to Business Analytics.pptx
PPSX
Data Analytics Business Intelligence
PPTX
Big Data Analysis: Transforming Industries and Unlocking Potential​
Big Data and Data Analytics,Business Intelligence/Analytics
Big Data and data analytics ,Business Intelligence/Analytics
bigdata introduction for students pg msc
Module 1 ppt BIG DATA ANALYTICS_NOTES FOR MCA
Digital Economics
Data Refinement: The missing link between data collection and decisions
Big_Data.pptx
Technologies and Innovation – Digital Economics
Business Analytics and Data mining.pdf
bigdata- Introduction for pg students fo
Advance Data Mining - Machine Learning -
Is Your Marketing Database "Model Ready"?
Is Your Marketing Database "Model Ready"?
This is abouts are you doing the same time who is the best person to be safe and
BA4206 UNIT 1.pptx business analytics ppt
Module 2 Data Collection and Management.pdf
Trends in data analytics
8205_Introduction to Business Analytics.pptx
Data Analytics Business Intelligence
Big Data Analysis: Transforming Industries and Unlocking Potential​
Ad

More from Seta Wicaksana (20)

PDF
Mengembangkan Model dan kamus Kompetensi
PDF
Bab 8 Analisis Data Kuantitatif dalam Riset Human Capital
PDF
Bab 7 Instrumen Pengukuran dan Validasi Data dalam Riset Human
PDF
Bab 6 Pengumpulan Data dalam Riset Human Capital
PDF
Bab 5 Desain Penelitian dalam Human Capital
PDF
Bab 4 Perumusan Masalah dan Hipotesis dalam Riset Human Capital
PDF
Bab 3 Review Literatur dan Tren Riset Human Capital
PDF
Bab 2 Paradigma dan Metode Riset dalam Human Capital
PDF
Bab 1 Pendahuluan Riset Human Capital Konsep Dasar dan Ruang Lingkup
PDF
Developing Tools Assessment Center_Analisis Kasus
PDF
Developing Tools Assessment Center_In tray
PDF
Beyond HR: Human Experience, Business Psychology, and the Future of Work
PDF
PENGEMBANGAN KNOWLEDGE MANAGEMENT (KM) DI ORGANISASI
PDF
Menuju Pengelolaan SDM Berbasis Pengalaman (Human Experience Management)
PDF
Happy Mental for Gen Z in Agile Organization
PDF
Wawancara Konfirmatori dalam Assessment Center
PDF
Business Psychology: 7 Perspectives and 7 Processes with 7 Recommendations fo...
PDF
Topik 8 Kepemimpinan yang Beretika dan Berintegritas
PDF
Topik 11 Kepemimpinan Inklusif dan Diversity Management
PDF
Topik 14 Evaluasi dan Implementasi Leadership dalam Praktik.pdf
Mengembangkan Model dan kamus Kompetensi
Bab 8 Analisis Data Kuantitatif dalam Riset Human Capital
Bab 7 Instrumen Pengukuran dan Validasi Data dalam Riset Human
Bab 6 Pengumpulan Data dalam Riset Human Capital
Bab 5 Desain Penelitian dalam Human Capital
Bab 4 Perumusan Masalah dan Hipotesis dalam Riset Human Capital
Bab 3 Review Literatur dan Tren Riset Human Capital
Bab 2 Paradigma dan Metode Riset dalam Human Capital
Bab 1 Pendahuluan Riset Human Capital Konsep Dasar dan Ruang Lingkup
Developing Tools Assessment Center_Analisis Kasus
Developing Tools Assessment Center_In tray
Beyond HR: Human Experience, Business Psychology, and the Future of Work
PENGEMBANGAN KNOWLEDGE MANAGEMENT (KM) DI ORGANISASI
Menuju Pengelolaan SDM Berbasis Pengalaman (Human Experience Management)
Happy Mental for Gen Z in Agile Organization
Wawancara Konfirmatori dalam Assessment Center
Business Psychology: 7 Perspectives and 7 Processes with 7 Recommendations fo...
Topik 8 Kepemimpinan yang Beretika dan Berintegritas
Topik 11 Kepemimpinan Inklusif dan Diversity Management
Topik 14 Evaluasi dan Implementasi Leadership dalam Praktik.pdf

Recently uploaded (20)

PDF
Foundation of Data Science unit number two notes
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Global journeys: estimating international migration
PPTX
Logistic Regression ml machine learning.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
A Quantitative-WPS Office.pptx research study
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Fluorescence-microscope_Botany_detailed content
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
Taxes Foundatisdcsdcsdon Certificate.pdf
PPTX
Introduction to Knowledge Engineering Part 1
Foundation of Data Science unit number two notes
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Global journeys: estimating international migration
Logistic Regression ml machine learning.pptx
Business Acumen Training GuidePresentation.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Clinical guidelines as a resource for EBP(1).pdf
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
A Quantitative-WPS Office.pptx research study
Business Ppt On Nestle.pptx huunnnhhgfvu
Fluorescence-microscope_Botany_detailed content
Miokarditis (Inflamasi pada Otot Jantung)
Reliability_Chapter_ presentation 1221.5784
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
climate analysis of Dhaka ,Banglades.pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Taxes Foundatisdcsdcsdon Certificate.pdf
Introduction to Knowledge Engineering Part 1

Understanding big data and data analytics big data

  • 1. Understanding Big Data and Data Analytics Seta A. Wicaksana Founder and CEO of www.humanikaconsulting.com 1
  • 2. Seta A. Wicaksana 0811 19 53 43 wicaksana@humanikaconsulting.com • Managing Director of Humanika Amanah Indonesia – Humanika Consulting • Managing Director of Humanika Bisnis Digital – hipotest.com • Wakil Ketua Asosiasi Psikologi Forensik Indonesia wilayah DKI • Business Psychologist • Certified of Assessor Talent Management • Certified of Human Resources as a Business Partner • Certified of Risk Professional • Certified of HR Audit • Certified of I/O Psychologist • Dosen Tetap Fakultas Psikologi Universitas Pancasila • Pembina Yayasan Humanika Edukasi Indonesia • Penulis Buku : “SOBAT WAY: Mengubah Potensi menjadi kompetensi” Elexmedia Gramedia 2016, Industri dan Organisasi: Pendekatan Integratif menghadapi perubahan, DD Publishing, 2020. Human Factor Engineering: Manusia dan Lingkungan Kerja. DD Publishing, 2021, Psikologi Industri dan Organisasi, DD Publishing, 2021 • Organizational Development Expertise • Sedang mengikuti tugas belajar Doktoral (S3) di Fakultas Ilmu Ekonomi dan Bisnis Universitas Pancasila Bidang MSDM Disertasi Peran Utama Budaya Organisasi dalam Agilitas Organisasi di Lembaga Pemerintah Non Kementrian XYZ • Fakultas Psikologi S1 dan S2 Universitas Indonesia • Mathematics: Cryptology sekolah ikatan dinas Akademi Sandi Negara
  • 3. CONTENTS • Introduction • Module 1: Big Data • Module 2: Business Intelligence/Analytics • Module 3: Visualization • Module 4: Data Mining 3
  • 4. Learning Objectives Upon successful completion of this chapter, you will be able to: • Explain the difference between BI, Analytics, Data Marts and Big Data. • Define the characteristics of data for good decision making. • Describe what Data Mining is. • Explain market basket and cluster analysis.
  • 6. Business Analytics, BI, Big Data, Data Mining - What’s the difference? • Business Analytics – Tools to explore past data to gain insight into future business decisions. • BI – Tools and techniques to turn data into meaningful information. • Big Data –data sets that are so large or complex that traditional data processing applications are inadequate. • Data Mining - Tools for discovering patterns in large data sets.
  • 7. Businesses Need Support for Decision Making • Uncertain economics • Rapidly changing environments • Global competition • Demanding customers • Taking advantage of information acquired by companies is a Critical Success Factor.
  • 8. Characteristics of Data for Good Decision Making • Source: speakingdata blog
  • 9. The Information Gap • The shortfall between gathering information and using it for decision making. • Firms have inadequate data warehouses. • Business Analysts spend 2 days a week gathering and formatting data, instead of performing analysis. (Data Warehousing Institute). • Business Intelligence (BI) seeks to bridge the information gap.
  • 10. 10 What is Big Data? MODULE 1
  • 11. What is Big Data? • Massive sets of unstructured/semi-structured data from Web traffic, social media, sensors, etc • Petabytes, exabytes of data • Volumes too great for typical DBMS • Information from multiple internal and external sources: • Transactions • Social media • Enterprise content • Sensors • Mobile devices • In the last minute there were ……. • 204 million emails sent • 61,000 hours of music listened to on Pandora • 20 million photo views • 100,000 tweets • 6 million views and 277,000 Facebook Logins • 2+ million Google searches • 3 million uploads on Flickr
  • 12. What is Big Data? continued • Companies leverage data to adapt products and services to: • Meet customer needs • Optimize operations • Optimize infrastructure • Find new sources of revenue • Can reveal more patterns and anomalies • IBM estimates that by 2015 4.4 million jobs will be created globally to support big data • 1.9 million of these jobs will be in the United States
  • 14. Where does Big Data come from? Enterprise “Dark Data” Partner, Employee Customer, Supplier Public Commercial Social Media Transactions Monitoring Sensor Economic Population Sentiment Email Contracts Network Industry Credit Weather
  • 17. Types of Data • When collecting or gathering data we collect data from individuals cases on particular variables. • A variable is a unit of data collection whose value can vary. • Variables can be defined into types according to the level of mathematical scaling that can be carried out on the data. • There are four types of data or levels of measurement:
  • 18. Categorical (Nominal) data • Nominal or categorical data is data that comprises of categories that cannot be rank ordered – each category is just different. • The categories available cannot be placed in any order and no judgement can be made about the relative size or distance from one category to another. • Categories bear no quantitative relationship to one another • Examples: • - customer’s location (America, Europe, Asia) • - employee classification (manager, supervisor, • associate) • What does this mean? No mathematical operations can be performed on the data relative to each other. • Therefore, nominal data reflect qualitative differences rather than quantitative ones.
  • 19. •Systems for measuring nominal data must ensure that each category is mutually exclusive and the system of measurement needs to be exhaustive. •Exhaustive: the system of categories system should have enough categories for all the observations • Variables that have only two responses i.e. Yes or No, are known as dichotomies. Nominal data Examples: What is your gender? (please tick) Male Female Did you enjoy the film? (please tick) Yes No
  • 21. • Ordinal data is data that comprises of categories that can be rank ordered. • Similarly with nominal data the distance between each category cannot be calculated but the categories can be ranked above or below each other.  No fixed units of measurement  Examples: - college football rankings - survey responses (poor, average, good, very good, excellent) •What does this mean?Can make statistical judgements and perform limited maths. Ordinal data How satisfied are you with the level of service you have received? (please tick) Very satisfied Somewhat satisfied Neutral Somewhat dissatisfied Very dissatisfied Example:
  • 23. Interval and ratio data • Both interval and ratio data are examples of scale data. • Scale data: • data is in numeric format ($50, $100, $150) • data that can be measured on a continuous scale • the distance between each can be observed and as a result measured • the data can be placed in rank order.
  • 24. Interval data • Ordinal data but with constant differences between observations • Ratios are not meaningful • Examples: • Time – moves along a continuous measure or seconds, minutes and so on and is without a zero point of time. • Temperature – moves along a continuous measure of degrees and is without a true zero. • SAT scores
  • 26. Ratio data • Ratio data measured on a continuous scale and does have a natural zero point. • Ratios are meaningful • Examples: • monthly sales • delivery times • Weight • Height • Age
  • 28. (continued) Classifying Data Elements in a Purchasing Database Data for Business Analytics 1-28 Figure 1.2 If there was field (column) for Supplier Rating (Excellent, Good, Acceptable, Bad), that data would be classified as Ordinal
  • 29. Big Data Characteristics Quickening speed of data e.g. smart meters, process monitoring Growing quantity of data e.g. social media, behavioral, video Increase in types of data e.g. app data, unstructured data VELOCITY VOLUME Gartner, Feb 2001
  • 30. Variety of data 48% Volume of data 35% Velocity of data 16% Source: Getting Value from Big Data, Gartner Webinar, May 2012 Which Big Data characteristic is the biggest issue for your organization?
  • 31. Volume • Volume •Petabytes, exabytes of data •Volumes too great for typical DBMS
  • 32. Volume - Bytes Defined eBay data warehouse (2010) = 10 PB eBay will increase this 2.5 times by 2011 Teradata > 10 PB Megabyte: 220 bytes or, loosely, one million bytes Gigabyte: 230 bytes or, loosely one billion bytes
  • 34. Variety • Variety • Massive sets of unstructured/se mi-structured data from Web traffic, social media, sensors, and so on
  • 35. Big Data Opportunities Discovering hidden insights e.g. anomalies forensics, patterns, trends Making better informed decisions e.g. strategies, recommendations Automating business processes e.g. complex events, translation
  • 40. Which is the biggest opportunity for Big Data in your organization?
  • 42. Quality Improvement • Opportunity • Move from manual to automated inspection of burger bun production to ensure and improve quality • Data & Analytics • Photo-analyze over 1000 buns-per-minute for color, shape and seed distribution • Continually adjust ovens and process automatically • Result • Eliminate 1000s of pounds of wasted product per year; speed production; save energy; Reduce manual labor costs • Is the company using all of its “senses” to observe, measure and optimize business processes?
  • 44. Identifying Fraud in Insurance Opportunity •Save and make money by reducing fraudulent auto insurance claims Data & Analytics •Predictive analytics against years of historical claims and coverage data •Text mining adjuster reports for hidden clues, e.g. missing facts, inconsistencies, changed stories Results •Improved success rate in pursuing fraudulent claims from 50% to 88%; reduced fraudulent claim investigation time by 95% •Marketing to individuals with low propensity for fraud What **“dark data” is just laying around that can transform business processes? **Operational data that is not being used. Consulting and market research company Gartner Inc. describes dark data as "information assets that organizations collect, process and store in the course of their regular business activity, but generally fail to use for other purposes."