SlideShare a Scribd company logo
Academic Skills in English, Oct. 12.2015
Ins. Colette Gattoni (M.Ed.)
Visualizing Big Data
Dawit Nida
Abstract
Ever since the creation of computers, data have been generated on a continues manner, in several
forms and structures. Many organizations are utilizing Big Data, a huge amount of data, that have
not been handled by the traditional data management systems, to make their business decisions.
Big Data can be referred to unstructured, semi-structured or structured data generated by users.
Unstructured data, such as emails, texts, data from sensors, etc. structured data includes transactions.
Big Data can be characterized by the four Vs, Volume, Velocity, Variety and Veracity. Handling Big
Data requires expertise in the field to minimize the risks in security, data management and analytics.
Data scientists who make different kinds of analytics for organizations, use a set of techniques for
processing big amount data to create business decisions using business intelligence tools and methods.
Keywords
Big Data, Analytic, Business Intelligence
Contents
Introduction 1
1 Why Big Data is ‘BIG’? 1
1.1 The Four Vs of Big Data . . . . . . . . . . . . . 2
1.2 The Risks of Big Data . . . . . . . . . . . . . . . . 2
2 Big Data Analytics 3
2.1 Types of Analytics . . . . . . . . . . . . . . . . . . . 3
3 BI and Big Data 3
4 Conclusion 4
References 4
Introduction
Big Data is a huge amount of data, that have not
been handled by the traditional data manage-
ment systems. To handle these data, analytics
is required. Analytics is a knowledge discovery
and extracting valuable trends in data that can
be visualized for insights and patterns using a
set of techniques and tool required to collect,
store, visualize data into valuable information
and benefit from analysing and making efficient
business decisions.
1. Why Big Data is ‘BIG’?
Big data is a form of data, but huge amount of
data, from several sources in varies forms that
have not been handled by the traditional data
management systems. Big data can be referred
to unstructured, semi-structured or structured
data. Unstructured data from social media and
sensors includes emails, texts, data from various
sensors, video and audio data, and etc. struc-
tured data includes transaction data from cus-
tomers, etc.
For instance, based on the infographic post on
iDigitalTimes (July 2013) [1][2], in every 60 sec-
onds on the internet the following happens.
• 204 mil. emails sent out
• 1.8 mil. Facebook likes are generated by users
• 278,000 Tweets are twitted
• 200,000 photos are uploaded to Facebook
• Around 100 hours of video are uploaded to
YouTube
• 20 mil. photos are viewed on Flickr
• 120 new users register to LinkedIn
• 88,000 calls made on Skype
• €73,000 is spent on Amazon
Visualizing Big Data — 2/4
Figure 1. Two selected Big Data articles analysed using R-programming language
1.1 The Four Vs of Big Data
To define the features of Big Data, the four Vs
are described below[3].
Volume: refers to the amount of data generated
every millisecond. Huge number of transactional
data stored every second, machine and sensor
data, social media, and enterprise data is being
collected and stored. Even though, storage cost
is not a big issue, sorting out the data with
respect to their relevancy is today’s problem.
Velocity: refers to the speed of streaming data
that is required to be executed in real time. It is
not only how fast data is produced or changed
but also the speed it has to be received, under-
stood and processed.
Variety: in addition to the size and the speed,
data also varies with respect to structure and
types. For instance email, photos, video or audio
data are few examples of unstructured data that
can be used with traditional structured data that
fits into tables and relational databases.
Veracity: refers to the trustworthiness (uncer-
tainty of data) of the data answering where the
data came from and how accurate it is. The
amount of data from varies sources and the
amount of data directly affects the value and
quality of the data.
1.2 The Risks of Big Data
Due to the nature of Big Data and technology
advancement, less expertise of data scientists
that handle data and do visualization, Big Data
brings the following risks when considering using
it in organizations [4].
Visualizing Big Data — 3/4
Bad data and bad data analysis: collecting data
cannot support companies business to grow and
compete with similar businesses. Irrelevant and
outdated data can turn the company business
decision to undesired outcomes. While analysing,
misinterpreting data patters and trends in the
data is also risky.
Security: since Big Data contains data that can
be sensitive considering the logistics of data col-
lection and analysis can be insecure and may
expose data. Besides the bigger the data, the
bigger the risk will become to companies. Data
privacy can also be mentioned closely in related
to the issue of security.
Costs: collecting data, aggregating and process-
ing it, storing and then analysing it to generate
reports and visual graphs it requires budget plan-
ning and money, thus Big Data might be costly
compare to the efficiency of the data later.
2. Big Data Analytics
Today, several organizations are collecting and
storing petabytes and exabytes of data from cus-
tomers, sensors, transactions and various data
sources that requests high-performance analytics
and with new advances in computing technol-
ogy to process the data in order to figure out
what’s important and what isn’t. Analytics is
simply knowledge discovery and extracting valu-
able trends in data that can be visualized for
insights, unknown correlations and hidden pat-
terns. Using high-performance data and text
mining, diagnostic and predictive analytics, pre-
scriptive analytics for forecasting and optimiza-
tion to make the best possible business decisions
for an organization [5].
2.1 Types of Analytics
Data visualization refers to the approaches and
tools used to visually understand the insights
from data to prove or disprove a hypothesis.
There are four types of analytics organizations
use to make high-performance analytics to in-
crease competence in their domains.
1. Descriptive analytics is a set of techniques
for reviewing and examining the data set to
understand the data and analyse business per-
formance to provide insight about the past.
2. Diagnostic analytics is a set of techniques for
determine what has happened,why services
demand is low/high, customer segmentation,
any trend from the data.
3. Predictive a set of techniques that analyse
current and historical data to determine what
is most likely to (not) happen, who need more,
helps model and forecast.
4. Prescriptive analytics is a set of techniques for
computationally developing and analysing al-
ternatives in tactical or strategical model and
discover the unexpected, seeks to determine
the best solution or outcome among various
choices, given the known parameters.
3. BI and Big Data
Business Intelligence is a technique or process,
technology and tools required to collect, store,
analyse data into valuable information and ben-
efit from analysing and taking efficient business
advantages. BI provides users to easily consoli-
date, search, and visually analyse data and gain
unexpected business insights by understanding
how data is associated.
Usefulness of BI
• Optimize business processes
• Better decision making on every level in
the organization based on fact
• Create better customer experience
• Improve competitiveness and
• Increase revenues, etc.
Data discovery and visualization in BI can be
• Reports
• Scorecards
• Dashboards
• Ad hoc analysis
• Visualization
Visualizing Big Data — 4/4
4. Conclusion
To summarize, finding ways to use Big Data
and analytics has become a bigger concern from
small and mid-sized businesses to big corporates
that are looking for ways keep up with larger
competitors. Big Data enhance organizations in
their product quality, improve marketing opera-
tions and further customer relationships, better
business decisions, etc. BI and data visualiza-
tion are very essential for utilizing Big Data and
implementing using descriptive, diagnostic, pre-
dictive, prescriptive or combination of the four
types of analytics. Although Big Data has risks
and weakness, it is called renewable oil that can
be applicable in many organizations and fields.
References
[1] Big data, what it is and why it matters.
[2] Tomas Eklund. Dw lecture 4: Big data, ˚Aau.
4.5.2015.
[3] Big data analytics, advanced analytics in
oracle database. 2013.
[4] Bernard Marr. The 5 biggest risks of big
data.
[5] Marko Grobelnik. Big data tutorial.

More Related Content

PPTX
Big Data Forum - Phoenix
PDF
The Comparison of Big Data Strategies in Corporate Environment
PPTX
Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...
PDF
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
PPTX
Big Data Analytics Proposal #1
PPT
Big data
PDF
Business Intelligence & its Best Practices
PPTX
Big Data Impact on Purchasing and SCM - PASIA World Conference Discussion
Big Data Forum - Phoenix
The Comparison of Big Data Strategies in Corporate Environment
Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
Big Data Analytics Proposal #1
Big data
Business Intelligence & its Best Practices
Big Data Impact on Purchasing and SCM - PASIA World Conference Discussion

What's hot (20)

PDF
Data Management
PDF
Using Big Data Smarter Decision Making
PDF
Big data assignment
PPTX
Unlocking value in your (big) data
PPTX
Importance of Big data for your Business
PDF
Big Data Analytics Architecture PowerPoint Presentation Slides
PDF
Understanding big data and data analytics big data
PPTX
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
PDF
The Data Science Institute-Cognitive Solutions
PPTX
Analytics for actuaries cia
PDF
Big data – A Review
PDF
The Economic Value of Data: A New Revenue Stream for Global Custodians
PDF
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
PPT
Data mining Introduction
PDF
Big Data Analytics
PPTX
Big data
PDF
Managing Data as a Strategic Resource – Foundation of the Digital and Data-Dr...
PDF
Applying Data Quality Best Practices at Big Data Scale
PDF
How to Create and Manage a Successful Analytics Organization
Data Management
Using Big Data Smarter Decision Making
Big data assignment
Unlocking value in your (big) data
Importance of Big data for your Business
Big Data Analytics Architecture PowerPoint Presentation Slides
Understanding big data and data analytics big data
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
The Data Science Institute-Cognitive Solutions
Analytics for actuaries cia
Big data – A Review
The Economic Value of Data: A New Revenue Stream for Global Custodians
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Data mining Introduction
Big Data Analytics
Big data
Managing Data as a Strategic Resource – Foundation of the Digital and Data-Dr...
Applying Data Quality Best Practices at Big Data Scale
How to Create and Manage a Successful Analytics Organization
Ad

Viewers also liked (15)

PDF
Common Browser Hijacking Methods
PPT
Presentación Zombies
PPT
Investment Banking Interview Guide
PPTX
Otara Community
PPSX
PPT
iPhone + Botnets = Fun
PDF
第一回朝食会(趣旨説明資料)20,111,027
PDF
ビジネス構造化経営研究会第一回朝食会 20111027
PPTX
Extorsiones mediante DDoS
PPTX
Ataques dirigidos contra activistas
PDF
Big Data Introduction
PPT
Yersinia - Demostraciones prácticas de nuevos ataques de nivel dos
PPT
7 Pengembangan Materi Pembelajaran
PPTX
Ayat - Ayat Komunikasi
PPSX
Humas Dalam Organisasi
Common Browser Hijacking Methods
Presentación Zombies
Investment Banking Interview Guide
Otara Community
iPhone + Botnets = Fun
第一回朝食会(趣旨説明資料)20,111,027
ビジネス構造化経営研究会第一回朝食会 20111027
Extorsiones mediante DDoS
Ataques dirigidos contra activistas
Big Data Introduction
Yersinia - Demostraciones prácticas de nuevos ataques de nivel dos
7 Pengembangan Materi Pembelajaran
Ayat - Ayat Komunikasi
Humas Dalam Organisasi
Ad

Similar to Introduction to visualizing Big Data (20)

PPTX
Evolution & Introduction to Big data-2.pptx
PDF
Big data analytics in Business Management and Businesss Intelligence: A Lietr...
PDF
IRJET - Big Data Analysis its Challenges
PDF
Analysis of Big Data
PPTX
Foundations of Big Data: Concepts, Techniques, and Applications
PPTX
Bigdata Hadoop introduction
PDF
IRJET- Big Data Management and Growth Enhancement
PDF
All About Big Data
PPTX
data analytics lecture2.pptx
PDF
What's the Big Deal About Big Data?
PPTX
Guide to big data analytics
DOCX
PDF
Big Data Analysis
PDF
bda-unit-bda-unit-materail big data1.pdf
PPTX
Big data Analytics Unit - CCS334 Syllabus
DOCX
Introduction to big data – convergences.
PDF
BigData Analytics_1.7
PPTX
Introduction to Big Data
PPTX
Big data
PDF
Bda assignment can also be used for BDA notes and concept understanding.
Evolution & Introduction to Big data-2.pptx
Big data analytics in Business Management and Businesss Intelligence: A Lietr...
IRJET - Big Data Analysis its Challenges
Analysis of Big Data
Foundations of Big Data: Concepts, Techniques, and Applications
Bigdata Hadoop introduction
IRJET- Big Data Management and Growth Enhancement
All About Big Data
data analytics lecture2.pptx
What's the Big Deal About Big Data?
Guide to big data analytics
Big Data Analysis
bda-unit-bda-unit-materail big data1.pdf
Big data Analytics Unit - CCS334 Syllabus
Introduction to big data – convergences.
BigData Analytics_1.7
Introduction to Big Data
Big data
Bda assignment can also be used for BDA notes and concept understanding.

Recently uploaded (20)

PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
1_Introduction to advance data techniques.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Mega Projects Data Mega Projects Data
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPTX
Global journeys: estimating international migration
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
1_Introduction to advance data techniques.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
climate analysis of Dhaka ,Banglades.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
Mega Projects Data Mega Projects Data
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Supervised vs unsupervised machine learning algorithms
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Clinical guidelines as a resource for EBP(1).pdf
Data_Analytics_and_PowerBI_Presentation.pptx
Introduction to Knowledge Engineering Part 1
Moving the Public Sector (Government) to a Digital Adoption
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Miokarditis (Inflamasi pada Otot Jantung)
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Global journeys: estimating international migration
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb

Introduction to visualizing Big Data

  • 1. Academic Skills in English, Oct. 12.2015 Ins. Colette Gattoni (M.Ed.) Visualizing Big Data Dawit Nida Abstract Ever since the creation of computers, data have been generated on a continues manner, in several forms and structures. Many organizations are utilizing Big Data, a huge amount of data, that have not been handled by the traditional data management systems, to make their business decisions. Big Data can be referred to unstructured, semi-structured or structured data generated by users. Unstructured data, such as emails, texts, data from sensors, etc. structured data includes transactions. Big Data can be characterized by the four Vs, Volume, Velocity, Variety and Veracity. Handling Big Data requires expertise in the field to minimize the risks in security, data management and analytics. Data scientists who make different kinds of analytics for organizations, use a set of techniques for processing big amount data to create business decisions using business intelligence tools and methods. Keywords Big Data, Analytic, Business Intelligence Contents Introduction 1 1 Why Big Data is ‘BIG’? 1 1.1 The Four Vs of Big Data . . . . . . . . . . . . . 2 1.2 The Risks of Big Data . . . . . . . . . . . . . . . . 2 2 Big Data Analytics 3 2.1 Types of Analytics . . . . . . . . . . . . . . . . . . . 3 3 BI and Big Data 3 4 Conclusion 4 References 4 Introduction Big Data is a huge amount of data, that have not been handled by the traditional data manage- ment systems. To handle these data, analytics is required. Analytics is a knowledge discovery and extracting valuable trends in data that can be visualized for insights and patterns using a set of techniques and tool required to collect, store, visualize data into valuable information and benefit from analysing and making efficient business decisions. 1. Why Big Data is ‘BIG’? Big data is a form of data, but huge amount of data, from several sources in varies forms that have not been handled by the traditional data management systems. Big data can be referred to unstructured, semi-structured or structured data. Unstructured data from social media and sensors includes emails, texts, data from various sensors, video and audio data, and etc. struc- tured data includes transaction data from cus- tomers, etc. For instance, based on the infographic post on iDigitalTimes (July 2013) [1][2], in every 60 sec- onds on the internet the following happens. • 204 mil. emails sent out • 1.8 mil. Facebook likes are generated by users • 278,000 Tweets are twitted • 200,000 photos are uploaded to Facebook • Around 100 hours of video are uploaded to YouTube • 20 mil. photos are viewed on Flickr • 120 new users register to LinkedIn • 88,000 calls made on Skype • €73,000 is spent on Amazon
  • 2. Visualizing Big Data — 2/4 Figure 1. Two selected Big Data articles analysed using R-programming language 1.1 The Four Vs of Big Data To define the features of Big Data, the four Vs are described below[3]. Volume: refers to the amount of data generated every millisecond. Huge number of transactional data stored every second, machine and sensor data, social media, and enterprise data is being collected and stored. Even though, storage cost is not a big issue, sorting out the data with respect to their relevancy is today’s problem. Velocity: refers to the speed of streaming data that is required to be executed in real time. It is not only how fast data is produced or changed but also the speed it has to be received, under- stood and processed. Variety: in addition to the size and the speed, data also varies with respect to structure and types. For instance email, photos, video or audio data are few examples of unstructured data that can be used with traditional structured data that fits into tables and relational databases. Veracity: refers to the trustworthiness (uncer- tainty of data) of the data answering where the data came from and how accurate it is. The amount of data from varies sources and the amount of data directly affects the value and quality of the data. 1.2 The Risks of Big Data Due to the nature of Big Data and technology advancement, less expertise of data scientists that handle data and do visualization, Big Data brings the following risks when considering using it in organizations [4].
  • 3. Visualizing Big Data — 3/4 Bad data and bad data analysis: collecting data cannot support companies business to grow and compete with similar businesses. Irrelevant and outdated data can turn the company business decision to undesired outcomes. While analysing, misinterpreting data patters and trends in the data is also risky. Security: since Big Data contains data that can be sensitive considering the logistics of data col- lection and analysis can be insecure and may expose data. Besides the bigger the data, the bigger the risk will become to companies. Data privacy can also be mentioned closely in related to the issue of security. Costs: collecting data, aggregating and process- ing it, storing and then analysing it to generate reports and visual graphs it requires budget plan- ning and money, thus Big Data might be costly compare to the efficiency of the data later. 2. Big Data Analytics Today, several organizations are collecting and storing petabytes and exabytes of data from cus- tomers, sensors, transactions and various data sources that requests high-performance analytics and with new advances in computing technol- ogy to process the data in order to figure out what’s important and what isn’t. Analytics is simply knowledge discovery and extracting valu- able trends in data that can be visualized for insights, unknown correlations and hidden pat- terns. Using high-performance data and text mining, diagnostic and predictive analytics, pre- scriptive analytics for forecasting and optimiza- tion to make the best possible business decisions for an organization [5]. 2.1 Types of Analytics Data visualization refers to the approaches and tools used to visually understand the insights from data to prove or disprove a hypothesis. There are four types of analytics organizations use to make high-performance analytics to in- crease competence in their domains. 1. Descriptive analytics is a set of techniques for reviewing and examining the data set to understand the data and analyse business per- formance to provide insight about the past. 2. Diagnostic analytics is a set of techniques for determine what has happened,why services demand is low/high, customer segmentation, any trend from the data. 3. Predictive a set of techniques that analyse current and historical data to determine what is most likely to (not) happen, who need more, helps model and forecast. 4. Prescriptive analytics is a set of techniques for computationally developing and analysing al- ternatives in tactical or strategical model and discover the unexpected, seeks to determine the best solution or outcome among various choices, given the known parameters. 3. BI and Big Data Business Intelligence is a technique or process, technology and tools required to collect, store, analyse data into valuable information and ben- efit from analysing and taking efficient business advantages. BI provides users to easily consoli- date, search, and visually analyse data and gain unexpected business insights by understanding how data is associated. Usefulness of BI • Optimize business processes • Better decision making on every level in the organization based on fact • Create better customer experience • Improve competitiveness and • Increase revenues, etc. Data discovery and visualization in BI can be • Reports • Scorecards • Dashboards • Ad hoc analysis • Visualization
  • 4. Visualizing Big Data — 4/4 4. Conclusion To summarize, finding ways to use Big Data and analytics has become a bigger concern from small and mid-sized businesses to big corporates that are looking for ways keep up with larger competitors. Big Data enhance organizations in their product quality, improve marketing opera- tions and further customer relationships, better business decisions, etc. BI and data visualiza- tion are very essential for utilizing Big Data and implementing using descriptive, diagnostic, pre- dictive, prescriptive or combination of the four types of analytics. Although Big Data has risks and weakness, it is called renewable oil that can be applicable in many organizations and fields. References [1] Big data, what it is and why it matters. [2] Tomas Eklund. Dw lecture 4: Big data, ˚Aau. 4.5.2015. [3] Big data analytics, advanced analytics in oracle database. 2013. [4] Bernard Marr. The 5 biggest risks of big data. [5] Marko Grobelnik. Big data tutorial.