SlideShare a Scribd company logo
What Is Big Data ?
By Ashwin Pednekar
Email : ashwinpednekar@gmail.com
Agenda
• Introduction to Big Data ( What’s so Big about Big Data ? )
• Understanding Big Data
• Use and Benefits of Big Data
• Technologies used in Big Data
• Famous Quotes about Big Data
What is so Big about Big Data ?
• We have heard big data defined in many, many
different ways, and so, I’m not surprised there’s
so much confusion surrounding the term.
Because of all the misunderstanding and
misperceptions
• Big data is a collection of data from traditional
and digital sources inside and outside your
company that represents a source for ongoing
discovery and analysis
Understanding Big Data
Enterprise need to Fully understand Big Data
• what it is to them,
• what is does for them
• what it means to them
Understand Data Itself :
• Structured Data
• Unstructured Data
Structured Data :
Structured data refers to information with a high degree of
organization, such that inclusion in a relational database is
seamless and readily searchable by simple, straightforward search
engine algorithms or other search operations
Unstructured Data :
Unstructured data usually refers to information that doesn't
reside in a traditional row-column database and not organized or
Structured Logically
Examples include e-mail messages, word processing documents,
videos
The management of unstructured data is recognized as one of the
major unsolved problems in the information technology (IT)
industry, the main reason being that the tools and techniques that
have proved so successful transforming structured data into
business intelligence and actionable information simply don't
work when it comes to unstructured data. New approaches are
necessary.
 Many organizations are missing out on what data experts agree is an opportunity to derive significant business
value from properly harnessing unstructured data. IDC, estimates that unstructured content already accounts
for a staggering 90 percent of all digital data, much of which is locked away across a variety of different data
stores, in different locations and in varying formats.
 Unstructured data can help companies gain a better understanding of their customers, products, services and
business in general. For example, data from Twitter streams, social media networks and web logs can help a
company gauge customer sentiment toward a product or service, or help identify and address a potential service
or quality issue before it becomes a full-fledged problem. Combining existing data about customers from
transactional systems with data gathered about them from other sources can help an organization get closer to a
360-degree view of its customers.
And an Answer to achieve this is “Big Data” Technologies and Methods
Use and Benefits of Big Data
Today’s consumers are a tough nut to crack. They look around a lot before they buy, talk to their entire social
network about their purchases, demand to be treated as unique and want to be sincerely thanked for buying
your products. Big Data allows you to profile these increasingly vocal and fickle little ‘tyrants’ in a far-reaching
manner so that you can engage in an almost one-on-one, real-time conversation with them. This is not
actually a luxury. If you don’t treat them like they want to, they will leave you in the blink of an eye.
Just a small example: when any customer enters a bank, Big Data tools allow the clerk to check his/her profile
in real-time and learn which relevant products or services (s)he might advise. Big Data will also have a key role
to play in uniting the digital and physical shopping spheres: a retailer could suggest an offer on a mobile
carrier, on the basis of a consumer indicating a certain need in the social media
 Big Data can also help you understand how others perceive
your products so that you can adapt them, or your marketing,
if need be. Analysis of unstructured social media text allows
you to uncover the sentiments of your customers and even
segment those in different geographical locations or among
different demographic groups.
 Success not only depends on how you run your company.
Social and economic factors are crucial for your
accomplishments as well. Predictive analytics, fueled by Big
Data allows you to scan and analyze newspaper reports or
social media feeds so that you permanently keep up to speed
on the latest developments in your industry and its
environment. Detailed health-tests on your suppliers and
customers are another goodie that comes with Big Data. This
will allow you to take action when one of them is in risk of
defaulting.
 The insights that you gain from analyzing your market and its consumers with Big Data are not just valuable to
you. You could sell them as non-personalized trend data to large industry players operating in the same segment
as you and create a whole new revenue stream.
One of the more impressive examples comes from Shazam, the song identification application. It helps
record labels find out where music sub-cultures are arising by monitoring the use of its service, including
the location data that mobile devices so conveniently provide. The record labels can then find and sign
up promising new artists or remarket their existing ones accordingly.
 Previously, if business users needed to analyze large amounts of varied data, they had to ask their IT colleagues
for help as they themselves lacked the technical skills for doing so. Often, by the time they received the
requested information, it was no longer useful or even correct. With Big Data tools, the technical teams can do
the groundwork and then build repeatability into algorithms for faster searches. In other words, they can
develop systems and install interactive and dynamic visualization tools that allow business users to analyze, view
and benefit from the data
Tools and Technologies used in “ Big Data”
Hadoop :
An open source (free) software framework for processing huge datasets on
certain kinds of problems on a distributed system. Its development was
inspired by Google’s MapReduce and Google File System. It was originally
developed at Yahoo! and is now managed as a project of the Apache
Software Foundation
R Programming:
An open source (free) programming language and software
environment for statistical computing and graphics. The R
language has become a de facto standard among statisticians for
developing statistical software and is widely used for statistical
software development and data analysis. R is part of the GNU
Project, a collaboration that supports open source projects.
Spark :
Apache Spark is a fast and general-purpose cluster computing
system designed for processing data in parallel at a large scale
Python NLTK : is a leading platform for building Python
programs to work with human language data. It provides easy-to-
use interfaces to over 50 corpora and lexical resources, along with
a suite of text processing libraries for classification, tokenization,
stemming, tagging, parsing, and semantic reasoning.
MongoDB : is a cross-platform document-oriented database that
stores data into JSON-like documents.
There are many such tools used for Data Analytics , Data mining , Visual and Statistical Analysis .
Big Data is huge ecosystem of such tools and technologies
What is big data
What is big data
References :
http://www.information-management.com/issues/20030201/6287-1.html
http://www.webopedia.com/TERM/U/unstructured_data.html
http://www.cio.com/article/2941015/big-data/solving-the-unstructured-data-challenge.html
https://www.axian.com/tag/big-data/
https://www.google.co.in/imgres?imgurl=http://www.centrodeinnovacionbbva.com/sites/default/files/bigdata_ejemplos_cibbva.jpg&imgrefurl=h
ttp://www.centrodeinnovacionbbva.com/en/news/practical-examples-big-data-
use&h=2832&w=4256&tbnid=15rmFVCEu2XbbM:&docid=SGbEeODTAENtjM&ei=a26kVomoKoK7uATRsZTgAw&tbm=isch&ved=0ahUK
EwjJiYuv6MHKAhWCHY4KHdEYBTwQMwiJAShjMGM
http://datascienceseries.com/stories/ten-practical-big-data-benefits
https://www.google.co.in/imgres?imgurl=http://qubole2.wpengine.com/wp-content/uploads/2014/02/Social-Media-Marketing-Best-Practices-
with-Big-Data_big.png&imgrefurl=https://www.qubole.com/blog/big-data/social-media-marketing-best-practices-big-
data/&h=430&w=810&tbnid=1SY6oDkBee2uaM:&docid=ReLGWMlJ1N6fLM&ei=rnGkVqXJEMeiugTOhZ54&tbm=isch&ved=0ahUKEwil2I
W968HKAhVHkY4KHc6CBw8QMwgbKAAwAA
https://www.google.co.in/search?q=big+data+for+consumer+goods&biw=1600&bih=789&source=lnms&tbm=isch&sa=X&ved=0ahUKEwj01
bPs6sHKAhUScI4KHTKrDssQ_AUIBygC#tbm=isch&q=hadoop+spark+MongoDB&imgrc=9RmNCk-nVWpj3M%3A
https://www.google.co.in/search?q=tools+and+techniques+used+in+Big+data&biw=1600&bih=789&source=lnms&tbm=isch&sa=X&ved=0ah
UKEwi2-IzF78HKAhWKPo4KHQFgAkAQ_AUIBygC&dpr=1#tbm=isch&q=Big+Data&imgrc=HcXyuA5oc6GTtM%3A
https://www.google.co.in/search?q=tools+and+techniques+used+in+Big+data&biw=1600&bih=789&source=lnms&tbm=isch&sa=X&ved=0ah
UKEwi2-IzF78HKAhWKPo4KHQFgAkAQ_AUIBygC&dpr=1#tbm=isch&q=quotes+on+Big+Data&imgrc=uaQato2b9XDa-M%3A
http://bigdata-madesimple.com/26-popular-techniques-for-analysing-big-data/
What is big data

More Related Content

PDF
Big agendas for big data analytics projects
PDF
Getting down to business on Big Data analytics
PDF
The dawn of Big Data
PDF
Snowball Group Whitepaper - Spotlight on Big Data
PPTX
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
PDF
Accelerate Data Discovery
PDF
2. Smart Data Discovery
Big agendas for big data analytics projects
Getting down to business on Big Data analytics
The dawn of Big Data
Snowball Group Whitepaper - Spotlight on Big Data
Making ‘Big Data’ Your Ally – Using data analytics to improve compliance, due...
Accelerate Data Discovery
2. Smart Data Discovery

What's hot (20)

DOCX
Understanding Dark Data
PDF
Suburbia Sales Booklet (2019)
PDF
Welcome to Data Science
PPTX
Big data for sales and marketing people
PDF
What's the Big Deal About Big Data?
PDF
Want a Data-Driven Culture? Start Sorting Out the BI and Big Data Myths Now
PDF
Big Data Fundamentals
PDF
bigdatabusinessguide-arzubarske-ver4
PDF
Dark data
PDF
Big data why big data is huge for CPG manufacturers
PDF
Big Data Maturity Model and Governance
PDF
Big Data at a Glance
PDF
Big data Whitepaper
PDF
The Second Big Bang
PPT
Big data
PPTX
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
DOCX
Policy paper need for focussed big data & analytics skillset building throu...
PDF
Big Data Analytics
PPTX
PPTX
Unit i big data introduction
Understanding Dark Data
Suburbia Sales Booklet (2019)
Welcome to Data Science
Big data for sales and marketing people
What's the Big Deal About Big Data?
Want a Data-Driven Culture? Start Sorting Out the BI and Big Data Myths Now
Big Data Fundamentals
bigdatabusinessguide-arzubarske-ver4
Dark data
Big data why big data is huge for CPG manufacturers
Big Data Maturity Model and Governance
Big Data at a Glance
Big data Whitepaper
The Second Big Bang
Big data
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
Policy paper need for focussed big data & analytics skillset building throu...
Big Data Analytics
Unit i big data introduction
Ad

Similar to What is big data (20)

PPTX
Big Data in Business Application use case and benefits
DOCX
Introduction to big data – convergences.
PPTX
Big Data Analytics_Unit1.pptx
PPTX
Structure data and Unstructured data,Web anlytics.pptx
PDF
Mastering Big Data: Tools, Techniques, and Applications
PPTX
PPTX
big data.pptx
PPTX
Introduction to Big Data
PPTX
Introduction of information technology with the emerging technology
PPTX
This is abouts are you doing the same time who is the best person to be safe and
PDF
Bda assignment can also be used for BDA notes and concept understanding.
PDF
UNIT 1 -BIG DATA ANALYTICS Full.pdf
PPTX
BigDataFinal.pptx
PPTX
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
PPTX
Presentation on Big Data
PPTX
Big Data Analytics
PPTX
Big data
PPTX
Big data analytics
PPTX
Introduction to Big Data
PDF
Lesson_1_definitions_BIG DATA INROSUCTIONUE.pdf
Big Data in Business Application use case and benefits
Introduction to big data – convergences.
Big Data Analytics_Unit1.pptx
Structure data and Unstructured data,Web anlytics.pptx
Mastering Big Data: Tools, Techniques, and Applications
big data.pptx
Introduction to Big Data
Introduction of information technology with the emerging technology
This is abouts are you doing the same time who is the best person to be safe and
Bda assignment can also be used for BDA notes and concept understanding.
UNIT 1 -BIG DATA ANALYTICS Full.pdf
BigDataFinal.pptx
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Presentation on Big Data
Big Data Analytics
Big data
Big data analytics
Introduction to Big Data
Lesson_1_definitions_BIG DATA INROSUCTIONUE.pdf
Ad

Recently uploaded (20)

PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Big Data Technologies - Introduction.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Encapsulation_ Review paper, used for researhc scholars
PPT
Teaching material agriculture food technology
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Cloud computing and distributed systems.
PPTX
A Presentation on Artificial Intelligence
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
Per capita expenditure prediction using model stacking based on satellite ima...
Chapter 3 Spatial Domain Image Processing.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Big Data Technologies - Introduction.pptx
Electronic commerce courselecture one. Pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Network Security Unit 5.pdf for BCA BBA.
Reach Out and Touch Someone: Haptics and Empathic Computing
Encapsulation_ Review paper, used for researhc scholars
Teaching material agriculture food technology
MYSQL Presentation for SQL database connectivity
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Modernizing your data center with Dell and AMD
NewMind AI Weekly Chronicles - August'25 Week I
Cloud computing and distributed systems.
A Presentation on Artificial Intelligence
Diabetes mellitus diagnosis method based random forest with bat algorithm

What is big data