SlideShare a Scribd company logo
Big Data
• 'Big Data' is also a data but with a huge size.
• 'Big Data' is a term used to describe collection of data that is huge in
size and yet growing exponentially with time.
• In short, such a data is so large and complex that none of the
traditional data management tools are able to store it or process it
efficiently.
Examples Of 'Big Data'
The New York Stock Exchange generates about one terabyte of new
trade data per day.
Statistic shows that 500+terabytes of new data gets ingested into the
databases of social media site Facebook, every day. This data is mainly
generated in terms of photo and video uploads, message exchanges,
putting comments etc.
Single Jet engine can generate 10+terabytes of data in 30 minutes of
a flight time. With many thousand flights per day, generation of data
reaches up to many Petabytes.
Categories Of Big Data
Structured -- data that can be stored, accessed and
processed in the form of fixed format
Unstructured -- Any data with unknown form or the
structure
Semi-structured -- Semi-structured data can contain
both the forms of data.
Characteristics Of Big Data
Big Data Analytics
• Process of collecting, organizing and analyzing large sets of data
(called Big Data) to discover patterns and other useful information.
• Help organizations to better understand the information contained
within the data
• Analysts working with Big Data typically want the knowledge that comes
from analyzing the data.
• Big Data analytics is typically performed using specialized software tools
and applications for predictive analytics, data mining, text mining,
forecasting and data optimization
Today's advances in analyzing big data allow researchers to
• Decode human DNA in minutes
• Predict where terrorists plan to attack
• Determine which gene is mostly likely to be responsible for certain
diseases
• Which ads you are most likely to respond to on Facebook.
How Big Data Analytics is Used Today
The Challenges
• The first challenge is in breaking down data to access all data an
organization stores in different places and often in different systems.
• Second challenge is in creating platforms that can pull in unstructured
data as easily as structured data.
Big Data Analytics Tools
Hadoop:
it is an open source, Java-based programming framework that supports
the processing and storage of extremely large data sets in a distributed
computing environment
Lumify:
Lumify is a relatively new open source project to create a Big Data fusion
and is a great alternative to Hadoop.
ElasticSearch:
A reliable and secure open source platform that allows users to take any
data from any source, in any format and search, analyze it and visualize
it real time.
MongoDb:
MongoDB is also a great tool to help store and analyze big data, as well
as help make applications.
Big data
Applications of Big Data:
1. Banking and securities
2. Communications, Media and Entertainment
3. Healthcare providers:
4.Education:
5. Manufacturing and Natural Resources
6. Government
7.Insurance
8.Retail and Wholesale trade
Big data
Benefits and Risks of Big
data
1.Benefits:
• Decision making
• Efficiency and productivity
• Research, development and innovation
• Personalization
• Transparency
2. Risks:
• Re-identification
• Privacy framework Obselote?
• Chilling effects
• Anti-competitive practices
• Data redundancy and Dispersion
How Companies make use of
Big Data
• Amazon uses big data to develop personalized recommendation
system.
• Amazon recently obtained a patent for the concept of predictive
dispatch.
• Google uses big data analytics to provide predictive search results
• Netflix relies on the data it collects from its customers to determine
which genre of programs are likely to be viewed more than other.
Future Of Big Data
• Machine Learning will be the Next Big thing in Big Data.
• Privacy will be the Biggest Challenge.
• Data Scientists Will Be In High Demand –(The Hindu predicts that by
end of 2018, India alone will face a shortage of close to two lakh
Data Scientists)
• Big Data Will Be Replaced By Fast and Actionable Data
Big data
Thank you

More Related Content

PPT
Big Data
DOCX
JPJ1417 Data Mining With Big Data
PPT
Data mining with big data
PPTX
PPTX
Data mining on big data
PPTX
Data Mining With Big Data
PPTX
Overview of Big data(ppt)
PPTX
Big data
Big Data
JPJ1417 Data Mining With Big Data
Data mining with big data
Data mining on big data
Data Mining With Big Data
Overview of Big data(ppt)
Big data

What's hot (20)

PPTX
Understanding big data
PDF
Big Data, Big Deal: For Future Big Data Scientists
PPTX
Mining Big Data in Real Time
PPTX
Introduction to Big Data & Big Data 1.0 System
PPTX
Big Stream Processing Systems, Big Graphs
PPTX
Data mining with big data
PPTX
Data mining with big data implementation
PPTX
Big data
PDF
Data minig with Big data analysis
PPTX
Big data seminor
PPT
Big data
PPTX
big data Presentation
PPTX
Big data and data mining
PPT
Introduction to Big Data & Hadoop
PPTX
PPTX
Big data
PPTX
Data mining with big data
PPTX
ODP
Big Data Presentation
PPTX
Understanding big data
Big Data, Big Deal: For Future Big Data Scientists
Mining Big Data in Real Time
Introduction to Big Data & Big Data 1.0 System
Big Stream Processing Systems, Big Graphs
Data mining with big data
Data mining with big data implementation
Big data
Data minig with Big data analysis
Big data seminor
Big data
big data Presentation
Big data and data mining
Introduction to Big Data & Hadoop
Big data
Data mining with big data
Big Data Presentation
Ad

Similar to Big data (20)

PPTX
Foundations of Big Data: Concepts, Techniques, and Applications
PPTX
Unit-I- Introduction- Traits of Big Data-Final.pptx
PPTX
Evolution & Introduction to Big data-2.pptx
PPTX
Bigdata and Hadoop with applications
PPTX
Big data
PPTX
Bigdata " new level"
PPTX
Kartikey tripathi
PPTX
Unit 1 (DSBDA) PD.pptx
PDF
UNIT 1 -BIG DATA ANALYTICS Full.pdf
PPTX
PresentationBig Data111111111111111.pptx
PPTX
Data mining with big data
PPTX
ppt final.pptx
PDF
Bigdatappt 140225061440-phpapp01
PPTX
Presentation on Big Data
PPTX
BigData.pptx
PPTX
Big data
PPTX
Special issues on big data
PPTX
In memory big data management and processing
DOCX
Content1. Introduction2. What is Big Data3. Characte.docx
PPTX
Big data ppt
Foundations of Big Data: Concepts, Techniques, and Applications
Unit-I- Introduction- Traits of Big Data-Final.pptx
Evolution & Introduction to Big data-2.pptx
Bigdata and Hadoop with applications
Big data
Bigdata " new level"
Kartikey tripathi
Unit 1 (DSBDA) PD.pptx
UNIT 1 -BIG DATA ANALYTICS Full.pdf
PresentationBig Data111111111111111.pptx
Data mining with big data
ppt final.pptx
Bigdatappt 140225061440-phpapp01
Presentation on Big Data
BigData.pptx
Big data
Special issues on big data
In memory big data management and processing
Content1. Introduction2. What is Big Data3. Characte.docx
Big data ppt
Ad

More from Joseph Sebastian (6)

PPTX
directives and instructions
PPTX
reinventing IT at BP case analysis
PPTX
Ritz carlton
PPTX
15 largest mn cs
PPTX
Firewall and vpn
PPTX
Service experience ppt
directives and instructions
reinventing IT at BP case analysis
Ritz carlton
15 largest mn cs
Firewall and vpn
Service experience ppt

Recently uploaded (20)

PPTX
Cell Types and Its function , kingdom of life
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
01-Introduction-to-Information-Management.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
Insiders guide to clinical Medicine.pdf
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Business Ethics Teaching Materials for college
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Cell Types and Its function , kingdom of life
Complications of Minimal Access Surgery at WLH
Microbial diseases, their pathogenesis and prophylaxis
Abdominal Access Techniques with Prof. Dr. R K Mishra
STATICS OF THE RIGID BODIES Hibbelers.pdf
Supply Chain Operations Speaking Notes -ICLT Program
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
01-Introduction-to-Information-Management.pdf
human mycosis Human fungal infections are called human mycosis..pptx
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
TR - Agricultural Crops Production NC III.pdf
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
2.FourierTransform-ShortQuestionswithAnswers.pdf
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Insiders guide to clinical Medicine.pdf
PPH.pptx obstetrics and gynecology in nursing
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Business Ethics Teaching Materials for college
Final Presentation General Medicine 03-08-2024.pptx
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...

Big data

  • 2. • 'Big Data' is also a data but with a huge size. • 'Big Data' is a term used to describe collection of data that is huge in size and yet growing exponentially with time. • In short, such a data is so large and complex that none of the traditional data management tools are able to store it or process it efficiently.
  • 3. Examples Of 'Big Data' The New York Stock Exchange generates about one terabyte of new trade data per day. Statistic shows that 500+terabytes of new data gets ingested into the databases of social media site Facebook, every day. This data is mainly generated in terms of photo and video uploads, message exchanges, putting comments etc. Single Jet engine can generate 10+terabytes of data in 30 minutes of a flight time. With many thousand flights per day, generation of data reaches up to many Petabytes.
  • 5. Structured -- data that can be stored, accessed and processed in the form of fixed format Unstructured -- Any data with unknown form or the structure Semi-structured -- Semi-structured data can contain both the forms of data.
  • 8. • Process of collecting, organizing and analyzing large sets of data (called Big Data) to discover patterns and other useful information. • Help organizations to better understand the information contained within the data • Analysts working with Big Data typically want the knowledge that comes from analyzing the data. • Big Data analytics is typically performed using specialized software tools and applications for predictive analytics, data mining, text mining, forecasting and data optimization
  • 9. Today's advances in analyzing big data allow researchers to • Decode human DNA in minutes • Predict where terrorists plan to attack • Determine which gene is mostly likely to be responsible for certain diseases • Which ads you are most likely to respond to on Facebook. How Big Data Analytics is Used Today
  • 10. The Challenges • The first challenge is in breaking down data to access all data an organization stores in different places and often in different systems. • Second challenge is in creating platforms that can pull in unstructured data as easily as structured data.
  • 12. Hadoop: it is an open source, Java-based programming framework that supports the processing and storage of extremely large data sets in a distributed computing environment Lumify: Lumify is a relatively new open source project to create a Big Data fusion and is a great alternative to Hadoop. ElasticSearch: A reliable and secure open source platform that allows users to take any data from any source, in any format and search, analyze it and visualize it real time. MongoDb: MongoDB is also a great tool to help store and analyze big data, as well as help make applications.
  • 14. Applications of Big Data: 1. Banking and securities 2. Communications, Media and Entertainment 3. Healthcare providers: 4.Education: 5. Manufacturing and Natural Resources 6. Government 7.Insurance 8.Retail and Wholesale trade
  • 16. Benefits and Risks of Big data
  • 17. 1.Benefits: • Decision making • Efficiency and productivity • Research, development and innovation • Personalization • Transparency 2. Risks: • Re-identification • Privacy framework Obselote? • Chilling effects • Anti-competitive practices • Data redundancy and Dispersion
  • 18. How Companies make use of Big Data
  • 19. • Amazon uses big data to develop personalized recommendation system. • Amazon recently obtained a patent for the concept of predictive dispatch. • Google uses big data analytics to provide predictive search results • Netflix relies on the data it collects from its customers to determine which genre of programs are likely to be viewed more than other.
  • 20. Future Of Big Data • Machine Learning will be the Next Big thing in Big Data. • Privacy will be the Biggest Challenge. • Data Scientists Will Be In High Demand –(The Hindu predicts that by end of 2018, India alone will face a shortage of close to two lakh Data Scientists) • Big Data Will Be Replaced By Fast and Actionable Data