SlideShare a Scribd company logo
Welcome to the World of Big Data & Hadoop 
www.easylearning.guru
Agenda 
What is Big Data ? 
Different Kinds of Big Data 
Big Data Global Market 
Hadoop Global job trends 
What is Hadoop ? 
www.easylearning.guru
What is Big Data? 
Big data is the term for a collection of data 
sets so large and complex that it becomes 
difficult to process using on-hand database 
management tools or traditional data 
processing applications. 
www.easylearning.guru
Types of Big Data ? 
Traditional RDBMS deals 
with only Structured data. 
Semi-Structured 
Data 
Need of a technology which deals with 
Semi-structured data, Unstructured 
data and Structured data as well 
www.easylearning.guru
The 3V’s of Big Data 
www.easylearning.guru
Sources of Data 
Social Media & Networks 
(All of us are generating data) 
Mobile Devices 
(Tracking all the objects all the time) 
Sensor Technology & Networks 
(Measuring all kinds of data) 
Scientific Instruments 
(Collecting all sorts of data) 
www.easylearning.guru
Where Big Data is used ? 
www.easylearning.guru
Face book Scenario 
Facebook on an average generates 70 thousand MB in 1 minute. 
1 hour = 70,000 MB *60 = 4.2 Million MB 
1 Day = 4.2 Million *24 MB = 10.8 Billion MB = 98438 GB 
1 week = 6.9 thousand GB = 690 TB 
4 weeks = 690 TB * 4 = 2756 TB = 2.7 PB 
52 weeks = 2.7 PB * 52 = 143.3 PB 
AŶd that’s aloooooooooot of data ! 
www.easylearning.guru
Various Bigdata Technologies 
www.easylearning.guru
Big Data Global Market 
Big Data Implementation 
Implemented Big Data Yet to Implement Big Data 
DATA SCIENTIST 
BIG DATA VISUAL IZER 
BIG DATA RESEARCH ANALYST 
Sources : Dice, LinkedIn. 
60 
50 
40 
30 
20 
10 
0 
2012 2013 2014 2015 2016 2017 
Big Data Growth (in USD Billions) 
BIG DATA ENGINEER 
BIG DATA ARCHITECT 
BIG DATA ANALYST 
50 
44 
43 
31 
23 
18 
50 
56 
57 
69 
77 
82 
Filled Unfilled 
FILLED/VACANCY(%) 
www.easylearning.guru
Hadoop Global Job Trends 
Top Hadoop Technology Companies 
Sources : Dice, LinkedIn. 
More than 17,000 
employees with Hadoop 
skill across these 
companies 
www.easylearning.guru
DEMAND FOR BIG DATA IN CITIES 
2% 2% 3% 4% 
8% 8% 
10% 11% 
14% 
38% 
As of February 2014 
Hadoop Global Job Trends 
120 
100 
80 
60 
40 
20 
0 
SALARY (USD P.A. IN THOUSANDS) 
Sources : Dice, LinkedIn. 
www.easylearning.guru
What is Hadoop ? 
Hadoop was created by Doug Cutting and Mike Cafarella. 
Hadoop provides the reliable shared storage and analysis 
system. 
It is designed to scale up from a single server to thousand of 
machines, with a high degree of fault tolerance. 
www.easylearning.guru
Hadoop History 
www.easylearning.guru
Hadoop Core Components 
Core Hadoop has two main systems: 
• Hadoop Distributed File System: The Hadoop file system is a 
Distributed file system which holds the large amount of data across 
multiple nodes in a cluster. 
• MapReduce: MapReduce is a distributed programming paradigm 
used to analyze the data in the HDFS. 
www.easylearning.guru
Hadoop Distributed File System (HDFS) 
A given file is broken down into blocks (default=64MB), then blocks are 
replicated across cluster (default=3). 
Optimized for throughput. 
HDFS allows you to put/get/delete files. 
Follows the philosophy 
͞Write OŶce aŶd Read Multiple tiŵes͟ 
Block Replication for: 
- Durability, High Availability and Throughput. 
www.easylearning.guru
MapReduce Flow 
www.easylearning.guru
MapReduce Framework 
Map Reduce works by breaking the processing into two phases : 
Map Phase and Reduce Phase. 
www.easylearning.guru
www.easylearning.guru
What we offer… 
www.easylearning.guru
www.easylearning.guru
Syllabus 
Introduction 
a)Big Data 
b)Hadoop 
Hadoop 
a)HDFS 
b)MapReduce 
PIG 
a)Pig 1 
b)Pig 2 
Hive 
a)Hive 1 
b)Hive 2 
Hbase 
Zookeeper 
Sqoop 
Yarn 
Project Class 
www.easylearning.guru
Thank you for watching the Live Demo for Hadoop. 
You can always contact us on: 
Phone : +91 124 4763660 (India) 
Email : contact@easylearning.guru 
Skype Id : easylearning.guru 
Website : www.easylearning.guru 
Your queries are always welcome. 
www.easylearning.guru

More Related Content

PPT
Big data analytics, survey r.nabati
PPTX
Big_data_ppt
PPTX
Introducing Technologies for Handling Big Data by Jaseela
PPTX
PPTX
Big Data & Hadoop Introduction
PDF
Core concepts and Key technologies - Big Data Analytics
PPTX
Big Data - Applications and Technologies Overview
PPTX
big data overview ppt
Big data analytics, survey r.nabati
Big_data_ppt
Introducing Technologies for Handling Big Data by Jaseela
Big Data & Hadoop Introduction
Core concepts and Key technologies - Big Data Analytics
Big Data - Applications and Technologies Overview
big data overview ppt

What's hot (20)

PDF
Big data analytics with Apache Hadoop
PPTX
Big Data, Big Content, and Aligning Your Storage Strategy
PPTX
Presentation About Big Data (DBMS)
PDF
PPTX
Big data by Mithlesh sadh
PPTX
Chapter 1 big data
PPTX
Big Data PPT by Rohit Dubey
PPTX
What is big data?
PPTX
Big data ppt
PPT
big data analytics in mobile cellular network
PPTX
Introduction to Big Data
PDF
Research paper on big data and hadoop
PPTX
10 Most Effective Big Data Technologies
PPTX
Our big data
PDF
Big data analytics, research report
DOCX
Big data abstract
PPTX
Big Data Marketing Analytics
PPTX
Big Data Overview 2013-2014
PDF
Introduction to Big Data
PPTX
Big Data
Big data analytics with Apache Hadoop
Big Data, Big Content, and Aligning Your Storage Strategy
Presentation About Big Data (DBMS)
Big data by Mithlesh sadh
Chapter 1 big data
Big Data PPT by Rohit Dubey
What is big data?
Big data ppt
big data analytics in mobile cellular network
Introduction to Big Data
Research paper on big data and hadoop
10 Most Effective Big Data Technologies
Our big data
Big data analytics, research report
Big data abstract
Big Data Marketing Analytics
Big Data Overview 2013-2014
Introduction to Big Data
Big Data
Ad

Viewers also liked (20)

PPT
Hadoop MapReduce Fundamentals
PDF
SFD2014_FOSS, Cloud and BigData in Vietnam
PDF
Addressing Big Data Challenges - The Hadoop Way
PDF
Big Data Analysis: The curse of dimensionality in official statistics
PPTX
Big Data Hadoop Tutorial by Easylearning Guru
PPSX
CR Bridge Solutions Pvt Ltd. Java slides
PDF
Big Data and Analytics: The IBM Perspective
PDF
Big Data Final Presentation
PPTX
HEC Digital Business. Sharing Economy and other trends
PPTX
Big data, data science & fast data
PDF
Spring integration概要
PPTX
Pattern driven Enterprise Architecture
PDF
How Google Does Big Data - DevNexus 2014
PPTX
Create a 'Customer 360' with Master Data Management for Financial Services
PPTX
Big Data Analysis Patterns with Hadoop, Mahout and Solr
PPTX
Big Data - The 5 Vs Everyone Must Know
PDF
Java essentials for hadoop
PPTX
Introduction to java
PDF
DAMA Webinar - Big and Little Data Quality
PPT
Seminar Presentation Hadoop
Hadoop MapReduce Fundamentals
SFD2014_FOSS, Cloud and BigData in Vietnam
Addressing Big Data Challenges - The Hadoop Way
Big Data Analysis: The curse of dimensionality in official statistics
Big Data Hadoop Tutorial by Easylearning Guru
CR Bridge Solutions Pvt Ltd. Java slides
Big Data and Analytics: The IBM Perspective
Big Data Final Presentation
HEC Digital Business. Sharing Economy and other trends
Big data, data science & fast data
Spring integration概要
Pattern driven Enterprise Architecture
How Google Does Big Data - DevNexus 2014
Create a 'Customer 360' with Master Data Management for Financial Services
Big Data Analysis Patterns with Hadoop, Mahout and Solr
Big Data - The 5 Vs Everyone Must Know
Java essentials for hadoop
Introduction to java
DAMA Webinar - Big and Little Data Quality
Seminar Presentation Hadoop
Ad

Similar to Big Data Hadoop Training by Easylearning Guru (20)

PPTX
Easylearning Guru online Hadoop class
PPTX
Big data(1st presentation)
PPTX
Big data
PPTX
Big data
PPTX
Overview of Big Data by Sunny
PPTX
Introduction to hadoop
PDF
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
PDF
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
PPTX
A Glimpse of Bigdata - Introduction
PPT
Data analytics & its Trends
PPTX
Big data Presentation
PDF
UNIT-II-BIG-DATA-FINAL(aktu imp)-PDF.pdf
DOCX
Big data and Hadoop overview
PPTX
Big data
PDF
Introduction to Hadoop
PDF
Hadoop hdfs interview questions
PDF
Big Data
PDF
Hadoop : The Pile of Big Data
Easylearning Guru online Hadoop class
Big data(1st presentation)
Big data
Big data
Overview of Big Data by Sunny
Introduction to hadoop
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
A Glimpse of Bigdata - Introduction
Data analytics & its Trends
Big data Presentation
UNIT-II-BIG-DATA-FINAL(aktu imp)-PDF.pdf
Big data and Hadoop overview
Big data
Introduction to Hadoop
Hadoop hdfs interview questions
Big Data
Hadoop : The Pile of Big Data

More from KCC Software Ltd. & Easylearning.guru (9)

PPTX
Python GUI Course Summary - 7 Modules
PPTX
Prerequisites of Bootstrap
PPTX
Bootstrap Self-paced Cousre Syllabus
PPTX
10 Keynotes in STRATA and HADOOP World Conference
PDF
Mongodb tutorial at Easylearning Guru
PPTX
Online MongoDB Training by Easylearning.guru
PPTX
Python Online From EasyLearning Guru
PDF
Java essentials for hadoop
Python GUI Course Summary - 7 Modules
Prerequisites of Bootstrap
Bootstrap Self-paced Cousre Syllabus
10 Keynotes in STRATA and HADOOP World Conference
Mongodb tutorial at Easylearning Guru
Online MongoDB Training by Easylearning.guru
Python Online From EasyLearning Guru
Java essentials for hadoop

Recently uploaded (20)

PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Basic Mud Logging Guide for educational purpose
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Pre independence Education in Inndia.pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Institutional Correction lecture only . . .
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Cell Types and Its function , kingdom of life
Pharmacology of Heart Failure /Pharmacotherapy of CHF
VCE English Exam - Section C Student Revision Booklet
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Week 4 Term 3 Study Techniques revisited.pptx
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
2.FourierTransform-ShortQuestionswithAnswers.pdf
Basic Mud Logging Guide for educational purpose
human mycosis Human fungal infections are called human mycosis..pptx
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Renaissance Architecture: A Journey from Faith to Humanism
Pre independence Education in Inndia.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Institutional Correction lecture only . . .
STATICS OF THE RIGID BODIES Hibbelers.pdf
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Microbial diseases, their pathogenesis and prophylaxis
Module 4: Burden of Disease Tutorial Slides S2 2025
Final Presentation General Medicine 03-08-2024.pptx
Cell Types and Its function , kingdom of life

Big Data Hadoop Training by Easylearning Guru

  • 1. Welcome to the World of Big Data & Hadoop www.easylearning.guru
  • 2. Agenda What is Big Data ? Different Kinds of Big Data Big Data Global Market Hadoop Global job trends What is Hadoop ? www.easylearning.guru
  • 3. What is Big Data? Big data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. www.easylearning.guru
  • 4. Types of Big Data ? Traditional RDBMS deals with only Structured data. Semi-Structured Data Need of a technology which deals with Semi-structured data, Unstructured data and Structured data as well www.easylearning.guru
  • 5. The 3V’s of Big Data www.easylearning.guru
  • 6. Sources of Data Social Media & Networks (All of us are generating data) Mobile Devices (Tracking all the objects all the time) Sensor Technology & Networks (Measuring all kinds of data) Scientific Instruments (Collecting all sorts of data) www.easylearning.guru
  • 7. Where Big Data is used ? www.easylearning.guru
  • 8. Face book Scenario Facebook on an average generates 70 thousand MB in 1 minute. 1 hour = 70,000 MB *60 = 4.2 Million MB 1 Day = 4.2 Million *24 MB = 10.8 Billion MB = 98438 GB 1 week = 6.9 thousand GB = 690 TB 4 weeks = 690 TB * 4 = 2756 TB = 2.7 PB 52 weeks = 2.7 PB * 52 = 143.3 PB AŶd that’s aloooooooooot of data ! www.easylearning.guru
  • 9. Various Bigdata Technologies www.easylearning.guru
  • 10. Big Data Global Market Big Data Implementation Implemented Big Data Yet to Implement Big Data DATA SCIENTIST BIG DATA VISUAL IZER BIG DATA RESEARCH ANALYST Sources : Dice, LinkedIn. 60 50 40 30 20 10 0 2012 2013 2014 2015 2016 2017 Big Data Growth (in USD Billions) BIG DATA ENGINEER BIG DATA ARCHITECT BIG DATA ANALYST 50 44 43 31 23 18 50 56 57 69 77 82 Filled Unfilled FILLED/VACANCY(%) www.easylearning.guru
  • 11. Hadoop Global Job Trends Top Hadoop Technology Companies Sources : Dice, LinkedIn. More than 17,000 employees with Hadoop skill across these companies www.easylearning.guru
  • 12. DEMAND FOR BIG DATA IN CITIES 2% 2% 3% 4% 8% 8% 10% 11% 14% 38% As of February 2014 Hadoop Global Job Trends 120 100 80 60 40 20 0 SALARY (USD P.A. IN THOUSANDS) Sources : Dice, LinkedIn. www.easylearning.guru
  • 13. What is Hadoop ? Hadoop was created by Doug Cutting and Mike Cafarella. Hadoop provides the reliable shared storage and analysis system. It is designed to scale up from a single server to thousand of machines, with a high degree of fault tolerance. www.easylearning.guru
  • 15. Hadoop Core Components Core Hadoop has two main systems: • Hadoop Distributed File System: The Hadoop file system is a Distributed file system which holds the large amount of data across multiple nodes in a cluster. • MapReduce: MapReduce is a distributed programming paradigm used to analyze the data in the HDFS. www.easylearning.guru
  • 16. Hadoop Distributed File System (HDFS) A given file is broken down into blocks (default=64MB), then blocks are replicated across cluster (default=3). Optimized for throughput. HDFS allows you to put/get/delete files. Follows the philosophy ͞Write OŶce aŶd Read Multiple tiŵes͟ Block Replication for: - Durability, High Availability and Throughput. www.easylearning.guru
  • 18. MapReduce Framework Map Reduce works by breaking the processing into two phases : Map Phase and Reduce Phase. www.easylearning.guru
  • 20. What we offer… www.easylearning.guru
  • 22. Syllabus Introduction a)Big Data b)Hadoop Hadoop a)HDFS b)MapReduce PIG a)Pig 1 b)Pig 2 Hive a)Hive 1 b)Hive 2 Hbase Zookeeper Sqoop Yarn Project Class www.easylearning.guru
  • 23. Thank you for watching the Live Demo for Hadoop. You can always contact us on: Phone : +91 124 4763660 (India) Email : contact@easylearning.guru Skype Id : easylearning.guru Website : www.easylearning.guru Your queries are always welcome. www.easylearning.guru