SlideShare a Scribd company logo
Lecture 2 part 2
 What is Hadoop?, What Hadoop is not?, and Hadoop Assumptions.
 What is Rack, Cluster, Nodes and Commodity Hardware?
 HDFS - Hadoop Distributed File System
 Using HDFS commands
 MapReduce
 Higher-level languages over Hadoop: Pig and Hive
 HBase – Overview
 HCatalog
 What is Hadoop and its components?
 What is the commodity server/Hardware?
 Why HDFS ?
 What is the responsibility of NameNode in HDFS?
 What is Fault Tolerance?
 What is the default replication factor in HDFS?
 What is the heartbeat in HDFS?
 What are JobTracker and TaskTracker?
 Why MapReduce programming model?
 Where do we have Data Locality in MapReduce?
 Why we need to use Pig and Hive?
 What is the difference between Hbase and HCatalog
Lecture 2 part 2
Lecture 2 part 2
Lecture 2 part 2
Lecture 2 part 2
 Download Oracle VM VirtualBox
 Install Oracle VM VirtualBox
 Run Oracle VM VirtualBox
1
23
• Download Hortonworks
• Install Hortonworks
• Import Hortonworks inside
Oracle VM VirtualBox
• Run Hortonworks
1 2
3
4
5
6
• Download Hortonworks
• Install Hortonworks
• Import Hortonworks inside Oracle VM VirtualBox
• Run Hortonworks
1
2 3
1 2
3
4
5
6
7
Lecture 2 part 2

More Related Content

PPT
PPTX
Hadoop
ODP
HDFS presented by VIJAY
PDF
Maintainable cloud architecture_of_hadoop
PPTX
How Hadoop Exploits Data Locality
PPTX
Asbury Hadoop Overview
PDF
Hadoop for sys_admin
PDF
Foss evolution cos-boudnik
Hadoop
HDFS presented by VIJAY
Maintainable cloud architecture_of_hadoop
How Hadoop Exploits Data Locality
Asbury Hadoop Overview
Hadoop for sys_admin
Foss evolution cos-boudnik

What's hot (20)

PDF
Hadoop for System Administrators
PDF
Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...
PDF
Hadoop description
PPT
Hadoop technology
PPTX
HBaseCon 2015: HBase and Spark
PPTX
Building Big Data Applications using Spark, Hive, HBase and Kafka
PPTX
Backup and Disaster Recovery in Hadoop
PPTX
La big datacamp2014_vikram_dixit
PDF
Tales from the Cloudera Field
PDF
Hadoop ecosystem; J.Ayeesha parveen 2 nd M.sc., computer science Bon Secours...
PPTX
Pptx present
PDF
Hadoop at ayasdi
PDF
Hw09 Clouderas Distribution For Hadoop
PPTX
Cloudera
PPTX
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
PPTX
HDInsight for Architects
PPTX
Hadoop Technology
PPT
Hadoop
PDF
Introduction to Hadoop Ecosystem
PPTX
Hadoop: The elephant in the room
Hadoop for System Administrators
Apache Kudu Fast Analytics on Fast Data (Hadoop / Spark Conference Japan 2016...
Hadoop description
Hadoop technology
HBaseCon 2015: HBase and Spark
Building Big Data Applications using Spark, Hive, HBase and Kafka
Backup and Disaster Recovery in Hadoop
La big datacamp2014_vikram_dixit
Tales from the Cloudera Field
Hadoop ecosystem; J.Ayeesha parveen 2 nd M.sc., computer science Bon Secours...
Pptx present
Hadoop at ayasdi
Hw09 Clouderas Distribution For Hadoop
Cloudera
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
HDInsight for Architects
Hadoop Technology
Hadoop
Introduction to Hadoop Ecosystem
Hadoop: The elephant in the room
Ad

Viewers also liked (18)

PDF
Lecture 2 part 3
PPTX
Anas bahkali 2
PPTX
Cyber-infrastructure Presentation 2015
PPTX
DATA BLENDING
PPTX
Relationship between cloud computing and big data
PDF
Jan 2012 HUG: HCatalog
PDF
Hadoop / Spark Conference Japan 2016 ご挨拶・Hadoopを取り巻く環境
PPTX
Presentation of Kent Park
PDF
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
PDF
2013 feb 20_thug_h_catalog
PPTX
Hadoop and rdbms with sqoop
PPTX
Future of HCatalog - Hadoop Summit 2012
PDF
IYAD KIWAN CV (1)
PPTX
The King: Jesus Ministry
PDF
Plazas de tipo 2
PDF
tecnicas de muestreo
PDF
8 принципов создания лендинга, которые обеспечат супер конверсию
PDF
What's On September to December 2015
Lecture 2 part 3
Anas bahkali 2
Cyber-infrastructure Presentation 2015
DATA BLENDING
Relationship between cloud computing and big data
Jan 2012 HUG: HCatalog
Hadoop / Spark Conference Japan 2016 ご挨拶・Hadoopを取り巻く環境
Presentation of Kent Park
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
2013 feb 20_thug_h_catalog
Hadoop and rdbms with sqoop
Future of HCatalog - Hadoop Summit 2012
IYAD KIWAN CV (1)
The King: Jesus Ministry
Plazas de tipo 2
tecnicas de muestreo
8 принципов создания лендинга, которые обеспечат супер конверсию
What's On September to December 2015
Ad

Similar to Lecture 2 part 2 (20)

PPTX
Lecture 2 Hadoop.pptx
PDF
Big Data Hoopla Simplified - TDWI Memphis 2014
PPTX
Big Data Training in Amritsar
PPTX
Big Data Training in Mohali
PPTX
Big Data Training in Ludhiana
PPTX
Hadoop online training
PPTX
Getting started big data
PPTX
SQL Server 2012 and Big Data
PDF
Introduction To Hadoop Administration - SpringPeople
PPT
Presentation
PPTX
Big Data and Hadoop Components
PPTX
Introduction to Apache Hadoop Ecosystem
DOCX
Hadoop Tutorial for Beginners
PPTX
Overview of Big data, Hadoop and Microsoft BI - version1
PPTX
Overview of big data & hadoop version 1 - Tony Nguyen
PPT
Hadoop presentation
PDF
What is hadoop
PPTX
Hadoop_arunam_ppt
PPT
Hadoop training by keylabs
DOCX
Hadoop online training by certified trainer
Lecture 2 Hadoop.pptx
Big Data Hoopla Simplified - TDWI Memphis 2014
Big Data Training in Amritsar
Big Data Training in Mohali
Big Data Training in Ludhiana
Hadoop online training
Getting started big data
SQL Server 2012 and Big Data
Introduction To Hadoop Administration - SpringPeople
Presentation
Big Data and Hadoop Components
Introduction to Apache Hadoop Ecosystem
Hadoop Tutorial for Beginners
Overview of Big data, Hadoop and Microsoft BI - version1
Overview of big data & hadoop version 1 - Tony Nguyen
Hadoop presentation
What is hadoop
Hadoop_arunam_ppt
Hadoop training by keylabs
Hadoop online training by certified trainer

Recently uploaded (20)

PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
Computing-Curriculum for Schools in Ghana
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Complications of Minimal Access Surgery at WLH
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
RMMM.pdf make it easy to upload and study
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Lesson notes of climatology university.
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
A systematic review of self-coping strategies used by university students to ...
PPTX
master seminar digital applications in india
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
human mycosis Human fungal infections are called human mycosis..pptx
FourierSeries-QuestionsWithAnswers(Part-A).pdf
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Computing-Curriculum for Schools in Ghana
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Microbial disease of the cardiovascular and lymphatic systems
Complications of Minimal Access Surgery at WLH
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
RMMM.pdf make it easy to upload and study
Pharmacology of Heart Failure /Pharmacotherapy of CHF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Lesson notes of climatology university.
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Supply Chain Operations Speaking Notes -ICLT Program
A systematic review of self-coping strategies used by university students to ...
master seminar digital applications in india
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
102 student loan defaulters named and shamed – Is someone you know on the list?

Lecture 2 part 2

  • 2.  What is Hadoop?, What Hadoop is not?, and Hadoop Assumptions.  What is Rack, Cluster, Nodes and Commodity Hardware?  HDFS - Hadoop Distributed File System  Using HDFS commands  MapReduce  Higher-level languages over Hadoop: Pig and Hive  HBase – Overview  HCatalog
  • 3.  What is Hadoop and its components?  What is the commodity server/Hardware?  Why HDFS ?  What is the responsibility of NameNode in HDFS?  What is Fault Tolerance?  What is the default replication factor in HDFS?  What is the heartbeat in HDFS?  What are JobTracker and TaskTracker?  Why MapReduce programming model?  Where do we have Data Locality in MapReduce?  Why we need to use Pig and Hive?  What is the difference between Hbase and HCatalog
  • 8.  Download Oracle VM VirtualBox  Install Oracle VM VirtualBox  Run Oracle VM VirtualBox 1 23
  • 9. • Download Hortonworks • Install Hortonworks • Import Hortonworks inside Oracle VM VirtualBox • Run Hortonworks 1 2 3 4 5 6 • Download Hortonworks • Install Hortonworks • Import Hortonworks inside Oracle VM VirtualBox • Run Hortonworks
  • 10. 1 2 3