SlideShare a Scribd company logo
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Introduction :
Big Data and Hadoop training course is designed to provide knowledge and skills to
become a successful Hadoop Developer. In-depth knowledge of concepts such as
Hadoop Distributed File System, Hadoop Cluster, Map-Reduce, Hbase Zookeeper etc.
will be covered in the course.
Reason To Attend :
After the completion of the Big Data and Hadoop Course at KPM, you
should be able to:
• Master the concepts of Hadoop Distributed File System and
MapReduce framework
• Setup a Hadoop Cluster
• Understand Data Loading Techniques using Sqoop and Flume
• Program in MapReduce (Both MRv1 and MRv2)
• Learn to write Complex MapReduce programs
• Program in YARN (MRv2)
• Perform Data Analytics using Pig and Hive
• Implement HBase, MapReduce Integration, Advanced Usage
and Advanced Indexing
• Have a good understanding of ZooKeeper service
• New features in Hadoop 2.0 -- YARN, HDFS Federation,
NameNode High Availability
• Implement best Practices for Hadoop Development and
Debugging
• Implement a Hadoop Project
• Work on a Real Life Project on Big Data Analytics and gain
Hands on Project Experience
Who should attend :
This course is designed for
professionals aspiring to make a
career in Big Data Analytics
using Hadoop Framework.
Software Professionals,
Analytics Professionals, ETL
developers, Project Managers,
Testing Professionals are the
key beneficiaries of this course.
Other professionals who are
looking forward to acquire a
solid foundation of Hadoop
Architecture can also opt for this
course.
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Course Content :
Big Data Economy …………………………………………………………… 1.5 Hrs.
• What is Big Data
• Characteristics of Big Data
• How did data become so Big
• Why should you care about Big Data
• Uses Cases of Big Data Analysis
• What are possible options for analyzing big data
• Traditional Distributed Systems
• Problem with traditional Distributed systems
Hadoop Introduction………………………………………………………… 1.5 Hrs.
• What is Hadoop
• History of Hadoop
• How does Hadoop solve Big Data Problem
• Components of Hadoop
• Hadoop Flavours
Hadoop Distributed File System Part 1…...……………………………… 2 Hrs
• HDFS Architecture
• HDFS Internals
• HDFS Use Cases
• HDFS Daemons
• Files and Blocks
• Namenode Memory Concerns
• Secondary Namenode
• HDFS Access Options
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Installing Hadoop (Single Node)…......……..……….…………………… 1 Hrs
• Installation Overview
• Hadoop Installation
• Hadoop Daemons Stuff
Advanced Hadoop Distributed File System Concepts………….…… 2 Hrs.
• HDFS Workshops
• HDFS API
• How to use Configuration class
• Using HDFS in MapReduce
• Using HDFS Programmatically
• HDFS Permission and Security
• Additional HDFS Tasks
• Rebalancing Blocks
• Copying Large Sets of Files
• Decommissioning Nodes
• Verifying File System Health
• Rack Awareness
• HDFS Web Interface
Map-Reduce Workshops………...…..……………………………………....… 5 Hrs
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Introduction to MapReduce ……….…………………………………..…… 3 Hrs
• MapReduce Basics
• Functional Programming Concepts
• List Processing
• Mapping Lists
• Reducing Lists
• Putting them Together in MapReduce
• An Example Application: Word Count
• Understanding the Driver
• Understanding the Mapper
• Understanding the Reducer
• MapReduce Data Flow
• A Closer look
• Additional MapReduce Functionality
• Fault Tolerance
Advanced MapReduce Concepts…..……………………………………..…. 2 Hrs
• Understanding Combiners
• Understanding Partitioners
• Understanding input formats
• Understanding output formats
• Distributed Cache
• Understanding Counters
• More Tips
• Chaining Jobs
• Listing and Killing Jobs
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Cloud Computing Overview………..…………………………...…….....…… 1 Hrs
• Cloud Computing Introduction
• SaaS/PaaS/IaaS
• Characteristics
Installing Hadoop (Multi Node)………..………………………..............…… 1 Hrs
• Cluster Configurations
• Configuring Masters
• Configuring Slaves
• Cluster Stuff
Hadoop Ecosystem Pig ….………………………………………………………. 1 Hrs
• Pig Programs structure and Execution Process
• Joins
• Filtering
• Group and Co-Group
• Schema merging and redefining schema
• Pig functions
Hadoop Ecosystem Hive…………………………………………………………. 2 Hrs
• Motivation and Understanding Hive
• Using Hive Command line interface
• Data types and File Formats
• Basic DDL operations
• Schema Design
• An Example of Pig and Hive
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Hadoop Ecosystem HBase and Zookeeper………….………………………. 1 Hrs
• HBase Overview
• HBase Architecture
• HBase Installation
• HBase Admin : Test
• HBase Client: Client Loading Overview
• Fully Distributed HBase Configuration
• Loading HBase
• HBase Data Access
Hadoop Ecosystem Sqoop …………………………………………………. 1 Hrs
• Sqoop Overview
• Sqoop Installation
• Importing Data
• Exporting Data
Hadoop Ecosystem Oozie………………………………………………..…. 1 Hrs
• Oozie overview
• Oozie Features
• Bundle
• Scalability
• Usability
• Oozie challenges
Hadoop Ecosystem Apache Flume……………….…………………..……. 1 Hrs
• Apache Flume Overview
• How it Works
• Flume Connection with HDFS
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Hadoop Version 2 Concepts …………………….………………………….. 2 Hrs
• Yarn
• Hadoop Federation
• Authentication in Hadoop
• High Availability
Administration Refresher……………………………………………………… 1 Hrs
• Setting up Hadoop Cluster – Considerations
• Most Important Configurations
• Installation Options
• Scheduling in Hadoop
• FIFO Scheduler
• FAIR Scheduler
Building a Web Log Analysis POC using MapReduce..…….……….…... 2 Hrs
• Designing Structures for POC
• With MapReduce develop code
• Push data using Flume into HDFS
• Run MapReduce Code
• Analyse the Output
Real Life Project and POC…………………………………….……….....……….... 6 Hrs
 
For	
  More	
  Details	
  :	
  info@kpmlearnings.com	
  	
  /	
  +91	
  8041705679	
  /	
  	
  	
   Website:	
  kpmlearnings.com	
  	
  	
  
Training Methodlogy :
- 80% training is practical
- The duration of course is 36 - 40 Hrs
- Individual attention is provided to all candidates
- Training involves multiple workshops to explain the practical concepts
- Regular assignments will be given to the candidates
- Study material, PPTs, Project and POC codes, etc. will be given to the candidates
- Course involves 3 Proof Of Concepts
- Course involves a Real Life Project
- Trainer will assist you for interview preparation
About The Organizer :
KPM Learning Solutions – Shaping your Future
KPI is one-stop learning solutions that offer a wide portfolio of learning and consulting services. We
provide tailored, practical, in-house and open house learning solutions in sync with the recent industrial
and technological trends.
We design, develop and deliver world-class academic and highly innovative learning programs in IT
and Mobility, Leadership & Management and other related areas world across.
“KPM” denotes the success factors and performance measurement which is directed towards the
strategic goals of any organization and few sets of key skills.
Our aim is to upgrade and set those key skills that are result oriented and bring organizational
excellence by all means.
You can log on to – www.kpmlearnings.com

More Related Content

PDF
Hadoop online training
PPTX
Dawn of YARN @ Rocket Fuel
PPTX
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
PDF
Hadoop Summit Amsterdam 2014: Capacity Planning In Multi-tenant Hadoop Deploy...
PDF
Hadoop 2 - More than MapReduce
PDF
Troubleshooting Hadoop: Distributed Debugging
PDF
Hadoop 2 - Beyond MapReduce
PPTX
Big Data Performance and Capacity Management
Hadoop online training
Dawn of YARN @ Rocket Fuel
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Hadoop Summit Amsterdam 2014: Capacity Planning In Multi-tenant Hadoop Deploy...
Hadoop 2 - More than MapReduce
Troubleshooting Hadoop: Distributed Debugging
Hadoop 2 - Beyond MapReduce
Big Data Performance and Capacity Management

What's hot (20)

PDF
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
PPTX
Back to School - St. Louis Hadoop Meetup September 2016
PDF
hadoop_module6
PPT
Hadoop_Its_Not_Just_Internal_Storage_V14
PDF
Best hadoop-online-training
PPTX
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
PPTX
Big Data and Hadoop in Cloud - Leveraging Amazon EMR
PDF
Introduction to Hadoop
PDF
Philly DB MapR Overview
PDF
Hadoop 31-frequently-asked-interview-questions
PPTX
Drill dchug-29 nov2012
PPTX
2015 GHC Presentation - High Availability and High Frequency Big Data Analytics
PDF
Apache Spark & Hadoop
PPT
Hadoop applicationarchitectures
PPT
Advanced Hadoop Tuning and Optimization - Hadoop Consulting
PDF
HUG slides on NFS and ODBC
PPTX
Challenges & Capabilites in Managing a MapR Cluster by David Tucker
ODP
Training
PPTX
Hadoop Interview Questions and Answers
PDF
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
Bikas saha:the next generation of hadoop– hadoop 2 and yarn
Back to School - St. Louis Hadoop Meetup September 2016
hadoop_module6
Hadoop_Its_Not_Just_Internal_Storage_V14
Best hadoop-online-training
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
Big Data and Hadoop in Cloud - Leveraging Amazon EMR
Introduction to Hadoop
Philly DB MapR Overview
Hadoop 31-frequently-asked-interview-questions
Drill dchug-29 nov2012
2015 GHC Presentation - High Availability and High Frequency Big Data Analytics
Apache Spark & Hadoop
Hadoop applicationarchitectures
Advanced Hadoop Tuning and Optimization - Hadoop Consulting
HUG slides on NFS and ODBC
Challenges & Capabilites in Managing a MapR Cluster by David Tucker
Training
Hadoop Interview Questions and Answers
The Search Is Over: Integrating Solr and Hadoop in the Same Cluster to Simpli...
Ad

Similar to Learn Hadoop at your Leisure time (20)

PPTX
Big Data and Hadoop Training in Bangalore by myTectra
PDF
Hadoop course content
PPTX
Best Hadoop Training in Bangalore - TIB Academy
PDF
Datascience Training with Hadoop, Python Machine Learning & Scala, Spark
DOCX
Hadoop admin online training
PPTX
Hadoop Online Training | Online Hadoop Training certification in India
PDF
Big data analytics_using_hadoop
PDF
Hadoop_Architect__eVenkat
DOCX
PDF
Practical Hadoop Big Data Training Course by Certified Architect
DOCX
Hadoop online training in india
PDF
Hadoop content
PDF
Big Data Hadoop Training Course
PDF
Open-BDA - Big Data Hadoop Developer Training 10th & 11th June
PDF
Hadoop course content Syed Academy
PDF
Hadoop 2.0-development
PDF
Hadoop 80hr v1.0
PPT
Hadoop course content @ a1 trainingss
PPTX
Big data and hadoop product page
PPTX
Big data hadoop
Big Data and Hadoop Training in Bangalore by myTectra
Hadoop course content
Best Hadoop Training in Bangalore - TIB Academy
Datascience Training with Hadoop, Python Machine Learning & Scala, Spark
Hadoop admin online training
Hadoop Online Training | Online Hadoop Training certification in India
Big data analytics_using_hadoop
Hadoop_Architect__eVenkat
Practical Hadoop Big Data Training Course by Certified Architect
Hadoop online training in india
Hadoop content
Big Data Hadoop Training Course
Open-BDA - Big Data Hadoop Developer Training 10th & 11th June
Hadoop course content Syed Academy
Hadoop 2.0-development
Hadoop 80hr v1.0
Hadoop course content @ a1 trainingss
Big data and hadoop product page
Big data hadoop
Ad

Recently uploaded (20)

PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PDF
HVAC Specification 2024 according to central public works department
PPTX
Introduction to Building Materials
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
What if we spent less time fighting change, and more time building what’s rig...
PDF
Empowerment Technology for Senior High School Guide
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PDF
Hazard Identification & Risk Assessment .pdf
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
IGGE1 Understanding the Self1234567891011
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PPTX
Computer Architecture Input Output Memory.pptx
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
HVAC Specification 2024 according to central public works department
Introduction to Building Materials
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
What if we spent less time fighting change, and more time building what’s rig...
Empowerment Technology for Senior High School Guide
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
Paper A Mock Exam 9_ Attempt review.pdf.
AI-driven educational solutions for real-life interventions in the Philippine...
Hazard Identification & Risk Assessment .pdf
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
IGGE1 Understanding the Self1234567891011
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
Computer Architecture Input Output Memory.pptx
Unit 4 Computer Architecture Multicore Processor.pptx
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf

Learn Hadoop at your Leisure time

  • 1.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Introduction : Big Data and Hadoop training course is designed to provide knowledge and skills to become a successful Hadoop Developer. In-depth knowledge of concepts such as Hadoop Distributed File System, Hadoop Cluster, Map-Reduce, Hbase Zookeeper etc. will be covered in the course. Reason To Attend : After the completion of the Big Data and Hadoop Course at KPM, you should be able to: • Master the concepts of Hadoop Distributed File System and MapReduce framework • Setup a Hadoop Cluster • Understand Data Loading Techniques using Sqoop and Flume • Program in MapReduce (Both MRv1 and MRv2) • Learn to write Complex MapReduce programs • Program in YARN (MRv2) • Perform Data Analytics using Pig and Hive • Implement HBase, MapReduce Integration, Advanced Usage and Advanced Indexing • Have a good understanding of ZooKeeper service • New features in Hadoop 2.0 -- YARN, HDFS Federation, NameNode High Availability • Implement best Practices for Hadoop Development and Debugging • Implement a Hadoop Project • Work on a Real Life Project on Big Data Analytics and gain Hands on Project Experience Who should attend : This course is designed for professionals aspiring to make a career in Big Data Analytics using Hadoop Framework. Software Professionals, Analytics Professionals, ETL developers, Project Managers, Testing Professionals are the key beneficiaries of this course. Other professionals who are looking forward to acquire a solid foundation of Hadoop Architecture can also opt for this course.
  • 2.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Course Content : Big Data Economy …………………………………………………………… 1.5 Hrs. • What is Big Data • Characteristics of Big Data • How did data become so Big • Why should you care about Big Data • Uses Cases of Big Data Analysis • What are possible options for analyzing big data • Traditional Distributed Systems • Problem with traditional Distributed systems Hadoop Introduction………………………………………………………… 1.5 Hrs. • What is Hadoop • History of Hadoop • How does Hadoop solve Big Data Problem • Components of Hadoop • Hadoop Flavours Hadoop Distributed File System Part 1…...……………………………… 2 Hrs • HDFS Architecture • HDFS Internals • HDFS Use Cases • HDFS Daemons • Files and Blocks • Namenode Memory Concerns • Secondary Namenode • HDFS Access Options
  • 3.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Installing Hadoop (Single Node)…......……..……….…………………… 1 Hrs • Installation Overview • Hadoop Installation • Hadoop Daemons Stuff Advanced Hadoop Distributed File System Concepts………….…… 2 Hrs. • HDFS Workshops • HDFS API • How to use Configuration class • Using HDFS in MapReduce • Using HDFS Programmatically • HDFS Permission and Security • Additional HDFS Tasks • Rebalancing Blocks • Copying Large Sets of Files • Decommissioning Nodes • Verifying File System Health • Rack Awareness • HDFS Web Interface Map-Reduce Workshops………...…..……………………………………....… 5 Hrs
  • 4.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Introduction to MapReduce ……….…………………………………..…… 3 Hrs • MapReduce Basics • Functional Programming Concepts • List Processing • Mapping Lists • Reducing Lists • Putting them Together in MapReduce • An Example Application: Word Count • Understanding the Driver • Understanding the Mapper • Understanding the Reducer • MapReduce Data Flow • A Closer look • Additional MapReduce Functionality • Fault Tolerance Advanced MapReduce Concepts…..……………………………………..…. 2 Hrs • Understanding Combiners • Understanding Partitioners • Understanding input formats • Understanding output formats • Distributed Cache • Understanding Counters • More Tips • Chaining Jobs • Listing and Killing Jobs
  • 5.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Cloud Computing Overview………..…………………………...…….....…… 1 Hrs • Cloud Computing Introduction • SaaS/PaaS/IaaS • Characteristics Installing Hadoop (Multi Node)………..………………………..............…… 1 Hrs • Cluster Configurations • Configuring Masters • Configuring Slaves • Cluster Stuff Hadoop Ecosystem Pig ….………………………………………………………. 1 Hrs • Pig Programs structure and Execution Process • Joins • Filtering • Group and Co-Group • Schema merging and redefining schema • Pig functions Hadoop Ecosystem Hive…………………………………………………………. 2 Hrs • Motivation and Understanding Hive • Using Hive Command line interface • Data types and File Formats • Basic DDL operations • Schema Design • An Example of Pig and Hive
  • 6.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Hadoop Ecosystem HBase and Zookeeper………….………………………. 1 Hrs • HBase Overview • HBase Architecture • HBase Installation • HBase Admin : Test • HBase Client: Client Loading Overview • Fully Distributed HBase Configuration • Loading HBase • HBase Data Access Hadoop Ecosystem Sqoop …………………………………………………. 1 Hrs • Sqoop Overview • Sqoop Installation • Importing Data • Exporting Data Hadoop Ecosystem Oozie………………………………………………..…. 1 Hrs • Oozie overview • Oozie Features • Bundle • Scalability • Usability • Oozie challenges Hadoop Ecosystem Apache Flume……………….…………………..……. 1 Hrs • Apache Flume Overview • How it Works • Flume Connection with HDFS
  • 7.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Hadoop Version 2 Concepts …………………….………………………….. 2 Hrs • Yarn • Hadoop Federation • Authentication in Hadoop • High Availability Administration Refresher……………………………………………………… 1 Hrs • Setting up Hadoop Cluster – Considerations • Most Important Configurations • Installation Options • Scheduling in Hadoop • FIFO Scheduler • FAIR Scheduler Building a Web Log Analysis POC using MapReduce..…….……….…... 2 Hrs • Designing Structures for POC • With MapReduce develop code • Push data using Flume into HDFS • Run MapReduce Code • Analyse the Output Real Life Project and POC…………………………………….……….....……….... 6 Hrs
  • 8.   For  More  Details  :  info@kpmlearnings.com    /  +91  8041705679  /       Website:  kpmlearnings.com       Training Methodlogy : - 80% training is practical - The duration of course is 36 - 40 Hrs - Individual attention is provided to all candidates - Training involves multiple workshops to explain the practical concepts - Regular assignments will be given to the candidates - Study material, PPTs, Project and POC codes, etc. will be given to the candidates - Course involves 3 Proof Of Concepts - Course involves a Real Life Project - Trainer will assist you for interview preparation About The Organizer : KPM Learning Solutions – Shaping your Future KPI is one-stop learning solutions that offer a wide portfolio of learning and consulting services. We provide tailored, practical, in-house and open house learning solutions in sync with the recent industrial and technological trends. We design, develop and deliver world-class academic and highly innovative learning programs in IT and Mobility, Leadership & Management and other related areas world across. “KPM” denotes the success factors and performance measurement which is directed towards the strategic goals of any organization and few sets of key skills. Our aim is to upgrade and set those key skills that are result oriented and bring organizational excellence by all means. You can log on to – www.kpmlearnings.com