SlideShare a Scribd company logo
APACHE HADOOP
W W W . S C H O O L O F C O R E A I . C O M
SCAI
Hadoop is an open source software programming
framework for storing a large amount of data and
performing the computation. Its framework is based on
Java programming with some native code in C and
shell scripts.
Hadoop is an open-source software framework that is
used for storing and processing large amounts of data
in a distributed computing environment. It is designed
to handle big data and is based on the MapReduce
programming model, which allows for the parallel
processing of large datasets.
SWEEP -->
APACHE HADOOP
SCAI
MAIN COMPONENTS
There are Four types of Main Components:-
• Hadoop Distributed File System (HDFS)
• Mad-Reduce
• Yet Another Resource Negotiator (YARN)
• Hadoop Common
SWIPE-->
SCAI
HDFS
• Hadoop Distributed File System is the
storage component of Hadoop, which
allows for the storage of large amounts of
data across multiple machines. It is
designed to work with commodity
hardware, which makes it cost-effective.
SWIPE-->
SCAI
This is the resource management
component of Hadoop, which
manages the allocation of
resources (such as CPU and
memory) for processing the data
stored in HDFS.
YARN
SWIPE-->
SCAI
MapReduce is a programming model
and processing engine for distributed
computing on large datasets. It consists
of two main tasks: the Map task, which
processes and filters data, and the
Reduce task, which aggregates and
summarizes the results.
MAPREDUCE
SWIPE-->
SCAI
HADOOP COMMON
• This includes the libraries, utilities,
and APIs that support the Hadoop
modules. It provides a foundation
for other Hadoop-related projects.
SWIPE-->
+91-9893978249
www.schoolofcoreai.com
H No 145 Green wood City Sec 40 Gurgaon,
122003
info@schoolofcoreai.com
CONTACT US
THANK YOU
S C H O O L O F C O R E A I . C O M

More Related Content

PPTX
Apache hadoop introduction and architecture
PPTX
Brief Introduction about Hadoop and Core Services.
PDF
What is Apache Hadoop and its ecosystem?
PPT
Introduction to Apache hadoop
PDF
Introduction To Hadoop Administration - SpringPeople
PPTX
PPTX
Getting started big data
PDF
BIGDATA ppts
Apache hadoop introduction and architecture
Brief Introduction about Hadoop and Core Services.
What is Apache Hadoop and its ecosystem?
Introduction to Apache hadoop
Introduction To Hadoop Administration - SpringPeople
Getting started big data
BIGDATA ppts

Similar to Apache hadoop (20)

PPTX
Big Data and Hadoop Guide
PPT
unit-3bda-230421082621-d2b7d921.ppthjghh
PPTX
Big Data Technology Stack : Nutshell
PPTX
Hadoop and their in big data analysis EcoSystem.pptx
PPT
Hadoop distributed file system (HDFS), HDFS concept
PPT
Unit-3_BDA.ppt
PPTX
hadoop-ecosystem-ppt.pptx
PPTX
Get started with hadoop hive hive ql languages
PPT
Hadoop in action
PPT
Hadoop
PPTX
Hadoop And Their Ecosystem ppt
PPTX
Hadoop And Their Ecosystem
DOCX
Hadoop Tutorial for Beginners
PPTX
Big Data and Hadoop Components
PDF
2.1-HADOOP.pdf
PPTX
An Introduction-to-Hive and its Applications and Implementations.pptx
PPTX
Hadoop vs Apache Spark
PDF
Big Data Tools & Libraries
DOCX
Hadoop map reduce
Big Data and Hadoop Guide
unit-3bda-230421082621-d2b7d921.ppthjghh
Big Data Technology Stack : Nutshell
Hadoop and their in big data analysis EcoSystem.pptx
Hadoop distributed file system (HDFS), HDFS concept
Unit-3_BDA.ppt
hadoop-ecosystem-ppt.pptx
Get started with hadoop hive hive ql languages
Hadoop in action
Hadoop
Hadoop And Their Ecosystem ppt
Hadoop And Their Ecosystem
Hadoop Tutorial for Beginners
Big Data and Hadoop Components
2.1-HADOOP.pdf
An Introduction-to-Hive and its Applications and Implementations.pptx
Hadoop vs Apache Spark
Big Data Tools & Libraries
Hadoop map reduce

Recently uploaded (20)

PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
AI in Product Development-omnex systems
PPTX
Essential Infomation Tech presentation.pptx
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
L1 - Introduction to python Backend.pptx
PDF
medical staffing services at VALiNTRY
PDF
Nekopoi APK 2025 free lastest update
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
System and Network Administration Chapter 2
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PPTX
history of c programming in notes for students .pptx
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
AI in Product Development-omnex systems
Essential Infomation Tech presentation.pptx
VVF-Customer-Presentation2025-Ver1.9.pptx
L1 - Introduction to python Backend.pptx
medical staffing services at VALiNTRY
Nekopoi APK 2025 free lastest update
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Odoo Companies in India – Driving Business Transformation.pdf
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
System and Network Administration Chapter 2
Wondershare Filmora 15 Crack With Activation Key [2025
2025 Textile ERP Trends: SAP, Odoo & Oracle
Upgrade and Innovation Strategies for SAP ERP Customers
How to Choose the Right IT Partner for Your Business in Malaysia
history of c programming in notes for students .pptx
Adobe Illustrator 28.6 Crack My Vision of Vector Design
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
CHAPTER 2 - PM Management and IT Context
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf

Apache hadoop

  • 1. APACHE HADOOP W W W . S C H O O L O F C O R E A I . C O M
  • 2. SCAI Hadoop is an open source software programming framework for storing a large amount of data and performing the computation. Its framework is based on Java programming with some native code in C and shell scripts. Hadoop is an open-source software framework that is used for storing and processing large amounts of data in a distributed computing environment. It is designed to handle big data and is based on the MapReduce programming model, which allows for the parallel processing of large datasets. SWEEP --> APACHE HADOOP
  • 3. SCAI MAIN COMPONENTS There are Four types of Main Components:- • Hadoop Distributed File System (HDFS) • Mad-Reduce • Yet Another Resource Negotiator (YARN) • Hadoop Common SWIPE-->
  • 4. SCAI HDFS • Hadoop Distributed File System is the storage component of Hadoop, which allows for the storage of large amounts of data across multiple machines. It is designed to work with commodity hardware, which makes it cost-effective. SWIPE-->
  • 5. SCAI This is the resource management component of Hadoop, which manages the allocation of resources (such as CPU and memory) for processing the data stored in HDFS. YARN SWIPE-->
  • 6. SCAI MapReduce is a programming model and processing engine for distributed computing on large datasets. It consists of two main tasks: the Map task, which processes and filters data, and the Reduce task, which aggregates and summarizes the results. MAPREDUCE SWIPE-->
  • 7. SCAI HADOOP COMMON • This includes the libraries, utilities, and APIs that support the Hadoop modules. It provides a foundation for other Hadoop-related projects. SWIPE-->
  • 8. +91-9893978249 www.schoolofcoreai.com H No 145 Green wood City Sec 40 Gurgaon, 122003 info@schoolofcoreai.com CONTACT US
  • 9. THANK YOU S C H O O L O F C O R E A I . C O M