SlideShare a Scribd company logo
Technical Seminar
on
HADOOP TECHNOLOGY
Under the Guidance of
P.V.R.K.MURTHY, M.Tech
Assistant Professor
What is hadoop Technology??
Why hadoop?
Developers of hadoop Technology
Famous hadoop users
Hadoop Features
Hadoop Architectures
Core-Components of Hadoop
Hadoop High Level Architechture
Hadoop cluster
CONTENTS
What is HDFS
HDFS – Name Node features:
HDFS-name node architecture
HDFS-data node
Hadoop MAPREDUCE
Benefits of Hadoop…
Conclusion
Reference
CONTENTS…
HADOOP TECHNOLOGY
What is Hadoop Technology??
•The most well known technology used for Big Data is
Hadoop.
•It is actually a large scale batch data processing system
Why Hadoop ??
•Distributed cluster system
•Platform for massively scalable applications
•Enables parallel data processing
Developers of Hadoop Technology:
Michael j. cafarella
Doug cutting
Famous Hadoop users
Hadoop Features
•Hadoop provides access to the file systems
• The Hadoop Common package contains the
necessary JAR files and scripts
•The package also provides source code,
documentation and a contribution section that includes
projects from the Hadoop Community.
HADOOPARCHITECTURE
Core-Components of Hadoop:
Hadoop distributive file system.
Map reduce.
What is HDFS ?
•Distributed file system
•Traditional hierarchical file organization
•Single namespace for the entire cluster
•Write-once-read-many access model
•Aware of the network topology
Hadoop High Level Architechture
Hadoop cluster
•A Small Hadoop Cluster Include a single master &
multiple worker nodes
Master node:
Data Node
Job Tracker
Task Tracker
Name Node
Slave node:
Data Node
Task Tracke
HDFS – Name Node Features
Metadata in main memory:
•List of files
•List of blocks for each file
•List of Data Nodes for each block
•File attributes
•Creation time
•Records every change in the
metadata
HDFS-name node architecture
Secondary name node
3.Store to HDD
Primary name-node
RAM
HDD
RAM
HDD
1. Pull transaction log
4.Push
2. Merge changes
HDFS-Data node
•Block Server Stores data in the local file system
•Periodic validation of checksums
•Periodically sends a report of all existing blocks
to the Name Node
Hadoop MAPREDUCE
Job Tracker:
Splitting into map and reduce tasks
Scheduling tasks on a cluster node
Task Tracker:
Runs Map Reduce tasks periodically
Map reduce implementation:
Benefits of Hadoop…
•Cost Saving and efficient and reliable data processing
•Provides an economically scalable solution
•Storing and processing of large amount of data
•Data grid operating system
•It is deployed on industry standard servers rather than expensive
specialized data storage systems.
• Parallel processing of huge amounts of data across inexpensive,
industry-standard servers.
Why commodity hw ?
because cheaper
designed to tolerate faults
Why HDFS ?
network bandwidth vs seek latency
Why Map reduce programming model?
parallel programming
large data sets
moving computation to data
single compute + data cluster
CONCLUSION
REFERENCES
•Apache Hadoop!
(http://hadoop.apache.org)
•Hadoop on Wikipedia
(http://en.wikipedia.org/wiki/Hadoop)
•Cloudera - Apache Hadoop for the Enterprise
(http://www.cloudera.com
HADOOP  TECHNOLOGY ppt
HADOOP  TECHNOLOGY ppt

More Related Content

PPTX
Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
PDF
What Is Hadoop | Hadoop Tutorial For Beginners | Edureka
PPTX
Hadoop technology
PPTX
HBase Tutorial For Beginners | HBase Architecture | HBase Tutorial | Hadoop T...
PPTX
PPT on Hadoop
PPTX
Learn to setup a Hadoop Multi Node Cluster
PPTX
Introduction to Hadoop Technology
PPTX
Map reduce presentation
Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop...
What Is Hadoop | Hadoop Tutorial For Beginners | Edureka
Hadoop technology
HBase Tutorial For Beginners | HBase Architecture | HBase Tutorial | Hadoop T...
PPT on Hadoop
Learn to setup a Hadoop Multi Node Cluster
Introduction to Hadoop Technology
Map reduce presentation

What's hot (20)

PPSX
PPTX
Hadoop
PPTX
PPTX
HBase.pptx
PPT
HADOOP AND MAPREDUCE ARCHITECTURE-Unit-5.ppt
PPT
Hadoop hive presentation
PPTX
Introduction to HDFS
DOCX
Hadoop basic commands
PPT
Hadoop Map Reduce
PPTX
Big data Hadoop presentation
PPTX
Hadoop and Big Data
PDF
Hadoop Ecosystem
PPT
RAID CONCEPT
PPTX
HADOOP TECHNOLOGY ppt
PPTX
Introduction to Apache Hive(Big Data, Final Seminar)
PDF
What is HDFS | Hadoop Distributed File System | Edureka
PPTX
Hadoop Tutorial For Beginners
PPTX
Big Data and Hadoop
PPTX
Hadoop Training | Hadoop Training For Beginners | Hadoop Architecture | Hadoo...
PPTX
Distributed database management system
Hadoop
HBase.pptx
HADOOP AND MAPREDUCE ARCHITECTURE-Unit-5.ppt
Hadoop hive presentation
Introduction to HDFS
Hadoop basic commands
Hadoop Map Reduce
Big data Hadoop presentation
Hadoop and Big Data
Hadoop Ecosystem
RAID CONCEPT
HADOOP TECHNOLOGY ppt
Introduction to Apache Hive(Big Data, Final Seminar)
What is HDFS | Hadoop Distributed File System | Edureka
Hadoop Tutorial For Beginners
Big Data and Hadoop
Hadoop Training | Hadoop Training For Beginners | Hadoop Architecture | Hadoo...
Distributed database management system
Ad

Similar to HADOOP TECHNOLOGY ppt (20)

PPTX
Hadoop info
PPTX
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
PPTX
Hadoop
PDF
Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for women
PDF
P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.
PDF
hdfs readrmation ghghg bigdats analytics info.pdf
PPTX
Hadoop
PPTX
Distributed Systems Hadoop.pptx
PPTX
Big Data and Hadoop with MapReduce Paradigms
PPTX
2. hadoop fundamentals
PPTX
OPERATING SYSTEM .pptx
PPT
Hadoop
PPTX
Apache hadoop basics
PPTX
HADOOP DISTRIBUTED FILE SYSTEM AND MAPREDUCE
PDF
Hadoop architecture-tutorial
PPTX
Hadoop ppt1
PPTX
Hadoop – Architecture.pptx
PPTX
hadoop.pptx
PDF
Introduction to Hadoop Administration
PDF
Introduction to Hadoop Administration
Hadoop info
02 Hadoop.pptx HADOOP VENNELA DONTHIREDDY
Hadoop
Hadoop Maharajathi,II-M.sc.,Computer Science,Bonsecours college for women
P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.
hdfs readrmation ghghg bigdats analytics info.pdf
Hadoop
Distributed Systems Hadoop.pptx
Big Data and Hadoop with MapReduce Paradigms
2. hadoop fundamentals
OPERATING SYSTEM .pptx
Hadoop
Apache hadoop basics
HADOOP DISTRIBUTED FILE SYSTEM AND MAPREDUCE
Hadoop architecture-tutorial
Hadoop ppt1
Hadoop – Architecture.pptx
hadoop.pptx
Introduction to Hadoop Administration
Introduction to Hadoop Administration
Ad

More from sravya raju (6)

PPT
Secure shell ppt
PPTX
BIOMETRIC IDENTIFICATION IN ATM’S PPT
PPTX
Hawk Eye Technology ppt
PPTX
fog computing ppt
DOCX
Fog computing document
PPTX
PERSON DE-IDENTIFICATION IN VIDEOS ppt
Secure shell ppt
BIOMETRIC IDENTIFICATION IN ATM’S PPT
Hawk Eye Technology ppt
fog computing ppt
Fog computing document
PERSON DE-IDENTIFICATION IN VIDEOS ppt

Recently uploaded (20)

PDF
KodekX | Application Modernization Development
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPT
Teaching material agriculture food technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
A Presentation on Artificial Intelligence
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
NewMind AI Monthly Chronicles - July 2025
KodekX | Application Modernization Development
Diabetes mellitus diagnosis method based random forest with bat algorithm
Network Security Unit 5.pdf for BCA BBA.
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Teaching material agriculture food technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Understanding_Digital_Forensics_Presentation.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
The AUB Centre for AI in Media Proposal.docx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Building Integrated photovoltaic BIPV_UPV.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Empathic Computing: Creating Shared Understanding
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Review of recent advances in non-invasive hemoglobin estimation
A Presentation on Artificial Intelligence
CIFDAQ's Market Insight: SEC Turns Pro Crypto
NewMind AI Monthly Chronicles - July 2025

HADOOP TECHNOLOGY ppt

  • 1. Technical Seminar on HADOOP TECHNOLOGY Under the Guidance of P.V.R.K.MURTHY, M.Tech Assistant Professor
  • 2. What is hadoop Technology?? Why hadoop? Developers of hadoop Technology Famous hadoop users Hadoop Features Hadoop Architectures Core-Components of Hadoop Hadoop High Level Architechture Hadoop cluster CONTENTS
  • 3. What is HDFS HDFS – Name Node features: HDFS-name node architecture HDFS-data node Hadoop MAPREDUCE Benefits of Hadoop… Conclusion Reference CONTENTS…
  • 4. HADOOP TECHNOLOGY What is Hadoop Technology?? •The most well known technology used for Big Data is Hadoop. •It is actually a large scale batch data processing system
  • 5. Why Hadoop ?? •Distributed cluster system •Platform for massively scalable applications •Enables parallel data processing
  • 6. Developers of Hadoop Technology: Michael j. cafarella Doug cutting
  • 8. Hadoop Features •Hadoop provides access to the file systems • The Hadoop Common package contains the necessary JAR files and scripts •The package also provides source code, documentation and a contribution section that includes projects from the Hadoop Community.
  • 10. Core-Components of Hadoop: Hadoop distributive file system. Map reduce.
  • 11. What is HDFS ? •Distributed file system •Traditional hierarchical file organization •Single namespace for the entire cluster •Write-once-read-many access model •Aware of the network topology
  • 12. Hadoop High Level Architechture
  • 13. Hadoop cluster •A Small Hadoop Cluster Include a single master & multiple worker nodes Master node: Data Node Job Tracker Task Tracker Name Node Slave node: Data Node Task Tracke
  • 14. HDFS – Name Node Features Metadata in main memory: •List of files •List of blocks for each file •List of Data Nodes for each block •File attributes •Creation time •Records every change in the metadata
  • 15. HDFS-name node architecture Secondary name node 3.Store to HDD Primary name-node RAM HDD RAM HDD 1. Pull transaction log 4.Push 2. Merge changes
  • 16. HDFS-Data node •Block Server Stores data in the local file system •Periodic validation of checksums •Periodically sends a report of all existing blocks to the Name Node
  • 17. Hadoop MAPREDUCE Job Tracker: Splitting into map and reduce tasks Scheduling tasks on a cluster node Task Tracker: Runs Map Reduce tasks periodically Map reduce implementation:
  • 18. Benefits of Hadoop… •Cost Saving and efficient and reliable data processing •Provides an economically scalable solution •Storing and processing of large amount of data •Data grid operating system •It is deployed on industry standard servers rather than expensive specialized data storage systems. • Parallel processing of huge amounts of data across inexpensive, industry-standard servers.
  • 19. Why commodity hw ? because cheaper designed to tolerate faults Why HDFS ? network bandwidth vs seek latency Why Map reduce programming model? parallel programming large data sets moving computation to data single compute + data cluster CONCLUSION
  • 20. REFERENCES •Apache Hadoop! (http://hadoop.apache.org) •Hadoop on Wikipedia (http://en.wikipedia.org/wiki/Hadoop) •Cloudera - Apache Hadoop for the Enterprise (http://www.cloudera.com