SlideShare a Scribd company logo
A K H I L G O Y A L
1 5 C S U 0 1 6
Summer Intenship
Report
About Company
 LinuxWorld Informatics Pvt Ltd is an ISO Certified
company which is dedicated to offering a
comprehensive set of most useful open source and
Commercial training programmes today’s Industry
demands! . It’s main office is in Bangalore.
The Structure of the organization is very simple. It is
fully governed by young and Energetic Technocrats,
dedicated to Open Source technologies and Linux
promotion. The team of faculty is guided by Mr.
Vimal Daga –the chief technicalofficer of Linux
World Informatics Pvt Ltd.
About Project
 In our project we have used Hadoop as a framework which is Apache open
source
framework.
My product provides two way to setup hadoop cluster-
Manual Configuration – Provided the node statistics like RAM, CPU and
Hard disk to the client. Cluster will be established as per on click selection
of Name Node, Data Node, Task Tracker and Job Tracker etc.
On demand Configuration – Establishment of cluster using Docker
(Container) and Ansible (DevOps) depending upon the number of
Data Nodes, Task Tracker and Node Manager
Features
 In Docker , We can setup Docker Images of type Centos:
latest, Fedora , Ubuntu and we can also manage our own
created docker images (container) by starting, stopping and
Removing our containers.
 We have provided our live shell on which we can run the
Linux Commands.
 We can setup Httpd (Hypertext transfer protocol Daemon
that is web server) server Configuration and NFS Server
Configuration
 In our Project we have used 60% of the Ansible and 40% of
python cgi.
NEED
Windows 10
Virtual Box 5.2
Redhat Linux 7.3
NFS Docker HDFS HTTPD
HDFS Architecture
Project report on hadoop and docker
Complete setup
Project report on hadoop and docker
Docker
Project report on hadoop and docker
Project report on hadoop and docker
Project report on hadoop and docker

More Related Content

PDF
Lab2ppt
PDF
Embedded system design psoc lab report
PPTX
Arduino RFID Module (RC522) & Buzzer Access System
PPT
Handling tree structures — recursive SPs, nested sets, recursive CTEs
PDF
Introduction to 8086 Assembly language & arrays
PPTX
What isn’t told about timers in stm32 application
PPTX
Extensible and Dynamic Topic Types for DDS
Lab2ppt
Embedded system design psoc lab report
Arduino RFID Module (RC522) & Buzzer Access System
Handling tree structures — recursive SPs, nested sets, recursive CTEs
Introduction to 8086 Assembly language & arrays
What isn’t told about timers in stm32 application
Extensible and Dynamic Topic Types for DDS

Similar to Project report on hadoop and docker (20)

PPTX
BigDataTech 2015 Is Hadoop Enterprise ready?
PPT
Hw09 Production Deep Dive With High Availability
PPTX
Hadoop-Automation-Tool_RamkishorTak
PPTX
How to Upgrade Your Hadoop Stack in 1 Step -- with Zero Downtime
PPTX
Implementing Hadoop on a single cluster
PPTX
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
PDF
Scale 12 x Efficient Multi-tenant Hadoop 2 Workloads with Yarn
PDF
Infrastructure Around Hadoop
PDF
Scaling Hadoop at LinkedIn
PPTX
Hadoop project design and a usecase
PDF
Hortonworks HDP, Is it goog enough ?
PPTX
Atlanta hadoop users group july 2013
PPTX
Top 10 lessons learned from deploying hadoop in a private cloud
ODP
The influence of "Distributed platforms" on #devops
PPT
Hadoop applicationarchitectures
PDF
Run stuff, Deploy Stuff, Jax London 2017 Edition
PDF
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
PPTX
Introduction to hadoop V2
PDF
BDM37: Hadoop in production – the war stories by Nikolaï Grigoriev, Principal...
DOCX
BigDataTech 2015 Is Hadoop Enterprise ready?
Hw09 Production Deep Dive With High Availability
Hadoop-Automation-Tool_RamkishorTak
How to Upgrade Your Hadoop Stack in 1 Step -- with Zero Downtime
Implementing Hadoop on a single cluster
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
Scale 12 x Efficient Multi-tenant Hadoop 2 Workloads with Yarn
Infrastructure Around Hadoop
Scaling Hadoop at LinkedIn
Hadoop project design and a usecase
Hortonworks HDP, Is it goog enough ?
Atlanta hadoop users group july 2013
Top 10 lessons learned from deploying hadoop in a private cloud
The influence of "Distributed platforms" on #devops
Hadoop applicationarchitectures
Run stuff, Deploy Stuff, Jax London 2017 Edition
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
Introduction to hadoop V2
BDM37: Hadoop in production – the war stories by Nikolaï Grigoriev, Principal...
Ad

Recently uploaded (20)

PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
A Presentation on Artificial Intelligence
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Cloud computing and distributed systems.
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
KodekX | Application Modernization Development
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Big Data Technologies - Introduction.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
A Presentation on Artificial Intelligence
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Cloud computing and distributed systems.
Unlocking AI with Model Context Protocol (MCP)
Spectral efficient network and resource selection model in 5G networks
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Understanding_Digital_Forensics_Presentation.pptx
KodekX | Application Modernization Development
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
MYSQL Presentation for SQL database connectivity
NewMind AI Monthly Chronicles - July 2025
Digital-Transformation-Roadmap-for-Companies.pptx
cuic standard and advanced reporting.pdf
Encapsulation_ Review paper, used for researhc scholars
Big Data Technologies - Introduction.pptx
Ad

Project report on hadoop and docker

  • 1. A K H I L G O Y A L 1 5 C S U 0 1 6 Summer Intenship Report
  • 2. About Company  LinuxWorld Informatics Pvt Ltd is an ISO Certified company which is dedicated to offering a comprehensive set of most useful open source and Commercial training programmes today’s Industry demands! . It’s main office is in Bangalore. The Structure of the organization is very simple. It is fully governed by young and Energetic Technocrats, dedicated to Open Source technologies and Linux promotion. The team of faculty is guided by Mr. Vimal Daga –the chief technicalofficer of Linux World Informatics Pvt Ltd.
  • 3. About Project  In our project we have used Hadoop as a framework which is Apache open source framework. My product provides two way to setup hadoop cluster- Manual Configuration – Provided the node statistics like RAM, CPU and Hard disk to the client. Cluster will be established as per on click selection of Name Node, Data Node, Task Tracker and Job Tracker etc. On demand Configuration – Establishment of cluster using Docker (Container) and Ansible (DevOps) depending upon the number of Data Nodes, Task Tracker and Node Manager
  • 4. Features  In Docker , We can setup Docker Images of type Centos: latest, Fedora , Ubuntu and we can also manage our own created docker images (container) by starting, stopping and Removing our containers.  We have provided our live shell on which we can run the Linux Commands.  We can setup Httpd (Hypertext transfer protocol Daemon that is web server) server Configuration and NFS Server Configuration  In our Project we have used 60% of the Ansible and 40% of python cgi.
  • 6. Windows 10 Virtual Box 5.2 Redhat Linux 7.3 NFS Docker HDFS HTTPD