Coartha Technosolutions
Presented by
Coartha Team
Rebot Project
Contents:-
1. Bigdata (Horton Works HDP-V2.0)
2. Machine Learning (Python V2.7.6)
3. Cloud ( Amazon Free tier)
4. Dev,staging and Production.
5. Database (MongoDB)
6. Linux Systems(Centos V6.4)
Brief Explanation
 Bigdata (Horton Works HDP-2.0)
Big Data is nothing but collection of data, data sets which
is of unstructured data .
Big Data is useful under the large growing data where it is
unable to manage.
Hadoop V2.0 is for running mapreduce job of the particular
task.
Hadoop V2.0 consists of Mapreduce ,Yarn and HDFS .
Above mentioned three components are important in
Hadoop.
Continue……
 HDFS:
 Hadoop Distributed file system.(HDFS).
 Handles large data with streaming data Access.
 Runs on top of all file system.
 Uses Blocks to store files.
 Mapreduce:
 Frame work for performing calculations on data in HDFS.
 Map&Reduce Function.
 YARN:
 Distributed Data Processing.
 Resource and Scheduler Manager.
Machine Learning Language
 Python V2.7.6 Modules on Dev,staging and
Production.
PIP-1.5.4
NLTK-2.0.4
Setup tools-3.3
Easy install-2.7
Numpy-1.8.1
pyYaml-3.11
Mrjob-0.4.2
Cloud
 Cloud Components for Staging & Production.
Centos V6.4 Instance .
Bucket for Storage of files.
WordPress Blog with Version 3.8.1.
JQuery on WordPress.
Visualization on instance.
.pem file for connecting Cloud from Local machine.
.ppk file for moving data from Local system to cloud
through FileZilla.
Public Ip (Elastic Ip for the Instance).
Dev,Staging and Production.
Maintain same version on all the three stages of
the Project.
MongoDB(Database)
 Description of Database:
Handles structured, unstructured and polymorphic.
NOSQL.....
Scale up with Bigdata.
MongoHQ for MongoDB server.
Backup & Restore Data from DB.
!!!!!!Thank You !!!!!!

More Related Content

PDF
Hadoop: Distributed data processing
PPTX
PDF
Hadoop and its role in Facebook: An Overview
PPTX
HADOOP TECHNOLOGY ppt
PPTX
Big data | Hadoop | components of hadoop |Rahul Gulab Sing
PPTX
Hadoop: Distributed Data Processing
PPTX
Big data
PPTX
Hadoop distributed file system
Hadoop: Distributed data processing
Hadoop and its role in Facebook: An Overview
HADOOP TECHNOLOGY ppt
Big data | Hadoop | components of hadoop |Rahul Gulab Sing
Hadoop: Distributed Data Processing
Big data
Hadoop distributed file system

What's hot (20)

PPTX
Introduction to Big Data & Hadoop Architecture - Module 1
PPT
Hadoop technology
PPTX
Hadoop Distributed File System
PPTX
Big data PPT
PPTX
Hadoop Presentation - PPT
PPTX
Data Analytics using MATLAB and HDF5
PDF
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
PDF
Hadoop
PPTX
Hadoop introduction
PDF
Cred_hadoop_presenatation
PDF
Introduction to Big Data and Hadoop using Local Standalone Mode
PPTX
Utilizing HDF4 File Content Maps for the Cloud Computing
PDF
Introduction to Hadoop part1
PPTX
PPTX
Matlab, Big Data, and HDF Server
PPTX
Hadoop Technology
PPTX
Introduction to Hadoop
PPTX
Big data and hadoop
PPTX
Hadoop introduction
PPT
Hadoop Technologies
Introduction to Big Data & Hadoop Architecture - Module 1
Hadoop technology
Hadoop Distributed File System
Big data PPT
Hadoop Presentation - PPT
Data Analytics using MATLAB and HDF5
Harnessing Hadoop: Understanding the Big Data Processing Options for Optimizi...
Hadoop
Hadoop introduction
Cred_hadoop_presenatation
Introduction to Big Data and Hadoop using Local Standalone Mode
Utilizing HDF4 File Content Maps for the Cloud Computing
Introduction to Hadoop part1
Matlab, Big Data, and HDF Server
Hadoop Technology
Introduction to Hadoop
Big data and hadoop
Hadoop introduction
Hadoop Technologies
Ad

Similar to Rebot Project Contents and Description (20)

PDF
What is hadoop
PPTX
2.introduction to hdfs
PPTX
Lecture 2 Hadoop.pptx
PDF
Hadoop J.G.Rohini 2nd M.sc., computer science bon secours college for women
PPTX
Bigdata and hadoop
PPTX
62_Tazeen_Sayed_Hadoop_Ecosystem.pptx
PDF
Hadoop J.G.Rohini II M.Sc.,computer science Bon secours college for women
DOCX
project report on hadoop
PPTX
Hadoop basics
PPTX
Hadoop and BigData - July 2016
PDF
G017143640
PDF
Big Data Analysis and Its Scheduling Policy – Hadoop
PPTX
Overview of big data & hadoop version 1 - Tony Nguyen
PPTX
Overview of Big data, Hadoop and Microsoft BI - version1
DOCX
Hadoop technology doc
PPT
Hadoop training by keylabs
PPTX
Hadoop
PPTX
THE SOLUTION FOR BIG DATA
PPTX
THE SOLUTION FOR BIG DATA
PPTX
Big data
What is hadoop
2.introduction to hdfs
Lecture 2 Hadoop.pptx
Hadoop J.G.Rohini 2nd M.sc., computer science bon secours college for women
Bigdata and hadoop
62_Tazeen_Sayed_Hadoop_Ecosystem.pptx
Hadoop J.G.Rohini II M.Sc.,computer science Bon secours college for women
project report on hadoop
Hadoop basics
Hadoop and BigData - July 2016
G017143640
Big Data Analysis and Its Scheduling Policy – Hadoop
Overview of big data & hadoop version 1 - Tony Nguyen
Overview of Big data, Hadoop and Microsoft BI - version1
Hadoop technology doc
Hadoop training by keylabs
Hadoop
THE SOLUTION FOR BIG DATA
THE SOLUTION FOR BIG DATA
Big data
Ad

Recently uploaded (20)

PPTX
The various Industrial Revolutions .pptx
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Tartificialntelligence_presentation.pptx
PPT
Geologic Time for studying geology for geologist
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Hybrid model detection and classification of lung cancer
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
DOCX
search engine optimization ppt fir known well about this
PDF
August Patch Tuesday
PPTX
Web Crawler for Trend Tracking Gen Z Insights.pptx
PDF
Architecture types and enterprise applications.pdf
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPT
Module 1.ppt Iot fundamentals and Architecture
The various Industrial Revolutions .pptx
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
Assigned Numbers - 2025 - Bluetooth® Document
Tartificialntelligence_presentation.pptx
Geologic Time for studying geology for geologist
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Chapter 5: Probability Theory and Statistics
Developing a website for English-speaking practice to English as a foreign la...
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Zenith AI: Advanced Artificial Intelligence
Hybrid model detection and classification of lung cancer
A review of recent deep learning applications in wood surface defect identifi...
NewMind AI Weekly Chronicles – August ’25 Week III
search engine optimization ppt fir known well about this
August Patch Tuesday
Web Crawler for Trend Tracking Gen Z Insights.pptx
Architecture types and enterprise applications.pdf
O2C Customer Invoices to Receipt V15A.pptx
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
Module 1.ppt Iot fundamentals and Architecture

Rebot Project Contents and Description

  • 2. Rebot Project Contents:- 1. Bigdata (Horton Works HDP-V2.0) 2. Machine Learning (Python V2.7.6) 3. Cloud ( Amazon Free tier) 4. Dev,staging and Production. 5. Database (MongoDB) 6. Linux Systems(Centos V6.4)
  • 3. Brief Explanation  Bigdata (Horton Works HDP-2.0) Big Data is nothing but collection of data, data sets which is of unstructured data . Big Data is useful under the large growing data where it is unable to manage. Hadoop V2.0 is for running mapreduce job of the particular task. Hadoop V2.0 consists of Mapreduce ,Yarn and HDFS . Above mentioned three components are important in Hadoop.
  • 4. Continue……  HDFS:  Hadoop Distributed file system.(HDFS).  Handles large data with streaming data Access.  Runs on top of all file system.  Uses Blocks to store files.  Mapreduce:  Frame work for performing calculations on data in HDFS.  Map&Reduce Function.  YARN:  Distributed Data Processing.  Resource and Scheduler Manager.
  • 5. Machine Learning Language  Python V2.7.6 Modules on Dev,staging and Production. PIP-1.5.4 NLTK-2.0.4 Setup tools-3.3 Easy install-2.7 Numpy-1.8.1 pyYaml-3.11 Mrjob-0.4.2
  • 6. Cloud  Cloud Components for Staging & Production. Centos V6.4 Instance . Bucket for Storage of files. WordPress Blog with Version 3.8.1. JQuery on WordPress. Visualization on instance. .pem file for connecting Cloud from Local machine. .ppk file for moving data from Local system to cloud through FileZilla. Public Ip (Elastic Ip for the Instance).
  • 7. Dev,Staging and Production. Maintain same version on all the three stages of the Project.
  • 8. MongoDB(Database)  Description of Database: Handles structured, unstructured and polymorphic. NOSQL..... Scale up with Bigdata. MongoHQ for MongoDB server. Backup & Restore Data from DB.