SlideShare a Scribd company logo
HDFS Salient Features
Application market experts have started to use
the term BigData to relate to information places
that are generally many magnitudes greater than
conventional data source. The biggest Oracle
data source or the biggest NetApp client could be
many number of terabytes at most, but BigData
represents storage space places that can range to
many number of petabytes.
●
Most standards concentrate on latency and
throughput of concerns, and appropriately so.
However, in my view, the key to developing a
BigData standard depends on must further
parallels of methods. A BigData standard should
evaluate latencies and throughput, but with a
good deal of modifications in the amount of
work, skews in the information set and in the
existence of mistakes.
Elasticity of resources
●
A main function of a BigData Product is that it
should be flexible in general. One should be able to
add software and components sources when
needed. Most BigData set ups do not want to pre-
provision for all the information that they might
gather in the long run, and the secret to success to
be cost-efficient is to be able to add sources to a
manufacturing shop without operating into recovery
time
Fault Tolerance
●
The Flexibility function described above
ultimately means that the program has to be
fault-tolerant. If a amount of work is operating on
your body and some areas of the program is not
able
Fault Tolerance
●
The Flexibility function described above
ultimately means that the program has to be
fault-tolerant. If a amount of work is operating on
your body and some areas of the program is not
able
Skew in the information set
●
Many big information techniques take in un-
curated information. Which indicates there are
always information factors that are excessive
outliers and presents locations in the program.
The amount of work on a BigData program is
not uniform; some small areas of it is are
significant locations and have extremely higher
fill than the rest of the program. Our standards
should be developed to operated with datasets
that have large alter and present amount of work
locations.
●
THANK YOU

More Related Content

PPTX
Chapter 6 - HR DEMAND FORECASTING – QUALITATIVE TECHNIQUES.pptx
PDF
Mt slides
PPTX
Advanced Analytics in Tableau: Use the Force!, Built-In Functions and 3rd Par...
PDF
Introdution to Dataops and AIOps (or MLOps)
PDF
Achieving Agility and Scale for Your Data Lake - Talend
PDF
5 Simple Steps to Unleash Big Data Talend Connect
PDF
RS Components reduce downtime and enable aggressive SAP rollout strategy to s...
DOCX
Word optimisa doc for linked in insights promotion
Chapter 6 - HR DEMAND FORECASTING – QUALITATIVE TECHNIQUES.pptx
Mt slides
Advanced Analytics in Tableau: Use the Force!, Built-In Functions and 3rd Par...
Introdution to Dataops and AIOps (or MLOps)
Achieving Agility and Scale for Your Data Lake - Talend
5 Simple Steps to Unleash Big Data Talend Connect
RS Components reduce downtime and enable aggressive SAP rollout strategy to s...
Word optimisa doc for linked in insights promotion

What's hot (19)

PDF
ODSC data science to DataOps
PDF
Unleash the Power of Big Data and Machine Learning
PPTX
Big Data Adoption Status
PPTX
Talend 6.1 - What's New in Talend?
PPTX
Mike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
PDF
Embracing Cloud Agility to Maximize Flexibility & Performance
PDF
Eligotech presents @ Data Donderdag on 24 April 2014
PPTX
Sage presenation
PDF
seven steps to dataops @ dataops.rocks conference Oct 2019
PPTX
Low-tech, Low-cost data management: Six insights from national reporting on f...
PDF
PPTX
Who is Talend?
PPTX
Postgres survey podcast
PDF
DOG Meetup 18 November 2021 - Intro and Azumuta
PDF
Absoft Oil & Gas SAP Specialists Infographic
PDF
Operationalizing Data Analytics
PDF
Do Agile Data in Just 5 Shocking Steps!
PDF
IBM Aspera in Chemical & Petroleum Infographic
PPTX
sitNL 2015 algorithm programming made easy (Marcel de Bruin, Sander de Wildt)
ODSC data science to DataOps
Unleash the Power of Big Data and Machine Learning
Big Data Adoption Status
Talend 6.1 - What's New in Talend?
Mike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Embracing Cloud Agility to Maximize Flexibility & Performance
Eligotech presents @ Data Donderdag on 24 April 2014
Sage presenation
seven steps to dataops @ dataops.rocks conference Oct 2019
Low-tech, Low-cost data management: Six insights from national reporting on f...
Who is Talend?
Postgres survey podcast
DOG Meetup 18 November 2021 - Intro and Azumuta
Absoft Oil & Gas SAP Specialists Infographic
Operationalizing Data Analytics
Do Agile Data in Just 5 Shocking Steps!
IBM Aspera in Chemical & Petroleum Infographic
sitNL 2015 algorithm programming made easy (Marcel de Bruin, Sander de Wildt)
Ad

Similar to Hdfs salient features (20)

PDF
Cloud-Native Data: What data questions to ask when building cloud-native apps
PDF
Map r whitepaper_zeta_architecture
PDF
Cloud Native Batch Processing: Beyond the What and How
PDF
TechoERP.pdf
PDF
Big data rmoug
PDF
GEP-Supply-Chain-Planning-Guide-Fnl_0.pdf
PDF
Best Practices & Lessons Learned from Deployment of PostgreSQL
 
PPT
PDF
PDF
Ataas2016 - Big data hadoop and map reduce - new age tools for aid to test...
PDF
Big Data Analytics Unit I CCS334 Syllabus
PDF
Big data Question bank.pdf
PPTX
Introduction of big data and analytics
PPTX
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
PDF
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
PDF
Understanding big data testing
PDF
Manipulating data with Talend. Learn how?
PDF
Manipulating Data with Talend.
PDF
Big Data with Hadoop – For Data Management, Processing and Storing
PDF
InfoSphere BigInsights
Cloud-Native Data: What data questions to ask when building cloud-native apps
Map r whitepaper_zeta_architecture
Cloud Native Batch Processing: Beyond the What and How
TechoERP.pdf
Big data rmoug
GEP-Supply-Chain-Planning-Guide-Fnl_0.pdf
Best Practices & Lessons Learned from Deployment of PostgreSQL
 
Ataas2016 - Big data hadoop and map reduce - new age tools for aid to test...
Big Data Analytics Unit I CCS334 Syllabus
Big data Question bank.pdf
Introduction of big data and analytics
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Understanding big data testing
Manipulating data with Talend. Learn how?
Manipulating Data with Talend.
Big Data with Hadoop – For Data Management, Processing and Storing
InfoSphere BigInsights
Ad

Recently uploaded (20)

PDF
Introduction to the R Programming Language
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Introduction to Data Science and Data Analysis
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
1_Introduction to advance data techniques.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
Introduction to the R Programming Language
Acceptance and paychological effects of mandatory extra coach I classes.pptx
annual-report-2024-2025 original latest.
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Reliability_Chapter_ presentation 1221.5784
Introduction to Data Science and Data Analysis
Galatica Smart Energy Infrastructure Startup Pitch Deck
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
Clinical guidelines as a resource for EBP(1).pdf
1_Introduction to advance data techniques.pptx
.pdf is not working space design for the following data for the following dat...
ISS -ESG Data flows What is ESG and HowHow
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
IBA_Chapter_11_Slides_Final_Accessible.pptx
Introduction to Knowledge Engineering Part 1
Introduction-to-Cloud-ComputingFinal.pptx
Database Infoormation System (DBIS).pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx

Hdfs salient features

  • 1. HDFS Salient Features Application market experts have started to use the term BigData to relate to information places that are generally many magnitudes greater than conventional data source. The biggest Oracle data source or the biggest NetApp client could be many number of terabytes at most, but BigData represents storage space places that can range to many number of petabytes.
  • 2. ● Most standards concentrate on latency and throughput of concerns, and appropriately so. However, in my view, the key to developing a BigData standard depends on must further parallels of methods. A BigData standard should evaluate latencies and throughput, but with a good deal of modifications in the amount of work, skews in the information set and in the existence of mistakes.
  • 3. Elasticity of resources ● A main function of a BigData Product is that it should be flexible in general. One should be able to add software and components sources when needed. Most BigData set ups do not want to pre- provision for all the information that they might gather in the long run, and the secret to success to be cost-efficient is to be able to add sources to a manufacturing shop without operating into recovery time
  • 4. Fault Tolerance ● The Flexibility function described above ultimately means that the program has to be fault-tolerant. If a amount of work is operating on your body and some areas of the program is not able
  • 5. Fault Tolerance ● The Flexibility function described above ultimately means that the program has to be fault-tolerant. If a amount of work is operating on your body and some areas of the program is not able
  • 6. Skew in the information set ● Many big information techniques take in un- curated information. Which indicates there are always information factors that are excessive outliers and presents locations in the program. The amount of work on a BigData program is not uniform; some small areas of it is are significant locations and have extremely higher fill than the rest of the program. Our standards should be developed to operated with datasets that have large alter and present amount of work locations.