SlideShare a Scribd company logo
BBigig DData Analysis for Pageata Analysis for Page
Ranking using Map/ReduceRanking using Map/Reduce
R.Renuka,
R.Vidhya Priya,
IIIB.Sc., IT,
The S.F.R.College forWomen,
Sivakasi.
Overview
Introduction
What isBig Data!
Why Big Data?
4 V’sOf Big Data
Big DataAnalyticsTechnologies
Map/Reduce
Applications
CaseStudy
Conclusion
Introduction
Datahaveoutgrown thestorageand processing capabilitiesof
asinglehost.
Two fundamental challenges:
– how to storeand
– how to work with voluminousdatasizes, and,
– how to understand dataand turn it into acompetitive
advantage.
What isBig Data!
‘Big-data’ issimilar to ‘Small-data’, but bigger
But having databigger requiresdifferent approaches:
techniques, tools& architectures
To solve:
New problemsand old problemsin abetter way.
TheBlind men and theElephant
Why Big Data?
Key enablersfor thegrowth of “Big Data” are:
Increaseof Processing Power
Increaseof StorageCapacities
Availability of Data
4 V’sof Big Data
Big DataAnalyticsTechnologies
Hadoop
PLATFORA
WibiData
PIG
Hive
MapReduce
NoSQL databases
Column-oriented databases
Hadoop
Hadoop isadistributed filesystem and data
processing engine
Hadoop hastwo components:
– TheHadoop distributed filesystem (HDFS)
– TheMapReduceprograming.
Map / Reduce
A High level abstracted framework for distributed processing of large
datasets
Fault Tolerant , Parallelization
Computation consistsof two phases
Map
Reduce
A Master-Slavearchitecture
Computationsoccursin multipleslavenodes
And it triesto providedatalocality asmuch aspossible.
MR model
Map
– Processakey/valuepair to generateintermediatekey/value
pairs
Reduce
– Mergeall intermediatevaluesassociated with thesamekey
Usersimplement interfaceof two primary methods:
1. Map: (key1, val1) → (key2, val2)
2. Reduce: (key2, [val2]) → [val3]
Applications
Homeland Security
FinanceSmarter Healthcare
Multi-channel
sales
Telecom
Manufacturing
Traffic Control
Trading Analytics Fraud and Risk
Log Analysis
Search Quality
Retails
CaseStudy
Big data analysis using map/reduce
Conclusion
Real-time big data isn’t just a process for storing
petabytesor exabytesof datain adatawarehouse, It’s
about the ability to make better decisions and take
meaningful actionsat theright time.
Queries ??
Big data analysis using map/reduce

More Related Content

PPTX
Big data analytics
PPTX
Big_data_ppt
PDF
Big Data
PPTX
PPTX
Big Data & Hadoop Introduction
PPTX
Data Lake Overview
PPTX
Our big data
Big data analytics
Big_data_ppt
Big Data
Big Data & Hadoop Introduction
Data Lake Overview
Our big data

What's hot (20)

PPTX
Chapter 1 big data
PPTX
Big Data PPT by Rohit Dubey
PDF
Data Architecture Strategies: Data Architecture for Digital Transformation
PPTX
Presentation About Big Data (DBMS)
PPTX
Big Data
PPTX
Big data by Mithlesh sadh
PDF
Data Strategy
PPTX
Introduction to Big Data
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PPTX
Big Data - Applications and Technologies Overview
PDF
Data Governance Best Practices and Lessons Learned
PDF
DAS Slides: Data Governance - Combining Data Management with Organizational ...
PPTX
Is the traditional data warehouse dead?
PDF
Enabling a Data Mesh Architecture with Data Virtualization
PPTX
Data science Big Data
PDF
Data Product Architectures
PPT
Big Data
PDF
Introduction to Data Science
PDF
Data Lake Architecture
Chapter 1 big data
Big Data PPT by Rohit Dubey
Data Architecture Strategies: Data Architecture for Digital Transformation
Presentation About Big Data (DBMS)
Big Data
Big data by Mithlesh sadh
Data Strategy
Introduction to Big Data
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Big Data - Applications and Technologies Overview
Data Governance Best Practices and Lessons Learned
DAS Slides: Data Governance - Combining Data Management with Organizational ...
Is the traditional data warehouse dead?
Enabling a Data Mesh Architecture with Data Virtualization
Data science Big Data
Data Product Architectures
Big Data
Introduction to Data Science
Data Lake Architecture
Ad

Similar to Big data analysis using map/reduce (20)

PPT
Big Data Analysis for page ranking using map reduce concept
PPT
BIG DATA Analysis for page ranking using Map Reduce
PDF
Big Data Analytics Lecture notes pdf notes
PPTX
Big data Intro - Presentation to OCHackerz Meetup Group
PDF
(R17A0528) BIG DATA ANALYTICS.pdf
PDF
(R17A0528) BIG DATA ANALYTICS.pdf
PPTX
Big data analytics - Introduction to Big Data and Hadoop
PPTX
Presentation on BigData by Swapnaja
PPTX
Big Data
PPTX
Big Data
PDF
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
PDF
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
PDF
PPT
Big data introduction, Hadoop in details
PPTX
Big Data
PDF
Hadoop Master Class : A concise overview
PDF
DOCX
Content1. Introduction2. What is Big Data3. Characte.docx
PDF
Introduction to Big Data
PPT
Big data
Big Data Analysis for page ranking using map reduce concept
BIG DATA Analysis for page ranking using Map Reduce
Big Data Analytics Lecture notes pdf notes
Big data Intro - Presentation to OCHackerz Meetup Group
(R17A0528) BIG DATA ANALYTICS.pdf
(R17A0528) BIG DATA ANALYTICS.pdf
Big data analytics - Introduction to Big Data and Hadoop
Presentation on BigData by Swapnaja
Big Data
Big Data
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
Big data introduction, Hadoop in details
Big Data
Hadoop Master Class : A concise overview
Content1. Introduction2. What is Big Data3. Characte.docx
Introduction to Big Data
Big data
Ad

Recently uploaded (20)

PPTX
Self management and self evaluation presentation
PDF
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
PPTX
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
PPTX
2025-08-10 Joseph 02 (shared slides).pptx
PPTX
Tablets And Capsule Preformulation Of Paracetamol
PDF
Swiggy’s Playbook: UX, Logistics & Monetization
PDF
oil_refinery_presentation_v1 sllfmfls.pdf
PDF
Instagram's Product Secrets Unveiled with this PPT
PPTX
Impressionism_PostImpressionism_Presentation.pptx
PPTX
Intro to ISO 9001 2015.pptx wareness raising
PPTX
Learning-Plan-5-Policies-and-Practices.pptx
PPTX
Tour Presentation Educational Activity.pptx
PPTX
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
PPTX
Non-Verbal-Communication .mh.pdf_110245_compressed.pptx
PPTX
Introduction to Effective Communication.pptx
PPTX
_ISO_Presentation_ISO 9001 and 45001.pptx
PPTX
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
PPTX
Effective_Handling_Information_Presentation.pptx
PPTX
AcademyNaturalLanguageProcessing-EN-ILT-M02-Introduction.pptx
PPTX
Understanding-Communication-Berlos-S-M-C-R-Model.pptx
Self management and self evaluation presentation
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
2025-08-10 Joseph 02 (shared slides).pptx
Tablets And Capsule Preformulation Of Paracetamol
Swiggy’s Playbook: UX, Logistics & Monetization
oil_refinery_presentation_v1 sllfmfls.pdf
Instagram's Product Secrets Unveiled with this PPT
Impressionism_PostImpressionism_Presentation.pptx
Intro to ISO 9001 2015.pptx wareness raising
Learning-Plan-5-Policies-and-Practices.pptx
Tour Presentation Educational Activity.pptx
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
Non-Verbal-Communication .mh.pdf_110245_compressed.pptx
Introduction to Effective Communication.pptx
_ISO_Presentation_ISO 9001 and 45001.pptx
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
Effective_Handling_Information_Presentation.pptx
AcademyNaturalLanguageProcessing-EN-ILT-M02-Introduction.pptx
Understanding-Communication-Berlos-S-M-C-R-Model.pptx

Big data analysis using map/reduce