SlideShare a Scribd company logo
BIG DATA - FAST DATA
USING MAPREDUCE IN HAZELCAST
Source:
www.hazelcast.com
Christoph Engelbert(@noctarius2k)
8+ years of JavaWeirdoness
Performance, GC, traffic topics
Apache Committer
Gaming, TravelManagement, ...
CastMapRMapReduce for Hazelcast3
www.hazelcast.com
TOPICS
Hazelcast
Distributed Computing
Map &Reduce
Demonstration
Questions
www.hazelcast.com
HAZELCAST
PICKIN' DIAMONDS
www.hazelcast.com
WHAT IS HAZELCAST?
In-MemoryData-Grid
DataPartioning(Sharding)
JavaCollections Implementation
Distributed ComputingPlatform
www.hazelcast.com
WHY HAZELCAST?
Automatic Partitioning
FaultTolerance
Sync /Async Backups
FullyDistributed
In-Memoryfor HighestSpeed
www.hazelcast.com
WHY HAZELCAST?
www.hazelcast.com
WHY DISTRIBUTED COMPUTING?
www.hazelcast.com
WHY IN-MEMORY
COMPUTING?
www.hazelcast.com
TREND OF PRICES
DataSource:
www.hazelcast.com
SPEED DIFFERENCE
DataSource:
www.hazelcast.com
DISTRIBUTED
COMPUTING
OR
MULTICORE CPU ON STEROIDS
www.hazelcast.com
THE IDEA OF DISTRIBUTED COMPUTING
Source:
www.hazelcast.com
THE BEGINNING
Source:
www.hazelcast.com
MULTICORE IS NOT NEW
Source:
www.hazelcast.com
CLUSTER IT
Source:
www.hazelcast.com
SUPER COMPUTER
Source:
www.hazelcast.com
CLOUD COMPUTING
Source:
www.hazelcast.com
MAP & REDUCE
THE BLACK MAGIC FROM PLANET GOOGLE
www.hazelcast.com
USE CASES
LogAnalysis
DataQuerying
Aggregation
Distributed Sort
ETL (ExtractTransform Load)
and more...
www.hazelcast.com
BASIC STEPS
Read
Map /Transform
Reduce
www.hazelcast.com
FULL STEPS
Read
Map /Transform
Combine
Group /Shuffle
Reduce
Collate
www.hazelcast.com
MAPREDUCE WORKFLOW
www.hazelcast.com
Dataare mapped /transformed in asetof key-value pairs
SOME PSEUDO CODE (1/3)
MAPPING
map(key:String,document:String):Void->
foreachw:Wordindocument:
emit(w,1)
www.hazelcast.com
Multiple values are combined to an
intermediate resultto preserve traffic
SOME PSEUDO CODE (2/3)
COMBINING
combine(word:Word,counts:List[Int]):Void->
emit(word,sum(counts))
www.hazelcast.com
Values are reduced /aggregated to the requested result
SOME PSEUDO CODE (3/3)
REDUCING
reduce(word:String,counts:List[Int]):Int->
returnsum(counts)
www.hazelcast.com
FOR MATHEMATICIANS
Process: (KxV)*→(LxW)* ⇒ [(l1,w1),…,(lm,wm)]
Mapping: (KxV) →(LxW)* ⇒ (k,v) →[(l1,w1),…,(ln,wn)]
Reducing: LxW*→X* ⇒ (l,[w1,…,wn]) →[x1,…,xn]
www.hazelcast.com
MAPREDUCE PROGRAMS IN
GOOGLE SOURCE TREE
Source:
www.hazelcast.com
DEMONSTRATION
www.hazelcast.com
@noctarius2k
@hazelcast
http://www.sourceprojects.com
http://github.com/noctarius
THANK YOU!
ANY QUESTIONS?
Images:AllimagesarelicensedunderCreativeCommons
www.hazelcast.com

More Related Content

PDF
How to Use HazelcastMQ for Flexible Messaging and More
PPTX
Reactive Systems And Vertx
PDF
Vert.x - 2014 JDay Lviv (English)
PDF
Lean reactive services with vertx
PPTX
Vertx in production
PPTX
Real World Enterprise Reactive Programming using Vert.x
PPTX
Event driven microservices with vertx and kubernetes
PDF
Reactive Software Systems
How to Use HazelcastMQ for Flexible Messaging and More
Reactive Systems And Vertx
Vert.x - 2014 JDay Lviv (English)
Lean reactive services with vertx
Vertx in production
Real World Enterprise Reactive Programming using Vert.x
Event driven microservices with vertx and kubernetes
Reactive Software Systems

Viewers also liked (14)

PDF
The Reactive Manifesto: Message-driven, Resilient, Elastic, Responsive - Stef...
PDF
Vert.X like Node.js but polyglot and reactive on JVM
PDF
Vert.x introduction
PPTX
Vert.x for Microservices Architecture
KEY
Functional Reactive Programming in Javascript
PDF
Reactor, Reactive streams and MicroServices
PDF
Reactive Web Applications
PPTX
PPTX
Vert.x vs akka
PDF
Reactor 3.0, a reactive foundation for java 8 and Spring
PPTX
Micro services, reactive manifesto and 12-factors
PDF
Introduction to Nodejs
PDF
Intro to Reactive Programming
PPTX
Benefits of developing a Single Page Web Applications using AngularJS
The Reactive Manifesto: Message-driven, Resilient, Elastic, Responsive - Stef...
Vert.X like Node.js but polyglot and reactive on JVM
Vert.x introduction
Vert.x for Microservices Architecture
Functional Reactive Programming in Javascript
Reactor, Reactive streams and MicroServices
Reactive Web Applications
Vert.x vs akka
Reactor 3.0, a reactive foundation for java 8 and Spring
Micro services, reactive manifesto and 12-factors
Introduction to Nodejs
Intro to Reactive Programming
Benefits of developing a Single Page Web Applications using AngularJS
Ad

Similar to Map Reduce in Hazelcast - Hazelcast User Group London Version (20)

PDF
Big Data, Fast Data - MapReduce in Hazelcast
PDF
Distributed computing with Hazelcast - JavaOne 2014
PDF
Distributed Computing in Hazelcast - Geekout 2014 Edition
PDF
How to Speed up your Database
PDF
Distributed Computing - An Interactive Introduction
PPTX
Distributed caching-computing v3.8
PDF
Hazelcast - In-Memory DataGrid
PDF
In-Memory Computing - Distributed Systems - Devoxx UK 2015
PDF
Low-latency real-time data processing at giga-scale with Kafka | John DesJard...
PPTX
Geek Nights Hong Kong
PPTX
Hazelcast Essentials
PDF
Easy Scaling with Open Source Data Structures, by Talip Ozturk
PDF
Hazelcast Introduction
PPTX
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
PDF
Hazelcast 101
PDF
Clustering your Application with Hazelcast
PDF
Map reduce and hadoop at mylife
PDF
Devoxx uk 2014 High performance in-memory Java with open source
PPT
Hazelcast
PPTX
Hadoop MapReduce Paradigm
Big Data, Fast Data - MapReduce in Hazelcast
Distributed computing with Hazelcast - JavaOne 2014
Distributed Computing in Hazelcast - Geekout 2014 Edition
How to Speed up your Database
Distributed Computing - An Interactive Introduction
Distributed caching-computing v3.8
Hazelcast - In-Memory DataGrid
In-Memory Computing - Distributed Systems - Devoxx UK 2015
Low-latency real-time data processing at giga-scale with Kafka | John DesJard...
Geek Nights Hong Kong
Hazelcast Essentials
Easy Scaling with Open Source Data Structures, by Talip Ozturk
Hazelcast Introduction
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
Hazelcast 101
Clustering your Application with Hazelcast
Map reduce and hadoop at mylife
Devoxx uk 2014 High performance in-memory Java with open source
Hazelcast
Hadoop MapReduce Paradigm
Ad

More from Christoph Engelbert (20)

PDF
Postgres on Kubernetes - Dos and Donts.pdf
PDF
Data Pipeline Plumbing
PDF
Gute Nachrichten, Schlechte Nachrichten
PDF
Of Farm Topologies and Time-Series Data
PDF
What I learned about IoT Security ... and why it's so hard!
PDF
PostgreSQL: The Time-Series Database You (Actually) Want
PDF
Road to (Enterprise) Observability
PDF
Oops-Less Operation
PDF
Instan(t)a-neous Monitoring
PDF
Don't Go, Java!
PDF
TypeScript Go(es) Embedded
PDF
Hazelcast Jet - Riding the Jet Streams
PDF
CBOR - The Better JSON
PDF
Project Panama - Beyond the (JVM) Wall
PDF
The Delivery Hero - A Simpsons As A Service Storyboard
PDF
A Post-Apocalyptic sun.misc.Unsafe World
PDF
Gimme Caching - The JCache Way
PDF
Distributed Computing with Hazelcast - Brazil Tour
PDF
In-Memory Distributed Computing - Porto Tech Hub
PDF
JCache - Gimme Caching - JavaLand
Postgres on Kubernetes - Dos and Donts.pdf
Data Pipeline Plumbing
Gute Nachrichten, Schlechte Nachrichten
Of Farm Topologies and Time-Series Data
What I learned about IoT Security ... and why it's so hard!
PostgreSQL: The Time-Series Database You (Actually) Want
Road to (Enterprise) Observability
Oops-Less Operation
Instan(t)a-neous Monitoring
Don't Go, Java!
TypeScript Go(es) Embedded
Hazelcast Jet - Riding the Jet Streams
CBOR - The Better JSON
Project Panama - Beyond the (JVM) Wall
The Delivery Hero - A Simpsons As A Service Storyboard
A Post-Apocalyptic sun.misc.Unsafe World
Gimme Caching - The JCache Way
Distributed Computing with Hazelcast - Brazil Tour
In-Memory Distributed Computing - Porto Tech Hub
JCache - Gimme Caching - JavaLand

Recently uploaded (20)

PPTX
web development for engineering and engineering
PPT
Project quality management in manufacturing
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPT
Mechanical Engineering MATERIALS Selection
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PDF
PPT on Performance Review to get promotions
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
composite construction of structures.pdf
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
Well-logging-methods_new................
PPTX
Welding lecture in detail for understanding
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PDF
Digital Logic Computer Design lecture notes
PPTX
Construction Project Organization Group 2.pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
web development for engineering and engineering
Project quality management in manufacturing
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Mechanical Engineering MATERIALS Selection
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPT on Performance Review to get promotions
R24 SURVEYING LAB MANUAL for civil enggi
composite construction of structures.pdf
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Well-logging-methods_new................
Welding lecture in detail for understanding
Foundation to blockchain - A guide to Blockchain Tech
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
Digital Logic Computer Design lecture notes
Construction Project Organization Group 2.pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx

Map Reduce in Hazelcast - Hazelcast User Group London Version