SlideShare a Scribd company logo
BIG DATA - FAST DATA
USING MAPREDUCE IN HAZELCAST
Source:http://www.newscientist.com/gallery/dn17805-computer-museums-of-the-world/11
www.hazelcast.com
WHO AM I
Christoph Engelbert(@noctarius2k)
8+ years of JavaWeirdoness
Performance, GC, traffic topics
Apache DirectMemoryPMC
Previous companies incl. Ubisoftand HRS
CastMapRMapReduce for Hazelcast3
www.hazelcast.com
TOPICS
Hazelcast
Distributed Computing
Map &Reduce
Demonstration
Questions
www.hazelcast.com
HAZELCAST
A SHORT SPACE TRIP
www.hazelcast.com
WHAT IS HAZELCAST?
In-MemoryData-Grid
DataPartioning(Sharding)
JavaCollections Implementation
Distributed ComputingPlatform
www.hazelcast.com
WHY HAZELCAST?
www.hazelcast.com
WHY IN-MEMORY
COMPUTING?
www.hazelcast.com
TREND OF PRICES
DataSource:http://www.jcmit.com/memoryprice.htm
www.hazelcast.com
SPEED DIFFERENCE
DataSource:http://i.imgur.com/ykOjTVw.png
www.hazelcast.com
DISTRIBUTED
COMPUTING
OR
MULTICORE CPU ON STEROIDS
www.hazelcast.com
THE IDEA OF DISTRIBUTED COMPUTING
Source:https://www.flickr.com/photos/stefan_ledwina/1853508040
www.hazelcast.com
THE BEGINNING
Source:http://en.wikipedia.org/wiki/File:KL_Advanced_Micro_Devices_AM9080.jpg
www.hazelcast.com
MULTICORE IS NOT NEW
Source:http://en.wikipedia.org/wiki/File:80386with387.JPG
www.hazelcast.com
CLUSTER IT
Source:http://rarecpus.com/images2/cpu_cluster.jpg
www.hazelcast.com
SUPER COMPUTER
Source:http://www.dkrz.de/about/aufgaben/dkrz-geschichte/rechnerhistorie-1
www.hazelcast.com
CLOUD COMPUTING
Source:https://farm6.staticflickr.com/5523/11407118963_e0e0870846_b_d.jpg
www.hazelcast.com
MAP & REDUCE
THE BLACK MAGIC FROM PLANET GOOGLE
www.hazelcast.com
USE CASES
LogAnalysis
DataQuerying
Aggregation and summing
Distributed Sort
ETL (ExtractTransform Load)
and more...
www.hazelcast.com
SIMPLE STEPS
Read
Map /Transform
Reduce
www.hazelcast.com
FULL STEPS
Read
Map /Transform
Combining
Grouping/Shuffling
Reduce
Collating
www.hazelcast.com
MAPREDUCE WORKFLOW
www.hazelcast.com
Dataare mapped /transformed in asetof key-value pairs
SOME PSEUDO CODE (1/3)
MAPPING
map( key:String, document:String ):Void ->
for each w:word in document:
emit( w, 1 )
www.hazelcast.com
Multiple values are combined to an
intermediate resultto preserve traffic
SOME PSEUDO CODE (2/3)
COMBINING
combine( word:String, counts:List[Int] ):Void ->
emit( word, sum( counts ) )
www.hazelcast.com
Values are reduced /aggregated to the requested result
SOME PSEUDO CODE (3/3)
REDUCING
reduce( word:String, counts:List[Int] ):Int ->
return sum( counts )
www.hazelcast.com
FOR MATHEMATICIANS
Process: (K x V)*→ (L x W)* ⇒ [(l1, w1), …, (lm, wm)]
Mapping: (K x V) → (L x W)* ⇒ (k, v) → [(l1, w1), …, (ln, wn)]
Reducing: L x W*→ X* ⇒ (l, [w1, …, wn]) → [x1, …,xn]
www.hazelcast.com
MAPREDUCE PROGRAMS IN
GOOGLE SOURCE TREE
Source:http://research.google.com/archive/mapreduce-osdi04-slides/index-auto-0005.html
www.hazelcast.com
DEMONSTRATION
www.hazelcast.com
@noctarius2k
@hazelcast
http://www.sourceprojects.com
http://github.com/noctarius
THANK YOU!
ANY QUESTIONS?
Images:AllimagesarelicensedunderCreativeCommons
www.hazelcast.com

More Related Content

PPT
Hadoop file systems
PPT
Hadoop for the disillusioned
PDF
Technology Outlook - The new Era of computing
PPTX
The next generation of the Montage image mosaic engine
PDF
Hadoop/Spark Non-Technical Basics
PDF
In-Memory Distributed Computing - Porto Tech Hub
PDF
JVM Internals - NEJUG Nov 2010
KEY
Introduction to Class File Format & Byte Code
Hadoop file systems
Hadoop for the disillusioned
Technology Outlook - The new Era of computing
The next generation of the Montage image mosaic engine
Hadoop/Spark Non-Technical Basics
In-Memory Distributed Computing - Porto Tech Hub
JVM Internals - NEJUG Nov 2010
Introduction to Class File Format & Byte Code

Viewers also liked (12)

PDF
JCache - Gimme Caching - JavaLand
PDF
Gimme Caching - The JCache Way
PDF
Gimme Caching, the Hazelcast JCache Way
PDF
My Old Friend Malloc
PDF
The Delivery Hero - A Simpsons As A Service Storyboard
PDF
Unsafe Java World - Crossing the Borderline - JokerConf 2014 Saint Petersburg
PDF
Distributed computing with Hazelcast - JavaOne 2014
PDF
JVM Mechanics: When Does the JVM JIT & Deoptimize?
PDF
A Post-Apocalyptic sun.misc.Unsafe World
PDF
Understanding Garbage Collection
PDF
In-Memory Computing - Distributed Systems - Devoxx UK 2015
PDF
Hazelcast - In-Memory DataGrid
JCache - Gimme Caching - JavaLand
Gimme Caching - The JCache Way
Gimme Caching, the Hazelcast JCache Way
My Old Friend Malloc
The Delivery Hero - A Simpsons As A Service Storyboard
Unsafe Java World - Crossing the Borderline - JokerConf 2014 Saint Petersburg
Distributed computing with Hazelcast - JavaOne 2014
JVM Mechanics: When Does the JVM JIT & Deoptimize?
A Post-Apocalyptic sun.misc.Unsafe World
Understanding Garbage Collection
In-Memory Computing - Distributed Systems - Devoxx UK 2015
Hazelcast - In-Memory DataGrid
Ad

Similar to Big Data, Fast Data - MapReduce in Hazelcast (20)

PDF
Map Reduce in Hazelcast - Hazelcast User Group London Version
PDF
Distributed Computing in Hazelcast - Geekout 2014 Edition
PDF
How to Speed up your Database
PDF
Distributed Computing - An Interactive Introduction
PPTX
Geek Nights Hong Kong
PPTX
Hazelcast Essentials
PPTX
Distributed caching-computing v3.8
PDF
Low-latency real-time data processing at giga-scale with Kafka | John DesJard...
PPTX
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
PDF
Hazelcast Introduction
PDF
In-memory No SQL- GIDS2014
PPTX
ConFoo - 3 performance improvements
PDF
Caching and JCache with Greg Luck 18.02.16
PDF
Hazelcast 101
PPTX
SouJava- 3 easy performance improvements in your microservices architecture
PDF
Devoxx uk 2014 High performance in-memory Java with open source
PDF
Distributed Computing with Hazelcast - Brazil Tour
PDF
Building scalable applications with hazelcast
PDF
Building scalable applications with hazelcast
PDF
Easy Scaling with Open Source Data Structures, by Talip Ozturk
Map Reduce in Hazelcast - Hazelcast User Group London Version
Distributed Computing in Hazelcast - Geekout 2014 Edition
How to Speed up your Database
Distributed Computing - An Interactive Introduction
Geek Nights Hong Kong
Hazelcast Essentials
Distributed caching-computing v3.8
Low-latency real-time data processing at giga-scale with Kafka | John DesJard...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
Hazelcast Introduction
In-memory No SQL- GIDS2014
ConFoo - 3 performance improvements
Caching and JCache with Greg Luck 18.02.16
Hazelcast 101
SouJava- 3 easy performance improvements in your microservices architecture
Devoxx uk 2014 High performance in-memory Java with open source
Distributed Computing with Hazelcast - Brazil Tour
Building scalable applications with hazelcast
Building scalable applications with hazelcast
Easy Scaling with Open Source Data Structures, by Talip Ozturk
Ad

More from Christoph Engelbert (15)

PDF
Postgres on Kubernetes - Dos and Donts.pdf
PDF
Data Pipeline Plumbing
PDF
Gute Nachrichten, Schlechte Nachrichten
PDF
Of Farm Topologies and Time-Series Data
PDF
What I learned about IoT Security ... and why it's so hard!
PDF
PostgreSQL: The Time-Series Database You (Actually) Want
PDF
Road to (Enterprise) Observability
PDF
Oops-Less Operation
PDF
Instan(t)a-neous Monitoring
PDF
Don't Go, Java!
PDF
TypeScript Go(es) Embedded
PDF
Hazelcast Jet - Riding the Jet Streams
PDF
CBOR - The Better JSON
PDF
Project Panama - Beyond the (JVM) Wall
PDF
Gimme Caching - The JCache Way
Postgres on Kubernetes - Dos and Donts.pdf
Data Pipeline Plumbing
Gute Nachrichten, Schlechte Nachrichten
Of Farm Topologies and Time-Series Data
What I learned about IoT Security ... and why it's so hard!
PostgreSQL: The Time-Series Database You (Actually) Want
Road to (Enterprise) Observability
Oops-Less Operation
Instan(t)a-neous Monitoring
Don't Go, Java!
TypeScript Go(es) Embedded
Hazelcast Jet - Riding the Jet Streams
CBOR - The Better JSON
Project Panama - Beyond the (JVM) Wall
Gimme Caching - The JCache Way

Recently uploaded (20)

PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
Lecture Notes Electrical Wiring System Components
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
composite construction of structures.pdf
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
OOP with Java - Java Introduction (Basics)
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
Sustainable Sites - Green Building Construction
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
Welding lecture in detail for understanding
PDF
PPT on Performance Review to get promotions
PPT
Mechanical Engineering MATERIALS Selection
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
Lecture Notes Electrical Wiring System Components
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
composite construction of structures.pdf
Model Code of Practice - Construction Work - 21102022 .pdf
R24 SURVEYING LAB MANUAL for civil enggi
OOP with Java - Java Introduction (Basics)
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Sustainable Sites - Green Building Construction
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
UNIT 4 Total Quality Management .pptx
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Welding lecture in detail for understanding
PPT on Performance Review to get promotions
Mechanical Engineering MATERIALS Selection

Big Data, Fast Data - MapReduce in Hazelcast