SlideShare a Scribd company logo
This slide blank on purpose
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
> whoami
• Solutions Architect @Hazelcast
• Hang out with awesome people
• @gamussa in internetz
Please, follow me in Twitter
I’m very interesting ©
Agenda
• Refreshing knowledge on Java 8 Streams
• Distribute and Conquer
• Distributed Data
• Distributed Streams
• How we did all this
Java 8 Streams
Java 8 Streams…
• An abstraction represents a sequence of elements
• Is not a data structure
• Convey elements from a source through a pipeline
of operations
• Operation doesn’t modify a source
Why I should care about
Stream API?
• You’re Java developer
What does regular Java developer think about Scala?
advanced
Why I should care about
Stream API?
• You’re Java developer
• Many Java developers know Java
• It’s all about data processing
java.util.stream operations
• map(), flatMap(), filter()
• reduce(), collect()
• sorted()
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
Problem
• One does not simply put all Big Data in one
machine
Problem
• Data doesn’t fit just one machine
Problem
• One does not simply put all Big Data in one
machine
• Data is too important to have it only one machine
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
CACHES
Replication on Sharding?
http://book.mixu.net/distsys/single-page.html
Solution
• Use Distributed Map aka IMap
What’s Hazelcast IMDG?
• In-memory Data Grid
• Apache v2 Licensed
• Distributed
• Caches (IMap, JCache)
• Java Collections (IList, ISet, IQueue)
• Messaging (Topic, RingBuffer)
• Computation (ExecutorService, M-R)
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
Green
Primary
Green
Backup
Green
Shard
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
Problem
• Lambda serialization
27
28
Solution
• serializable version of the interfaces
• Introducing DistributedStream
29
30
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
32
Jet Streams
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
What’s Hazelcast Jet?
• General purpose distributed data processing
framework
• Based on Direct Acyclic Graph to model data flow
• Built on top of Hazelcast IMDG
• Comparable to Apache Spark or Apache Flink
34
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
DAG
36
Job Execution
37
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017
Future (It’s bright!)
• Memory module for processing big data
• Higher level streaming and batching APIs
• Reactive Streams
• Distributed Classloading
• Integrations (HDFS/Yarn/Mesos)
Your fuel, our Jet Engine
• Public release – Feb 7th.
• Developer Preview today - yay!
• http://hazelcast.org/jet-signup
• Send me a note viktor@hazelcast.com
• Follow @hazelcast and @gamussa (duh!!)
• Your questions #hazelcast #hazelcastjet
Conclusion
• Java Stream API provides very white range of data
processing tools
• War And Piece – is a Big (a lot of data) Book!
• Now we’re pretty sure that Andrew and Pierre are
the main characters
[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017

More Related Content

PPTX
[Codemash] Caching Made "Bootiful"!
PDF
[Jfokus] Riding the Jet Streams
PPTX
Cloudstack container service
PDF
Building a cloud based managed BigData platform for the enterprise
PPTX
Big Data Day LA 2015 - Lessons learned from scaling Big Data in the Cloud by...
PPTX
Benchmarking Aerospike on the Google Cloud - NoSQL Speed with Ease
PPTX
Meetup #3: Migrating an Oracle Application from on-premise to AWS
PPTX
IronSource Atom - Redshift - Lessons Learned
[Codemash] Caching Made "Bootiful"!
[Jfokus] Riding the Jet Streams
Cloudstack container service
Building a cloud based managed BigData platform for the enterprise
Big Data Day LA 2015 - Lessons learned from scaling Big Data in the Cloud by...
Benchmarking Aerospike on the Google Cloud - NoSQL Speed with Ease
Meetup #3: Migrating an Oracle Application from on-premise to AWS
IronSource Atom - Redshift - Lessons Learned

What's hot (20)

PDF
How to set up Kubernetes for all your machine learning workflows
PDF
Hands on Compute Engine
PPTX
Azure Introduction
PDF
Google Dataflow Intro
PPTX
Building big data applications on AWS by Ran Tessler
PDF
Real-Time Vote Platform Benchmark
PDF
Webinar kubernetes and-spark
PDF
2020 07-30 elastic agent + ingest management
PPTX
Real time dashboards with Kafka and Druid
PPTX
Intro to the Google Cloud for Developers
PDF
Big problems Big Data, simple solutions
PDF
Spotify's journey to GCP
PPTX
Aem asset optimizations & best practices
PPTX
Save Azure Cost
PPTX
Hacking google cloud run
PPTX
Meetup #3: Migrate a fast scale system to AWS
PPT
Stacktician - CloudStack Collab Conference 2014
PDF
Spca2014 7 tenets of highly scalable applications kapic
PPTX
Harness The Power Of ACF For Gatsby and WordPress
PDF
Serverless Big Data Architecture on Google Cloud Platform at Credit OK
How to set up Kubernetes for all your machine learning workflows
Hands on Compute Engine
Azure Introduction
Google Dataflow Intro
Building big data applications on AWS by Ran Tessler
Real-Time Vote Platform Benchmark
Webinar kubernetes and-spark
2020 07-30 elastic agent + ingest management
Real time dashboards with Kafka and Druid
Intro to the Google Cloud for Developers
Big problems Big Data, simple solutions
Spotify's journey to GCP
Aem asset optimizations & best practices
Save Azure Cost
Hacking google cloud run
Meetup #3: Migrate a fast scale system to AWS
Stacktician - CloudStack Collab Conference 2014
Spca2014 7 tenets of highly scalable applications kapic
Harness The Power Of ACF For Gatsby and WordPress
Serverless Big Data Architecture on Google Cloud Platform at Credit OK
Ad

Viewers also liked (20)

PDF
[JokerConf] Верхом на реактивных стримах, 10/13/2016
PDF
[OracleCode - SF] Distributed caching for your next node.js project
PDF
[OracleCode SF] In memory analytics with apache spark and hazelcast
KEY
Functional UI testing of Adobe Flex RIA
PDF
WebSockets: The Current State of the Most Valuable HTML5 API for Java Developers
PDF
JavaOne 2013: «Java and JavaScript - Shaken, Not Stirred»
KEY
Testing Flex RIAs for NJ Flex user group
PPTX
Fault Tolerance and Processing Semantics in Apache Apex
PPTX
Creating your own private Download Center with Bintray
PDF
DevOps @Scale (Greek Tragedy in 3 Acts) as it was presented at Oracle Code SF...
PPTX
Java 8 Puzzlers [as presented at OSCON 2016]
PPTX
Spring Data: New approach to persistence
PPTX
Confession of an Engineer
PPTX
Morning at Lohika 2nd anniversary
PDF
Couchbase Sydney meetup #1 Couchbase Architecture and Scalability
PDF
Patterns and antipatterns in Docker image lifecycle @ DevOpsDays Charlotte 2017
PPTX
Javaeeconf 2016 how to cook apache kafka with camel and spring boot
PDF
Patterns and antipatterns in Docker image lifecycle as was presented at Oracl...
PDF
Patterns and antipatterns in Docker image lifecycle as was presented at Scale...
PDF
Java 8 Puzzlers as it was presented at Codemash 2017
[JokerConf] Верхом на реактивных стримах, 10/13/2016
[OracleCode - SF] Distributed caching for your next node.js project
[OracleCode SF] In memory analytics with apache spark and hazelcast
Functional UI testing of Adobe Flex RIA
WebSockets: The Current State of the Most Valuable HTML5 API for Java Developers
JavaOne 2013: «Java and JavaScript - Shaken, Not Stirred»
Testing Flex RIAs for NJ Flex user group
Fault Tolerance and Processing Semantics in Apache Apex
Creating your own private Download Center with Bintray
DevOps @Scale (Greek Tragedy in 3 Acts) as it was presented at Oracle Code SF...
Java 8 Puzzlers [as presented at OSCON 2016]
Spring Data: New approach to persistence
Confession of an Engineer
Morning at Lohika 2nd anniversary
Couchbase Sydney meetup #1 Couchbase Architecture and Scalability
Patterns and antipatterns in Docker image lifecycle @ DevOpsDays Charlotte 2017
Javaeeconf 2016 how to cook apache kafka with camel and spring boot
Patterns and antipatterns in Docker image lifecycle as was presented at Oracl...
Patterns and antipatterns in Docker image lifecycle as was presented at Scale...
Java 8 Puzzlers as it was presented at Codemash 2017
Ad

Similar to [NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017 (20)

PPTX
Stream Processing and Real-Time Data Pipelines
PPTX
vJUG - Introduction to data streaming
PPTX
JUG Tirana - Introduction to data streaming
PPTX
JUG SF - Introduction to data streaming
PPTX
SCALE - Stream processing and Open Data, a match made in Heaven
PPTX
WaJUG - Introduction to data streaming
PPTX
BruJUG - Introduction to data streaming
PPTX
JEEConf 2017 - In-Memory Data Streams With Hazelcast Jet
PPTX
In-Memory Stream Processing with Hazelcast Jet @MorningAtLohika
PDF
Distributed applications using Hazelcast
PDF
In-memory No SQL- GIDS2014
PPTX
Trivento summercamp masterclass 9/9/2016
PPTX
Hazelcast Essentials
PPTX
In-Memory Stream Processing with Hazelcast Jet @JEEConf
PDF
Distributed computing with Hazelcast - JavaOne 2014
PDF
Apache Hadoop & Friends at Utah Java User's Group
PDF
Introduction to hazelcast
PDF
In-Memory Computing - Distributed Systems - Devoxx UK 2015
PPTX
Trivento summercamp fast data 9/9/2016
PPTX
Geek Nights Hong Kong
Stream Processing and Real-Time Data Pipelines
vJUG - Introduction to data streaming
JUG Tirana - Introduction to data streaming
JUG SF - Introduction to data streaming
SCALE - Stream processing and Open Data, a match made in Heaven
WaJUG - Introduction to data streaming
BruJUG - Introduction to data streaming
JEEConf 2017 - In-Memory Data Streams With Hazelcast Jet
In-Memory Stream Processing with Hazelcast Jet @MorningAtLohika
Distributed applications using Hazelcast
In-memory No SQL- GIDS2014
Trivento summercamp masterclass 9/9/2016
Hazelcast Essentials
In-Memory Stream Processing with Hazelcast Jet @JEEConf
Distributed computing with Hazelcast - JavaOne 2014
Apache Hadoop & Friends at Utah Java User's Group
Introduction to hazelcast
In-Memory Computing - Distributed Systems - Devoxx UK 2015
Trivento summercamp fast data 9/9/2016
Geek Nights Hong Kong

More from Viktor Gamov (7)

PDF
[DataSciCon] Divide, distribute and conquer stream v. batch
PDF
[Philly JUG] Divide, Distribute and Conquer: Stream v. Batch
PDF
Testing containers with TestContainers @ AJUG 7/18/2017
PDF
Distributed caching for your next node.js project cf summit - 06-15-2017
PDF
[Philly ETE] Java Puzzlers NG
PDF
Распределяй и властвуй — 2: Потоки данных наносят ответный удар
PDF
[JBreak] Блеск И Нищета Распределенных Стримов - 04-04-2017
[DataSciCon] Divide, distribute and conquer stream v. batch
[Philly JUG] Divide, Distribute and Conquer: Stream v. Batch
Testing containers with TestContainers @ AJUG 7/18/2017
Distributed caching for your next node.js project cf summit - 06-15-2017
[Philly ETE] Java Puzzlers NG
Распределяй и властвуй — 2: Потоки данных наносят ответный удар
[JBreak] Блеск И Нищета Распределенных Стримов - 04-04-2017

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Empathic Computing: Creating Shared Understanding
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Spectroscopy.pptx food analysis technology
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
KodekX | Application Modernization Development
PPTX
Cloud computing and distributed systems.
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Electronic commerce courselecture one. Pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Understanding_Digital_Forensics_Presentation.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Encapsulation_ Review paper, used for researhc scholars
Empathic Computing: Creating Shared Understanding
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Spectroscopy.pptx food analysis technology
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Per capita expenditure prediction using model stacking based on satellite ima...
Machine learning based COVID-19 study performance prediction
Unlocking AI with Model Context Protocol (MCP)
KodekX | Application Modernization Development
Cloud computing and distributed systems.
Advanced methodologies resolving dimensionality complications for autism neur...
MIND Revenue Release Quarter 2 2025 Press Release
NewMind AI Weekly Chronicles - August'25 Week I
Digital-Transformation-Roadmap-for-Companies.pptx
Electronic commerce courselecture one. Pdf

[NYJavaSig] Riding the Distributed Streams - Feb 2nd, 2017

Editor's Notes

  • #9: Credits to Jeka Borisov
  • #13: http://www.nogarlicnoonions.com/images/article_images/2015-12/five-guys-dubai232015-12-28-11-24-23.jpg
  • #19: Подумать на тему нарезанного барана
  • #27: http://www.velvet.by/files/userfiles/19083/ekrk.jpg
  • #32: http://www.velvet.by/files/userfiles/19083/ekrk.jpg
  • #42: Интересный факт из войны и мира Что андрей и пьер действительно главные герои