SlideShare a Scribd company logo
© 2014 MapR Technologies 1© 2014 MapR Technologies
© 2014 MapR Technologies 2
American express American express
MapR at the Core of Big Data
700+ CustomersCloud Leadership
American
Express
Wells Fargo
UnitedHealth
Group
© 2015 MapR Technologies 3
The Growing Need For Platform Thinking
As it
happens
Near real
time
Micro batch
Batch
Developer Production Aspirational Goal
Batch
Micro
batch
New System
of Record
AND
Operational
Intelligence
Operational
analytics
Scale
Reliability
Trust
Latency
Multi-tenancy
Streams
© 2015 MapR Technologies 4
Their Strategy: Customer Do-It-Yourself
APACHE HADOOP
Security
YARN
Spark
Streaming
Storm
StreamingNoSQL &
Search
Juju
Provisioning
&
coordination
Savanna
h
ML,
Graph
Mahout
MLLib
GraphX
EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS
Workflow
& Data
Governanc
e
Pig
Cascadin
g
Spark
Batch
MapRedu
ce v1 & v2
Tez
HBase
Solr
Hive
Impala
Spark
SQL
Drill
SQL
Sentry Oozie
ZooKeepe
r
Sqoop
Flume
Data
Integration
& Access
HttpFS
Hue
Search
cluster
HA
duplicate
Kafka
cluster
NAS
storage
NoSQL
cluster
Not pre-integrated
More expensive
Latencies everywhere
Unreliability grows
© 2015 MapR Technologies 5
BIG STORAGE
What Makes A Big Data Platform?
Hadoop SQL Search Messaging
BIG PROCESSING
NoSQL
© 2015 MapR Technologies 6
MapR CONVERGED DATA PLATFORM
APACHE HADOOP
Security
YARN
Spark
Streaming
Storm
StreamingNoSQL &
Search
Provisioning
&
coordination
Sahara
ML, Graph
Mahout
MLLib
GraphX
EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS
Workflow
& Data
Governance
Pig
Spark
Batch
MapReduce v1
& v2
HBase
Solr
Hive
Impala
Spark SQL
Drill
SQL
Sentry Oozie ZooKeeperSqoop
Flume
Data Integration
& Access
HttpFS
Hue
HDFS APIs
MapR STORAGE PLATFORM
Unlimited Scale
Multi
Data-center
Global Namespace
Data
Protection
Unified Security
© 2015 MapR Technologies 7
MapR CONVERGED DATA PLATFORM
MapR-
DB
JSON
Search
RealTime
Streams
Apache Drill : ANSI SQL as the Unified Access Layer
File
System
POSIX
APACHE HADOOP
Security
YARN
Spark Streaming
Storm
StreamingNoSQL & Search
Provisioning
&
coordination
Sahara
ML, Graph
Mahout
MLLib
GraphX
EXECUTION ENGINES
DATA GOVERNANCE AND OPERATIONS
Workflow
& Data
Governance
Pig
Spark
Batch
MapReduce v1 &
v2
HBase
Solr
Hive
Impala
Spark SQL
Drill
SQL
Sentry Oozie ZooKeeperSqoop
Flume
Data Integration
& Access
HttpFS
Hue
HDFS APIs
MapR STORAGE PLATFORM
Unlimited Scale
Multi
Data-center
Global Namespace
Data
Protection
Unified Security
© 2015 MapR Technologies 8
MapRCONVERGEDDATAPLATFORM
MapR
DB
Search
MapR
Streams
APACHE HADOOP AND OSS ECOSYSTEM
Security
YARN
Spark
Streaming
Storm
StreamingNoSQL &
Search
Provisioning
&
coordination
Sahara
ML, Graph
Mahout
MLLib
GraphX
EXECUTION ENGINES
DATA GOVERNANCE AND OPERATIONS
Workflow
& Data
Governance
Pig
Spark
Batch
MapReduce
v1 & v2 HBase
Solr
Hive
Impala
Spark SQL
Drill
SQL
Sentry Oozie ZooKeeperSqoop
Flume
Data
Integration
& Access
HttpFS
Hue
File
System
POSIX
Apache Drill: Unified Access Layer
MapR STORAGE PLATFORM
Unlimited Scale
Multi
Data-center
Global Namespace
Data
Protection
Unified Security
DATABASES&APPLICATIONS
EMail
HPC
Document
Mgmt
Custom
Apps
Custom
Apps
© 2015 MapR Technologies 9
MapR Converged Data Platform
Open Source Engines & Tools Commercial Engines & Applications
Utility-Grade Platform Services
DataProcessing
Enterprise Storage
MapR-FS MapR-DB MapR Streams
Database Event Streaming
Global Namespace High Availability Data Protection Self-healing Unified Security Real-time Multi-tenancy
Search &
Others
Cloud &
Managed
Services
Custom
Apps
UnifiedManagementandMonitoring
© 2015 MapR Technologies 10
MapR Streams – Publish/Subscribe
Listeners
Publish billions of messages per sec to a topic in a stream.
Reliable delivery to all consumers. Immediately.
JSON data format, direct data access from analytics frameworks
L
Simple, standard API (Kafka).
Topi
c
Stream
Tie together geo-dispersed clusters. Worldwide.
Topic
Producers
Consumers
© 2014 MapR Technologies
Records
JSON messages allow SQL queries by Apache Drill
Pub1
{
“event” : “CLICK”,
“url” : “/about/”,
“user” : “Will”,
“ip” : “192.168.1.1”,
“loggedin” : true
}
/stream/web:events
[
{“seq001”: { … } },
{“seq002”: { … } },
…
]
SELECT user, ip, url FROM marlin.`/stream/web:events` WHERE event=CLICK
user ip url
--------------------------------------------------------------------------------------------
Will 192.168.1.1 /about/
Neeraja 192.168.1.2 /logout/
Mitesh 192.168.1.3 /prod01/buy
© 2015 MapR Technologies
MapR Streams: Global Messaging Fabric
...Edge
Aggregation
...
Sensors Apps Analytics
● Arbitrary interconnection of thousands of clusters
● Seamless disconnection/reconnection
● Globally synchronized sequence numbers &
cursors: Listener & Producer failover
● Global applications
● Data filtering/aggregation at the edge
● Distributed analytics
NYC LONDON
CHI TOK

More Related Content

PPTX
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
PPTX
MapR Streams and MapR Converged Data Platform
PPTX
Spark & Hadoop at Production at Scale
PDF
Meruvian - Introduction to MapR
PPTX
3 Benefits of Multi-Temperature Data Management for Data Analytics
PPTX
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
PPTX
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
PDF
Real World Use Cases: Hadoop and NoSQL in Production
Advanced Visual Analytics and Real-time Analytics at Platform scale by Brian ...
MapR Streams and MapR Converged Data Platform
Spark & Hadoop at Production at Scale
Meruvian - Introduction to MapR
3 Benefits of Multi-Temperature Data Management for Data Analytics
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
Real World Use Cases: Hadoop and NoSQL in Production

What's hot (20)

PDF
Streaming Architecture to Connect Everything (Including Hybrid Cloud) - Strat...
PPTX
Big Data at your Desk with KNIME
PDF
Spark and MapR Streams: A Motivating Example
PPTX
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
PPTX
MapR Product Update - Spring 2017
PPTX
CEP - simplified streaming architecture - Strata Singapore 2016
PDF
Open Source Innovations in the MapR Ecosystem Pack 2.0
PPTX
Keys for Success from Streams to Queries
PDF
Regulatory Reporting of Asset Trading Using Apache Spark-(Sudipto Shankar Das...
PPTX
Evolving Beyond the Data Lake: A Story of Wind and Rain
PDF
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
PDF
Big data today and tomorrow
PPTX
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
PPTX
Managing a Multi-Tenant Data Lake
PPTX
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
PPTX
Revolution Analytics
PDF
IoT Data Platforms: Processing IoT Data with Apache Kafka™
PPTX
Log I am your father
PDF
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...
PPTX
NoSQL Application Development with JSON and MapR-DB
Streaming Architecture to Connect Everything (Including Hybrid Cloud) - Strat...
Big Data at your Desk with KNIME
Spark and MapR Streams: A Motivating Example
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
MapR Product Update - Spring 2017
CEP - simplified streaming architecture - Strata Singapore 2016
Open Source Innovations in the MapR Ecosystem Pack 2.0
Keys for Success from Streams to Queries
Regulatory Reporting of Asset Trading Using Apache Spark-(Sudipto Shankar Das...
Evolving Beyond the Data Lake: A Story of Wind and Rain
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
Big data today and tomorrow
Xactly: How to Build a Successful Converged Data Platform with Hadoop, Spark,...
Managing a Multi-Tenant Data Lake
State of the Art Robot Predictive Maintenance with Real-time Sensor Data
Revolution Analytics
IoT Data Platforms: Processing IoT Data with Apache Kafka™
Log I am your father
Fast Cars, Big Data - How Streaming Can Help Formula 1 - Tugdual Grall - Code...
NoSQL Application Development with JSON and MapR-DB
Ad

Viewers also liked (20)

PPTX
American Express Slides, MLconf 2013
PDF
[Japanese Content] Lance Riedel_The App Server, The Hive in Tokyo_Aug29
PDF
Notes from the (greasy) field by Ranjit Nair - Co-founder and CTO, Altizon
PDF
Startup Series: Lean Analytics, Innovation, and Tilting at Windmills
PPTX
Bizitzaren historia
PPTX
Tomer Shiran, MapR_Hadoop&SQL
PPTX
Pre production planning
PPT
Chictopia for Mobile & Social Commerce panel discussion
PPS
San martin 2013 2014
PPTX
The Hive "Data Virtualization" Introduction - Jim Green, CEO of Composite Sof...
PDF
Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...
PPT
San martin 2013 2014
PDF
Mumhsocialpdf
PDF
Big Data App servor by Lance Riedel, CTO, The Hive for The Hive India event
PPTX
My magazine edited
PPTX
1.nigam shah stanford_meetup
PDF
[Japanese Content] TM Ravi_ Tokyo Presentation_TheHive_Sept 2013
PPTX
Alan Gates, Hortonworks_Hadoop&SQL
PDF
Expt panel hive_data_rp_20130320_final-1
PPS
Very beautiful
American Express Slides, MLconf 2013
[Japanese Content] Lance Riedel_The App Server, The Hive in Tokyo_Aug29
Notes from the (greasy) field by Ranjit Nair - Co-founder and CTO, Altizon
Startup Series: Lean Analytics, Innovation, and Tilting at Windmills
Bizitzaren historia
Tomer Shiran, MapR_Hadoop&SQL
Pre production planning
Chictopia for Mobile & Social Commerce panel discussion
San martin 2013 2014
The Hive "Data Virtualization" Introduction - Jim Green, CEO of Composite Sof...
Opportunites in Big Data by Sumant Mandal, Founder of The Hive for The Hive I...
San martin 2013 2014
Mumhsocialpdf
Big Data App servor by Lance Riedel, CTO, The Hive for The Hive India event
My magazine edited
1.nigam shah stanford_meetup
[Japanese Content] TM Ravi_ Tokyo Presentation_TheHive_Sept 2013
Alan Gates, Hortonworks_Hadoop&SQL
Expt panel hive_data_rp_20130320_final-1
Very beautiful
Ad

Similar to The Hive Think Tank: "Stream Processing Systems" by M.C. Srivas of MapR (20)

PDF
Hadoop and NoSQL joining forces by Dale Kim of MapR
PDF
An Introduction to the MapR Converged Data Platform
PPTX
Real Time and Big Data – It’s About Time
PPTX
Real Time and Big Data – It’s About Time
PPTX
How Spark is Enabling the New Wave of Converged Cloud Applications
PPTX
Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)
PPTX
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
PPT
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
PDF
Drill into Drill – How Providing Flexibility and Performance is Possible
PPTX
How Spark is Enabling the New Wave of Converged Applications
PPTX
Powering the "As it Happens" Business
PDF
MapR 5.2: Getting More Value from the MapR Converged Data Platform
PPTX
HUG France - Apache Drill
PDF
Key Considerations for Putting Hadoop in Production SlideShare
PPTX
Integrating Hadoop into your enterprise IT environment
PPTX
MapR-DB – The First In-Hadoop Document Database
PDF
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
PPTX
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
PPTX
Real time-hadoop
PPTX
Predictive Analytics San Diego
Hadoop and NoSQL joining forces by Dale Kim of MapR
An Introduction to the MapR Converged Data Platform
Real Time and Big Data – It’s About Time
Real Time and Big Data – It’s About Time
How Spark is Enabling the New Wave of Converged Cloud Applications
Big Data Everywhere Chicago: Getting Real with the MapR Platform (MapR)
Lambda Architecture: The Best Way to Build Scalable and Reliable Applications!
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Drill into Drill – How Providing Flexibility and Performance is Possible
How Spark is Enabling the New Wave of Converged Applications
Powering the "As it Happens" Business
MapR 5.2: Getting More Value from the MapR Converged Data Platform
HUG France - Apache Drill
Key Considerations for Putting Hadoop in Production SlideShare
Integrating Hadoop into your enterprise IT environment
MapR-DB – The First In-Hadoop Document Database
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Self-Service BI for big data applications using Apache Drill (Big Data Amster...
Real time-hadoop
Predictive Analytics San Diego

More from The Hive (20)

PDF
"Responsible AI", by Charlie Muirhead
PPTX
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
PDF
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
PDF
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
PPTX
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
PDF
Data Science in the Enterprise
PDF
AI in Software for Augmenting Intelligence Across the Enterprise
PPTX
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
PPTX
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
PPTX
Social Impact & Ethics of AI by Steve Omohundro
PDF
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
PDF
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
PDF
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
PPTX
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
PDF
The Hive Think Tank: The Content Trap - Strategist's Guide to Digital Change
PPTX
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
PDF
The Hive Think Tank: Heron at Twitter
PPTX
The Hive Think Tank: Unpacking AI for Healthcare
PPTX
The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...
PDF
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
"Responsible AI", by Charlie Muirhead
Translating a Trillion Points of Data into Therapies, Diagnostics, and New In...
Digital Transformation; Digital Twins for Delivering Business Value in IIoT
Quantum Computing (IBM Q) - Hive Think Tank Event w/ Dr. Bob Sutor - 02.22.18
The Hive Think Tank: Rendezvous Architecture Makes Machine Learning Logistics...
Data Science in the Enterprise
AI in Software for Augmenting Intelligence Across the Enterprise
“ High Precision Analytics for Healthcare: Promises and Challenges” by Sriram...
"The Future of Manufacturing" by Sujeet Chand, SVP&CTO, Rockwell Automation
Social Impact & Ethics of AI by Steve Omohundro
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: Machine Learning Applications in Genomics by Prof. Jian ...
The Hive Think Tank: The Future Of Customer Support - AI Driven Automation
The Hive Think Tank: Talk by Mohandas Pai - India at 2030, How Tech Entrepren...
The Hive Think Tank: The Content Trap - Strategist's Guide to Digital Change
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
The Hive Think Tank: Heron at Twitter
The Hive Think Tank: Unpacking AI for Healthcare
The Hive Think Tank: Translating IoT into Innovation at Every Level by Prith ...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...

Recently uploaded (20)

PDF
How to run a consulting project- client discovery
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Lecture1 pattern recognition............
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Database Infoormation System (DBIS).pptx
PDF
annual-report-2024-2025 original latest.
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPT
Predictive modeling basics in data cleaning process
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PPTX
modul_python (1).pptx for professional and student
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
Managing Community Partner Relationships
PDF
Microsoft Core Cloud Services powerpoint
How to run a consulting project- client discovery
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Lecture1 pattern recognition............
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Database Infoormation System (DBIS).pptx
annual-report-2024-2025 original latest.
[EN] Industrial Machine Downtime Prediction
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Oracle OFSAA_ The Complete Guide to Transforming Financial Risk Management an...
Galatica Smart Energy Infrastructure Startup Pitch Deck
Predictive modeling basics in data cleaning process
Optimise Shopper Experiences with a Strong Data Estate.pdf
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
modul_python (1).pptx for professional and student
IBA_Chapter_11_Slides_Final_Accessible.pptx
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Managing Community Partner Relationships
Microsoft Core Cloud Services powerpoint

The Hive Think Tank: "Stream Processing Systems" by M.C. Srivas of MapR

  • 1. © 2014 MapR Technologies 1© 2014 MapR Technologies
  • 2. © 2014 MapR Technologies 2 American express American express MapR at the Core of Big Data 700+ CustomersCloud Leadership American Express Wells Fargo UnitedHealth Group
  • 3. © 2015 MapR Technologies 3 The Growing Need For Platform Thinking As it happens Near real time Micro batch Batch Developer Production Aspirational Goal Batch Micro batch New System of Record AND Operational Intelligence Operational analytics Scale Reliability Trust Latency Multi-tenancy Streams
  • 4. © 2015 MapR Technologies 4 Their Strategy: Customer Do-It-Yourself APACHE HADOOP Security YARN Spark Streaming Storm StreamingNoSQL & Search Juju Provisioning & coordination Savanna h ML, Graph Mahout MLLib GraphX EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Workflow & Data Governanc e Pig Cascadin g Spark Batch MapRedu ce v1 & v2 Tez HBase Solr Hive Impala Spark SQL Drill SQL Sentry Oozie ZooKeepe r Sqoop Flume Data Integration & Access HttpFS Hue Search cluster HA duplicate Kafka cluster NAS storage NoSQL cluster Not pre-integrated More expensive Latencies everywhere Unreliability grows
  • 5. © 2015 MapR Technologies 5 BIG STORAGE What Makes A Big Data Platform? Hadoop SQL Search Messaging BIG PROCESSING NoSQL
  • 6. © 2015 MapR Technologies 6 MapR CONVERGED DATA PLATFORM APACHE HADOOP Security YARN Spark Streaming Storm StreamingNoSQL & Search Provisioning & coordination Sahara ML, Graph Mahout MLLib GraphX EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Workflow & Data Governance Pig Spark Batch MapReduce v1 & v2 HBase Solr Hive Impala Spark SQL Drill SQL Sentry Oozie ZooKeeperSqoop Flume Data Integration & Access HttpFS Hue HDFS APIs MapR STORAGE PLATFORM Unlimited Scale Multi Data-center Global Namespace Data Protection Unified Security
  • 7. © 2015 MapR Technologies 7 MapR CONVERGED DATA PLATFORM MapR- DB JSON Search RealTime Streams Apache Drill : ANSI SQL as the Unified Access Layer File System POSIX APACHE HADOOP Security YARN Spark Streaming Storm StreamingNoSQL & Search Provisioning & coordination Sahara ML, Graph Mahout MLLib GraphX EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Workflow & Data Governance Pig Spark Batch MapReduce v1 & v2 HBase Solr Hive Impala Spark SQL Drill SQL Sentry Oozie ZooKeeperSqoop Flume Data Integration & Access HttpFS Hue HDFS APIs MapR STORAGE PLATFORM Unlimited Scale Multi Data-center Global Namespace Data Protection Unified Security
  • 8. © 2015 MapR Technologies 8 MapRCONVERGEDDATAPLATFORM MapR DB Search MapR Streams APACHE HADOOP AND OSS ECOSYSTEM Security YARN Spark Streaming Storm StreamingNoSQL & Search Provisioning & coordination Sahara ML, Graph Mahout MLLib GraphX EXECUTION ENGINES DATA GOVERNANCE AND OPERATIONS Workflow & Data Governance Pig Spark Batch MapReduce v1 & v2 HBase Solr Hive Impala Spark SQL Drill SQL Sentry Oozie ZooKeeperSqoop Flume Data Integration & Access HttpFS Hue File System POSIX Apache Drill: Unified Access Layer MapR STORAGE PLATFORM Unlimited Scale Multi Data-center Global Namespace Data Protection Unified Security DATABASES&APPLICATIONS EMail HPC Document Mgmt Custom Apps Custom Apps
  • 9. © 2015 MapR Technologies 9 MapR Converged Data Platform Open Source Engines & Tools Commercial Engines & Applications Utility-Grade Platform Services DataProcessing Enterprise Storage MapR-FS MapR-DB MapR Streams Database Event Streaming Global Namespace High Availability Data Protection Self-healing Unified Security Real-time Multi-tenancy Search & Others Cloud & Managed Services Custom Apps UnifiedManagementandMonitoring
  • 10. © 2015 MapR Technologies 10 MapR Streams – Publish/Subscribe Listeners Publish billions of messages per sec to a topic in a stream. Reliable delivery to all consumers. Immediately. JSON data format, direct data access from analytics frameworks L Simple, standard API (Kafka). Topi c Stream Tie together geo-dispersed clusters. Worldwide. Topic Producers Consumers
  • 11. © 2014 MapR Technologies Records JSON messages allow SQL queries by Apache Drill Pub1 { “event” : “CLICK”, “url” : “/about/”, “user” : “Will”, “ip” : “192.168.1.1”, “loggedin” : true } /stream/web:events [ {“seq001”: { … } }, {“seq002”: { … } }, … ] SELECT user, ip, url FROM marlin.`/stream/web:events` WHERE event=CLICK user ip url -------------------------------------------------------------------------------------------- Will 192.168.1.1 /about/ Neeraja 192.168.1.2 /logout/ Mitesh 192.168.1.3 /prod01/buy
  • 12. © 2015 MapR Technologies MapR Streams: Global Messaging Fabric ...Edge Aggregation ... Sensors Apps Analytics ● Arbitrary interconnection of thousands of clusters ● Seamless disconnection/reconnection ● Globally synchronized sequence numbers & cursors: Listener & Producer failover ● Global applications ● Data filtering/aggregation at the edge ● Distributed analytics NYC LONDON CHI TOK