SlideShare a Scribd company logo
© Cloudera, Inc. All rights reserved.
ADDRESSING CHALLENGES WITH
IOT EDGE MANAGEMENT
Dinesh Chandrasekhar
Director of Product Marketing, Data-in-Motion
© Cloudera, Inc. All rights reserved. 2© Cloudera, Inc. All rights reserved.
Public Sector Transportation Utilities Healthcare Manufacturing Retail
COMMON IOT USE CASES BY INDUSTRY
Fleet
Management
Connected
Cars
Smart
Cities
Predictive
Analytics
Inventory/
Material
Tracking
• IoT is a $1.13T market opportunity in 2021.
• Americas - $329B IoT spending. Manufacturing and Transportation are top industries, accounting for 26% of total spending.
• APAC - $500B IoT spending. Manufacturing, Utilities and Transportation are top industries.
• EMEA - $264B IoT spending. Manufacturing is top industry, powered by Industry 4.0 initiatives.
• Worldwide IoT Analytics and Information Management Market = $573M
Top 5
Use cases Utility
Monitoring
Predictive
Maintenance
Patient
Monitoring
Usage-based
Insurance
Asset
Tracking /
Monitoring
Edge Data
Collection
© Cloudera, Inc. All rights reserved. 3© Cloudera, Inc. All rights reserved.
WHAT IS THE EDGE?
© Cloudera, Inc. All rights reserved. 4© Cloudera, Inc. All rights reserved.
WHY IS EDGE DATA COLLECTION IMPORTANT?
Need for real-time insights
ETL process is not real-time
Data provenance
© Cloudera, Inc. All rights reserved. 5© Cloudera, Inc. All rights reserved.
Data Flow Apps
Powered by NiFi
STREAMING REFERENCE ARCHITECTURE
Freightliner Fleet Trucks
Truck Sensors C++
Agent
Mack Fleet Trucks
Truck Sensors C++
Agent
Tata Fleet Trucks
Truck Sensors C++
Agent
Subcribing
Streaming Analytics
Apps
Analytics App 4
Analytics App 1
Analytics App 2
Micro Services
Analytics App 3
© Cloudera, Inc. All rights reserved. 6© Cloudera, Inc. All rights reserved.
WHAT ARE EDGE AGENTS?
© Cloudera, Inc. All rights reserved. 7© Cloudera, Inc. All rights reserved.
KEY CHALLENGES AT THE EDGE
Deploying edge agents
Identifying classes of agents
Updates
Device compatibility
Edge storage availability
Low power
© Cloudera, Inc. All rights reserved. 8© Cloudera, Inc. All rights reserved.
WHAT IS AN EDGE MANAGEMENT HUB?
Management and monitoring of agents
Application support
OTA updates
Registry
Eventing
© Cloudera, Inc. All rights reserved. 9© Cloudera, Inc. All rights reserved.
Data Flow Apps
Powered by NiFi
MANAGING EDGE DATA COLLECTION IS A HARD PROBLEM
Freightliner Fleet Trucks
Truck Sensors C++
Agent
Mack Fleet Trucks
Truck Sensors C++
Agent
Tata Fleet Trucks
Truck Sensors C++
Agent
Subcribing
Streaming Analytics
Apps
Analytics App 4
Analytics App 1
Analytics App 2
Micro Services
Analytics App 3
Challenge #1: Developing Edge Flows
How do I develop flows meant to run on
the edge using NiFi-like flow based
programming with no code?
Challenge #2: Deploying Flows/ Over the Air Updates
(OTA)
How do I deploy data collection flows to the edge? How
do I do updates to only the Freightliner class of trucks?
Challenge #3: Monitoring
Agents and Flows
How do I manage hundreds of
thousands of edge agents
and the flows that they are
running
© Cloudera, Inc. All rights reserved. 10© Cloudera, Inc. All rights reserved.
KEY CHALLENGES WITH EDGE MANAGEMENT
Network availability
Offline storage
Agent heartbeat
© Cloudera, Inc. All rights reserved.
© Cloudera, Inc. All rights reserved. 12© Cloudera, Inc. All rights reserved.
CLOUDERA DATAFLOW
© Cloudera, Inc. All rights reserved. 13© Cloudera, Inc. All rights reserved.
EDGE MANAGEMENT
• Edge data collection powered by Apache MiNiFi
• MiNiFi – smaller footprint than NiFi
• Guaranteed delivery
• Data buffering
• Prioritized queuing
• Flow-specific QoS
• Data provenance
• Designed for extension
• C++ / Java agents
• TensorFlow support
• Designed for IoT
© Cloudera, Inc. All rights reserved. 14
INTRODUCING CLOUDERA EDGE MANAGEMENT (CEM)
What is CEM?
 New product offering is an edge management
solution consisting of edge agents + edge
management hub.
 What does CEM do? Manage, Control and Monitor
edge agents to collect data from edge devices and
push intelligence back to the edge.
 How does CEM do it? Develop, Deploy, Run &
Monitor edge flow apps on thousands of edge
devices.
N
ew
© Cloudera, Inc. All rights reserved. 15© Cloudera, Inc. All rights reserved.
CLOUDERA DATAFLOW
© Cloudera, Inc. All rights reserved. 16© Cloudera, Inc. All rights reserved.
FLOW MANAGEMENT
• Web-based user interface
• Highly configurable
• Out-of-the-box data provenance
• Designed for extensibility
• Secure
• NiFi Registry
• DevOps support
• FDLC
• Versioning
• Deployment
© Cloudera, Inc. All rights reserved. 17© Cloudera, Inc. All rights reserved.
280+ PROCESSORS FOR DEEPER ECOSYSTEM INTEGRATION
Hash
Extract
Merge
Duplicate
Scan
GeoEnrich
Replace
ConvertSplit
Translate
Route Content
Route Context
Route Text
Control Rate
Distribute Load
Generate Table Fetch
Jolt Transform JSON
Prioritized Delivery
Encrypt
Tail
Evaluate
Execute
All Apache project logos are trademarks of the ASF and the respective projects.
Fetch
HTTP
Syslog
Email
HTML
Image
HL7
FTP
UDP
XML
SFTP
AMQP
WebSocket
© Cloudera, Inc. All rights reserved. 18© Cloudera, Inc. All rights reserved.
CLOUDERA DATAFLOW
© Cloudera, Inc. All rights reserved.
Streaming Analytics Reference Architecture
Data Flow Apps
Powered by NiFi
Kafka is Everywhere. Critical Component of Streaming Architectures
Kafka Producers Kafka Topics Kafka TopicsKafka Consumers & Producers Kafka Consumers
US West Fleet
Truck Sensors C++
Agent
US Central Fleet
Truck Sensors C++
Agent
US East Fleet
Truck Sensors C++
Agent
Analytics App 1
Analytics App 2
Analytics App 5
Analytics App 3
Analytics App 4
© Cloudera, Inc. All rights reserved.
Cloudera Streams Messaging Manager (SMM)
What is SMM?
 Kafka Management and Monitoring
tool
 Cure the “Kafka Blindness”
 Single Monitoring Dashboard for all
your Kafka Clusters across 4 entities
– Broker
– Producer
– Topic
– Consumer
 REST as a First Class Citizen
 Alerting
 Schema Management
 Integration with Schema Registry
© Cloudera, Inc. All rights reserved. 21© Cloudera, Inc. All rights reserved.
CLOUDERA DATAFLOW
© Cloudera, Inc. All rights reserved. 22© Cloudera, Inc. All rights reserved.
STREAMING ANALYTICS
• Pattern matching
• Predictive and Prescriptive Analytics
• Complex Event Processing
• Continuous & Real-time Insights
© Cloudera, Inc. All rights reserved.
OLAP Access PatternSQL Access Pattern
Streaming Event Storage Substrate
Topic A
Kafka Topic Kafka Topic
Topic B
Kafka Topic
Topic C
Kafka Topic
Topic D
Kafka Topic
Topic X
3 KafkaAnalyticsAccess Patterns
Streaming Access Pattern
N
ew
KAFKA SQL
New
KAFKA OLAP
New
© Cloudera, Inc. All rights reserved. 24© Cloudera, Inc. All rights reserved.
CLOUDERA DATAFLOW
© Cloudera, Inc. All rights reserved. 25© Cloudera, Inc. All rights reserved.
ENTERPRISE SERVICES
• Provisioning
• Management
• Monitoring
• Unified Security
• Single Sign-on
• Audit
• Compliance
• Edge-to-Enterprise Governance
© Cloudera, Inc. All rights reserved. 26© Cloudera, Inc. All rights reserved.
CLOUDERA DATAFLOW
© Cloudera, Inc. All rights reserved.
THANK YOU

More Related Content

PPTX
A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...
PPTX
When SAP alone is not enough
PPTX
Platform for the Research and Analysis of Cybernetic Threats
PPTX
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
PDF
Data Science Crash Course
PDF
Journey to Big Data: Main Issues, Solutions, Benefits
PDF
Stl meetup cloudera platform - january 2020
PDF
Open Source Data Management for Industry 4.0
A Journey to a Serverless Business Intelligence, Machine Learning and Big Dat...
When SAP alone is not enough
Platform for the Research and Analysis of Cybernetic Threats
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
Data Science Crash Course
Journey to Big Data: Main Issues, Solutions, Benefits
Stl meetup cloudera platform - january 2020
Open Source Data Management for Industry 4.0

What's hot (20)

PPTX
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
PPTX
Introducing Cloudera DataFlow (CDF) 2.13.19
PPTX
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
PPTX
GDPR: 20 Million Reasons to Get Ready - Part 2: Living Compliance
PPTX
Pouring the Foundation: Data Management in the Energy Industry
PPTX
Fighting Financial Crime with Artificial Intelligence
PDF
Cyber-I3 System - Intelligence, Incidence, and Investigation-based Big Data T...
PPTX
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
PPTX
How to Build Continuous Ingestion for the Internet of Things
PPTX
The 5 Biggest Data Myths in Telco: Exposed
PDF
Climbing the AI Ladder
PPTX
Benefits of Transferring Real-Time Data to Hadoop at Scale
PDF
Data Acquisition Automation for NiFi in a Hybrid Cloud environment – the Path...
PPTX
How komatsu is driving operational efficiencies using io t and machine learni...
PPTX
Relying on Data for Strategic Decision-Making--Financial Services Experience
PDF
Beyond Big Data: Data Science and AI
PDF
Hybrid Cloud Strategy for Big Data and Analytics
PPTX
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
PDF
Making Enterprise Big Data Small with Ease
PPTX
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
Introducing Cloudera DataFlow (CDF) 2.13.19
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
GDPR: 20 Million Reasons to Get Ready - Part 2: Living Compliance
Pouring the Foundation: Data Management in the Energy Industry
Fighting Financial Crime with Artificial Intelligence
Cyber-I3 System - Intelligence, Incidence, and Investigation-based Big Data T...
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
How to Build Continuous Ingestion for the Internet of Things
The 5 Biggest Data Myths in Telco: Exposed
Climbing the AI Ladder
Benefits of Transferring Real-Time Data to Hadoop at Scale
Data Acquisition Automation for NiFi in a Hybrid Cloud environment – the Path...
How komatsu is driving operational efficiencies using io t and machine learni...
Relying on Data for Strategic Decision-Making--Financial Services Experience
Beyond Big Data: Data Science and AI
Hybrid Cloud Strategy for Big Data and Analytics
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
Making Enterprise Big Data Small with Ease
Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive...
Ad

Similar to Addressing Challenges with IoT Edge Management (20)

PPTX
Edge2AI delivered by Cloudera Edge Management(CEM) 
PPTX
apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...
PDF
Developing safety autonomous driving solutions based on the adaptive AUTOSAR ...
PDF
Are you ready to be edgy? Bringing applications to the edge of the network
PPTX
F5 Networks - парадная дверь в облака
PDF
Accelerating Edge Computing Adoption
PPTX
Wavefront by vmware june 2019 - legraswindow
PPTX
App-First & Cloud-Native: How InterMiles Boosted CX with AWS & Infostretch
PPTX
Cloudera - IoT & Smart Cities
PDF
Implement a Universal Data Distribution Architecture to Manage All Streaming ...
PDF
Sangfor's Presentation.pdf
PDF
Infrastructure Performance Management: Flexibility Combining Breadth, Depth ...
PPTX
Enabling the Connected Car Revolution

PDF
Meetup Streaming Data Pipeline Development
PDF
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
PPTX
How Schneider Electric Assures Its Salesforce Lightning Migration with Thousa...
PDF
NoOps in a Serverless World
PDF
Scala dayssrinivas v3
PPTX
MapR Streams and MapR Converged Data Platform
PPTX
Transform IT Operations with eNlight 360°: The Ultimate DCIM and Monitoring S...
Edge2AI delivered by Cloudera Edge Management(CEM) 
apidays LIVE New York 2021 - Simplify Open Policy Agent with Styra DAS by Tim...
Developing safety autonomous driving solutions based on the adaptive AUTOSAR ...
Are you ready to be edgy? Bringing applications to the edge of the network
F5 Networks - парадная дверь в облака
Accelerating Edge Computing Adoption
Wavefront by vmware june 2019 - legraswindow
App-First & Cloud-Native: How InterMiles Boosted CX with AWS & Infostretch
Cloudera - IoT & Smart Cities
Implement a Universal Data Distribution Architecture to Manage All Streaming ...
Sangfor's Presentation.pdf
Infrastructure Performance Management: Flexibility Combining Breadth, Depth ...
Enabling the Connected Car Revolution

Meetup Streaming Data Pipeline Development
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
How Schneider Electric Assures Its Salesforce Lightning Migration with Thousa...
NoOps in a Serverless World
Scala dayssrinivas v3
MapR Streams and MapR Converged Data Platform
Transform IT Operations with eNlight 360°: The Ultimate DCIM and Monitoring S...
Ad

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
PPTX
Managing the Dewey Decimal System
PPTX
Practical NoSQL: Accumulo's dirlist Example
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
PPTX
Security Framework for Multitenant Architecture
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
PPTX
Extending Twitter's Data Platform to Google Cloud
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
PDF
Computer Vision: Coming to a Store Near You
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Data Science Crash Course
Floating on a RAFT: HBase Durability with Apache Ratis
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
HBase Tales From the Trenches - Short stories about most common HBase operati...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Managing the Dewey Decimal System
Practical NoSQL: Accumulo's dirlist Example
HBase Global Indexing to support large-scale data ingestion at Uber
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Security Framework for Multitenant Architecture
Presto: Optimizing Performance of SQL-on-Anything Engine
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Extending Twitter's Data Platform to Google Cloud
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Computer Vision: Coming to a Store Near You
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

Recently uploaded (20)

DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Big Data Technologies - Introduction.pptx
PDF
Approach and Philosophy of On baking technology
PDF
cuic standard and advanced reporting.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPT
Teaching material agriculture food technology
PPTX
A Presentation on Artificial Intelligence
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Encapsulation theory and applications.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Modernizing your data center with Dell and AMD
The AUB Centre for AI in Media Proposal.docx
Building Integrated photovoltaic BIPV_UPV.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Network Security Unit 5.pdf for BCA BBA.
Big Data Technologies - Introduction.pptx
Approach and Philosophy of On baking technology
cuic standard and advanced reporting.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Advanced methodologies resolving dimensionality complications for autism neur...
Teaching material agriculture food technology
A Presentation on Artificial Intelligence
Unlocking AI with Model Context Protocol (MCP)
NewMind AI Weekly Chronicles - August'25 Week I
Per capita expenditure prediction using model stacking based on satellite ima...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Encapsulation theory and applications.pdf
NewMind AI Monthly Chronicles - July 2025
Modernizing your data center with Dell and AMD

Addressing Challenges with IoT Edge Management

  • 1. © Cloudera, Inc. All rights reserved. ADDRESSING CHALLENGES WITH IOT EDGE MANAGEMENT Dinesh Chandrasekhar Director of Product Marketing, Data-in-Motion
  • 2. © Cloudera, Inc. All rights reserved. 2© Cloudera, Inc. All rights reserved. Public Sector Transportation Utilities Healthcare Manufacturing Retail COMMON IOT USE CASES BY INDUSTRY Fleet Management Connected Cars Smart Cities Predictive Analytics Inventory/ Material Tracking • IoT is a $1.13T market opportunity in 2021. • Americas - $329B IoT spending. Manufacturing and Transportation are top industries, accounting for 26% of total spending. • APAC - $500B IoT spending. Manufacturing, Utilities and Transportation are top industries. • EMEA - $264B IoT spending. Manufacturing is top industry, powered by Industry 4.0 initiatives. • Worldwide IoT Analytics and Information Management Market = $573M Top 5 Use cases Utility Monitoring Predictive Maintenance Patient Monitoring Usage-based Insurance Asset Tracking / Monitoring Edge Data Collection
  • 3. © Cloudera, Inc. All rights reserved. 3© Cloudera, Inc. All rights reserved. WHAT IS THE EDGE?
  • 4. © Cloudera, Inc. All rights reserved. 4© Cloudera, Inc. All rights reserved. WHY IS EDGE DATA COLLECTION IMPORTANT? Need for real-time insights ETL process is not real-time Data provenance
  • 5. © Cloudera, Inc. All rights reserved. 5© Cloudera, Inc. All rights reserved. Data Flow Apps Powered by NiFi STREAMING REFERENCE ARCHITECTURE Freightliner Fleet Trucks Truck Sensors C++ Agent Mack Fleet Trucks Truck Sensors C++ Agent Tata Fleet Trucks Truck Sensors C++ Agent Subcribing Streaming Analytics Apps Analytics App 4 Analytics App 1 Analytics App 2 Micro Services Analytics App 3
  • 6. © Cloudera, Inc. All rights reserved. 6© Cloudera, Inc. All rights reserved. WHAT ARE EDGE AGENTS?
  • 7. © Cloudera, Inc. All rights reserved. 7© Cloudera, Inc. All rights reserved. KEY CHALLENGES AT THE EDGE Deploying edge agents Identifying classes of agents Updates Device compatibility Edge storage availability Low power
  • 8. © Cloudera, Inc. All rights reserved. 8© Cloudera, Inc. All rights reserved. WHAT IS AN EDGE MANAGEMENT HUB? Management and monitoring of agents Application support OTA updates Registry Eventing
  • 9. © Cloudera, Inc. All rights reserved. 9© Cloudera, Inc. All rights reserved. Data Flow Apps Powered by NiFi MANAGING EDGE DATA COLLECTION IS A HARD PROBLEM Freightliner Fleet Trucks Truck Sensors C++ Agent Mack Fleet Trucks Truck Sensors C++ Agent Tata Fleet Trucks Truck Sensors C++ Agent Subcribing Streaming Analytics Apps Analytics App 4 Analytics App 1 Analytics App 2 Micro Services Analytics App 3 Challenge #1: Developing Edge Flows How do I develop flows meant to run on the edge using NiFi-like flow based programming with no code? Challenge #2: Deploying Flows/ Over the Air Updates (OTA) How do I deploy data collection flows to the edge? How do I do updates to only the Freightliner class of trucks? Challenge #3: Monitoring Agents and Flows How do I manage hundreds of thousands of edge agents and the flows that they are running
  • 10. © Cloudera, Inc. All rights reserved. 10© Cloudera, Inc. All rights reserved. KEY CHALLENGES WITH EDGE MANAGEMENT Network availability Offline storage Agent heartbeat
  • 11. © Cloudera, Inc. All rights reserved.
  • 12. © Cloudera, Inc. All rights reserved. 12© Cloudera, Inc. All rights reserved. CLOUDERA DATAFLOW
  • 13. © Cloudera, Inc. All rights reserved. 13© Cloudera, Inc. All rights reserved. EDGE MANAGEMENT • Edge data collection powered by Apache MiNiFi • MiNiFi – smaller footprint than NiFi • Guaranteed delivery • Data buffering • Prioritized queuing • Flow-specific QoS • Data provenance • Designed for extension • C++ / Java agents • TensorFlow support • Designed for IoT
  • 14. © Cloudera, Inc. All rights reserved. 14 INTRODUCING CLOUDERA EDGE MANAGEMENT (CEM) What is CEM?  New product offering is an edge management solution consisting of edge agents + edge management hub.  What does CEM do? Manage, Control and Monitor edge agents to collect data from edge devices and push intelligence back to the edge.  How does CEM do it? Develop, Deploy, Run & Monitor edge flow apps on thousands of edge devices. N ew
  • 15. © Cloudera, Inc. All rights reserved. 15© Cloudera, Inc. All rights reserved. CLOUDERA DATAFLOW
  • 16. © Cloudera, Inc. All rights reserved. 16© Cloudera, Inc. All rights reserved. FLOW MANAGEMENT • Web-based user interface • Highly configurable • Out-of-the-box data provenance • Designed for extensibility • Secure • NiFi Registry • DevOps support • FDLC • Versioning • Deployment
  • 17. © Cloudera, Inc. All rights reserved. 17© Cloudera, Inc. All rights reserved. 280+ PROCESSORS FOR DEEPER ECOSYSTEM INTEGRATION Hash Extract Merge Duplicate Scan GeoEnrich Replace ConvertSplit Translate Route Content Route Context Route Text Control Rate Distribute Load Generate Table Fetch Jolt Transform JSON Prioritized Delivery Encrypt Tail Evaluate Execute All Apache project logos are trademarks of the ASF and the respective projects. Fetch HTTP Syslog Email HTML Image HL7 FTP UDP XML SFTP AMQP WebSocket
  • 18. © Cloudera, Inc. All rights reserved. 18© Cloudera, Inc. All rights reserved. CLOUDERA DATAFLOW
  • 19. © Cloudera, Inc. All rights reserved. Streaming Analytics Reference Architecture Data Flow Apps Powered by NiFi Kafka is Everywhere. Critical Component of Streaming Architectures Kafka Producers Kafka Topics Kafka TopicsKafka Consumers & Producers Kafka Consumers US West Fleet Truck Sensors C++ Agent US Central Fleet Truck Sensors C++ Agent US East Fleet Truck Sensors C++ Agent Analytics App 1 Analytics App 2 Analytics App 5 Analytics App 3 Analytics App 4
  • 20. © Cloudera, Inc. All rights reserved. Cloudera Streams Messaging Manager (SMM) What is SMM?  Kafka Management and Monitoring tool  Cure the “Kafka Blindness”  Single Monitoring Dashboard for all your Kafka Clusters across 4 entities – Broker – Producer – Topic – Consumer  REST as a First Class Citizen  Alerting  Schema Management  Integration with Schema Registry
  • 21. © Cloudera, Inc. All rights reserved. 21© Cloudera, Inc. All rights reserved. CLOUDERA DATAFLOW
  • 22. © Cloudera, Inc. All rights reserved. 22© Cloudera, Inc. All rights reserved. STREAMING ANALYTICS • Pattern matching • Predictive and Prescriptive Analytics • Complex Event Processing • Continuous & Real-time Insights
  • 23. © Cloudera, Inc. All rights reserved. OLAP Access PatternSQL Access Pattern Streaming Event Storage Substrate Topic A Kafka Topic Kafka Topic Topic B Kafka Topic Topic C Kafka Topic Topic D Kafka Topic Topic X 3 KafkaAnalyticsAccess Patterns Streaming Access Pattern N ew KAFKA SQL New KAFKA OLAP New
  • 24. © Cloudera, Inc. All rights reserved. 24© Cloudera, Inc. All rights reserved. CLOUDERA DATAFLOW
  • 25. © Cloudera, Inc. All rights reserved. 25© Cloudera, Inc. All rights reserved. ENTERPRISE SERVICES • Provisioning • Management • Monitoring • Unified Security • Single Sign-on • Audit • Compliance • Edge-to-Enterprise Governance
  • 26. © Cloudera, Inc. All rights reserved. 26© Cloudera, Inc. All rights reserved. CLOUDERA DATAFLOW
  • 27. © Cloudera, Inc. All rights reserved. THANK YOU

Editor's Notes

  • #13: Data ingestion, transformation and routing done visually with no code using Apache NiFi & 260+ processors Build streaming apps and analytics from edge to datalake / EDW using builder Enable edge data collection and intelligence through MiNiFi agents Support massive IoT infrastructures Deliver perishable insights with pattern matching and Complex Event Processing (CEP) from real-time streams Manage, monitor, secure and govern streaming data
  • #16: Data ingestion, transformation and routing done visually with no code using Apache NiFi & 260+ processors Build streaming apps and analytics from edge to datalake / EDW using builder Enable edge data collection and intelligence through MiNiFi agents Support massive IoT infrastructures Deliver perishable insights with pattern matching and Complex Event Processing (CEP) from real-time streams Manage, monitor, secure and govern streaming data
  • #17: Web-based user interface Design, control, feedback & monitoring Highly configurable Loss tolerant vs guaranteed delivery Low latency vs high throughput Dynamic prioritization Flow can be modified at runtime Back pressure Data provenance Track dataflow from beginning to end Designed for extension Build your own processors Secure SSL, SSH, HTTPS, etc.
  • #19: Data ingestion, transformation and routing done visually with no code using Apache NiFi & 260+ processors Build streaming apps and analytics from edge to datalake / EDW using builder Enable edge data collection and intelligence through MiNiFi agents Support massive IoT infrastructures Deliver perishable insights with pattern matching and Complex Event Processing (CEP) from real-time streams Manage, monitor, secure and govern streaming data
  • #22: Data ingestion, transformation and routing done visually with no code using Apache NiFi & 260+ processors Build streaming apps and analytics from edge to datalake / EDW using builder Enable edge data collection and intelligence through MiNiFi agents Support massive IoT infrastructures Deliver perishable insights with pattern matching and Complex Event Processing (CEP) from real-time streams Manage, monitor, secure and govern streaming data
  • #25: Data ingestion, transformation and routing done visually with no code using Apache NiFi & 260+ processors Build streaming apps and analytics from edge to datalake / EDW using builder Enable edge data collection and intelligence through MiNiFi agents Support massive IoT infrastructures Deliver perishable insights with pattern matching and Complex Event Processing (CEP) from real-time streams Manage, monitor, secure and govern streaming data
  • #26: Web-based user interface Design, control, feedback & monitoring Highly configurable Loss tolerant vs guaranteed delivery Low latency vs high throughput Dynamic prioritization Flow can be modified at runtime Back pressure Data provenance Track dataflow from beginning to end Designed for extension Build your own processors Secure SSL, SSH, HTTPS, etc.
  • #27: Data ingestion, transformation and routing done visually with no code using Apache NiFi & 260+ processors Build streaming apps and analytics from edge to datalake / EDW using builder Enable edge data collection and intelligence through MiNiFi agents Support massive IoT infrastructures Deliver perishable insights with pattern matching and Complex Event Processing (CEP) from real-time streams Manage, monitor, secure and govern streaming data