SlideShare a Scribd company logo
Meet the Committers Lab
Preparation
3-May-2023
Š 2023 Cloudera, Inc. All rights reserved.
IMPORTANT NOTES
Guidance
Documentation
Daily Zoom Sessions
Examples
Ready Flows
Slack Channel
Flow Proctors
Shared Environment
Since it is shared environment,
each user has access to every
other users’ flow.
No production data should be
used.
We will stop your design
sessions after 4 hours of
inactivity.
Š 2023 Cloudera, Inc. All rights reserved.
SANDBOX FROM MAY 3, 2023 to MAY 9 MIDNIGHT, 2023
Sandbox will be destroyed at midnight EST May 9, 2023 before
May 10, 2023.
You must complete your item, Save and Download Your Flows
Before Then.
All data and code will be destroyed on the end of the trial
Submit your flow (CRN), video and text via this form.
Š 2023 Cloudera, Inc. All rights reserved.
NAVIGATE IN CHROME TO THE SHARED SANDBOX
https://login.cdpworkshops.cloudera.com/auth/realms/se-workshop-5
/protocol/saml/clients/cdp-sso
Š 2023 Cloudera, Inc. All rights reserved.
REGISTRATION
Click Register
Must use Recent
Chrome Browser
Meet the Committers Webinar_ Lab Preparation
Meet the Committers Webinar_ Lab Preparation
Meet the Committers Webinar_ Lab Preparation
Meet the Committers Webinar_ Lab Preparation
Š 2023 Cloudera, Inc. All rights reserved.
GETTING STARTED - GUIDED USE CASES
● Syslog to Kafka topic
● Reading and Filtering a Syslog Stream
● Writing Critical Syslog Events to Apache Iceberg
● Must use Recent Chrome Browser
Š 2023 Cloudera, Inc. All rights reserved.
BEST IN FLOW COMPETITION - BUILD & DOCUMENT A FLOW
A chance to win a $2,000 Amazon gift card.
A great way to get recognition.
Cloudera public award social media post.
Š 2023 Cloudera, Inc. All rights reserved.
BEST IN FLOW COMPETITION - BUILD & DOCUMENT A FLOW
A chance to win a $2,000 Amazon gift card.
A great way to get recognition.
Cloudera public award social media post.
Š 2023 Cloudera, Inc. All rights reserved.
WHAT TO BUILD?
You can extend or try one of our tutorials
You can extend or use one of our Ready Flows
You can connect to external resources (passwords are visible,
only use public data or examples)
Š 2023 Cloudera, Inc. All rights reserved.
FLOW REQUIREMENTS
The following are the requirements for the flow to be considered eligible for the competition:
1. The flow must be developed using the new DataFlow Designer in the DataFlow Service sandbox.
2. The flow must have at least one “source” data.
3. The flow must have at least one “destination” where the data is delivered
4. The flow must be functional, tested, and working using the Test session feature of the DataFlow Service. The
Data viewer should be used to inspect the data payload within the different flow steps.
5. The flow must be checked into the DataFlow Catalog, deployed using the deployment wizard, and validated that
it is correctly running.
6. Each submitted Flow must include the following additional details:
○ The CRN of the flow was checked into the flow catalog with a detailed description of the flow and use
case.
○ Link to a short blog describing the use case and the flow that was built and deployed using DataFlow
Designer
○ Link to a short video showing the flow running in the Flow Designer with the test session and data
traversing through flow. The Data viewer should be used to inspect the data payload within the different
flow steps.
○ Product feedback on the DataFlow Service.
Š 2023 Cloudera, Inc. All rights reserved.
Criteria Description
Complete Flow Artifacts The submitted flow entry contains all the required artifacts, including Flow CRN in the Catalog, a link to
the blog describing the use case and the flow, and a short video link showing the flow running with
data traversing through the flow.
Adheres to NiFi flow best
practices
Follows NiFi flow design best practices like record-oriented processors, controller services, and
parameters.
Showcases NiFi processing
capabilities
Showcases NiFi processing capabilities including protocol bridging, schema transformation, routing,
filtering, enrichment, compression, etc.
Universal Data Distribution The flow showcases multiple data sources and delivers data to multiple destinations.
Uses the latest NiFi processors
and controllers services
Showcases the latest NiFi processors in the latest Apache NiFi release: 1.20, 1.19, 1.18, 1.17,
including PutSnowflakeInternalStage, PutIceberg, UpdateDeltaLakeTable, Amazon ML Processors:
Amazon Web Services Polly, Textract, Translate, and Transcribe services, etc.
ReadyFlow The flow addresses a common data pipeline use case and can be reused by other users hence a good
candidate to be added to the ReadyFlow gallery.
Deployable The flow should be able to be deployed with minimum effort with the appropriate documentation (e.g.:
description of parameters in the parameter context, the blog details, etc..)
Š 2023 Cloudera, Inc. All rights reserved.
SANDBOX FLOW DEVELOPMENT BEST PRACTICES
Uniquely Name your
processors/ connections
with yourid_
Parameterize connection
information
Don’t use sensitive data
in sandbox
Don’t use or change other
people’s assets, only your
own
Š 2023 Cloudera, Inc. All rights reserved.
Don’t use or change other
people’s assets, only your
own
Reuse components via
Copy and Process Groups
We are here to help reach
out via Slack or Zoom.
SANDBOX FLOW DEVELOPMENT BEST PRACTICES
Š 2023 Cloudera, Inc. All rights reserved.
DAILY ZOOM
https://cloudera.zoom.us/j/964
60893376?pwd=eWZEVDhpZm
pFSDNRejFzMXkvcHpOdz09
Š 2023 Cloudera, Inc. All rights reserved.
SLACK CHANNEL
https://bestinflow.slack.com/join
/shared_invite/zt-1uj1ti8hc-8mnh
mbr_AbOCD7f~A68P0w#/shared
-invite/email
Š 2023 Cloudera, Inc. All rights reserved.
SOURCE CODE AND EXAMPLES
https://github.com/tspannhw/FLaN
K-DataFlows
Š 2023 Cloudera, Inc. All rights reserved.
Submit Your Flow
https://docs.google.com/forms/d/1Ku2KSDFoxJy45jiOWuLRDi9Trpgm-42aaxeAVwy-fpo
Š 2023 Cloudera, Inc. All rights reserved.
ADDITIONAL RESOURCES
Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development
Streaming Data Ingestion into an Open Data Lakehouse Made Easy with
DataFlow Example
Cloudera DataFlow Designer: Kafka to Iceberg in Cloudera Data Warehouse
Serverless NiFi Flows with DataFlow Functions
DataFlow Functions Technical Demo
DataFlow Documentation
23
Š 2023 Cloudera, Inc. All rights reserved.
Marketing
Carolyn
Duby
Field CTO
Field
Meet the Data-In-Motion Team
Tim
Spann
Developer Advocate
Richard Walden
DIM SME Lead
Engineering Product
Chris
Joynt
Product Marketing
Joe
Witt
Engineering Leader
George
Vetticaden
Product Leader
Michael
Kohs
Product Owner for
DataFlow
Pierre
Villard
Product Owner for
DataFlow
Andre
Araujo
Product Owner for
Stream Processing
John Kuchmek
DIM SME Expert
Š 2023 Cloudera, Inc. All rights reserved.
WARNING
“Notwithstanding any contrary terms in the Agreement, Customer
acknowledges that information shared using the Trial Product is
in a shared environment with similarly situated customers. All
information in the shared environment is accessible by all other
customers participating in the trial and such information will not
be deemed Confidential Information.”
https://www.cloudera.com/legal/commercial-terms-and-conditio
ns/cdp-public-cloud-trial-agreement.html
Š 2023 Cloudera, Inc. All rights reserved.
CONTAINER BASED DATAFLOW
Flow Deployment Flow Monitoring
Allows easy flow deployment based
on NiFi 1.20 across CDP
environments (Dev, QA, Prod)
Dene and assign KPIs to your
flows
Easy NiFi version upgrades
Update/Add KPIs, Update
Parameters, Change sizing
conguration
Automatic infrastructure scaling
based on CPU utilization
Central monitoring console for all
your flows across environments
Monitor flow metrics and
infrastructure usage
Define alerts for flows breaching
assigned KPIs
Flow Catalog
Keep track of your flow definitions
and versions in a central catalog
Reuse your existing NiFi flows by
uploading them to the catalog
Discover, search and reuse existing
flows easily
26
Š 2023 Cloudera, Inc. All rights reserved.
FLOW CATALOG
• Central repository for flow
denitions
• Import existing NiFi flows
• Manage flow definitions
• Initiate flow deployments
27
Š 2023 Cloudera, Inc. All rights reserved.
TURNS FLOW
DEFINITIONS
INTO FLOW
DEPLOYMENTS
2.) NiFi Cong
4.) Congure Sizing & Scaling 5.) Dene KPIs
1.) Start Deployment Wizard
3.) Provide Parameters for NiFi
28
Š 2023 Cloudera, Inc. All rights reserved.
KEY
PERFORMANCE
INDICATORS
• Visibility into flow deployments
• Track high level flow
performance
• Track in-depth NiFi component
metrics
• Defined in Deployment Wizard
• Monitoring & Alerts in
Deployment Details
KPI Denition in Deployment Wizard KPI Monitoring
29
Š 2023 Cloudera, Inc. All rights reserved.
DASHBOARD
• Central Monitoring View
• Monitors flow deployments
across CDP environments
• Monitors flow deployment
health & performance
• Drill into flow deployment to
monitor system metrics and
deployment events
30
Š 2023 Cloudera, Inc. All rights reserved.
DEPLOYMENT
MANAGER
• Manage flow deployment
lifecycle
(Suspend/Start/Terminate)
• Add/Edit KPIs
• Change sizing configuration
• Update parameters
• Change NiFi version of the
deployment
• Gateway to NiFi canvas
Meet the Committers Webinar_ Lab Preparation
32
Š 2023 Cloudera, Inc. All rights reserved.
TH N Y U

More Related Content

PDF
Best Practices For Workflow
PDF
AIDevWorldApacheNiFi101
PDF
Unconference Round Table Notes
PDF
The Never Landing Stream with HTAP and Streaming
PDF
OSSFinance_UnlockingFinancialDatawithReal-TimePipelines.pdf
PPTX
Partner Briefing_January 25 (FINAL).pptx
PDF
Cloudera Sandbox Event Guidelines For Workflow
PDF
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023
Best Practices For Workflow
AIDevWorldApacheNiFi101
Unconference Round Table Notes
The Never Landing Stream with HTAP and Streaming
OSSFinance_UnlockingFinancialDatawithReal-TimePipelines.pdf
Partner Briefing_January 25 (FINAL).pptx
Cloudera Sandbox Event Guidelines For Workflow
Future of Data Milwaukee Meetup Streaming Data Pipeline Development 28 June 2023

Similar to Meet the Committers Webinar_ Lab Preparation (20)

PDF
Meetup Streaming Data Pipeline Development
PDF
Implement a Universal Data Distribution Architecture to Manage All Streaming ...
PDF
Using Apache NiFi with Apache Pulsar for Fast Data On-Ramp
PDF
Building Real-time Pipelines with FLaNK_ A Case Study with Transit Data
PDF
Introduction to Apache NiFi 1.10
PDF
Cloudera streaming with flink oct 29, 2020 meetup london
PDF
AIDEVDAY_ Data-in-Motion to Supercharge AI
PDF
GSJUG: Mastering Data Streaming Pipelines 09May2023
PDF
WarsawITDays_ ApacheNiFi202
PDF
BYOP: Custom Processor Development with Apache NiFi
PPTX
Introducing Cloudera DataFlow (CDF) 2.13.19
PDF
Building Real-Time Travel Alerts
PDF
Meetup - Brasil - Data In Motion - 2023 September 19
PDF
Meetup - Brasil - Data In Motion - 2023 September 19
PDF
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
PDF
Budapest Data/ML - Building Modern Data Streaming Apps with NiFi, Flink and K...
PPTX
Spark+flume seattle
PDF
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
PDF
Conf42-Python-Building Apache NiFi 2.0 Python Processors
PDF
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
Meetup Streaming Data Pipeline Development
Implement a Universal Data Distribution Architecture to Manage All Streaming ...
Using Apache NiFi with Apache Pulsar for Fast Data On-Ramp
Building Real-time Pipelines with FLaNK_ A Case Study with Transit Data
Introduction to Apache NiFi 1.10
Cloudera streaming with flink oct 29, 2020 meetup london
AIDEVDAY_ Data-in-Motion to Supercharge AI
GSJUG: Mastering Data Streaming Pipelines 09May2023
WarsawITDays_ ApacheNiFi202
BYOP: Custom Processor Development with Apache NiFi
Introducing Cloudera DataFlow (CDF) 2.13.19
Building Real-Time Travel Alerts
Meetup - Brasil - Data In Motion - 2023 September 19
Meetup - Brasil - Data In Motion - 2023 September 19
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Budapest Data/ML - Building Modern Data Streaming Apps with NiFi, Flink and K...
Spark+flume seattle
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
Conf42-Python-Building Apache NiFi 2.0 Python Processors
26Oct2023_Adding Generative AI to Real-Time Streaming Pipelines_ NYC Meetup
Ad

More from Timothy Spann (20)

PDF
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
PDF
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
PDF
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
PDF
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
PDF
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
PDF
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
PDF
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
PDF
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
PDF
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
PDF
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
PPTX
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
PDF
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
PDF
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
PDF
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
PDF
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
PDF
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
PDF
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
PDF
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
PDF
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
PDF
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
Ad

Recently uploaded (20)

PDF
AI Guide for Business Growth - Arna Softech
PPTX
Computer Software and OS of computer science of grade 11.pptx
PPTX
Advanced SystemCare Ultimate Crack + Portable (2025)
PDF
DuckDuckGo Private Browser Premium APK for Android Crack Latest 2025
PDF
Topaz Photo AI Crack New Download (Latest 2025)
DOCX
How to Use SharePoint as an ISO-Compliant Document Management System
PDF
Visual explanation of Dijkstra's Algorithm using Python
PPTX
Monitoring Stack: Grafana, Loki & Promtail
PPTX
Why Generative AI is the Future of Content, Code & Creativity?
PPTX
GSA Content Generator Crack (2025 Latest)
PPTX
Weekly report ppt - harsh dattuprasad patel.pptx
PDF
Website Design Services for Small Businesses.pdf
PDF
How Tridens DevSecOps Ensures Compliance, Security, and Agility
PDF
DNT Brochure 2025 – ISV Solutions @ D365
PPTX
Computer Software - Technology and Livelihood Education
PDF
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
PPTX
Tech Workshop Escape Room Tech Workshop
PDF
Microsoft Office 365 Crack Download Free
PPTX
Introduction to Windows Operating System
PDF
Cost to Outsource Software Development in 2025
AI Guide for Business Growth - Arna Softech
Computer Software and OS of computer science of grade 11.pptx
Advanced SystemCare Ultimate Crack + Portable (2025)
DuckDuckGo Private Browser Premium APK for Android Crack Latest 2025
Topaz Photo AI Crack New Download (Latest 2025)
How to Use SharePoint as an ISO-Compliant Document Management System
Visual explanation of Dijkstra's Algorithm using Python
Monitoring Stack: Grafana, Loki & Promtail
Why Generative AI is the Future of Content, Code & Creativity?
GSA Content Generator Crack (2025 Latest)
Weekly report ppt - harsh dattuprasad patel.pptx
Website Design Services for Small Businesses.pdf
How Tridens DevSecOps Ensures Compliance, Security, and Agility
DNT Brochure 2025 – ISV Solutions @ D365
Computer Software - Technology and Livelihood Education
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
Tech Workshop Escape Room Tech Workshop
Microsoft Office 365 Crack Download Free
Introduction to Windows Operating System
Cost to Outsource Software Development in 2025

Meet the Committers Webinar_ Lab Preparation

  • 1. Meet the Committers Lab Preparation 3-May-2023
  • 2. Š 2023 Cloudera, Inc. All rights reserved. IMPORTANT NOTES Guidance Documentation Daily Zoom Sessions Examples Ready Flows Slack Channel Flow Proctors Shared Environment Since it is shared environment, each user has access to every other users’ flow. No production data should be used. We will stop your design sessions after 4 hours of inactivity.
  • 3. Š 2023 Cloudera, Inc. All rights reserved. SANDBOX FROM MAY 3, 2023 to MAY 9 MIDNIGHT, 2023 Sandbox will be destroyed at midnight EST May 9, 2023 before May 10, 2023. You must complete your item, Save and Download Your Flows Before Then. All data and code will be destroyed on the end of the trial Submit your flow (CRN), video and text via this form.
  • 4. Š 2023 Cloudera, Inc. All rights reserved. NAVIGATE IN CHROME TO THE SHARED SANDBOX https://login.cdpworkshops.cloudera.com/auth/realms/se-workshop-5 /protocol/saml/clients/cdp-sso
  • 5. Š 2023 Cloudera, Inc. All rights reserved. REGISTRATION Click Register Must use Recent Chrome Browser
  • 10. Š 2023 Cloudera, Inc. All rights reserved. GETTING STARTED - GUIDED USE CASES ● Syslog to Kafka topic ● Reading and Filtering a Syslog Stream ● Writing Critical Syslog Events to Apache Iceberg ● Must use Recent Chrome Browser
  • 11. Š 2023 Cloudera, Inc. All rights reserved. BEST IN FLOW COMPETITION - BUILD & DOCUMENT A FLOW A chance to win a $2,000 Amazon gift card. A great way to get recognition. Cloudera public award social media post.
  • 12. Š 2023 Cloudera, Inc. All rights reserved. BEST IN FLOW COMPETITION - BUILD & DOCUMENT A FLOW A chance to win a $2,000 Amazon gift card. A great way to get recognition. Cloudera public award social media post.
  • 13. Š 2023 Cloudera, Inc. All rights reserved. WHAT TO BUILD? You can extend or try one of our tutorials You can extend or use one of our Ready Flows You can connect to external resources (passwords are visible, only use public data or examples)
  • 14. Š 2023 Cloudera, Inc. All rights reserved. FLOW REQUIREMENTS The following are the requirements for the flow to be considered eligible for the competition: 1. The flow must be developed using the new DataFlow Designer in the DataFlow Service sandbox. 2. The flow must have at least one “source” data. 3. The flow must have at least one “destination” where the data is delivered 4. The flow must be functional, tested, and working using the Test session feature of the DataFlow Service. The Data viewer should be used to inspect the data payload within the different flow steps. 5. The flow must be checked into the DataFlow Catalog, deployed using the deployment wizard, and validated that it is correctly running. 6. Each submitted Flow must include the following additional details: ○ The CRN of the flow was checked into the flow catalog with a detailed description of the flow and use case. ○ Link to a short blog describing the use case and the flow that was built and deployed using DataFlow Designer ○ Link to a short video showing the flow running in the Flow Designer with the test session and data traversing through flow. The Data viewer should be used to inspect the data payload within the different flow steps. ○ Product feedback on the DataFlow Service.
  • 15. Š 2023 Cloudera, Inc. All rights reserved. Criteria Description Complete Flow Artifacts The submitted flow entry contains all the required artifacts, including Flow CRN in the Catalog, a link to the blog describing the use case and the flow, and a short video link showing the flow running with data traversing through the flow. Adheres to NiFi flow best practices Follows NiFi flow design best practices like record-oriented processors, controller services, and parameters. Showcases NiFi processing capabilities Showcases NiFi processing capabilities including protocol bridging, schema transformation, routing, filtering, enrichment, compression, etc. Universal Data Distribution The flow showcases multiple data sources and delivers data to multiple destinations. Uses the latest NiFi processors and controllers services Showcases the latest NiFi processors in the latest Apache NiFi release: 1.20, 1.19, 1.18, 1.17, including PutSnowflakeInternalStage, PutIceberg, UpdateDeltaLakeTable, Amazon ML Processors: Amazon Web Services Polly, Textract, Translate, and Transcribe services, etc. ReadyFlow The flow addresses a common data pipeline use case and can be reused by other users hence a good candidate to be added to the ReadyFlow gallery. Deployable The flow should be able to be deployed with minimum effort with the appropriate documentation (e.g.: description of parameters in the parameter context, the blog details, etc..)
  • 16. Š 2023 Cloudera, Inc. All rights reserved. SANDBOX FLOW DEVELOPMENT BEST PRACTICES Uniquely Name your processors/ connections with yourid_ Parameterize connection information Don’t use sensitive data in sandbox Don’t use or change other people’s assets, only your own
  • 17. Š 2023 Cloudera, Inc. All rights reserved. Don’t use or change other people’s assets, only your own Reuse components via Copy and Process Groups We are here to help reach out via Slack or Zoom. SANDBOX FLOW DEVELOPMENT BEST PRACTICES
  • 18. Š 2023 Cloudera, Inc. All rights reserved. DAILY ZOOM https://cloudera.zoom.us/j/964 60893376?pwd=eWZEVDhpZm pFSDNRejFzMXkvcHpOdz09
  • 19. Š 2023 Cloudera, Inc. All rights reserved. SLACK CHANNEL https://bestinflow.slack.com/join /shared_invite/zt-1uj1ti8hc-8mnh mbr_AbOCD7f~A68P0w#/shared -invite/email
  • 20. Š 2023 Cloudera, Inc. All rights reserved. SOURCE CODE AND EXAMPLES https://github.com/tspannhw/FLaN K-DataFlows
  • 21. Š 2023 Cloudera, Inc. All rights reserved. Submit Your Flow https://docs.google.com/forms/d/1Ku2KSDFoxJy45jiOWuLRDi9Trpgm-42aaxeAVwy-fpo
  • 22. Š 2023 Cloudera, Inc. All rights reserved. ADDITIONAL RESOURCES Cloudera DataFlow Designer: The Key to Agile Data Pipeline Development Streaming Data Ingestion into an Open Data Lakehouse Made Easy with DataFlow Example Cloudera DataFlow Designer: Kafka to Iceberg in Cloudera Data Warehouse Serverless NiFi Flows with DataFlow Functions DataFlow Functions Technical Demo DataFlow Documentation
  • 23. 23 Š 2023 Cloudera, Inc. All rights reserved. Marketing Carolyn Duby Field CTO Field Meet the Data-In-Motion Team Tim Spann Developer Advocate Richard Walden DIM SME Lead Engineering Product Chris Joynt Product Marketing Joe Witt Engineering Leader George Vetticaden Product Leader Michael Kohs Product Owner for DataFlow Pierre Villard Product Owner for DataFlow Andre Araujo Product Owner for Stream Processing John Kuchmek DIM SME Expert
  • 24. Š 2023 Cloudera, Inc. All rights reserved. WARNING “Notwithstanding any contrary terms in the Agreement, Customer acknowledges that information shared using the Trial Product is in a shared environment with similarly situated customers. All information in the shared environment is accessible by all other customers participating in the trial and such information will not be deemed Condential Information.” https://www.cloudera.com/legal/commercial-terms-and-conditio ns/cdp-public-cloud-trial-agreement.html
  • 25. Š 2023 Cloudera, Inc. All rights reserved. CONTAINER BASED DATAFLOW Flow Deployment Flow Monitoring Allows easy flow deployment based on NiFi 1.20 across CDP environments (Dev, QA, Prod) Dene and assign KPIs to your flows Easy NiFi version upgrades Update/Add KPIs, Update Parameters, Change sizing conguration Automatic infrastructure scaling based on CPU utilization Central monitoring console for all your flows across environments Monitor flow metrics and infrastructure usage Dene alerts for flows breaching assigned KPIs Flow Catalog Keep track of your flow denitions and versions in a central catalog Reuse your existing NiFi flows by uploading them to the catalog Discover, search and reuse existing flows easily
  • 26. 26 Š 2023 Cloudera, Inc. All rights reserved. FLOW CATALOG • Central repository for flow denitions • Import existing NiFi flows • Manage flow denitions • Initiate flow deployments
  • 27. 27 Š 2023 Cloudera, Inc. All rights reserved. TURNS FLOW DEFINITIONS INTO FLOW DEPLOYMENTS 2.) NiFi Cong 4.) Congure Sizing & Scaling 5.) Dene KPIs 1.) Start Deployment Wizard 3.) Provide Parameters for NiFi
  • 28. 28 Š 2023 Cloudera, Inc. All rights reserved. KEY PERFORMANCE INDICATORS • Visibility into flow deployments • Track high level flow performance • Track in-depth NiFi component metrics • Dened in Deployment Wizard • Monitoring & Alerts in Deployment Details KPI Denition in Deployment Wizard KPI Monitoring
  • 29. 29 Š 2023 Cloudera, Inc. All rights reserved. DASHBOARD • Central Monitoring View • Monitors flow deployments across CDP environments • Monitors flow deployment health & performance • Drill into flow deployment to monitor system metrics and deployment events
  • 30. 30 Š 2023 Cloudera, Inc. All rights reserved. DEPLOYMENT MANAGER • Manage flow deployment lifecycle (Suspend/Start/Terminate) • Add/Edit KPIs • Change sizing conguration • Update parameters • Change NiFi version of the deployment • Gateway to NiFi canvas
  • 32. 32 Š 2023 Cloudera, Inc. All rights reserved. TH N Y U