SlideShare a Scribd company logo
Connecting the Drops
with Apache NiFi & MiNiFi
Aldrin Piri – Apache NiFi PMC
@aldrinpiri
© Hortonworks Inc. 2011 – 2016. All Rights Reserved2
Agenda
Apache NiFi Fundamentals
Expanding the Reach of NiFi with Apache NiFi - MiNiFi
Evolving the NiFi Ecosystem
Apache NiFi Registry
Community
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Empower users to manage the
collection and flow of data
© Hortonworks Inc. 2011 – 2016. All Rights Reserved4
The Problem at Hand
Producers A.K.A Things
Anything
AND
Everything
Internet!
Consumers
• User
• Storage
• System
• …More Things
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Moving data effectively is hard
Standards: http://xkcd.com/927/
© Hortonworks Inc. 2011 – 2016. All Rights Reserved6
Apache NiFi: A Primer
Key Features and Principles
• Guaranteed delivery
• Data buffering
- Backpressure
- Pressure release
• Prioritized queuing
• Flow specific QoS
- Latency vs. throughput
- Loss tolerance
• Data provenance
• Recovery/recording
a rolling log of fine-grained
history
• Visual command and
control
• Flow templates
• Pluggable/multi-role
security
• Designed for extension
• Clustering
© Hortonworks Inc. 2011 – 2016. All Rights Reserved7
NiFi is based on Flow Based Programming (FBP)
FBP Term NiFi Term Description
Information
Packet
FlowFile Each object moving through the system.
Black Box FlowFile
Processor
Performs the work, doing some combination of data routing, transformation,
or mediation between systems.
Bounded
Buffer
Connection The linkage between processors, acting as queues and allowing various
processes to interact at differing rates.
Scheduler Flow
Controller
Maintains the knowledge of how processes are connected, and manages the
threads and allocations thereof which all processes use.
Subnet Process
Group
A set of processes and their connections, which can receive and send data via
ports. A process group allows creation of entirely new component simply by
composition of its components.
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
NiFi & Data Agnosticism
 NiFi is data agnostic!
 But, NiFi was designed understanding that users
can care about specifics and provides tooling
to interact with specific formats, protocols, etc.
ISO 8601 - http://xkcd.com/1179/
Robustness principle
Be conservative in what you do,
be liberal in what you accept from others“
© Hortonworks Inc. 2011 – 2016. All Rights Reserved9
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2011 – 2016. All Rights Reserved11
Apache NiFi - MiNiFi
 Let me get the key parts of NiFi close to where data begins
 Bidirectional data transfer
 Greater illuminate journey with provenance
 NiFi lives in the data center. Give it an enterprise server or a cluster of
them.
 MiNiFi lives as close to where data is born and is a guest on that device or
system
© Hortonworks Inc. 2011 – 2016. All Rights Reserved12
Apache NiFi - MiNiFi
 Limited computing capability
 Limited power/network
 Restricted software library/platform availability
 No UI
 Physically inaccessible
 Not frequently updated
 Competing standards/protocols
 Scalability
 Privacy & Security
Realities of computing outside the cozy datacenter
© Hortonworks Inc. 2011 – 2016. All Rights Reserved13
Apache NiFi - MiNiFi: Scoping
 Go small: Java – Write once, run anywhere*
– Feature parity and reuse of core NiFi libraries
 Go smaller: C++ – Write once**, run anywhere
 Go smallest: Write n-many times, embed, run anywhere
Language libraries to support tagging, FlowFile format, Site to Site protocol, and
provenance generation without a full processing framework
– Language SDKs, Mobile Platforms
Provide all the key principles of NiFi in varying, smaller footprints
© Hortonworks Inc. 2011 – 2016. All Rights Reserved14
Apache NiFi - MiNiFi: The Differences
 No UI / Declarative configuration
– Supports YAML
– Extensible interface to ingest other formats
 Reduced set of bundled components
 Minimize initial size
Departures from NiFi
© Hortonworks Inc. 2011 – 2016. All Rights Reserved15
Apache NiFi - MiNiFi: Centralized Command & Control (C2)
 Provide flow updates, information and assets to instances where they live
 Act as a gateway to/from network enclaves
 Provide a user interface/experience for design & deploy and monitoring
Extend the reach of user experience and operations
© Hortonworks Inc. 2011 – 2016. All Rights Reserved16
Connecting the Drops
SOURCES
REGIONAL
INFRASTRUCTURE
CORE
INFRASTRUCTURE
© Hortonworks Inc. 2011 – 2016. All Rights Reserved17
Managing data flow for a courier service
Physical Store
Gateway
Server
Mobile Devices
Registers
Server Cluster
Distribution Center
Kafka
Core Data Center at HQ
Server Cluster
Others
Storm / Spark /
Flink / Apex
Kafka
Storm / Spark / Flink / Apex
On Delivery Routes
Trucks Deliverers
Delivery Truck: Creative Stall, https://thenounproject.com/creativestall/
Deliverer: Rigo Peter, https://thenounproject.com/rigo/
Cash Register: Sergey Patutin, https://thenounproject.com/bdesign.by/
Hand Scanner: Eric Pearson, https://thenounproject.com/epearson001/
Client
Libraries
Client
Libraries
MiNiFi
MiNiFi
NiFi NiFi NiFi NiFi NiFi NiFi
Client
Libraries
© Hortonworks Inc. 2011 – 2016. All Rights Reserved18
Evolving the NiFi Platform
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Listening to our community
How can I … How do I ... What about ...
 Version my flows?
 Drive CI/CD processes?
 Migrate flows between environments?
 Provision distributions of NiFi with a set of components?
 Make reference datasets/extensions available to the entirety of my data
flow?
 Certify / Audit / Sign-off on flows as compliant per regulations?
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Capturing the essence of a flow in your organization
 The n-dimensions of data flow
 Consider a flowfile to be a singular event at a given juncture in its processing
 A flow is the directed graph of processing at a given point in time
 With each component’s:
 Configuration
 Version
 Referenced Assets
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Introducing
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
Registry is an enabler
 SDLC
 Manage variables, sensitive properties for environments
 Extension Registry
 Association/tagging of data with the flow that created it
 Enhanced Command and Control of MiNiFi instances
© Hortonworks Inc. 2011 – 2016. All Rights Reserved
The Evolution of Apache NiFi
 Our core substrate for data flow is NiFi & MiNiFi
 Command and Control facilitates operations and management of components
 Registry for common tasks with disparate resources across the NiFi ecosystem
© Hortonworks Inc. 2011 – 2016. All Rights Reserved30
Why the Apache NiFi Ecosystem?
 Moving data is multifaceted in its challenges and these are present in different contexts
at varying scopes
 Provide components and a platform with common tooling and extensions that are
commonly needed but be flexible for extension in all aspects
– Allow organizations to integrate with their existing infrastructure
 Empower folks managing your infrastructure to make changes and reason about issues
that are occurring
– Data Provenance to show context and data’s journey
– User Interface/Experience a key component
© Hortonworks Inc. 2011 – 2016. All Rights Reserved31
Community
© Hortonworks Inc. 2011 – 2016. All Rights Reserved32
Apache NiFi Crash Course
Wednesday, 14 June
11:00 AM – 1:30PM, Room LL21A
• Learn more about NiFi, the community, and work through a hands-on lab
• Seats available on a first come, first served basis
• Make sure you are in possession of the latest version of VirtualBox
© Hortonworks Inc. 2011 – 2016. All Rights Reserved33
Learn, Share at Birds of a Feather
IOT, STREAMING & DATA FLOW
Thursday, June 15
5:50 pm, Ballroom C
© Hortonworks Inc. 2011 – 2016. All Rights Reserved34
Learn more and join us!
Project Sites:
NiFi: https://nifi.apache.org
Subproject MiNiFi: https://nifi.apache.org/minifi/
Subproject Registry: http://nifi.apache.org/registry.html
Subscribe to and collaborate at
dev@nifi.apache.org
users@nifi.apache.org
Submit Ideas or Issues
https://issues.apache.org/jira/browse/NIFI
https://issues.apache.org/jira/browse/MINIFI
Follow us on Twitter
@apachenifi
© Hortonworks Inc. 2011 – 2016. All Rights Reserved35
Thank You

More Related Content

PDF
Intelligently collecting data at the edge—intro to Apache MiNiFi
PPTX
Bytecode manipulation with Javassist for fun and profit
PDF
OpenStack Neutron IPv6 Lessons
PPTX
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
PDF
OpenStack Architecture
PDF
Apache Nifi Crash Course
PDF
Model driven telemetry
PDF
Large scale overlay networks with ovn: problems and solutions
Intelligently collecting data at the edge—intro to Apache MiNiFi
Bytecode manipulation with Javassist for fun and profit
OpenStack Neutron IPv6 Lessons
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
OpenStack Architecture
Apache Nifi Crash Course
Model driven telemetry
Large scale overlay networks with ovn: problems and solutions

What's hot (20)

PPTX
Palo alto NGfw2023.pptx
PPTX
Integrating Apache Spark and NiFi for Data Lakes
PDF
プログラマ目線から見たRDMAのメリットと その応用例について
PDF
about Tcpreplay
PDF
Deploy MPLS Traffic Engineering
PDF
Unlock Value from Big Data with Apache NiFi and Streaming CDC
PDF
464XLAT Tutorial
PPTX
Hadoop Summit Tokyo Apache NiFi Crash Course
PPTX
Canonical data model
PDF
Room 1 - 5 - Thủy Đặng - Load balancing k8s services on baremetal with Cilium...
PPTX
Breakout - Airheads Macau 2013 - Top 10 Tips from Aruba TAC
PDF
Deploying IPv6 on OpenStack
PDF
Brkdcn 2035 multi-x
PDF
NETCONFとYANGの話
PDF
Power Platform Architecture Corrections
PDF
CDNの仕組み(JANOG36)
PDF
Trouble Ticket Integration with Zabbix in Large Environment
PPTX
Seller Presentation - Power Systems Power Virtual Server.PPTX
PDF
Segment Routing Technology Deep Dive and Advanced Use Cases
Palo alto NGfw2023.pptx
Integrating Apache Spark and NiFi for Data Lakes
プログラマ目線から見たRDMAのメリットと その応用例について
about Tcpreplay
Deploy MPLS Traffic Engineering
Unlock Value from Big Data with Apache NiFi and Streaming CDC
464XLAT Tutorial
Hadoop Summit Tokyo Apache NiFi Crash Course
Canonical data model
Room 1 - 5 - Thủy Đặng - Load balancing k8s services on baremetal with Cilium...
Breakout - Airheads Macau 2013 - Top 10 Tips from Aruba TAC
Deploying IPv6 on OpenStack
Brkdcn 2035 multi-x
NETCONFとYANGの話
Power Platform Architecture Corrections
CDNの仕組み(JANOG36)
Trouble Ticket Integration with Zabbix in Large Environment
Seller Presentation - Power Systems Power Virtual Server.PPTX
Segment Routing Technology Deep Dive and Advanced Use Cases
Ad

Similar to Connecting the Drops with Apache NiFi & Apache MiNiFi (20)

PPTX
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
PPTX
State of the Apache NiFi Ecosystem & Community
PDF
Dataflow Management From Edge to Core with Apache NiFi
PDF
Apache Nifi Crash Course
PDF
Dataflow Management From Edge to Core with Apache NiFi
PPTX
Apache NiFi Crash Course Intro
PPTX
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
PPTX
NJ Hadoop Meetup - Apache NiFi Deep Dive
PPTX
The Avant-garde of Apache NiFi
PPTX
The Avant-garde of Apache NiFi
PDF
Devnexus 2018 - Let Your Data Flow with Apache NiFi
PDF
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
PDF
The First Mile -- Edge and IoT Data Collection with Apache NiFi and MiNiFi
PPTX
Apache NiFi Crash Course - San Jose Hadoop Summit
PDF
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFi
PDF
Nifi workshop
PDF
The First Mile – Edge and IoT Data Collection with Apache NiFi and MiNiFi
PPTX
MiNiFi 0.0.1 MeetUp talk
PPTX
Apache NiFi- MiNiFi meetup Slides
PDF
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
State of the Apache NiFi Ecosystem & Community
Dataflow Management From Edge to Core with Apache NiFi
Apache Nifi Crash Course
Dataflow Management From Edge to Core with Apache NiFi
Apache NiFi Crash Course Intro
Big Data Day LA 2016/ Big Data Track - Building scalable enterprise data flow...
NJ Hadoop Meetup - Apache NiFi Deep Dive
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
Devnexus 2018 - Let Your Data Flow with Apache NiFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile -- Edge and IoT Data Collection with Apache NiFi and MiNiFi
Apache NiFi Crash Course - San Jose Hadoop Summit
Intelligently Collecting Data at the Edge – Intro to Apache MiNiFi
Nifi workshop
The First Mile – Edge and IoT Data Collection with Apache NiFi and MiNiFi
MiNiFi 0.0.1 MeetUp talk
Apache NiFi- MiNiFi meetup Slides
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFi
Ad

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
PPTX
Managing the Dewey Decimal System
PPTX
Practical NoSQL: Accumulo's dirlist Example
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
PPTX
Security Framework for Multitenant Architecture
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
PPTX
Extending Twitter's Data Platform to Google Cloud
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
PDF
Computer Vision: Coming to a Store Near You
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Data Science Crash Course
Floating on a RAFT: HBase Durability with Apache Ratis
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
HBase Tales From the Trenches - Short stories about most common HBase operati...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Managing the Dewey Decimal System
Practical NoSQL: Accumulo's dirlist Example
HBase Global Indexing to support large-scale data ingestion at Uber
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Security Framework for Multitenant Architecture
Presto: Optimizing Performance of SQL-on-Anything Engine
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Extending Twitter's Data Platform to Google Cloud
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Computer Vision: Coming to a Store Near You
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

Recently uploaded (20)

DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Encapsulation theory and applications.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Big Data Technologies - Introduction.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
The AUB Centre for AI in Media Proposal.docx
MIND Revenue Release Quarter 2 2025 Press Release
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Encapsulation theory and applications.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Big Data Technologies - Introduction.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Programs and apps: productivity, graphics, security and other tools
Encapsulation_ Review paper, used for researhc scholars
Machine learning based COVID-19 study performance prediction
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Reach Out and Touch Someone: Haptics and Empathic Computing
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
The Rise and Fall of 3GPP – Time for a Sabbatical?

Connecting the Drops with Apache NiFi & Apache MiNiFi

  • 1. Connecting the Drops with Apache NiFi & MiNiFi Aldrin Piri – Apache NiFi PMC @aldrinpiri
  • 2. © Hortonworks Inc. 2011 – 2016. All Rights Reserved2 Agenda Apache NiFi Fundamentals Expanding the Reach of NiFi with Apache NiFi - MiNiFi Evolving the NiFi Ecosystem Apache NiFi Registry Community
  • 3. © Hortonworks Inc. 2011 – 2016. All Rights Reserved Empower users to manage the collection and flow of data
  • 4. © Hortonworks Inc. 2011 – 2016. All Rights Reserved4 The Problem at Hand Producers A.K.A Things Anything AND Everything Internet! Consumers • User • Storage • System • …More Things
  • 5. © Hortonworks Inc. 2011 – 2016. All Rights Reserved Moving data effectively is hard Standards: http://xkcd.com/927/
  • 6. © Hortonworks Inc. 2011 – 2016. All Rights Reserved6 Apache NiFi: A Primer Key Features and Principles • Guaranteed delivery • Data buffering - Backpressure - Pressure release • Prioritized queuing • Flow specific QoS - Latency vs. throughput - Loss tolerance • Data provenance • Recovery/recording a rolling log of fine-grained history • Visual command and control • Flow templates • Pluggable/multi-role security • Designed for extension • Clustering
  • 7. © Hortonworks Inc. 2011 – 2016. All Rights Reserved7 NiFi is based on Flow Based Programming (FBP) FBP Term NiFi Term Description Information Packet FlowFile Each object moving through the system. Black Box FlowFile Processor Performs the work, doing some combination of data routing, transformation, or mediation between systems. Bounded Buffer Connection The linkage between processors, acting as queues and allowing various processes to interact at differing rates. Scheduler Flow Controller Maintains the knowledge of how processes are connected, and manages the threads and allocations thereof which all processes use. Subnet Process Group A set of processes and their connections, which can receive and send data via ports. A process group allows creation of entirely new component simply by composition of its components.
  • 8. © Hortonworks Inc. 2011 – 2016. All Rights Reserved NiFi & Data Agnosticism  NiFi is data agnostic!  But, NiFi was designed understanding that users can care about specifics and provides tooling to interact with specific formats, protocols, etc. ISO 8601 - http://xkcd.com/1179/ Robustness principle Be conservative in what you do, be liberal in what you accept from others“
  • 9. © Hortonworks Inc. 2011 – 2016. All Rights Reserved9
  • 10. © Hortonworks Inc. 2011 – 2016. All Rights Reserved
  • 11. © Hortonworks Inc. 2011 – 2016. All Rights Reserved11 Apache NiFi - MiNiFi  Let me get the key parts of NiFi close to where data begins  Bidirectional data transfer  Greater illuminate journey with provenance  NiFi lives in the data center. Give it an enterprise server or a cluster of them.  MiNiFi lives as close to where data is born and is a guest on that device or system
  • 12. © Hortonworks Inc. 2011 – 2016. All Rights Reserved12 Apache NiFi - MiNiFi  Limited computing capability  Limited power/network  Restricted software library/platform availability  No UI  Physically inaccessible  Not frequently updated  Competing standards/protocols  Scalability  Privacy & Security Realities of computing outside the cozy datacenter
  • 13. © Hortonworks Inc. 2011 – 2016. All Rights Reserved13 Apache NiFi - MiNiFi: Scoping  Go small: Java – Write once, run anywhere* – Feature parity and reuse of core NiFi libraries  Go smaller: C++ – Write once**, run anywhere  Go smallest: Write n-many times, embed, run anywhere Language libraries to support tagging, FlowFile format, Site to Site protocol, and provenance generation without a full processing framework – Language SDKs, Mobile Platforms Provide all the key principles of NiFi in varying, smaller footprints
  • 14. © Hortonworks Inc. 2011 – 2016. All Rights Reserved14 Apache NiFi - MiNiFi: The Differences  No UI / Declarative configuration – Supports YAML – Extensible interface to ingest other formats  Reduced set of bundled components  Minimize initial size Departures from NiFi
  • 15. © Hortonworks Inc. 2011 – 2016. All Rights Reserved15 Apache NiFi - MiNiFi: Centralized Command & Control (C2)  Provide flow updates, information and assets to instances where they live  Act as a gateway to/from network enclaves  Provide a user interface/experience for design & deploy and monitoring Extend the reach of user experience and operations
  • 16. © Hortonworks Inc. 2011 – 2016. All Rights Reserved16 Connecting the Drops SOURCES REGIONAL INFRASTRUCTURE CORE INFRASTRUCTURE
  • 17. © Hortonworks Inc. 2011 – 2016. All Rights Reserved17 Managing data flow for a courier service Physical Store Gateway Server Mobile Devices Registers Server Cluster Distribution Center Kafka Core Data Center at HQ Server Cluster Others Storm / Spark / Flink / Apex Kafka Storm / Spark / Flink / Apex On Delivery Routes Trucks Deliverers Delivery Truck: Creative Stall, https://thenounproject.com/creativestall/ Deliverer: Rigo Peter, https://thenounproject.com/rigo/ Cash Register: Sergey Patutin, https://thenounproject.com/bdesign.by/ Hand Scanner: Eric Pearson, https://thenounproject.com/epearson001/ Client Libraries Client Libraries MiNiFi MiNiFi NiFi NiFi NiFi NiFi NiFi NiFi Client Libraries
  • 18. © Hortonworks Inc. 2011 – 2016. All Rights Reserved18 Evolving the NiFi Platform
  • 19. © Hortonworks Inc. 2011 – 2016. All Rights Reserved Listening to our community How can I … How do I ... What about ...  Version my flows?  Drive CI/CD processes?  Migrate flows between environments?  Provision distributions of NiFi with a set of components?  Make reference datasets/extensions available to the entirety of my data flow?  Certify / Audit / Sign-off on flows as compliant per regulations?
  • 20. © Hortonworks Inc. 2011 – 2016. All Rights Reserved Capturing the essence of a flow in your organization  The n-dimensions of data flow  Consider a flowfile to be a singular event at a given juncture in its processing  A flow is the directed graph of processing at a given point in time  With each component’s:  Configuration  Version  Referenced Assets
  • 21. © Hortonworks Inc. 2011 – 2016. All Rights Reserved Introducing
  • 22. © Hortonworks Inc. 2011 – 2016. All Rights Reserved
  • 23. © Hortonworks Inc. 2011 – 2016. All Rights Reserved
  • 24. © Hortonworks Inc. 2011 – 2016. All Rights Reserved
  • 25. © Hortonworks Inc. 2011 – 2016. All Rights Reserved
  • 26. © Hortonworks Inc. 2011 – 2016. All Rights Reserved
  • 27. © Hortonworks Inc. 2011 – 2016. All Rights Reserved
  • 28. © Hortonworks Inc. 2011 – 2016. All Rights Reserved Registry is an enabler  SDLC  Manage variables, sensitive properties for environments  Extension Registry  Association/tagging of data with the flow that created it  Enhanced Command and Control of MiNiFi instances
  • 29. © Hortonworks Inc. 2011 – 2016. All Rights Reserved The Evolution of Apache NiFi  Our core substrate for data flow is NiFi & MiNiFi  Command and Control facilitates operations and management of components  Registry for common tasks with disparate resources across the NiFi ecosystem
  • 30. © Hortonworks Inc. 2011 – 2016. All Rights Reserved30 Why the Apache NiFi Ecosystem?  Moving data is multifaceted in its challenges and these are present in different contexts at varying scopes  Provide components and a platform with common tooling and extensions that are commonly needed but be flexible for extension in all aspects – Allow organizations to integrate with their existing infrastructure  Empower folks managing your infrastructure to make changes and reason about issues that are occurring – Data Provenance to show context and data’s journey – User Interface/Experience a key component
  • 31. © Hortonworks Inc. 2011 – 2016. All Rights Reserved31 Community
  • 32. © Hortonworks Inc. 2011 – 2016. All Rights Reserved32 Apache NiFi Crash Course Wednesday, 14 June 11:00 AM – 1:30PM, Room LL21A • Learn more about NiFi, the community, and work through a hands-on lab • Seats available on a first come, first served basis • Make sure you are in possession of the latest version of VirtualBox
  • 33. © Hortonworks Inc. 2011 – 2016. All Rights Reserved33 Learn, Share at Birds of a Feather IOT, STREAMING & DATA FLOW Thursday, June 15 5:50 pm, Ballroom C
  • 34. © Hortonworks Inc. 2011 – 2016. All Rights Reserved34 Learn more and join us! Project Sites: NiFi: https://nifi.apache.org Subproject MiNiFi: https://nifi.apache.org/minifi/ Subproject Registry: http://nifi.apache.org/registry.html Subscribe to and collaborate at dev@nifi.apache.org users@nifi.apache.org Submit Ideas or Issues https://issues.apache.org/jira/browse/NIFI https://issues.apache.org/jira/browse/MINIFI Follow us on Twitter @apachenifi
  • 35. © Hortonworks Inc. 2011 – 2016. All Rights Reserved35 Thank You