SlideShare a Scribd company logo
Codeless Generative AI
Pipelines
(GenAI with Milvus)
Tim Spann
Principal Developer Advocate
Timothy Spann
Principal Developer Advocate
https://medium.com/@tspann
https://github.com/tspannhw
This week in Milvus, Towhee, Attu, Apache
NiFi, Apache Flink, Apache Kafka, ML, AI,
Apache Spark, Apache Iceberg, Python,
Java, LLM, GenAI, Vector DB and Open
Source friends.
https://flipstackweekly.com/
FLaNK-AIM Stack Weekly
https://www.meetup.com/unstructured-data-meetup-new-york/
https://www.meetup.com/pro/unstructureddata/
From Unstructured Data to Vector Databases to ML to Generative AI to Deep Learning to Data Science
Unstructured Data Meetup @ New York
DSSML24_tspann_CodelessGenerativeAIPipelines
AGENDA
Introduction
Overview
GenAI Architecture
Streaming Projects
Demos
Resources
Q&A
https://flankworkspace.slack.com/
https://join.slack.com/t/flankworkspac
e/shared_invite/zt-2fycjv241-~NRHZDt
dfwDjlfvXK_Bz0A
Join Our Slack and Interact with LLM
DataFlow Pipelines Can
Help
External Context Ingest
Ingesting, routing, clean, enrich, transforming,
parsing, chunking and vectorizing structured,
unstructured, semistructured, binary data and
documents
Prompt engineering
Crafting and structuring queries to optimize
LLM responses
Context Retrieval
Enhancing LLM with external context such as
Retrieval Augmented Generation (RAG)
Roundtrip Interface
Act as a Discord, REST, Kafka, SQL, Slack bot to
roundtrip discussions
https://medium.com/cloudera-inc/getting-ready-for-apache-nifi-2-0-5a5e6a67f450
NiFi 2.0.0 Features
● Python Integration
● Parameters
● JDK 21+
● JSON Flow Serialization
● Rules Engine for Development Assistance
● Run Process Group as Stateless
● flow.json.gz
https://cwiki.apache.org/confluence/display/NIFI/NiFi+2.0+Release+Goals
Extract Company Names
● Python 3.10+
● Hugging Face, NLP, SpaCY, PyTorch
https://github.com/tspannhw/FLaNK-python-ExtractCompanyName-processor
CaptionImage
● Python 3.10+
● Hugging Face
● Salesforce/blip-image-captioning-large
● Generate Captions for Images
● Adds captions to FlowFile Attributes
● Does not require download or copies of
your images
https://github.com/tspannhw/FLaNK-python-processors
RESNetImageClassification
● Python 3.10+
● Hugging Face
● Transformers
● Pytorch
● Datasets
● microsoft/resnet-50
● Adds classification label to FlowFile
Attributes
● Does not require download or copies of
your images
https://github.com/tspannhw/FLaNK-python-processors
NSFWImageDetection
● Python 3.10+
● Hugging Face
● Transformers
● Falconsai/nsfw_image_detection
● Adds normal and nsfw to FlowFile
Attributes
● Gives score on safety of image
● Does not require download or copies of
your images
https://github.com/tspannhw/FLaNK-python-processors
FacialEmotionsImageDetection
● Python 3.10+
● Hugging Face
● Transformers
● facial_emotions_image_detection
● Image Classification
● Adds labels/scores to FlowFile Attributes
● Does not require download or copies of
your images
https://github.com/tspannhw/FLaNK-python-processors
Let’s do a metamorphosis on your data. Don’t fear changing data.
You don’t need to be a brilliant writer to stream
data.
Franz Kafka was a German-speaking
Bohemian novelist and short-story writer,
widely regarded as one of the major figures
of 20th-century literature. His work fuses
elements of realism and the fantastic.
Wikipedia
YES, FRANZ, IT’S KAFKA
Open Source Edition
•Apache NiFi in
Docker
•Try new features
quickly
•Develop applications
locally
● Docker NiFi
○ docker run --name nifi -p 8443:8443 -d -e
SINGLE_USER_CREDENTIALS_USERNAME=admin -e
SINGLE_USER_CREDENTIALS_PASSWORD=ctsBtRBKHRAx69EqUgh
vvgEvjnaLjFEB apache/nifi:latest
● Licensed under the ASF License
● Unsupported
● NiFi 1.26 and NiFi 2.0.0-M3
https://hub.docker.com/r/apache/nifi
https://medium.com/cloudera-inc/streaming-street-cams-to-yolo-v8-with-python-and-nifi-to-minio-s3-3277e73723ce
Street Cameras
Dziękujemy za uwagę
Zapraszamy do zadawania pytań
oraz oceny wystąpienia pod nagraniem.

More Related Content

PDF
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
PDF
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
PDF
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
PDF
Using the flipn stack for edge ai (flink, nifi, pulsar)
PDF
Using the FLiPN Stack for Edge AI (Flink, NiFi, Pulsar) - Pulsar Summit Asia ...
PDF
Using apache mx net in production deep learning streaming pipelines
PDF
Continuous SQL with Apache Streaming (FLaNK and FLiP)
PDF
Using the FLaNK Stack for edge ai (apache mxnet, apache flink, apache nifi, a...
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
Using the flipn stack for edge ai (flink, nifi, pulsar)
Using the FLiPN Stack for Edge AI (Flink, NiFi, Pulsar) - Pulsar Summit Asia ...
Using apache mx net in production deep learning streaming pipelines
Continuous SQL with Apache Streaming (FLaNK and FLiP)
Using the FLaNK Stack for edge ai (apache mxnet, apache flink, apache nifi, a...

Similar to DSSML24_tspann_CodelessGenerativeAIPipelines (20)

PDF
Full Stack Monitoring with Prometheus and Grafana
PDF
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
PDF
Origins of Serverless
PDF
FLiP Into Trino
PDF
Samsung SDS OpeniT - The possibility of Python
PDF
Building and deploying LLM applications with Apache Airflow
ODP
Intro To Spring Python
PDF
Ai dev world utilizing apache pulsar, apache ni fi and minifi for edgeai io...
PPTX
What is Full Stack Python Development? A Beginner’s Guide
PDF
ApacheCon 2021: Apache NiFi 101- introduction and best practices
PDF
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
PDF
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
PPT
PDF
Deploy Deep Learning Application with Azure Container Instance - Devdays2018
PDF
Real time cloud native open source streaming of any data to apache solr
PDF
The Rise of the DataOps - Dataiku - J On the Beach 2016
PPTX
Ads team12 final_project_presentation
PDF
Learning the basics of Apache NiFi for iot OSS Europe 2020
PDF
Big data made easy with a Spark
PDF
Anaconda and PyData Solutions
Full Stack Monitoring with Prometheus and Grafana
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
Origins of Serverless
FLiP Into Trino
Samsung SDS OpeniT - The possibility of Python
Building and deploying LLM applications with Apache Airflow
Intro To Spring Python
Ai dev world utilizing apache pulsar, apache ni fi and minifi for edgeai io...
What is Full Stack Python Development? A Beginner’s Guide
ApacheCon 2021: Apache NiFi 101- introduction and best practices
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Deploy Deep Learning Application with Azure Container Instance - Devdays2018
Real time cloud native open source streaming of any data to apache solr
The Rise of the DataOps - Dataiku - J On the Beach 2016
Ads team12 final_project_presentation
Learning the basics of Apache NiFi for iot OSS Europe 2020
Big data made easy with a Spark
Anaconda and PyData Solutions
Ad

More from Timothy Spann (20)

PDF
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
PDF
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
PDF
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
PDF
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
PDF
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
PDF
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
PPTX
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
PDF
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
PDF
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
PDF
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
PDF
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
PDF
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
PDF
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
PDF
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
PDF
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
PDF
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
PDF
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
PDF
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
PDF
09-18-2024 NYC Meetup Vector Databases 102
PDF
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
09-18-2024 NYC Meetup Vector Databases 102
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...
Ad

Recently uploaded (20)

PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Computer network topology notes for revision
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PDF
Mega Projects Data Mega Projects Data
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Galatica Smart Energy Infrastructure Startup Pitch Deck
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
Supervised vs unsupervised machine learning algorithms
climate analysis of Dhaka ,Banglades.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
.pdf is not working space design for the following data for the following dat...
STUDY DESIGN details- Lt Col Maksud (21).pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Computer network topology notes for revision
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Data_Analytics_and_PowerBI_Presentation.pptx
Foundation of Data Science unit number two notes
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Clinical guidelines as a resource for EBP(1).pdf
Moving the Public Sector (Government) to a Digital Adoption
Mega Projects Data Mega Projects Data
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn

DSSML24_tspann_CodelessGenerativeAIPipelines