SlideShare a Scribd company logo
Explaining how GenAI works
( in a noisy pub )
Simon Aubury
@SimonAubury
GenAI
You may have heard of it?
2
Training
Pre training & fine tuning
Large language models - 3 things …
Data
A bunch of text
Architecture
The transformer
3
1. Data
Let’s compress the internet
5
“Beer”
Image by Republica from Pixabay
WebVectors Online for "beer"
Image by Christian_Birkholz from Pixabay
2. Architecture
The transformer
Image by delo from Pixabay
Understanding language can be challenging
9
The group of friends bought cold beer at the
local pub to celebrate their reunion.
context
context
Transformers
10
Source: Attention Is All You Need https://arxiv.org/abs/1706.03762
Self-attention
11
Inspired by DeepLearning.AI
The
group
of
friends
bought
cold
beer
The
group
of
friends
bought
cold
beer
A transformer + data
= a base model
12
The base model
Data
Transformer
Base model
Completion
Base model
13
Completions
Beer
is
served
best
cold
with
friends
Context
Base model + context
= completion
3. Training
Pre training & fine-tuning
15
How to train your LLM
Pretraining (once)
1. PB’s of text.
2. 1000’s of GPUs.
3. Compress the text into
a neural network,
4. Pay 💰 wait 📆
➡ Obtain base model.
Fine-tuning (recurring)
1. 1000’s ideal Q&A
responses (human)
2. Finetune base model
on this data wait 📆
3. Obtain assistant model
4. Evaluate, deploy &
monitor
󰜄 It is faster to pick than to generate.
16
Reinforcement Learning from Human Feedback
(RLHF)
�� ��
��
Training
Pre training & fine tuning
Large language models
Data
A bunch of text
Architecture
The transformer
17
18
Retrieval Augmented Generation (RAG)
What drink should
I order?
“ The entire history of
software engineering
is one of rising levels
of abstraction.
Grady Booch, IBM chief scientist
19
20
Thanks!
@SimonAubury
❤Presentation template by SlidesCarnival, Icons Flaticon photos Unsplash and Pixabay

More Related Content

PDF
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
PDF
Renaming a Kafka Topic | Kafka Summit London
PDF
Evolution of NRT Data Ingestion Pipeline at Trendyol
PDF
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
PDF
Exactly-once Stream Processing with Arroyo and Kafka
PDF
Fish Plays Pokemon | Kafka Summit London
PDF
Tiered Storage 101 | Kafla Summit London
PDF
Building a Self-Service Stream Processing Portal: How And Why
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Renaming a Kafka Topic | Kafka Summit London
Evolution of NRT Data Ingestion Pipeline at Trendyol
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Exactly-once Stream Processing with Arroyo and Kafka
Fish Plays Pokemon | Kafka Summit London
Tiered Storage 101 | Kafla Summit London
Building a Self-Service Stream Processing Portal: How And Why

More from HostedbyConfluent (20)

PDF
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
PDF
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
PDF
Navigating Private Network Connectivity Options for Kafka Clusters
PDF
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
PDF
TL;DR Kafka Metrics | Kafka Summit London
PDF
A Window Into Your Kafka Streams Tasks | KSL
PDF
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
PDF
Data Contracts Management: Schema Registry and Beyond
PDF
Code-First Approach: Crafting Efficient Flink Apps
PDF
Debezium vs. the World: An Overview of the CDC Ecosystem
PDF
Beyond Tiered Storage: Serverless Kafka with No Local Disks
PDF
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
PDF
How to Build an Event-based Control Center for the Electrical Grid
PDF
Keep Your Kafka Cloud Costs in Check with Showbacks
PDF
When Securing Access to Data is About Life and Death
PDF
Aggregating Ad Events with Kafka Streams and Interactive Queries at Invidi
PDF
Mastering Kafka Consumer Distribution: A Guide to Efficient Scaling and Resou...
PDF
Flink 2.0: Navigating the Future of Unified Stream and Batch Processing
PDF
Leveraging Tiered Storage in Strimzi-Operated Kafka for Cost-Effective Stream...
PDF
Building Kafka Connectors with Kotlin: A Step-by-Step Guide to Creation and D...
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Navigating Private Network Connectivity Options for Kafka Clusters
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
TL;DR Kafka Metrics | Kafka Summit London
A Window Into Your Kafka Streams Tasks | KSL
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Data Contracts Management: Schema Registry and Beyond
Code-First Approach: Crafting Efficient Flink Apps
Debezium vs. the World: An Overview of the CDC Ecosystem
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
How to Build an Event-based Control Center for the Electrical Grid
Keep Your Kafka Cloud Costs in Check with Showbacks
When Securing Access to Data is About Life and Death
Aggregating Ad Events with Kafka Streams and Interactive Queries at Invidi
Mastering Kafka Consumer Distribution: A Guide to Efficient Scaling and Resou...
Flink 2.0: Navigating the Future of Unified Stream and Batch Processing
Leveraging Tiered Storage in Strimzi-Operated Kafka for Cost-Effective Stream...
Building Kafka Connectors with Kotlin: A Step-by-Step Guide to Creation and D...
Ad

Recently uploaded (20)

PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Modernizing your data center with Dell and AMD
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
KodekX | Application Modernization Development
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Dropbox Q2 2025 Financial Results & Investor Presentation
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Modernizing your data center with Dell and AMD
Diabetes mellitus diagnosis method based random forest with bat algorithm
Empathic Computing: Creating Shared Understanding
Understanding_Digital_Forensics_Presentation.pptx
A Presentation on Artificial Intelligence
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Unlocking AI with Model Context Protocol (MCP)
KodekX | Application Modernization Development
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Chapter 3 Spatial Domain Image Processing.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Building Integrated photovoltaic BIPV_UPV.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Ad

Explaining How Real-Time GenAI Works in a Noisy Pub