Explaining How Real-Time GenAI Works in a Noisy Pub

Explaining how GenAI works
( in a noisy pub )
Simon Aubury
@SimonAubury

GenAI
You may have heard of it?
2

Training
Pre training & ﬁne tuning
Large language models - 3 things …
Data
A bunch of text
Architecture
The transformer
3

1. Data
Let’s compress the internet

5
“Beer”
Image by Republica from Pixabay
WebVectors Online for "beer"

Image by Christian_Birkholz from Pixabay

2. Architecture
The transformer

Understanding language can be challenging
9
The group of friends bought cold beer at the
local pub to celebrate their reunion.
context
context

Transformers
10
Source: Attention Is All You Need https://arxiv.org/abs/1706.03762

Self-attention
11
Inspired by DeepLearning.AI
The
group
of
friends
bought
cold
beer
The
group
of
friends
bought
cold
beer

A transformer + data
= a base model
12
The base model
Data
Transformer
Base model

Completion
Base model
13
Completions
Beer
is
served
best
cold
with
friends
Context
Base model + context
= completion

3. Training
Pre training & ﬁne-tuning

15
How to train your LLM
Pretraining (once)
1. PB’s of text.
2. 1000’s of GPUs.
3. Compress the text into
a neural network,
4. Pay 💰 wait 📆
➡ Obtain base model.
Fine-tuning (recurring)
1. 1000’s ideal Q&A
responses (human)
2. Finetune base model
on this data wait 📆
3. Obtain assistant model
4. Evaluate, deploy &
monitor

󰜄 It is faster to pick than to generate.
16
Reinforcement Learning from Human Feedback
(RLHF)
��
��

Training
Pre training & ﬁne tuning
Large language models
Data
A bunch of text
Architecture
The transformer
17

18
Retrieval Augmented Generation (RAG)
What drink should
I order?

“ The entire history of
software engineering
is one of rising levels
of abstraction.
Grady Booch, IBM chief scientist
19

20
Thanks!
@SimonAubury
❤Presentation template by SlidesCarnival, Icons Flaticon photos Unsplash and Pixabay

Explaining How Real-Time GenAI Works in a Noisy Pub

More Related Content

More from HostedbyConfluent (20)

Recently uploaded (20)

Explaining How Real-Time GenAI Works in a Noisy Pub