Training & Serving
Open-Sourced Foundational Models
3 4
Advanced Summary
Agenda
2
0 1 2
Intro Context Basics
Intro
3
About me
Tech Lead at Georgian,
Create ML courses.
Previously:
BorealisAI, Neuro, Ring and
Depositphotos
Connect on GitHub and Linkedin
4
About me
Tech Lead at Georgian,
Create ML courses.
Previously:
BorealisAI, Neuro, Ring and
Depositphotos
Connect on GitHub and Linkedin
5
Context
6
MLOps stages
7
Monitoring
Experiments
Data
Pipelines
Deployment
MLOps stages
8
Monitoring
Experiments
Data
Pipelines
Deployment
LLM
Just use GPT4
9
- Costs
- Speed
- Privacy
- Moat
Why?
10
Why?
Compliance challenges
- PII aspects
- Data sent to a third-party
API
Over-reliance factor
- Upper-bounded by third-
party tech
- No control over quality
11
Differentiation challenges
- Cannot incorporate your
product’s user feedback
- What’s stopping your
competitor from doing the
same?
LLM to keep eye on
Close:
- GPT4
- PaLM2/Gemini
- Claude3
- Mistral
12
Open:
- Mistral/Mixtral (7B, 8 x 7B)
- Llama 2 (7B, 13B, 70B)
- Gemma (2B, 7B)
- Flan-T5
Our benchmarking toolkit: LLM-Finetuning-Hub
(Training, Inference, Costs, Time)
Make LLM your LLM
Options you need to use:
- Prompts
- RAG
- Fine-Tuning
13
*use all
Make LLM your LLM
Options you need to use:
- Prompts
- RAG
- Fine-Tuning
14
*use all
Training
15
blog post
end2end code
Training
16
LLM Finetuning Toolkit
Basics
17
Fundamentals
18
Source: https://madewithml.com/courses/mlops/api/
Why not FastAPI?
19
Source: https://madewithml.com/courses/mlops/api/
TGI from Huggingface
20
Source: https://github.com/huggingface/text-generation-inference
vLLM
21
Source: https://github.com/vllm-project/vllm
OpenLLM
22
Source: https://github.com/bentoml/OpenLLM
DeepSpeed-MII
23
Source: https://github.com/microsoft/DeepSpeed-MII
Advanced
24
Benchmarking
25
Source: https://k6.io/
Benchmarking: Best costs
26
Source: https://www.philschmid.de/sagemaker-llama-benchmark
Benchmarking: Best throughput
27
Source: https://www.philschmid.de/sagemaker-llama-benchmark
Benchmarking: Best latency
28
Source: https://www.philschmid.de/sagemaker-llama-benchmark
Best
cost = $3.50 & 138m
throughput = $4.38 & 37m
latency = $32 & 271m
29
Triton Inference Server
30
Source: https://github.com/triton-inference-server/tensorrtllm_backend, https://github.com/triton-inference-server/vllm_backend, https://github.com/NVIDIA/TensorRT-LLM,
https://github.com/triton-inference-server/server, https://github.com/triton-inference-server/python_backend
Lorax & Multi Adapters
31
Source: https://github.com/predibase/lorax
Sagemaker & VertexAI
32
Sources: https://www.philschmid.de/sagemaker-llama-llm,, GCP: Serve Open-Source LLMs on Google Cloud
Async inference
33
Sources: https://www.philschmid.de/sagemaker-huggingface-async-inference, AWS Sagemaker: Asynchronous Inference
Memory summarization
34
Source: https://huggingface.co/spaces/hf-accelerate/model-memory-usage
Summary
35
Costs
Speed
Privacy
Moat
36
MLOps stages
37
Monitoring
Experiments
Data
Pipelines
Deployment
Thanks
“Machine Learning in
Production” course
38
aionlineday2024
15% off promo code
Free course preview
39
Thanks
“Machine Learning in
Production” course
40
aionlineday2024
15% off promo code

More Related Content

PPTX
GPT-4: A Glimpse into GPT-4 and Let's Demystify
PPTX
Open, Secure & Transparent AI Pipelines
PPTX
Continuous Performance Testing
DOC
CV Coral
PDF
Testing and Deployment - Full Stack Deep Learning
PDF
Sandesh_Rao_Unlocking Oracle Database Mysteries AHF Insights and the AI-LLM D...
PPTX
vodQA Pune (2019) - Testing AI,ML applications
PPTX
MOVIE RECOMMENDATION SYSTEM, UDUMULA GOPI REDDY, Y24MC13085.pptx
GPT-4: A Glimpse into GPT-4 and Let's Demystify
Open, Secure & Transparent AI Pipelines
Continuous Performance Testing
CV Coral
Testing and Deployment - Full Stack Deep Learning
Sandesh_Rao_Unlocking Oracle Database Mysteries AHF Insights and the AI-LLM D...
vodQA Pune (2019) - Testing AI,ML applications
MOVIE RECOMMENDATION SYSTEM, UDUMULA GOPI REDDY, Y24MC13085.pptx

Similar to Kyryl Truskovskyi: Training and Serving Open-Sourced Foundational Models (UA) (20)

PDF
Dt812 g formation-infosphere-optim-test-data-management-and-data-masking-on-z-os
PPTX
Why is dev ops for machine learning so different
PDF
Unlock the Potential of Microsoft 365 Copilot | Norwegian M365 User Group |...
PPTX
Why is dev ops for machine learning so different - dataxdays
PDF
[DSC Europe 22] Why you need to think about MLOps at the beginning of your pr...
PDF
Microsoft 365 Copilot data security and governance |Commsverse 2024 | June 2024
PDF
Troubleshooting for Intent-based Networking
PPTX
Mastering the DevOps Certification: CI/CD, Governance & Monitoring Made Simple
PDF
F5 BIG IP LTM & GTM Training Certification
PPT
Pilot Project Highlights: Ruby on Rails - November 2006
PPTX
Rabobank - There is something about Data
PDF
What are Phi Small Language Models Capable of
PPTX
Kyryl Truskovskyi: Remove complexity from your RAG application (UA)
DOCX
PPT
Intro to machine learning with scikit learn
PPTX
Knime Evaluation Smaller
PDF
Implications of GPT-3
PDF
BigML Education - Deepnets
PDF
Digicrome Data Science & AI 11 Month Course PDF.pdf
PPTX
Nodes 2023 - Knowledge graph based chatbot.pptx
Dt812 g formation-infosphere-optim-test-data-management-and-data-masking-on-z-os
Why is dev ops for machine learning so different
Unlock the Potential of Microsoft 365 Copilot | Norwegian M365 User Group |...
Why is dev ops for machine learning so different - dataxdays
[DSC Europe 22] Why you need to think about MLOps at the beginning of your pr...
Microsoft 365 Copilot data security and governance |Commsverse 2024 | June 2024
Troubleshooting for Intent-based Networking
Mastering the DevOps Certification: CI/CD, Governance & Monitoring Made Simple
F5 BIG IP LTM & GTM Training Certification
Pilot Project Highlights: Ruby on Rails - November 2006
Rabobank - There is something about Data
What are Phi Small Language Models Capable of
Kyryl Truskovskyi: Remove complexity from your RAG application (UA)
Intro to machine learning with scikit learn
Knime Evaluation Smaller
Implications of GPT-3
BigML Education - Deepnets
Digicrome Data Science & AI 11 Month Course PDF.pdf
Nodes 2023 - Knowledge graph based chatbot.pptx
Ad

More from Lviv Startup Club (20)

PDF
Maksym Vyshnivetskyi: PMO Maturity and Continuous Improvement (UA)
PPTX
Oleksandr Ivakhnenko: Cold Outreach і Social Selling: просунуті техніки (UA)
PDF
Maksym Vyshnivetskyi: PMO KPIs (UA) - LemBS
PDF
Oleksandr Ivakhnenko: LinkedIn Marketing і Content Marketing: розширений підх...
PDF
Maksym Vyshnivetskyi: PMO Quality Management (UA)
PDF
Oleksandr Ivakhnenko: Вступ до генерації лідів для ІТ-аутсорсингу (UA)
PDF
Oleksandr Osypenko: Поради щодо іспиту та закриття курсу (UA)
PDF
Oleksandr Osypenko: Пробний іспит + аналіз (UA)
PDF
Oleksandr Osypenko: Agile / Hybrid Delivery (UA)
PDF
Oleksandr Osypenko: Стейкхолдери та їх вплив (UA)
PDF
Rostyslav Chayka: Prompt Engineering для проєктного менеджменту (Advanced) (UA)
PPTX
Dmytro Liesov: PMO Tools and Technologies (UA)
PDF
Rostyslav Chayka: Управління командою за допомогою AI (UA)
PDF
Oleksandr Osypenko: Tailoring + Change Management (UA)
PDF
Maksym Vyshnivetskyi: Управління закупівлями (UA)
PDF
Oleksandr Osypenko: Управління ризиками (UA)
PPTX
Dmytro Zubkov: PMO Resource Management (UA)
PPTX
Rostyslav Chayka: Комунікація за допомогою AI (UA)
PDF
Ihor Pavlenko: Комунікація за допомогою AI (UA)
PDF
Maksym Vyshnivetskyi: Управління якістю (UA)
Maksym Vyshnivetskyi: PMO Maturity and Continuous Improvement (UA)
Oleksandr Ivakhnenko: Cold Outreach і Social Selling: просунуті техніки (UA)
Maksym Vyshnivetskyi: PMO KPIs (UA) - LemBS
Oleksandr Ivakhnenko: LinkedIn Marketing і Content Marketing: розширений підх...
Maksym Vyshnivetskyi: PMO Quality Management (UA)
Oleksandr Ivakhnenko: Вступ до генерації лідів для ІТ-аутсорсингу (UA)
Oleksandr Osypenko: Поради щодо іспиту та закриття курсу (UA)
Oleksandr Osypenko: Пробний іспит + аналіз (UA)
Oleksandr Osypenko: Agile / Hybrid Delivery (UA)
Oleksandr Osypenko: Стейкхолдери та їх вплив (UA)
Rostyslav Chayka: Prompt Engineering для проєктного менеджменту (Advanced) (UA)
Dmytro Liesov: PMO Tools and Technologies (UA)
Rostyslav Chayka: Управління командою за допомогою AI (UA)
Oleksandr Osypenko: Tailoring + Change Management (UA)
Maksym Vyshnivetskyi: Управління закупівлями (UA)
Oleksandr Osypenko: Управління ризиками (UA)
Dmytro Zubkov: PMO Resource Management (UA)
Rostyslav Chayka: Комунікація за допомогою AI (UA)
Ihor Pavlenko: Комунікація за допомогою AI (UA)
Maksym Vyshnivetskyi: Управління якістю (UA)
Ad

Recently uploaded (20)

DOCX
Center Enamel Powering Innovation and Resilience in the Italian Chemical Indu...
PDF
Cross-Cultural Leadership Practices in Education (www.kiu.ac.ug)
DOCX
Handbook of entrepreneurship- Chapter 7- Types of business organisations
PDF
income tax laws notes important pakistan
PPTX
chapter 2 entrepreneurship full lecture ppt
PDF
Sustainable Digital Finance in Asia_FINAL_22.pdf
PPTX
IMM marketing mix of four ps give fjcb jjb
DOCX
Handbook of Entrepreneurship- Chapter 5: Identifying business opportunity.docx
PPTX
33ABJFA6556B1ZP researhchzfrsdfasdfsadzd
PDF
#1 Safe and Secure Verified Cash App Accounts for Purchase.pdf
PDF
Chapter 2 - AI chatbots and prompt engineering.pdf
PDF
Value-based IP Management at Siemens: A Cross-Divisional Analysis
PPTX
df0ee68f89e1a869be4bff9b80a7 business 79f0.pptx
DOCX
80 DE ÔN VÀO 10 NĂM 2023vhkkkjjhhhhjjjj
PDF
Communication Tactics in Legal Contexts: Historical Case Studies (www.kiu.ac...
PDF
Robin Fischer: A Visionary Leader Making a Difference in Healthcare, One Day ...
PPTX
Portfolio Example- Market & Consumer Insights – Strategic Entry for BYD UK.pptx
PPTX
2 - Self & Personality 587689213yiuedhwejbmansbeakjrk
DOCX
Hand book of Entrepreneurship 4 Chapters.docx
PPTX
Market and Demand Analysis.pptx for Management students
Center Enamel Powering Innovation and Resilience in the Italian Chemical Indu...
Cross-Cultural Leadership Practices in Education (www.kiu.ac.ug)
Handbook of entrepreneurship- Chapter 7- Types of business organisations
income tax laws notes important pakistan
chapter 2 entrepreneurship full lecture ppt
Sustainable Digital Finance in Asia_FINAL_22.pdf
IMM marketing mix of four ps give fjcb jjb
Handbook of Entrepreneurship- Chapter 5: Identifying business opportunity.docx
33ABJFA6556B1ZP researhchzfrsdfasdfsadzd
#1 Safe and Secure Verified Cash App Accounts for Purchase.pdf
Chapter 2 - AI chatbots and prompt engineering.pdf
Value-based IP Management at Siemens: A Cross-Divisional Analysis
df0ee68f89e1a869be4bff9b80a7 business 79f0.pptx
80 DE ÔN VÀO 10 NĂM 2023vhkkkjjhhhhjjjj
Communication Tactics in Legal Contexts: Historical Case Studies (www.kiu.ac...
Robin Fischer: A Visionary Leader Making a Difference in Healthcare, One Day ...
Portfolio Example- Market & Consumer Insights – Strategic Entry for BYD UK.pptx
2 - Self & Personality 587689213yiuedhwejbmansbeakjrk
Hand book of Entrepreneurship 4 Chapters.docx
Market and Demand Analysis.pptx for Management students

Kyryl Truskovskyi: Training and Serving Open-Sourced Foundational Models (UA)