SlideShare a Scribd company logo
Marco De Nittis
marco.denittis [a] gmail.com
LLM-ize your apps 101
whoami
• Independent Cloud Architect
• Trainer
• ❤ cloud, serverless, devops, AI, wasm
• Curious and tinkerer
objectives
• Explore an LLM app with no framework
• Bash scripts / curl (almost)
• Highlight fundamentals behaviours and
building blocks
Old Manticore Inn
in the beginning…
Everyone uses ChatGPT!
We should do either to improve our
business!
We could ask to some professionals
Too expensive!
Alex the waiter could do it!
He has a playstation and plays Fortnite every day
in the beginning…
Ok, we need Python
I hate snakes, no pythons here
And also several cloud servers with
GPUs
Ok, use this, that’s the same
…then
fi
rst try
• Open AI API
• Fair price
• Rest HTTPS calls with auth token
do it!
fi
rst try
• Very easy to implement
• No contextual responses
second try - context
• Prompt engineering
• Adding context with background
informations
• Mixing user text with “prepared
contextual text”
prompt engineering
• Several established techniques
• Role: “you are an helpful restaurant
assistant..
• Shots: providing examples
• Ask for steps
do it!
second try - context
• Tones and background are ok
• No integrations with company
informations
what is a LLM
• Probabilistic engine: most probable text
response from input (according to the
corpus)
• Black box with input and output pipe
• No dynamic/short term memory
• Context
context
• Max input + output size
• Measured in tokens
• Group of letters
• English 1.2 token/word
• Typical size 4k - 32k, and
beyond…
memory with context
• Adding all the informations in each
request
• How "the chat" ChatGPT works
• Limited to context size
RAG + external memory
• Semantic memory
• => retrieve the “most relevant” informations
• Vector database + RAG:
• Retrieval Augmented Generation
• Include in context only the relevants
• Easy to edit the memory
fi
ne tuning
• Enhancing the “cultural baggage” of
the LLM
• Train again the model with custom
speci
fi
c data
• Expensive (not so much)
• Dif
fi
cult to “edit” the data in memory
embeddings
• Place a text in a multidimensional “space of concepts"
• Digest the informations
• text => vector (of
fl
oats)
• High dimensional vector (300+)
• Relevancy => mathematical distance
• A model “calculates” the embeds
• Vector database to store & search
vector database
• A database capable to handle vector of
fl
oats
• Indexing
• Search
• Find distances (several algorithms)
• Ad hoc db: Qdrant, Pinecone, Chroma, Milvus, …
• Almost any common db: postgres, redis, sqlite, …
phases
• Storing and indexing company info in a vector DB
• Embeddings contents
• Una tantum
• Embeddings of user prompt
• Search of relevant infos
• Embed infos in the prompt
phases
do it!
improvements
• Chunking input for large document
• Check for prompt injection
• feedback (or self feedback) for the
response
improvements
• Integrate LLM framework
• Assistant base approach
• An LLM coordinating the work of
other LLMs
• Needs a full-
fl
edged framework
point of attentions
• Control costs
• Input control
• Predictability
• Sustainability
thank you!
Marco De Nittis
marco.denittis [a] gmail.com
https://speakerscore.it/AIHEROES-101
https://github.com/mdnmdn/aiheroes-2023

More Related Content

PPTX
Generative AI.pptx
PPTX
Smartphone Hardware Architecture
PPT
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Enviro...
PDF
IIoT / Industry 4.0 with Apache Kafka, Connect, KSQL, Apache PLC4X
PPTX
introduction Azure OpenAI by Usama wahab khan
PDF
The future of AI is hybrid
PDF
Edge Computing
PPTX
Accelerate Your Digital Transformation Journey with Pimcore
Generative AI.pptx
Smartphone Hardware Architecture
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Enviro...
IIoT / Industry 4.0 with Apache Kafka, Connect, KSQL, Apache PLC4X
introduction Azure OpenAI by Usama wahab khan
The future of AI is hybrid
Edge Computing
Accelerate Your Digital Transformation Journey with Pimcore

What's hot (20)

PDF
Introduction to Cloud Computing and Big Data
PPTX
Anypoint Custom Metrics Mastery
PPTX
API Frenzy: API Strategy 101
PDF
DDS vs AMQP
PDF
The internet of things.pptx
PPTX
Domain specific IoT
PPTX
Introduction to microservices
PDF
Next Gen ADM: The future of application services.
 
PPTX
Telecom API Management and Monetization
PDF
Limitations and Cost of Using Microsoft's Power Platform
PPTX
Datacenter overview
PDF
Understanding the Internet of Things Protocols
PDF
DDS and OPC UA Explained
PPTX
Benefits Of Building Private Cloud
PDF
IoT Communication Protocols
PPTX
AI services in google
PPTX
Top 5 IoT Use Cases
PPTX
Watson IOT Platform
PPTX
Cloud Architecture in the Data Center
Introduction to Cloud Computing and Big Data
Anypoint Custom Metrics Mastery
API Frenzy: API Strategy 101
DDS vs AMQP
The internet of things.pptx
Domain specific IoT
Introduction to microservices
Next Gen ADM: The future of application services.
 
Telecom API Management and Monetization
Limitations and Cost of Using Microsoft's Power Platform
Datacenter overview
Understanding the Internet of Things Protocols
DDS and OPC UA Explained
Benefits Of Building Private Cloud
IoT Communication Protocols
AI services in google
Top 5 IoT Use Cases
Watson IOT Platform
Cloud Architecture in the Data Center
Ad

Similar to Integrate LLM in your applications 101 (20)

PDF
Semantic kernel - Do you need Python to play with LLM?
PPTX
GPT, LLM, RAG, and RAG in Action: Understanding the Future of AI-Powered Info...
PDF
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
PDF
Java Developers - What Lies Ahead in the AI era
PDF
Presentation Session 2 -Context Grounding.pdf
PDF
MemGPT: Introduction to Memory Augmented Chat
PDF
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
PDF
Chat with your data, privately and locally
PPTX
Build With AI Event: GDG on Campus IIE.pptx
PDF
Use Case Patterns for LLM Applications (1).pdf
PDF
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
PDF
Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...
PDF
'The Art & Science of LLM Reliability - Building Trustworthy AI Systems' by M...
PPTX
TechDayPakistan-Slides RAG with Cosmos DB.pptx
PPTX
Cold_Email_Generator_using_LLM_APIS.pptx
PDF
Blending AI in Enterprise Architecture.pdf
PDF
What need to be mastered as AI-Powered Java Developers
PPTX
Journey of Generative AI
PPTX
AI Bots and there ways of development using the code
PPTX
aistudy-240521200530-db141c56 RAG AI.pptx
Semantic kernel - Do you need Python to play with LLM?
GPT, LLM, RAG, and RAG in Action: Understanding the Future of AI-Powered Info...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Java Developers - What Lies Ahead in the AI era
Presentation Session 2 -Context Grounding.pdf
MemGPT: Introduction to Memory Augmented Chat
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
Chat with your data, privately and locally
Build With AI Event: GDG on Campus IIE.pptx
Use Case Patterns for LLM Applications (1).pdf
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
Dell APEX Cloud Platform for Red Hat OpenShift: An easily deployable and powe...
'The Art & Science of LLM Reliability - Building Trustworthy AI Systems' by M...
TechDayPakistan-Slides RAG with Cosmos DB.pptx
Cold_Email_Generator_using_LLM_APIS.pptx
Blending AI in Enterprise Architecture.pdf
What need to be mastered as AI-Powered Java Developers
Journey of Generative AI
AI Bots and there ways of development using the code
aistudy-240521200530-db141c56 RAG AI.pptx
Ad

Recently uploaded (20)

PDF
Encapsulation theory and applications.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Approach and Philosophy of On baking technology
PPTX
Cloud computing and distributed systems.
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
MYSQL Presentation for SQL database connectivity
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPT
Teaching material agriculture food technology
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Encapsulation theory and applications.pdf
Programs and apps: productivity, graphics, security and other tools
Approach and Philosophy of On baking technology
Cloud computing and distributed systems.
The Rise and Fall of 3GPP – Time for a Sabbatical?
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
MYSQL Presentation for SQL database connectivity
MIND Revenue Release Quarter 2 2025 Press Release
Teaching material agriculture food technology
Building Integrated photovoltaic BIPV_UPV.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
sap open course for s4hana steps from ECC to s4
Per capita expenditure prediction using model stacking based on satellite ima...
Understanding_Digital_Forensics_Presentation.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Integrate LLM in your applications 101

  • 1. Marco De Nittis marco.denittis [a] gmail.com LLM-ize your apps 101
  • 2. whoami • Independent Cloud Architect • Trainer • ❤ cloud, serverless, devops, AI, wasm • Curious and tinkerer
  • 3. objectives • Explore an LLM app with no framework • Bash scripts / curl (almost) • Highlight fundamentals behaviours and building blocks
  • 5. in the beginning… Everyone uses ChatGPT! We should do either to improve our business! We could ask to some professionals Too expensive! Alex the waiter could do it! He has a playstation and plays Fortnite every day
  • 6. in the beginning… Ok, we need Python I hate snakes, no pythons here And also several cloud servers with GPUs Ok, use this, that’s the same
  • 8. fi rst try • Open AI API • Fair price • Rest HTTPS calls with auth token
  • 10. fi rst try • Very easy to implement • No contextual responses
  • 11. second try - context • Prompt engineering • Adding context with background informations • Mixing user text with “prepared contextual text”
  • 12. prompt engineering • Several established techniques • Role: “you are an helpful restaurant assistant.. • Shots: providing examples • Ask for steps
  • 14. second try - context • Tones and background are ok • No integrations with company informations
  • 15. what is a LLM • Probabilistic engine: most probable text response from input (according to the corpus) • Black box with input and output pipe • No dynamic/short term memory • Context
  • 16. context • Max input + output size • Measured in tokens • Group of letters • English 1.2 token/word • Typical size 4k - 32k, and beyond…
  • 17. memory with context • Adding all the informations in each request • How "the chat" ChatGPT works • Limited to context size
  • 18. RAG + external memory • Semantic memory • => retrieve the “most relevant” informations • Vector database + RAG: • Retrieval Augmented Generation • Include in context only the relevants • Easy to edit the memory
  • 19. fi ne tuning • Enhancing the “cultural baggage” of the LLM • Train again the model with custom speci fi c data • Expensive (not so much) • Dif fi cult to “edit” the data in memory
  • 20. embeddings • Place a text in a multidimensional “space of concepts" • Digest the informations • text => vector (of fl oats) • High dimensional vector (300+) • Relevancy => mathematical distance • A model “calculates” the embeds • Vector database to store & search
  • 21. vector database • A database capable to handle vector of fl oats • Indexing • Search • Find distances (several algorithms) • Ad hoc db: Qdrant, Pinecone, Chroma, Milvus, … • Almost any common db: postgres, redis, sqlite, …
  • 22. phases • Storing and indexing company info in a vector DB • Embeddings contents • Una tantum • Embeddings of user prompt • Search of relevant infos • Embed infos in the prompt
  • 25. improvements • Chunking input for large document • Check for prompt injection • feedback (or self feedback) for the response
  • 26. improvements • Integrate LLM framework • Assistant base approach • An LLM coordinating the work of other LLMs • Needs a full- fl edged framework
  • 27. point of attentions • Control costs • Input control • Predictability • Sustainability
  • 28. thank you! Marco De Nittis marco.denittis [a] gmail.com https://speakerscore.it/AIHEROES-101 https://github.com/mdnmdn/aiheroes-2023