Amazon Q and Bedrock,
fully managed vs custom
Alessandra Bilardi
Data & Automation Specialist @ Corley Cloud
>>AI CONF 2025
AI Conf 2025
Amazon Q and Bedrock,
fully managed vs custom
AI Conf 2025
Oltre 500 progetti su AWS
Corley Cloud è una realtà certificata
con innumerevoli riconoscimenti e
un portfolio di centinaia di progetti
AWS sviluppati in diversi ambiti:
cloud native, migrazione, machine
learning & AI, serverless, IoT,
sicurezza e cloudOps.
Advanced Partner AWS
AI Conf 2025
Data & Automation Specialist @ Corley Cloud
alessandra.bilardi@corley.it
corley.it
Alessandra Bilardi
AI Conf 2025
Alessandra Bilardi
Data & Automation Specialist @ Corley Cloud
alessandra.bilardi@corley.it
corley.it
AI Conf 2025
Alessandra Bilardi
Data & Automation Specialist @ Corley Cloud
alessandra.bilardi@corley.it
corley.it
AI Conf 2025
SUMMARY
Machine learning steps and actors
Generative AI with Amazon Q
Generative AI with Amazon Bedrock
Chat bot
AI Conf 2025
Machine learning steps and actors
AI Conf 2025
What are the steps of ML ?
➔ The data may arrive ready for learning,
but often some processing is needed
➔ Model training could be delegated to an
AI system, except for custom steps
➔ Evaluation is a prediction for which we
know the expected values, for which we
can calculate metrics
➔ The prediction works on new data
processed with point 1 with the best
model saved in point 3
Preparation
Training
& Tuning
Testing
& Evaluation
Prediction
(inference)
AI Conf 2025
ML system
AI Conf 2025
ML system
AI Conf 2025
ML system
AI Conf 2025
ML system
AI Conf 2025
ML
AI Conf 2025
ML
AI Conf 2025
Are there other steps or actors in ML ?
➔ Embeddings are objects that contain
information about text, images, videos,
audio or code
➔ The prompt is the text that contains the
behavior that the model must have, the
instructions to follow to respond to the
request posed.
➔ Augmented Generation (AG) techniques
allow us to exploit generalist models by
providing them with instructions (the
prompt), context (an extract of the
embeddings) and a request to obtain a
specific response.
Question
AG
Answer
Embeddings
& Prompt
LM
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Generative AI with Amazon Q
AI Conf 2025
Amazon Q
AI Conf 2025
Amazon Q
AI Conf 2025
Amazon Q
AI Conf 2025
Amazon Q
Business
AI Conf 2025
Amazon Q
Business
1. Embedding (as needed)
AI Conf 2025
Amazon Q
Business
1. Embedding (as needed)
AI Conf 2025
Amazon Q
Business
1. Embedding (as needed)
AI Conf 2025
Amazon Q
Business
1. Embedding (as needed)
2. Request (question)
3. RAG
4. Response (answer)
AI Conf 2025
Amazon Q
Business
1. Embedding (as needed)
2. Request (question)
3. RAG
4. Response (answer)
AI Conf 2025
Amazon Q
Developer
1. Request
AI Conf 2025
Amazon Q
Developer
1. Request
2. Embedding (as needed)
AI Conf 2025
Amazon Q
Developer
1. Request
2. Embedding (as needed)
3. Context
AI Conf 2025
Generative AI with Amazon Bedrock
AI Conf 2025
Amazon Bedrock
1
2 Knowledge base (embedding)
3 Agent
Models
4 Prompt
AI Conf 2025
Models
AI Conf 2025
Models
AI Conf 2025
Models
AI Conf 2025
Models
AI Conf 2025
Models
AI Conf 2025
Models
AI Conf 2025
Models
AI Conf 2025
Knowledge base
AI Conf 2025
Knowledge base
AI Conf 2025
Knowledge base
AI Conf 2025
Knowledge base
AI Conf 2025
Knowledge base
AI Conf 2025
Agent
AI Conf 2025
Agent
AI Conf 2025
Agent
AI Conf 2025
Agent
AI Conf 2025
Agent
AI Conf 2025
Agent
AI Conf 2025
Agent
AI Conf 2025
Agent
AI Conf 2025
Prompt
AI Conf 2025
Prompt
AI Conf 2025
Prompt
AI Conf 2025
Prompt
AI Conf 2025
Prompt
AI Conf 2025
Flows
AI Conf 2025
Flows
AI Conf 2025
Flows
AI Conf 2025
Amazon Bedrock
1
2 Model evaluation
3 Playground
Data automation
4 Prompt routers / caching
AI Conf 2025
Chat bot
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - preparation steps
AI Conf 2025
Use case - Chat bot - production version 1.0
AI Conf 2025
Use case - Chat bot - production version 1.0
AI Conf 2025
Use case - Chat bot - production version 1.0
AI Conf 2025
Use case - Chat bot - production version 1.0
AI Conf 2025
Use case - Chat bot - production version 1.0
AI Conf 2025
Use case - Chat bot - production version 1.1
AI Conf 2025
Use case - Chat bot - production version 2.0
AI Conf 2025
Use case - Chat bot - production version 2.0
AI Conf 2025
Use case - Chat bot - production version 2.0
AI Conf 2025
Use case - Chat bot - production version 2.0
AI Conf 2025
Use case - Chat bot - production version 2.0
AI Conf 2025
Which infrastructure for the
ChatBot ?
AI Conf 2025
Solutions
1. like ChatGPT, max 30s
AI Conf 2025
Solutions
1. like ChatGPT, max 30s
2. extend 30s timeout
AI Conf 2025
Solutions
1. like ChatGPT, max 30s
2. extend 30s timeout
Goals
● ↓ the response time
● ↓ the inference costs
AI Conf 2025
Inference
➔ AWS Lambda
➔ Amazon Fargate
➔ Amazon EC2
AI Conf 2025
Inference
➔ AWS Lambda
➔ Amazon Fargate
➔ Amazon EC2
➔ Amazon SageMaker
AI Conf 2025
Inference
➔ AWS Lambda
➔ Amazon Fargate
➔ Amazon EC2
➔ Amazon SageMaker
➔ Amazon Bedrock / Q
AI Conf 2025
Inference
Needs
➔ GPU
➔ AWS Lambda
➔ Amazon Fargate
➔ Amazon EC2
➔ Amazon SageMaker
➔ Amazon Bedrock / Q
AI Conf 2025
Inference
Needs
➔ GPU
➔ Model loading
✖
✖
➔ AWS Lambda
➔ Amazon Fargate
➔ Amazon EC2
➔ Amazon SageMaker
➔ Amazon Bedrock / Q
✖
AI Conf 2025
Inference
Needs
➔ GPU
➔ Model loading
✖
✖
➔ AWS Lambda
➔ Amazon Fargate
➔ Amazon EC2
➔ Amazon SageMaker
➔ Amazon Bedrock / Q
✖
AI Conf 2025
Comparison of solutions
AI Conf 2025
AWS Services Comparison for a Chatbot
Services Difficulty Embeddings Training $ Inference $
Amazon Q
Business
$0.264 / hour
/ 200MB
$20 / user /
mo
Bedrock
fine tuning $2 / 1000
queries
$0.0079 / 1k
tokens
$30 / hour
Bedrock
on demand
$0.00072 for input / 1k tokens
and for output / 1k tokens
Amazon
SageMaker
$2 / 1000
queries
$0.921 / hour $0.921 / hour
AI Conf 2025
AWS Services Comparison for a Chatbot
Services Difficulty Embeddings Training $ Inference $
Amazon Q
Business
$0.264 / hour
/ 200MB
$20 / user /
mo
Bedrock
fine tuning $2 / 1000
queries
$0.0079 / 1k
tokens
$30 / hour
Bedrock
on demand
$0.00072 for input / 1k tokens
and for output / 1k tokens
Amazon
SageMaker
$2 / 1000
queries
$0.921 / hour $0.921 / hour
AI Conf 2025
AWS Services Comparison for a Chatbot
Services Difficulty Embeddings Training $ Inference $
Amazon Q
Business
$0.264 / hour
/ 200MB
$20 / user /
mo
Bedrock
fine tuning $2 / 1000
queries
$0.0079 / 1k
tokens
$30 / hour
Bedrock
on demand
$0.00072 for input / 1k tokens
and for output / 1k tokens
Amazon
SageMaker
$2 / 1000
queries
$0.921 / hour $0.921 / hour
AI Conf 2025
AWS Services Comparison for a Chatbot
Services Difficulty Embeddings Training $ Inference $
Amazon Q
Business
$0.264 / hour
/ 200MB
$20 / user /
mo
Bedrock
fine tuning $2 / 1000
queries
$0.0079 / 1k
tokens
$30 / hour
Bedrock
on demand
$0.00072 for input / 1k tokens
and for output / 1k tokens
Amazon
SageMaker
$2 / 1000
queries
$0.921 / hour $0.921 / hour
AI Conf 2025
Services Embeddings $ Training $ Inference $
Amazon Q Business 190 20 (user / mo)
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month
AI Conf 2025
Services Embeddings $ Training $ Inference $
Amazon Q Business 190 20 (user / mo)
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month
AI Conf 2025
Services Embeddings $ Training $ Inference $
Amazon Q Business 190 20 (user / mo)
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month
AI Conf 2025
Services Embeddings $ Training $ Inference $
Amazon Q Business 190 20 (user / mo)
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month
AI Conf 2025
Services Embeddings $ Training $ Inference $
Amazon Q Business 190
Bedrock fine tuning
2
1.5089 22320
Bedrock on demand 1.0714 (per 1k token)
Amazon SageMaker
2 0.0154
3.68 (serverless)
685.22 (provisioned)
AWS Services Comparison for a Chatbot
Excluded from costs: ML storage, data processing and provisioned concurrency (serverless only)
Example: 1 training of 191011 tokens + 1 request of 20s for every hour, every day for a month
AI Conf 2025
Thanks
for listening!
Thank you!
>>AI CONF 2025
👉 slides & videos: https://www.improove.tech/videos

More Related Content

PDF
APIdays Barcelona 2019 - How to build a social network on Serverless with Yan...
PDF
How to build a social network on serverless
PDF
20201111 AWS Black Belt Online Seminar AWS CodeStar & AWS CodePipeline
PDF
Serverless Design Patterns
PDF
Serveless design patterns
PDF
Visión Microsoft Fabric Data analyst for the era of AI
PPTX
Visión Microsoft Fabric Data analyst for the era of AI
PDF
Serverless Design Patterns (London Dev Community)
APIdays Barcelona 2019 - How to build a social network on Serverless with Yan...
How to build a social network on serverless
20201111 AWS Black Belt Online Seminar AWS CodeStar & AWS CodePipeline
Serverless Design Patterns
Serveless design patterns
Visión Microsoft Fabric Data analyst for the era of AI
Visión Microsoft Fabric Data analyst for the era of AI
Serverless Design Patterns (London Dev Community)

Similar to Amazon Q and Amazon Bedrock, fully managed vs. custom - 2025-06-25 (20)

PDF
How to build a social network on serverless
PDF
Von JavaEE auf Microservice in 6 Monaten - The Good, the Bad, and the wtfs...
PDF
The Future is Now: AI-powered Salesforce Integration Explained, Introducing 2...
PDF
How to build a social network on Serverless (AWS Community Summit)
PDF
How to build a social network on serverless | Yan Cui
PDF
How serverless changes the cost paradigm
PDF
Serverless is a win for businesses, not just developers
PDF
How to build a social network on serverless
PDF
Serverless Design Patterns
PDF
Serveless design patterns (VoxxedDays Luxembourg)
PDF
Build social network in 4 weeks
PDF
Custom processing and modeling with Amazon SageMaker - 2024-06-17
PPTX
Azure Integration in Production with Logic Apps and more
PDF
Serverless Architectural Patterns
PDF
[오픈소스컨설팅]Session 2 1. Future of Team Collaboration
PDF
Building a social network in under 4 weeks with Serverless and GraphQL
PDF
Biz Talk Server Certification
PDF
Lessons from running AppSync in prod
PDF
Getting Started with Serverless Architectures
POTX
devworkshop-10_28_1015-amazon-conference-presentation
How to build a social network on serverless
Von JavaEE auf Microservice in 6 Monaten - The Good, the Bad, and the wtfs...
The Future is Now: AI-powered Salesforce Integration Explained, Introducing 2...
How to build a social network on Serverless (AWS Community Summit)
How to build a social network on serverless | Yan Cui
How serverless changes the cost paradigm
Serverless is a win for businesses, not just developers
How to build a social network on serverless
Serverless Design Patterns
Serveless design patterns (VoxxedDays Luxembourg)
Build social network in 4 weeks
Custom processing and modeling with Amazon SageMaker - 2024-06-17
Azure Integration in Production with Logic Apps and more
Serverless Architectural Patterns
[오픈소스컨설팅]Session 2 1. Future of Team Collaboration
Building a social network in under 4 weeks with Serverless and GraphQL
Biz Talk Server Certification
Lessons from running AppSync in prod
Getting Started with Serverless Architectures
devworkshop-10_28_1015-amazon-conference-presentation
Ad

More from Alessandra Bilardi (20)

PDF
The Art of Data Visualization - 2025-05-31
PDF
Data Management on AWS: from caos to centralized governance - 2025-03-26
PDF
GenAI-powered assistants compared in a real case - 2025-03-18
PDF
Forecasting in AWS - 2025-01-25
PDF
Overview of Hugging Face platform - 2024-10-24
PDF
A gentle introduction to MLSecOps - 2024-10-11
PDF
Custom processing and modeling with Amazon SageMaker - 2024-09-26
PDF
Data scientist vs Cloud engineer: who wins ? - 2024-09-19
PDF
IoT: ingestion, streaming, real-time and interactive data analysis - 2024-05-29
PDF
MLOps vs LLMOps (by workflows and use cases) - 2024-05-21
PDF
How to analyze the data arriving from the IoT? - 2024-05-16
PDF
Overview of the OpenCV library and some use cases - 2024-04-19
PDF
How to move your ML system from local to production - 2024-03-15
PDF
Overview of the Kaggle platform and its competitions
PDF
Forecasting in AWS - 2024-02-01
PDF
From your laptop to all resource that you need - 2023-12-09
PDF
Parallelize data processing - 2023-10-24
PDF
The Fourier transformation - 2023-07-23
PDF
Anomaly Detection and IP Insights - 2023-06-10
PDF
Forecasting in AWS - 2023-05-16
The Art of Data Visualization - 2025-05-31
Data Management on AWS: from caos to centralized governance - 2025-03-26
GenAI-powered assistants compared in a real case - 2025-03-18
Forecasting in AWS - 2025-01-25
Overview of Hugging Face platform - 2024-10-24
A gentle introduction to MLSecOps - 2024-10-11
Custom processing and modeling with Amazon SageMaker - 2024-09-26
Data scientist vs Cloud engineer: who wins ? - 2024-09-19
IoT: ingestion, streaming, real-time and interactive data analysis - 2024-05-29
MLOps vs LLMOps (by workflows and use cases) - 2024-05-21
How to analyze the data arriving from the IoT? - 2024-05-16
Overview of the OpenCV library and some use cases - 2024-04-19
How to move your ML system from local to production - 2024-03-15
Overview of the Kaggle platform and its competitions
Forecasting in AWS - 2024-02-01
From your laptop to all resource that you need - 2023-12-09
Parallelize data processing - 2023-10-24
The Fourier transformation - 2023-07-23
Anomaly Detection and IP Insights - 2023-06-10
Forecasting in AWS - 2023-05-16
Ad

Recently uploaded (20)

PPTX
SAP 2 completion done . PRESENTATION.pptx
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPT
Predictive modeling basics in data cleaning process
PDF
Global Data and Analytics Market Outlook Report
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
SET 1 Compulsory MNH machine learning intro
PDF
Introduction to the R Programming Language
PPTX
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
A Complete Guide to Streamlining Business Processes
PDF
Data Engineering Interview Questions & Answers Data Modeling (3NF, Star, Vaul...
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PPTX
Managing Community Partner Relationships
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PPTX
Steganography Project Steganography Project .pptx
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PDF
Microsoft 365 products and services descrption
SAP 2 completion done . PRESENTATION.pptx
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Predictive modeling basics in data cleaning process
Global Data and Analytics Market Outlook Report
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
[EN] Industrial Machine Downtime Prediction
SET 1 Compulsory MNH machine learning intro
Introduction to the R Programming Language
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
A Complete Guide to Streamlining Business Processes
Data Engineering Interview Questions & Answers Data Modeling (3NF, Star, Vaul...
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
Pilar Kemerdekaan dan Identi Bangsa.pptx
Managing Community Partner Relationships
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
Steganography Project Steganography Project .pptx
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Microsoft 365 products and services descrption

Amazon Q and Amazon Bedrock, fully managed vs. custom - 2025-06-25