SlideShare a Scribd company logo
Google - Build with AI
April 2025 Florian Blaga
Security and Compliance Lead
Our People & Places
Our 400+ best-in-class experts are based across
the EMEA region, with locations including:
Romania
Iași
The Netherlands
Utrecht
UK
London, Manchester, Swindon, and Edinburgh
France
Paris
Belgium
Brussels
Gemini 2.0
What’s New?
And Google has put AI to work for 3B users in Google Workspace
Spelling & Grammar
Suggestions
Smart Reply
Noise Cancellation
Portrait Restore
Smart Compose
Malware Detection and Prevention
2017 2018 2019 2020 2021 2022 2023
2016
Explore
2015
Suggested Actions
Chart Suggestions
Answers
Grammar Suggestions
Formula Acceleration
Formula Suggestions
Smart Compose
Smart Compose
File Suggestions
Write & Re-Write
AI-generated Summaries
Sharing Suggestions
Practice sets
Originality reports Interactive questions for YT videos in
Classroom
Google keeps more people
safe online than anyone else
4 billion
Devices running
Chrome are protected
each day against
malware and social
engineering attacks
2.5 billion
Active Gmail users
protected against
phishing, malware,
and spam
5 billion
Google Safe Browsing
users devices protected
each day from malware
and social engineering .
Petabytes
Of cloud telemetry analyzed
each day by Chronicle and
Security Command Center
(CSPM) for threat detection
and response.
Generative AI is a type of artificial intelligence that
helps humans create content, such as text and
images.
It’ll enable a new era of work, as it can make humans
significantly more productive, collaborative, and
creative.
Generative AI can transform industries and roles by a
multiplying factor that is still unknown, and professionals
that don’t use it will likely be less productive than those
that do.
As a consequence, Google believes organizations that adopt
generative AI will have an opportunity to leapfrog those
who don't.
Generative AI has the potential to change the anatomy of
work, augmenting the capabilities of individual workers by
automating some of their individual activities. Current generative
AI and other technologies have the potential to automate work
activities that absorb 60 to 70 percent of employees’ time
today.”
McKinsey: The economic potential of generative AI: The next productivity frontier
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
What is it Why it matters
Gemini 2.0 Flash
GA
First
production-ready
Gemini 2.0 model
Offers the same 1M context window as 1.5 Flash
but with expanded capabilities (e.g., sophisticated
integration and tool use) and better performance,
exceeding 1.5 Pro on key benchmarks at twice the
speed
Gemini 2.0
Flash-Lite
Preview
Low-cost,
high-performance
Gemini 2.0 model
Offers the same 1M context window as 1.5 Flash
but with improved performance and a lower price
for workloads that blend small- and large-context
inputs and outputs.
Gemini 2.5 Pro
Experimental
Superior intelligence
with a 2M context
window
Ideal for coding, reasoning, complex prompts, and
long context
3 Sizes | Improved Performance | New Output Capabilities
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
ChatBot Arena
#1,6,6
,10
Top models as
blindly evaluated by
users
SEAL Leaderboard
#1
Mode for Visual
Understanding while
being orders of
magnitude cheaper
SEAL Leaderboard
#4
Model for Coding
while being orders of
magnitude cheaper
Vectara
Hallucination
#1,2,4
,5,8
Lowest hallucination
Rate
Features Notes
Gemini 2.0 Flash Gemini 2.0 Flash (Model ID: gemini-2.0-flash-exp) is available as an experimental preview release through
AI Studio, the Gemini Developer API and the Vertex AI Gemini API. Gemini 2.0 Flash offers improvements in
speed, quality, and advanced reasoning capabilities including enhanced understanding, coding, and
instruction following. (Only available in us-central)
Gemini 2.0 Flash Thinking Gemini 2.0 Flash Thinking is an experimental model that's trained to generate the "thinking process" the
model goes through as part of its response.
Google Gen AI SDK The new Google Gen AI SDK provides programmatic access to Gemini 2.0 using both the Developer and
Vertex AI APIs in 4 languages: Python and Go with Java and Javascript coming in the future.
Speech generation
(early access/allowlist)
Gemini 2.0 supports a new multimodal generation capability: text to speech as a private experimental
release.
Image generation
(early access/allowlist)
Gemini 2.0 supports image generation as a private experimental release.
Multimodal Live API The Multimodal Live API enables low-latency, two-way interactions using text, audio, and video input, and
audio and text output.
Tools: Search as a tool Google Search is available as a tool. Using Grounding with Google Search, you can improve the accuracy
and recency of responses from the model.
Tools: Code execution The code execution feature enables the model to generate and run Python code and learn iteratively from
the results until it arrives at a final output.
Gemini 2.0
Portfolio
Walkthrough
Gemini 2.0 Models
Comprehensive Table of Google Gemini AI Models
Feature
Gemini 2.5 Pro
Experimental Gemini 2.0 Flash Gemini 2.0 Flash-Lite Gemini 1.5 Flash Gemini 1.5 Flash-8B Gemini 1.5 Pro Imagen 3
Current release gemini-2.5-pro-exp-03-25
Latest (gemini-2.0-flash),
Stable (-001)
Latest
(gemini-2.0-flash-lite),
Stable (-001)
Latest stable
(gemini-1.5-flash), Stable
(-001, -002)
Latest stable
(gemini-1.5-flash-8b),
Stable (-001)
Latest stable
(gemini-1.5-pro), Stable
(-001, -002) imagen-3.0-generate-002
Current model gemini-2.5-pro-exp-03-25
gemini-2.0-flash, -001,
-exp, -thinking-exp gemini-2.0-flash-lite, -001
gemini-1.5-flash, -latest,
-001, -002
gemini-1.5-flash-8b, -latest,
-001
gemini-1.5-pro, -latest,
-001, -002 imagen-3.0-generate-002
Multimodal inputs Audio, images, video, text Audio, images, video, text Audio, images, video, text Audio, images, video, text Audio, images, video, text Audio, images, video, text Text
Text output Yes (limit 65,536) Yes (limit 8,192) Yes (limit 8,192) Yes (limit 8,192) Yes (limit 8,192) Yes (limit 8,192)
No (input for image
generation)
Image output No Experimental No
No (processes images as
input) No No Yes (up to 4 per prompt)
Audio output No Coming soon No No No No No
Multimodal Live API No Experimental No No No No No
Thinking model Yes
Experimental version
available No No No No No
Context window
Input: 1,048,576; Output:
65,536
Input: 1,048,576; Output:
8,192
Input: 1,048,576; Output:
8,192
Input: 1,048,576; Output:
8,192
Input: 1,048,576; Output:
8,192
Input: 2,097,152; Output:
8,192
Input: N/A; Output: Up to 4
Images
Controlled generationStructured outputs
Structured outputs,
Function calling Structured outputs
JSON mode, JSON
schema
System instructions, JSON
mode, JSON schema
System instructions, JSON
mode, JSON schema Details not available
Function calling Yes Yes No Yes Yes Yes No
Search as a tool Yes (Search grounding) Yes (Search) No Yes (Search) No No No
Code execution Yes Yes No Yes Yes Yes No
Batch prediction Information unavailable Supported on Vertex AI Supported on Vertex AI Supported on Vertex AI Information unavailable Supported on Vertex AI Information unavailable
Tuning No
Supported on Vertex AI
(-001)
Supported on Vertex AI
(-001) Supported Supported
Supported on Vertex AI
(-002)
Supported on Vertex AI
(customization,
fine-tuning)
AI Model Feature Comparison Table
Feature GPT-4.5 GPT-4o
Gemini 2.5 Pro
Experimental Gemini 2.0 Flash Claude 3.7 Sonnet Grok-3 DeepSeek R1 Llama 3.2
Current Release Feb 2025
May 2024 (Image Gen
Mar 2025) Mar 2025 Feb 2025 Feb 2025 Feb 2025 Jan 2025 Sep 2024
Current Model Orion Omni
gemini-2.5-pro-exp-03-
25 gemini-2.0-flash-001
claude-3-7-sonnet-202
50219 Grok-3 671B MoE (37B active) Llama 3.2
Multimodal Inputs:
Text Yes Yes Yes Yes Yes Yes Yes Yes
Multimodal Inputs:
Image Yes Yes Yes Yes Yes Yes Yes Yes
Multimodal Inputs:
Audio No Yes Yes Yes No No Yes Yes
Text Output Yes Yes Yes Yes Yes Yes Yes Yes
Image Output No Yes Yes Yes No Yes (via Aurora) No Yes
Audio Output No Yes Yes Yes No No No Yes
Multimodal Live API No Yes No Yes No No No No
Thinking Model
Scaled Unsupervised
Learning
Generative Pre-trained
Transformer
Thinking Model
(Reasons Through
Thoughts)
Thinking Model
(Reasons Before
Answering) Hybrid Reasoning
Reasoning-Based
(Test-Time Computing)
Reasoning Model
(MoE)
Decoder-Only
Transformer
Context Window 128K 128K 1M (2M Soon) 1M 200K 128K 16K 128K
Controlled Generation Yes (Steerability) Yes Yes Yes
Yes (Adjustable
Reasoning Budget)
Yes (Think/Big Brain
Modes) Yes (Temperature) Yes
Function Calling Yes Yes
Yes (Excluding
Compositional) Yes Yes (via Tools) No Explicit Mention No Yes (Tool Calling)
Search as a Tool Yes Yes Yes Yes Yes Yes No Explicit Mention No Explicit Mention
Code Execution Yes Yes Yes Yes Yes (via Claude Code) Yes No Explicit Mention
Yes (Tool Calling
Implied)
Batch Prediction Yes Yes No Explicit Mention Yes Yes (Preview) No Explicit Mention Yes No Explicit Mention
Tuning Yes Yes Yes Yes No Explicit Mention Yes Yes Yes
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
Online Prediction Gemini 1.5 Flash Gemini 2.0 Flash-Lite Gemini 2.0 Flash
Performance
(Relative to Gemini 1.5 Flash)
Slightly better than 1.5 Flash Much better than 1.5
Flash
Input text / image / video $0.075
$0.15 (long context)
$0.075 $0.10
Input audio $0.075
$0.15 (long context)
$0.075 $1
Output text $0.3
$0.6 (long context)
$0.3 $0.40
Output image NA NA Coming Soon
Output audio NA NA Coming Soon
Search as a Tool free query per
day (QPD)
$35/1K
(standard search)
NA 1,500 QPD Free
(+ $35/1K overage)
Context caching (storage) $1.00 / 1,000,000
tokens per hour
Available March 31,
2025
How does pricing compare between models
The enhanced reasoning
model with 1M context window
improved performance and
explainability. Building on the
speed and performance of 2.0
Flash, 2.0 Flash thinking exp
excels in science and math while
revealing its thought process
to solve complex problems
Progress is quick in this space
Gemini Thinking: Now available in Gemini Advanced & Google AI Studio
Instruction Models
Eg. Gemini 2.0 Flash, GPT-4o, Claude 3.5, Llama 3…
Strengths:
● Faster
● Cost Effective
Weaknesses:
● Less capable at the most complex reasoning
Example Use Cases:
● Summarization & Extraction of Content
● Creative Writing
● RAG & Fact based question answering
Thinking Models
Eg. 2.0 Flash Thinking, O3-mini, O1, Deepseek R1…
Strengths:
● First Principles Thinking & Problem Solving
Weaknesses:
● Much Slower
● More expensive (larger outputs)
● Overkill for basic tasks
Example Use Cases:
● Multi Step Agentic Reasoning
● Novel Problem Solving (eg. Mathematics & Code)
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
Chatbot Arena Overview (Task)
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
Google Cloud
The new ‘googling’ -
structuring your request
Persona
Task
Context
Format
You are a Sales Manager. Write a guide describing
3 sales techniques to use for the Sales Team when
selling a new car to 30 year olds. Limit to bullet
points with a description and conversation example.
From Google’s Prompting Guide
Iterative Prompting
● Limits: Gemini has limited output ~5,000
- 6,000 words of standard text, less with
tables.
● "Continue" Prompt: If the model reaches
the output token limit, simply reply with
"continue" to prompt it to generate the
next part of the response.
● Iteration: Repeat the "continue" prompt
as needed until the model has generated
the complete response
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
Model Armor - Overview
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf
Google
Agentspace.
https://www.youtube.com/watch?v=V-r0WjXJhL8
Proprietary + Confidential
Next ‘25

More Related Content

PDF
5 Things to Consider When Deploying AI in Your Enterprise
PDF
LJC-Unconference-2023-Keynote.pdf
PPTX
Extending the Visual Studio 2010 Code Editor to Visualize Runtime Intelligenc...
PDF
Unlocking the Power of ChatGPT and AI in Testing - NextSteps, presented by Ap...
PDF
Google Reveals Gemini 2.5 Pro I/O: Surpasses GPT-4 in Programming, Supports N...
DOCX
Game Engine Terminology
PDF
Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"
PPTX
Becoming a kinect hacker innovator v2
5 Things to Consider When Deploying AI in Your Enterprise
LJC-Unconference-2023-Keynote.pdf
Extending the Visual Studio 2010 Code Editor to Visualize Runtime Intelligenc...
Unlocking the Power of ChatGPT and AI in Testing - NextSteps, presented by Ap...
Google Reveals Gemini 2.5 Pro I/O: Surpasses GPT-4 in Programming, Supports N...
Game Engine Terminology
Kostiantyn Yelisavenko "Mastering Macro Benchmarking in .NET"
Becoming a kinect hacker innovator v2

Similar to [SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf (20)

PDF
Gemini 2.0 and Vertex AI for Innovation Workshop
PPTX
Taking Your Product Development to the Next Level with Full Stack
PPTX
Practics of Cross Platform Development with Gingee (case study)
PDF
Supercharge Your AI Development with Local LLMs
PPTX
Azure for cs50x Miami
PDF
Canvas real speaker
PDF
Faster deep learning solutions from training to inference - Amitai Armon & Ni...
PDF
GDG Cloud Southlake 31: Santosh Chennuri and Festus Yeboah: Empowering Develo...
KEY
Appcelerator iPhone/iPad Dev Con 2010 San Diego, CA
ZIP
iPhone/iPad Development with Titanium
PDF
Applying AI in Games (GDC2019)
PDF
Arya.ai artificial intelligence platform vinay kumar
PDF
AWS_Meetup_BLR_July_22_Social.pdf
PPTX
Training for New Users
PDF
Von JavaEE auf Microservice in 6 Monaten - The Good, the Bad, and the wtfs...
PPTX
Blugsphere2011 dev
ODP
Actminds Outsourcing Summit 07
DOCX
Lewis brady engine terminology (edited version)
DOCX
Lewis brady engine_terminology (edited version)
DOCX
James Williams_IT110_IP1.doc.docxIT110-1404A-02 Introduction.docx
Gemini 2.0 and Vertex AI for Innovation Workshop
Taking Your Product Development to the Next Level with Full Stack
Practics of Cross Platform Development with Gingee (case study)
Supercharge Your AI Development with Local LLMs
Azure for cs50x Miami
Canvas real speaker
Faster deep learning solutions from training to inference - Amitai Armon & Ni...
GDG Cloud Southlake 31: Santosh Chennuri and Festus Yeboah: Empowering Develo...
Appcelerator iPhone/iPad Dev Con 2010 San Diego, CA
iPhone/iPad Development with Titanium
Applying AI in Games (GDC2019)
Arya.ai artificial intelligence platform vinay kumar
AWS_Meetup_BLR_July_22_Social.pdf
Training for New Users
Von JavaEE auf Microservice in 6 Monaten - The Good, the Bad, and the wtfs...
Blugsphere2011 dev
Actminds Outsourcing Summit 07
Lewis brady engine terminology (edited version)
Lewis brady engine_terminology (edited version)
James Williams_IT110_IP1.doc.docxIT110-1404A-02 Introduction.docx
Ad

Recently uploaded (20)

PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Empathic Computing: Creating Shared Understanding
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
KodekX | Application Modernization Development
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Electronic commerce courselecture one. Pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Modernizing your data center with Dell and AMD
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPT
Teaching material agriculture food technology
Dropbox Q2 2025 Financial Results & Investor Presentation
Unlocking AI with Model Context Protocol (MCP)
Empathic Computing: Creating Shared Understanding
Spectral efficient network and resource selection model in 5G networks
Per capita expenditure prediction using model stacking based on satellite ima...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
KodekX | Application Modernization Development
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Electronic commerce courselecture one. Pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Modernizing your data center with Dell and AMD
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
CIFDAQ's Market Insight: SEC Turns Pro Crypto
The Rise and Fall of 3GPP – Time for a Sabbatical?
Teaching material agriculture food technology
Ad

[SHARED ONLINE] - GDG Cloud Iasi - Build with AI - Sponsored by Qodea.pdf

  • 1. Google - Build with AI April 2025 Florian Blaga Security and Compliance Lead
  • 2. Our People & Places Our 400+ best-in-class experts are based across the EMEA region, with locations including: Romania Iași The Netherlands Utrecht UK London, Manchester, Swindon, and Edinburgh France Paris Belgium Brussels
  • 4. And Google has put AI to work for 3B users in Google Workspace Spelling & Grammar Suggestions Smart Reply Noise Cancellation Portrait Restore Smart Compose Malware Detection and Prevention 2017 2018 2019 2020 2021 2022 2023 2016 Explore 2015 Suggested Actions Chart Suggestions Answers Grammar Suggestions Formula Acceleration Formula Suggestions Smart Compose Smart Compose File Suggestions Write & Re-Write AI-generated Summaries Sharing Suggestions Practice sets Originality reports Interactive questions for YT videos in Classroom
  • 5. Google keeps more people safe online than anyone else 4 billion Devices running Chrome are protected each day against malware and social engineering attacks 2.5 billion Active Gmail users protected against phishing, malware, and spam 5 billion Google Safe Browsing users devices protected each day from malware and social engineering . Petabytes Of cloud telemetry analyzed each day by Chronicle and Security Command Center (CSPM) for threat detection and response.
  • 6. Generative AI is a type of artificial intelligence that helps humans create content, such as text and images. It’ll enable a new era of work, as it can make humans significantly more productive, collaborative, and creative.
  • 7. Generative AI can transform industries and roles by a multiplying factor that is still unknown, and professionals that don’t use it will likely be less productive than those that do. As a consequence, Google believes organizations that adopt generative AI will have an opportunity to leapfrog those who don't.
  • 8. Generative AI has the potential to change the anatomy of work, augmenting the capabilities of individual workers by automating some of their individual activities. Current generative AI and other technologies have the potential to automate work activities that absorb 60 to 70 percent of employees’ time today.” McKinsey: The economic potential of generative AI: The next productivity frontier
  • 11. What is it Why it matters Gemini 2.0 Flash GA First production-ready Gemini 2.0 model Offers the same 1M context window as 1.5 Flash but with expanded capabilities (e.g., sophisticated integration and tool use) and better performance, exceeding 1.5 Pro on key benchmarks at twice the speed Gemini 2.0 Flash-Lite Preview Low-cost, high-performance Gemini 2.0 model Offers the same 1M context window as 1.5 Flash but with improved performance and a lower price for workloads that blend small- and large-context inputs and outputs. Gemini 2.5 Pro Experimental Superior intelligence with a 2M context window Ideal for coding, reasoning, complex prompts, and long context
  • 12. 3 Sizes | Improved Performance | New Output Capabilities
  • 14. ChatBot Arena #1,6,6 ,10 Top models as blindly evaluated by users SEAL Leaderboard #1 Mode for Visual Understanding while being orders of magnitude cheaper SEAL Leaderboard #4 Model for Coding while being orders of magnitude cheaper Vectara Hallucination #1,2,4 ,5,8 Lowest hallucination Rate
  • 15. Features Notes Gemini 2.0 Flash Gemini 2.0 Flash (Model ID: gemini-2.0-flash-exp) is available as an experimental preview release through AI Studio, the Gemini Developer API and the Vertex AI Gemini API. Gemini 2.0 Flash offers improvements in speed, quality, and advanced reasoning capabilities including enhanced understanding, coding, and instruction following. (Only available in us-central) Gemini 2.0 Flash Thinking Gemini 2.0 Flash Thinking is an experimental model that's trained to generate the "thinking process" the model goes through as part of its response. Google Gen AI SDK The new Google Gen AI SDK provides programmatic access to Gemini 2.0 using both the Developer and Vertex AI APIs in 4 languages: Python and Go with Java and Javascript coming in the future. Speech generation (early access/allowlist) Gemini 2.0 supports a new multimodal generation capability: text to speech as a private experimental release. Image generation (early access/allowlist) Gemini 2.0 supports image generation as a private experimental release. Multimodal Live API The Multimodal Live API enables low-latency, two-way interactions using text, audio, and video input, and audio and text output. Tools: Search as a tool Google Search is available as a tool. Using Grounding with Google Search, you can improve the accuracy and recency of responses from the model. Tools: Code execution The code execution feature enables the model to generate and run Python code and learn iteratively from the results until it arrives at a final output.
  • 18. Comprehensive Table of Google Gemini AI Models Feature Gemini 2.5 Pro Experimental Gemini 2.0 Flash Gemini 2.0 Flash-Lite Gemini 1.5 Flash Gemini 1.5 Flash-8B Gemini 1.5 Pro Imagen 3 Current release gemini-2.5-pro-exp-03-25 Latest (gemini-2.0-flash), Stable (-001) Latest (gemini-2.0-flash-lite), Stable (-001) Latest stable (gemini-1.5-flash), Stable (-001, -002) Latest stable (gemini-1.5-flash-8b), Stable (-001) Latest stable (gemini-1.5-pro), Stable (-001, -002) imagen-3.0-generate-002 Current model gemini-2.5-pro-exp-03-25 gemini-2.0-flash, -001, -exp, -thinking-exp gemini-2.0-flash-lite, -001 gemini-1.5-flash, -latest, -001, -002 gemini-1.5-flash-8b, -latest, -001 gemini-1.5-pro, -latest, -001, -002 imagen-3.0-generate-002 Multimodal inputs Audio, images, video, text Audio, images, video, text Audio, images, video, text Audio, images, video, text Audio, images, video, text Audio, images, video, text Text Text output Yes (limit 65,536) Yes (limit 8,192) Yes (limit 8,192) Yes (limit 8,192) Yes (limit 8,192) Yes (limit 8,192) No (input for image generation) Image output No Experimental No No (processes images as input) No No Yes (up to 4 per prompt) Audio output No Coming soon No No No No No Multimodal Live API No Experimental No No No No No Thinking model Yes Experimental version available No No No No No Context window Input: 1,048,576; Output: 65,536 Input: 1,048,576; Output: 8,192 Input: 1,048,576; Output: 8,192 Input: 1,048,576; Output: 8,192 Input: 1,048,576; Output: 8,192 Input: 2,097,152; Output: 8,192 Input: N/A; Output: Up to 4 Images Controlled generationStructured outputs Structured outputs, Function calling Structured outputs JSON mode, JSON schema System instructions, JSON mode, JSON schema System instructions, JSON mode, JSON schema Details not available Function calling Yes Yes No Yes Yes Yes No Search as a tool Yes (Search grounding) Yes (Search) No Yes (Search) No No No Code execution Yes Yes No Yes Yes Yes No Batch prediction Information unavailable Supported on Vertex AI Supported on Vertex AI Supported on Vertex AI Information unavailable Supported on Vertex AI Information unavailable Tuning No Supported on Vertex AI (-001) Supported on Vertex AI (-001) Supported Supported Supported on Vertex AI (-002) Supported on Vertex AI (customization, fine-tuning)
  • 19. AI Model Feature Comparison Table Feature GPT-4.5 GPT-4o Gemini 2.5 Pro Experimental Gemini 2.0 Flash Claude 3.7 Sonnet Grok-3 DeepSeek R1 Llama 3.2 Current Release Feb 2025 May 2024 (Image Gen Mar 2025) Mar 2025 Feb 2025 Feb 2025 Feb 2025 Jan 2025 Sep 2024 Current Model Orion Omni gemini-2.5-pro-exp-03- 25 gemini-2.0-flash-001 claude-3-7-sonnet-202 50219 Grok-3 671B MoE (37B active) Llama 3.2 Multimodal Inputs: Text Yes Yes Yes Yes Yes Yes Yes Yes Multimodal Inputs: Image Yes Yes Yes Yes Yes Yes Yes Yes Multimodal Inputs: Audio No Yes Yes Yes No No Yes Yes Text Output Yes Yes Yes Yes Yes Yes Yes Yes Image Output No Yes Yes Yes No Yes (via Aurora) No Yes Audio Output No Yes Yes Yes No No No Yes Multimodal Live API No Yes No Yes No No No No Thinking Model Scaled Unsupervised Learning Generative Pre-trained Transformer Thinking Model (Reasons Through Thoughts) Thinking Model (Reasons Before Answering) Hybrid Reasoning Reasoning-Based (Test-Time Computing) Reasoning Model (MoE) Decoder-Only Transformer Context Window 128K 128K 1M (2M Soon) 1M 200K 128K 16K 128K Controlled Generation Yes (Steerability) Yes Yes Yes Yes (Adjustable Reasoning Budget) Yes (Think/Big Brain Modes) Yes (Temperature) Yes Function Calling Yes Yes Yes (Excluding Compositional) Yes Yes (via Tools) No Explicit Mention No Yes (Tool Calling) Search as a Tool Yes Yes Yes Yes Yes Yes No Explicit Mention No Explicit Mention Code Execution Yes Yes Yes Yes Yes (via Claude Code) Yes No Explicit Mention Yes (Tool Calling Implied) Batch Prediction Yes Yes No Explicit Mention Yes Yes (Preview) No Explicit Mention Yes No Explicit Mention Tuning Yes Yes Yes Yes No Explicit Mention Yes Yes Yes
  • 21. Online Prediction Gemini 1.5 Flash Gemini 2.0 Flash-Lite Gemini 2.0 Flash Performance (Relative to Gemini 1.5 Flash) Slightly better than 1.5 Flash Much better than 1.5 Flash Input text / image / video $0.075 $0.15 (long context) $0.075 $0.10 Input audio $0.075 $0.15 (long context) $0.075 $1 Output text $0.3 $0.6 (long context) $0.3 $0.40 Output image NA NA Coming Soon Output audio NA NA Coming Soon Search as a Tool free query per day (QPD) $35/1K (standard search) NA 1,500 QPD Free (+ $35/1K overage) Context caching (storage) $1.00 / 1,000,000 tokens per hour Available March 31, 2025 How does pricing compare between models
  • 22. The enhanced reasoning model with 1M context window improved performance and explainability. Building on the speed and performance of 2.0 Flash, 2.0 Flash thinking exp excels in science and math while revealing its thought process to solve complex problems Progress is quick in this space Gemini Thinking: Now available in Gemini Advanced & Google AI Studio
  • 23. Instruction Models Eg. Gemini 2.0 Flash, GPT-4o, Claude 3.5, Llama 3… Strengths: ● Faster ● Cost Effective Weaknesses: ● Less capable at the most complex reasoning Example Use Cases: ● Summarization & Extraction of Content ● Creative Writing ● RAG & Fact based question answering Thinking Models Eg. 2.0 Flash Thinking, O3-mini, O1, Deepseek R1… Strengths: ● First Principles Thinking & Problem Solving Weaknesses: ● Much Slower ● More expensive (larger outputs) ● Overkill for basic tasks Example Use Cases: ● Multi Step Agentic Reasoning ● Novel Problem Solving (eg. Mathematics & Code)
  • 35. The new ‘googling’ - structuring your request Persona Task Context Format You are a Sales Manager. Write a guide describing 3 sales techniques to use for the Sales Team when selling a new car to 30 year olds. Limit to bullet points with a description and conversation example. From Google’s Prompting Guide
  • 36. Iterative Prompting ● Limits: Gemini has limited output ~5,000 - 6,000 words of standard text, less with tables. ● "Continue" Prompt: If the model reaches the output token limit, simply reply with "continue" to prompt it to generate the next part of the response. ● Iteration: Repeat the "continue" prompt as needed until the model has generated the complete response
  • 38. Model Armor - Overview