SlideShare a Scribd company logo
Google BigQuery is the future of Analytics! (Google Developer Conference)
Big Query
Google BigQuery is the future of Analytics!
MD. RASEL RANA
CTO & Scrum Master
LightCastle Partners
/raselrana raselcse10
Data that has three attributes(V’s)
can be ‘Big Data’
Velocity
Variety Volume
A fast, economical, fully managed and cloud
based interactive query service for large-scale
data analytics
BigQueryBig Data
How Big is B-I-G
Youtube
Media data
15+ exabytes (2017)
Inventory &
Customer Data
42 Terabytes (2014)
Gmail only
18.5+ petabytes (2018)
English article
10 + Terabytes
(2013)
Amazon Google Wikipedia
1. Generate big data reports require expensive servers and skilled database administrators
2. Interacting with big data has been expensive, slow and inefficient
3. BigQuery changes all that reducing time and expense to query data
4. Super fast SQL queries - run queries on terabyte data sets in seconds( 4.7TB data took 2.5 sec.)
5. Scalable – i) Store hundreds of terabytes ii) Pay only for what you use
6. Service for interactive analysis of massive datasets:
a) Query billions of rows: seconds to write, seconds to return
b) Uses a SQL style query syntax c) It's a service, accessed by a RESTful API
Why BigQuery
[
{
"mode": "NULLABLE",
"name": "version",
"type": "INTEGER"
},
{
"mode": "NULLABLE",
"name": "amount",
"type": "NUMERIC"
]
Integer: 64 bit signed
Float
String: UTF-8 encoded,
<64KB
Boolean: “true” or “false”
Timestamp: String - YYYY-
MM-DD HH:MM:SS
Numeric - seconds from
UNIX
Schema & Data Types
1. Project: All data in BigQuery belongs inside
a project (Set of users, APIs, authentication,
billing information)
2. Dataset: Holds one or more tables (Lowest
access control
3. Table: Row-column structure that contains
actual data
4. Job: Used to start potentially long running
queries
Project
Big Query
Jobs
Team access
Dataset
Dataset
Table
Table
Project Hierarchy
1. Table name is represented as follows:
Current Project
<dataset>.<table name>
e.g. lightcastle-data-testing:forecasting.sales
Datasets & Tables
BigQuery support following format for data loading
Avro, CSV, TSV, JSON,ORC, Parquet, Cloud Datastore exports, Cloud Firestore exports
Big Query
tool
Web
Browser
API
Big
Query
Data Format & Accessing BigQuery
SELECT extract(year from timestamp) as year, country, sum(amount) as total FROM
`lightcastle-data-testing.forecasting.sales` where version = 1 group by extract(year from
timestamp), country LIMIT 1000;
BigQuery Demo Using Web Interface
Visualization Tools
1. Data Studio
2. Tableau
3. Qlik View
4. Metric Insights
5. Jaspersoft
6. Bime
Analysis Using Google Data Studio
• CSV/JSON must be split into chunks less than 1TB
• Split to smaller files
Easier error recovery
To smaller data unit (day, month instead of year)
• Split tables by dates
Minimize cost of data scanned
Minimize query time
• Denormalize your data
• For Query - Query only the columns(SELECT name) that you need instead of select
all(SELECT *)
A Few Best Practices
• 1,000 import jobs per table per day
• 10,000 import jobs per project per day
• File size (for both CSV and JSON)
1GB for compressed file
1TB for uncompressed
• 10,000 files per import job
• 1TB per import job
BigQuery Data Load
• Use it when you have queries that run more than five seconds
• Major usage in Data Analytics
• BigQuery is good for scenarios where data does not change often
• Retailer using data to forecast product sales
• Ads targeting proper customer sections
• Log analysis is making sense of computer generated records
Use Cases of BigQuery
• Use it when you have queries that run more than five seconds
• Major usage in Data Analytics
• BigQuery is good for scenarios where data does not change often
• Retailer using data to forecast product sales
• Ads targeting proper customer sections
• Log analysis is making sense of computer generated records
Use Cases of BigQuery
BigQuery Job Vacancy (percentage)
BigQuery Pricing Summary
Operation Pricing Details
Active storage $0.020 per GB The first 10 GB is free each month.
Long-term storage $0.010 per GB The first 10 GB is free each month.
BigQuery Storage API $1.10 per TB The BigQuery Storage API is not included in
the free tier.
Streaming Inserts $0.010 per 200 MB You are charged for rows that are successfully
inserted. Individual rows are calculated using a 1
KB minimum size.
Queries (on-demand) $5.00 per TB First 1 TB per month is free
Queries (monthly flat-
rate)
$10,000 per 500 slots You can purchase additional slots in 500 slot
increments.
Get $300 free credit to spend over 12 months
Thank You!

More Related Content

PPTX
Google Developer Group - Cloud Singapore BigQuery Webinar
PDF
How BigQuery broke my heart
PDF
BigQuery for Beginners
PDF
Google BigQuery - Features & Benefits
PDF
Redshift VS BigQuery
PDF
Big query
PDF
Big Query Basics
PDF
Google BigQuery Best Practices
Google Developer Group - Cloud Singapore BigQuery Webinar
How BigQuery broke my heart
BigQuery for Beginners
Google BigQuery - Features & Benefits
Redshift VS BigQuery
Big query
Big Query Basics
Google BigQuery Best Practices

What's hot (20)

PDF
Exploring BigData with Google BigQuery
PDF
Google BigQuery for Everyday Developer
ODP
Big Data Analytics with Google BigQuery. By Javier Ramirez. All your base Co...
PDF
Intro to new Google cloud technologies: Google Storage, Prediction API, BigQuery
PDF
Google Cloud Platform at Vente-Exclusive.com
PPTX
BigQuery for the Big Data win
PDF
TDC2016SP - Trilha BigData
PDF
An overview of BigQuery
PDF
2017 09-27 democratize data products with SQL
PPTX
30 days of google cloud event
PPTX
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
PDF
How Google Does Big Data - DevNexus 2014
PDF
Google and big query
PDF
Google Bigtable
PDF
Google BigQuery
PPTX
Google BigQuery 101 & What’s New
PDF
Self Service Analytics at Twitch
PPTX
Your data layer - Choosing the right database solutions for the future
PPTX
Modern data warehouse
PDF
An indepth look at Google BigQuery Architecture by Felipe Hoffa of Google
Exploring BigData with Google BigQuery
Google BigQuery for Everyday Developer
Big Data Analytics with Google BigQuery. By Javier Ramirez. All your base Co...
Intro to new Google cloud technologies: Google Storage, Prediction API, BigQuery
Google Cloud Platform at Vente-Exclusive.com
BigQuery for the Big Data win
TDC2016SP - Trilha BigData
An overview of BigQuery
2017 09-27 democratize data products with SQL
30 days of google cloud event
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
How Google Does Big Data - DevNexus 2014
Google and big query
Google Bigtable
Google BigQuery
Google BigQuery 101 & What’s New
Self Service Analytics at Twitch
Your data layer - Choosing the right database solutions for the future
Modern data warehouse
An indepth look at Google BigQuery Architecture by Felipe Hoffa of Google
Ad

Similar to Google BigQuery is the future of Analytics! (Google Developer Conference) (19)

PPTX
bigquery.pptx
PDF
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
PDF
Big Query - Women Techmarkers (Ukraine - March 2014)
PDF
Webinar: Faster Big Data Analytics with MongoDB
PDF
Webinar: NoSQL as the New Normal
PPTX
Introduction to Big Data
ODP
BigQuery at AppsFlyer - past, present and future
PDF
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
PDF
SEO for Large/Enterprise Websites - Data & Tech Side
PDF
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
PPTX
Big data by Mithlesh sadh
PDF
Using ClickHouse for Experimentation
PDF
the Data World Distilled
PDF
High-performance database technology for rock-solid IoT solutions
PDF
Elastic Stack: Using data for insight and action
PPTX
Architecting Cloudy Applications
PPTX
Apache IOTDB: a Time Series Database for Industrial IoT
PPT
AWS Summit Berlin 2013 - Big Data Analytics
PPTX
PostgreSQL as a Strategic Tool
 
bigquery.pptx
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
Big Query - Women Techmarkers (Ukraine - March 2014)
Webinar: Faster Big Data Analytics with MongoDB
Webinar: NoSQL as the New Normal
Introduction to Big Data
BigQuery at AppsFlyer - past, present and future
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
SEO for Large/Enterprise Websites - Data & Tech Side
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
Big data by Mithlesh sadh
Using ClickHouse for Experimentation
the Data World Distilled
High-performance database technology for rock-solid IoT solutions
Elastic Stack: Using data for insight and action
Architecting Cloudy Applications
Apache IOTDB: a Time Series Database for Industrial IoT
AWS Summit Berlin 2013 - Big Data Analytics
PostgreSQL as a Strategic Tool
 
Ad

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPT
Teaching material agriculture food technology
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Review of recent advances in non-invasive hemoglobin estimation
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
The AUB Centre for AI in Media Proposal.docx
Understanding_Digital_Forensics_Presentation.pptx
cuic standard and advanced reporting.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation theory and applications.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Dropbox Q2 2025 Financial Results & Investor Presentation
20250228 LYD VKU AI Blended-Learning.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Teaching material agriculture food technology
Mobile App Security Testing_ A Comprehensive Guide.pdf

Google BigQuery is the future of Analytics! (Google Developer Conference)

  • 2. Big Query Google BigQuery is the future of Analytics!
  • 3. MD. RASEL RANA CTO & Scrum Master LightCastle Partners /raselrana raselcse10
  • 4. Data that has three attributes(V’s) can be ‘Big Data’ Velocity Variety Volume A fast, economical, fully managed and cloud based interactive query service for large-scale data analytics BigQueryBig Data
  • 5. How Big is B-I-G Youtube Media data 15+ exabytes (2017) Inventory & Customer Data 42 Terabytes (2014) Gmail only 18.5+ petabytes (2018) English article 10 + Terabytes (2013) Amazon Google Wikipedia
  • 6. 1. Generate big data reports require expensive servers and skilled database administrators 2. Interacting with big data has been expensive, slow and inefficient 3. BigQuery changes all that reducing time and expense to query data 4. Super fast SQL queries - run queries on terabyte data sets in seconds( 4.7TB data took 2.5 sec.) 5. Scalable – i) Store hundreds of terabytes ii) Pay only for what you use 6. Service for interactive analysis of massive datasets: a) Query billions of rows: seconds to write, seconds to return b) Uses a SQL style query syntax c) It's a service, accessed by a RESTful API Why BigQuery
  • 7. [ { "mode": "NULLABLE", "name": "version", "type": "INTEGER" }, { "mode": "NULLABLE", "name": "amount", "type": "NUMERIC" ] Integer: 64 bit signed Float String: UTF-8 encoded, <64KB Boolean: “true” or “false” Timestamp: String - YYYY- MM-DD HH:MM:SS Numeric - seconds from UNIX Schema & Data Types
  • 8. 1. Project: All data in BigQuery belongs inside a project (Set of users, APIs, authentication, billing information) 2. Dataset: Holds one or more tables (Lowest access control 3. Table: Row-column structure that contains actual data 4. Job: Used to start potentially long running queries Project Big Query Jobs Team access Dataset Dataset Table Table Project Hierarchy
  • 9. 1. Table name is represented as follows: Current Project <dataset>.<table name> e.g. lightcastle-data-testing:forecasting.sales Datasets & Tables
  • 10. BigQuery support following format for data loading Avro, CSV, TSV, JSON,ORC, Parquet, Cloud Datastore exports, Cloud Firestore exports Big Query tool Web Browser API Big Query Data Format & Accessing BigQuery
  • 11. SELECT extract(year from timestamp) as year, country, sum(amount) as total FROM `lightcastle-data-testing.forecasting.sales` where version = 1 group by extract(year from timestamp), country LIMIT 1000; BigQuery Demo Using Web Interface
  • 12. Visualization Tools 1. Data Studio 2. Tableau 3. Qlik View 4. Metric Insights 5. Jaspersoft 6. Bime Analysis Using Google Data Studio
  • 13. • CSV/JSON must be split into chunks less than 1TB • Split to smaller files Easier error recovery To smaller data unit (day, month instead of year) • Split tables by dates Minimize cost of data scanned Minimize query time • Denormalize your data • For Query - Query only the columns(SELECT name) that you need instead of select all(SELECT *) A Few Best Practices
  • 14. • 1,000 import jobs per table per day • 10,000 import jobs per project per day • File size (for both CSV and JSON) 1GB for compressed file 1TB for uncompressed • 10,000 files per import job • 1TB per import job BigQuery Data Load
  • 15. • Use it when you have queries that run more than five seconds • Major usage in Data Analytics • BigQuery is good for scenarios where data does not change often • Retailer using data to forecast product sales • Ads targeting proper customer sections • Log analysis is making sense of computer generated records Use Cases of BigQuery • Use it when you have queries that run more than five seconds • Major usage in Data Analytics • BigQuery is good for scenarios where data does not change often • Retailer using data to forecast product sales • Ads targeting proper customer sections • Log analysis is making sense of computer generated records Use Cases of BigQuery
  • 16. BigQuery Job Vacancy (percentage)
  • 17. BigQuery Pricing Summary Operation Pricing Details Active storage $0.020 per GB The first 10 GB is free each month. Long-term storage $0.010 per GB The first 10 GB is free each month. BigQuery Storage API $1.10 per TB The BigQuery Storage API is not included in the free tier. Streaming Inserts $0.010 per 200 MB You are charged for rows that are successfully inserted. Individual rows are calculated using a 1 KB minimum size. Queries (on-demand) $5.00 per TB First 1 TB per month is free Queries (monthly flat- rate) $10,000 per 500 slots You can purchase additional slots in 500 slot increments. Get $300 free credit to spend over 12 months