SlideShare a Scribd company logo
Baltimore
BIGQUERY
Code Labs
DevFest DC - 2017
SELECT tweet.@dineshMD,email.dinesh@acarin.net from Contacts
Dinesh Ganesan
Why do we need Big Data tools?
BigQuery - Show the power
Getting Started
Let’s Play - Analyzing USA Census Data within few seconds.
Loading data into BigQuery
Talking to BigQuery
Q&A
4
Agenda?
Why Big Data?
5
3.	Why	“Big	Data”	Is	a	Big	Deal	-	Harvard	Magazine
Enabled	Biologist	to	compute	the	genome	sequence	within	24	hours	rather	waiting	for	a	full	year.			
-	Reducing	Cost	&	Time
Enabled	NASA	to	provide	Climate	Modeling	as	Service(MERRA)	using	data	since	1979	(	few	hundred	peta	bytes	)	
-	Better	Living	and	New	business	opportunities.
2012	–	100	Billion	Google	Searches	every	month.	Leverages	30	trillion	URLs	for	data	collection
-	Optimal	Knowledge	from	huge	data	size.
2.	MERRA:	MODERN	ERA-RETROSPECTIVE	ANALYSIS	FOR	RESEARCH	AND	APPLICATIONS
1.	Google	Search	-	Wikipedia
Getting Started - GCloud Registration
1.	https://console.cloud.google.com
1
2 3
6
Getting Started - Project Creation
1
3
7
2
4
Refresh the page if you don’t see the project name
Getting Started - Enable API
8
5
Getting Started - Lab
6 7 8
9
Getting Started - Lab
9
10
Query Editor
11
Query Editor
UDF Editor
Query Execution
select year, SUM(record_weight) as Births from
publicdata:samples.natality where (mother_race = 18 and
father_race = 18) group by year order by year;
Asian Indian Kids born
The race of the child. One of the following numbers:
1 - White
2 - Black
3 - American Indian
4 - Chinese
5 - Japanese
6 - Hawaiian
7 - Filipino
9 - Unknown/Other
18 - Asian Indian
28 - Korean
39 - Samoan
48 - Vietnamese
SOURCE DATA Formats
13
CSV
JSON
Avro
Cloud datastore backups
SOURCE DATA Formats
14
http://www.ssa.gov/OACT/babynames/names.zip
https://goo.gl/qZW9sx
Loading data - CSV Direct Upload
15
1
2
3
Loading data - CSV Direct Upload
16
4
Loading data - CSV Direct Upload
17
4
name:string,gender:string,count:integer
Loading Data - GCP Storage
18
gs://ssa-babynames/names/yob2013.txt
How about loading multiple files into a single
table?
Running Queries
19
Standard SQL
Simple Aggregates
20
Select COUNT(GoogleDevs),MAX(GoogleDevs),
STDDEV(GoogleDevs) FROM …
Complex Processing
21
Select … FROM … where
REGEXP_MATCH(url,”Acarin$”) AND user
CONTAINS ‘Mr.Foo’
Nested Select
22
Select count(*) from ( Select devGurus ….) Group
BY Name
Small JOIN
23
Select tbl_school.student_id from tbl_school JOIN
tbl_classroom ON tbl_classroom.student_id =
tbl_shool.student_id
Talking to BigQuery
Accessing data through API
25
https://developers.google.com/api-client-library/
Validating API
26
https://developers.google.com/apis-explorer/#p/bigquery/v2/
Validating API
27
Integrating with Google Sheet
28
https://developers.google.com/apps-script/advanced/bigquery
https://goo.gl/evaRRC
1 2
Q&A
T H A N K Y O U
SELECT @dineshMD from Contact.tweet

More Related Content

PPTX
Big data
PPTX
Big data peresintaion
PDF
Big Data Story - From An Engineer's Perspective
PPTX
Gail Zhou on "Big Data Technology, Strategy, and Applications"
PDF
Big Data on Public Cloud
PDF
Introduction to Big Data by Manouj Bongirr
PDF
Big Data, Big Deal: For Future Big Data Scientists
PPTX
Big data
Big data
Big data peresintaion
Big Data Story - From An Engineer's Perspective
Gail Zhou on "Big Data Technology, Strategy, and Applications"
Big Data on Public Cloud
Introduction to Big Data by Manouj Bongirr
Big Data, Big Deal: For Future Big Data Scientists
Big data

What's hot (20)

PPTX
A Big Data Timeline
PPTX
Data mining with big data
PDF
Big Data presentation Tensing
PDF
Introduction to Big Data
PPTX
The future of mobile and big data
PPTX
BIG DATA & DATA ANALYTICS
PDF
Big Data, Big Opportunities
PPTX
Big data
PPTX
Peter Elleby - Big Data, Big Noise, Big Hope - No Miracles
PDF
Big Data
PPTX
Big Data Analytics
PPTX
BIG DATA
ODP
Identifying sick cannabis with ai defcon 2018
PPT
Big Data
PPTX
PPTX
Presentation on Big Data
PPTX
Overview of bigdata
PPTX
Big data
PPTX
Big data? No. Big Decisions are What You Want
PPTX
Big data, Hadoop and Hive
A Big Data Timeline
Data mining with big data
Big Data presentation Tensing
Introduction to Big Data
The future of mobile and big data
BIG DATA & DATA ANALYTICS
Big Data, Big Opportunities
Big data
Peter Elleby - Big Data, Big Noise, Big Hope - No Miracles
Big Data
Big Data Analytics
BIG DATA
Identifying sick cannabis with ai defcon 2018
Big Data
Presentation on Big Data
Overview of bigdata
Big data
Big data? No. Big Decisions are What You Want
Big data, Hadoop and Hive
Ad

Similar to Biq query devfest2017_slides (20)

PPTX
Big Data By Vijay Bhaskar Semwal
PPTX
Our big data
PPT
Big data and Internet
PPTX
The What, Why and How of Big Data
PDF
BIG Data and Methodology-A review
PDF
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
PPTX
INN530 - Assignment 2, Big data and cloud computing for management
PDF
Key note big data analytics ecosystem strategy
PPTX
big-data-8722-m8RQ3h1.pptx
PPTX
bigdata.pptx
PPTX
Data mining with big data
PPTX
Big data
PPTX
Big data Ppt
PDF
How Google Does Big Data - DevNexus 2014
PDF
Big Data - Gerami
PPTX
Big data Analytics
PDF
Big Data for One Big Family
PPT
Research issues in the big data and its Challenges
PPTX
Big Data Lessons from the Cloud
Big Data By Vijay Bhaskar Semwal
Our big data
Big data and Internet
The What, Why and How of Big Data
BIG Data and Methodology-A review
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
INN530 - Assignment 2, Big data and cloud computing for management
Key note big data analytics ecosystem strategy
big-data-8722-m8RQ3h1.pptx
bigdata.pptx
Data mining with big data
Big data
Big data Ppt
How Google Does Big Data - DevNexus 2014
Big Data - Gerami
Big data Analytics
Big Data for One Big Family
Research issues in the big data and its Challenges
Big Data Lessons from the Cloud
Ad

Recently uploaded (20)

PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Machine learning based COVID-19 study performance prediction
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Empathic Computing: Creating Shared Understanding
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
sap open course for s4hana steps from ECC to s4
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Understanding_Digital_Forensics_Presentation.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
Machine learning based COVID-19 study performance prediction
Mobile App Security Testing_ A Comprehensive Guide.pdf
Encapsulation_ Review paper, used for researhc scholars
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
MYSQL Presentation for SQL database connectivity
Advanced methodologies resolving dimensionality complications for autism neur...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Dropbox Q2 2025 Financial Results & Investor Presentation
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Empathic Computing: Creating Shared Understanding
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
sap open course for s4hana steps from ECC to s4
The Rise and Fall of 3GPP – Time for a Sabbatical?
Understanding_Digital_Forensics_Presentation.pptx

Biq query devfest2017_slides