SlideShare a Scribd company logo
Data Science with Google Analytics
Is it possible and how ?
Alexandros Papageorgiou
MeasureCamp London. September 2017.
About me
Account strategist @ Google
Back to school - Data analytics
analyst @alex-papageo.com
Consumer behaviour analyst @ WhatClinic.com
Data...data...data
It’s in our menu every day!
Is it Data science ?
Can it be Data science ?
(what’s data science anyway ? )
“Google Analytics is basically for marketers”
Data Scientist in big internet company to me
The landscape back in 2015
A year or so later
Hype ?
Back to the question:
Is DS with GA possible ?
Answer: Yes it is. But…
1. Access the Data
Google API + R/Python Libraries.
R
GoogleAnalyticsR
RGA
RGoogleAnalytics
Python
Pandas ga
google2pandas
2. Initial variable selection
3. Un-sample the data
E.g. Day by day or hour by hour api calls.
4. Transform the data
Bring it to atomic/event level, e.g. every row corresponds to an individual
obervation
- User
- Session
- Page
User explorer ?
Query multiple dimensions
Result: Session level data
Custom dimensions
Supercharging websites with a real-time R API http://code.markedmondson.me/predictClickOpenCPU/supercharge.html#1
Improve Data Collection With Four Custom Dimensions https://www.simoahava.com/analytics/improve-data-collection-with-four-custom-dimensions/
E.g. Cust ID + Timestamp
5. Store the data (recommended)
6. Model & Communicate the data
https://github.com/papageorgiou/dublinR-talk-analytics http://www.alex-papageo.com/research.html
E.g. Decision tree & variable importance for conversion prediction
Some ideas
● Decision tree for conversion prediction (rpart)
● Clickstream analysis to predict next page (clickstream)
● Clustering for customer segmentation (base R)
● Association Rules for pages/products frequently visited together (arules)
Enrich data: Look for opportunities to join with internal data/ external api data
Wrapping up
DS with GA not straightforward but possible
Take advantage of GA API
Open source R/Python libraries
Get familiar with 2-3 algos and apply them on your data.
Thank you!

More Related Content

PPTX
Growth Analytics: Evolution, Community and Tools
PDF
[Webinar Deck] Google Data Studio for Mastering the Art of Data Visualizations
PDF
Spca2014 apps search schot
PPTX
SPConnect 2014 - Search Intents with Apps
PDF
Resume
PPTX
Google analytics
PDF
Xinchao(luke) lu
PDF
4 Steps to Go Beyond Visualization - Pyramid Analytics
Growth Analytics: Evolution, Community and Tools
[Webinar Deck] Google Data Studio for Mastering the Art of Data Visualizations
Spca2014 apps search schot
SPConnect 2014 - Search Intents with Apps
Resume
Google analytics
Xinchao(luke) lu
4 Steps to Go Beyond Visualization - Pyramid Analytics

What's hot (20)

PDF
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
PDF
Satwik Mishra Resume
PPTX
Using Simple Machine Learning Models in a New Ads Manager
PPTX
20171023 5 Lifehacks: How to Analyze a Pack of Websites
DOCX
Visualize and explore your google analytics data with power bi
PPTX
3 Personalization Use Cases of Google Optimize 360
PPT
Google Analytics
PDF
MongoDB World 2019: Analytics with MongoDB: The Data Warehouse You Didn't Kno...
PPTX
RAMP: Repository Analytics and Metrics Portal
PDF
Google Analytics Products Overview 2020
PDF
PDF
Advance Google Analytics Integration with Inventory Data
PDF
Google analytics products overview 2021
PDF
Using Google Data Studio and Supermetrics to create your dashboard by Ann Sta...
PPTX
apidays LIVE LONDON - Data monetisation: Increasing revenue through data-driv...
PDF
What is data science ?
PPTX
Improving the reported use and impact of institutional repositories
PPTX
Introduction to Google Analytics
PDF
PRSA San Diego: Analytics
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
Satwik Mishra Resume
Using Simple Machine Learning Models in a New Ads Manager
20171023 5 Lifehacks: How to Analyze a Pack of Websites
Visualize and explore your google analytics data with power bi
3 Personalization Use Cases of Google Optimize 360
Google Analytics
MongoDB World 2019: Analytics with MongoDB: The Data Warehouse You Didn't Kno...
RAMP: Repository Analytics and Metrics Portal
Google Analytics Products Overview 2020
Advance Google Analytics Integration with Inventory Data
Google analytics products overview 2021
Using Google Data Studio and Supermetrics to create your dashboard by Ann Sta...
apidays LIVE LONDON - Data monetisation: Increasing revenue through data-driv...
What is data science ?
Improving the reported use and impact of institutional repositories
Introduction to Google Analytics
PRSA San Diego: Analytics
Ad

Similar to Data science with Google Analytics @MeasureCamp (20)

PDF
5_Data Analytics, Data Science and Machine Learning
PPTX
Unlocking the potential of data studio
PPT
Google Analytics Conference and Product Releases
PPTX
TOP 15 ONLINE INDUSTRIES THAT BENEFITS FROM DATA SCIENCE.pptx
PDF
Classification Modelling with case Sortter
PDF
Top 10 data science takeaways for executives
PDF
MeasureCamp Amsterdam 2022
PDF
Google Analytics Premium for Better Data-Driven Decisions With Swapnil Sinha
PDF
Barga, roger. predictive analytics with microsoft azure machine learning
PDF
Big data Analytics
PDF
01-Introduction.pdf
PDF
Data Science Presentation.pdf
PPTX
Data studio brighton seo apr 2017 - 16 x 9
PDF
Intro to google analytics
PPTX
Data Driven Marketing with Google Data Studio
PPT
Powerful Flexible Intelligent
PDF
Columbus Web Analytics Wednesday - Google Analytics 4
PDF
MMG GA4 – Using Analytics to Make Better Business Decisions 20230407.pdf
PPTX
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
PPTX
Chapter 1 Introduction to Data Science (Computing)
5_Data Analytics, Data Science and Machine Learning
Unlocking the potential of data studio
Google Analytics Conference and Product Releases
TOP 15 ONLINE INDUSTRIES THAT BENEFITS FROM DATA SCIENCE.pptx
Classification Modelling with case Sortter
Top 10 data science takeaways for executives
MeasureCamp Amsterdam 2022
Google Analytics Premium for Better Data-Driven Decisions With Swapnil Sinha
Barga, roger. predictive analytics with microsoft azure machine learning
Big data Analytics
01-Introduction.pdf
Data Science Presentation.pdf
Data studio brighton seo apr 2017 - 16 x 9
Intro to google analytics
Data Driven Marketing with Google Data Studio
Powerful Flexible Intelligent
Columbus Web Analytics Wednesday - Google Analytics 4
MMG GA4 – Using Analytics to Make Better Business Decisions 20230407.pdf
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
Chapter 1 Introduction to Data Science (Computing)
Ad

More from Alex Papageorgiou (14)

PPTX
Webinar Advanced marketing analytics
PPTX
Kaggle for digital analysts
PPTX
Kaggle for Analysts - MeasureCamp London 2019
PDF
Travel information search: the presence of social media
PPTX
The Kaggle Experience from a Digital Analysts' Perspective
PPTX
Clickstream analytics with Markov Chains
PDF
Clickstream Analytics with Markov Chains
PDF
The impact of search ads on organic search traffic
PDF
Programming for big data
PDF
Prediciting happiness from mobile app survey data
PDF
E com conversion prediction and optimisation
PDF
Web analytics with R
ODP
Intro to AdWords eMTI
PPT
Social Media And Civil Society
Webinar Advanced marketing analytics
Kaggle for digital analysts
Kaggle for Analysts - MeasureCamp London 2019
Travel information search: the presence of social media
The Kaggle Experience from a Digital Analysts' Perspective
Clickstream analytics with Markov Chains
Clickstream Analytics with Markov Chains
The impact of search ads on organic search traffic
Programming for big data
Prediciting happiness from mobile app survey data
E com conversion prediction and optimisation
Web analytics with R
Intro to AdWords eMTI
Social Media And Civil Society

Recently uploaded (20)

PPTX
Managing Community Partner Relationships
PPTX
Database Infoormation System (DBIS).pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
Business Analytics and business intelligence.pdf
PDF
Introduction to Data Science and Data Analysis
PDF
Transcultural that can help you someday.
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
Supervised vs unsupervised machine learning algorithms
Managing Community Partner Relationships
Database Infoormation System (DBIS).pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Miokarditis (Inflamasi pada Otot Jantung)
.pdf is not working space design for the following data for the following dat...
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Business Analytics and business intelligence.pdf
Introduction to Data Science and Data Analysis
Transcultural that can help you someday.
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Clinical guidelines as a resource for EBP(1).pdf
climate analysis of Dhaka ,Banglades.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
Supervised vs unsupervised machine learning algorithms

Data science with Google Analytics @MeasureCamp

  • 1. Data Science with Google Analytics Is it possible and how ? Alexandros Papageorgiou MeasureCamp London. September 2017.
  • 2. About me Account strategist @ Google Back to school - Data analytics analyst @alex-papageo.com Consumer behaviour analyst @ WhatClinic.com
  • 3. Data...data...data It’s in our menu every day! Is it Data science ? Can it be Data science ? (what’s data science anyway ? )
  • 4. “Google Analytics is basically for marketers” Data Scientist in big internet company to me
  • 6. A year or so later
  • 8. Back to the question: Is DS with GA possible ? Answer: Yes it is. But…
  • 9. 1. Access the Data Google API + R/Python Libraries. R GoogleAnalyticsR RGA RGoogleAnalytics Python Pandas ga google2pandas
  • 10. 2. Initial variable selection
  • 11. 3. Un-sample the data E.g. Day by day or hour by hour api calls.
  • 12. 4. Transform the data Bring it to atomic/event level, e.g. every row corresponds to an individual obervation - User - Session - Page
  • 16. Custom dimensions Supercharging websites with a real-time R API http://code.markedmondson.me/predictClickOpenCPU/supercharge.html#1 Improve Data Collection With Four Custom Dimensions https://www.simoahava.com/analytics/improve-data-collection-with-four-custom-dimensions/ E.g. Cust ID + Timestamp
  • 17. 5. Store the data (recommended)
  • 18. 6. Model & Communicate the data https://github.com/papageorgiou/dublinR-talk-analytics http://www.alex-papageo.com/research.html E.g. Decision tree & variable importance for conversion prediction
  • 19. Some ideas ● Decision tree for conversion prediction (rpart) ● Clickstream analysis to predict next page (clickstream) ● Clustering for customer segmentation (base R) ● Association Rules for pages/products frequently visited together (arules) Enrich data: Look for opportunities to join with internal data/ external api data
  • 20. Wrapping up DS with GA not straightforward but possible Take advantage of GA API Open source R/Python libraries Get familiar with 2-3 algos and apply them on your data.