SlideShare a Scribd company logo
Regression Analysis & Prediction
Devon Jones
Lead Systems Engineer, Knewton

Gabe Hamilton
Software Engineering Mgr, Revionics
Tech Confluence
For those who work downtown, check out our
DOSUG inspired group.
meetup.com/TechConfluence
3rd Wednesday of the month
At lunch: 12:30 - 1:30pm
The Plan
1. Regression Analysis

- Devon

2. Google Prediction API - Gabe
3. Applying Regression

- Devon
Google Prediction API

{

{

"label": "awesome",
"score": 0.98

},

"label": "lame",
"score": 0.08

}

Gabe Hamilton
What kind of Prediction?
Predict an output value based on some input
values.
Things like:
Sentiment Analysis, Spam Detection, Today's
temperature, GDP Growth
DOSUG Intro to google prediction api
How does Google predict things?
Well, it's Google
Through an intensive breeding program Google has managed to distribute Punxsutawney Phils
throughout its datacenters across the world. Each Phil is kept in a climate controlled enclosure that
mimics the conditions of a perfectly average February 2nd. A full scale digital sundial maps your
problem domain onto the shadow matrix of the enclosure allowing each Phil to fully interact with
your model. The early spring / long winter emergence probability of each Phil is then sorted and
reduced to determine the final result returned by the prediction API.
No Really, How do they do it?
Short Answer:
I have no idea
Long answer:
It's a service, they can
do whatever works,
swap implementations
run multiple
algorithms
Possible Implementations
Regression Analysis
Neural Networks
Support Vector
Machine
Monte Carlo Sim
Decision Trees
Evolutionary
Algorithms

Basically it is

STATISTICS
Types of Prediction you can do
Regression

Classification

How do inputs cause
an output to vary?

Deciding which bucket
some input belongs in

Output is a numeric
value:
Shopping Cart Size
Stock Price

Buckets are text
values: French,
Spanish, English
What is Classification good for?
Classification
●
●
●
●
●
●
●
●

Sentiment analysis
Spam detection
Language categorization
Tagging
Assign priority to bugs
Predict movie ratings
Message routing decisions
<Your brilliant idea here>
Getting Started
Hello World page is great
https://developers.google.com/prediction/docs/hello_world
So you have a big pile
of data
Time for some cleanup
90% of the
development time is
data cleanup

Good talk on data
driven projects
http://www.slideshare.
net/ryanweald/building-data-drivenproducts-with-ruby-rubyconf-2012
CSV Input file aka Training Set
First column is expected values.
2nd through N columns are input values
"French", "Je pense donc j'essuie", "Paris"

Output

an input

No header columns

more input
250MB max file size
4 Steps to Prediction
1. Create a CSV file of your training data
2. Create a new Project in the Prediction API
a. requires entering billing info

3. Upload your csv file to Google Storage
4. In Prediction API Browser:
a. insert a new training set (the csv file)
b. view your trained set
c. use trainedmodel.predict to make
predictions
See the hello world for details of the method calls
Let's make some predictions...
Live demo screenshots: List Models
Live demo screens: Analyze Model
Live demo: Predict Model Category
Live demo: Predict Model Numeric
Storage for datasets
https://storage.cloud.google.com

API Explorer
https://developers.google.com/apis-explorer/#s/prediction/v1.6/

More Related Content

PDF
Intro to Google Prediction API
PPTX
An Interactive Guide to Creating a Simple LabVIEW Program
DOC
Comp 122 lab 3 lab report and source code
PDF
Infer Predictive Lead Scoring
PPT
2.7 other classifiers
PPT
T16 multiple regression
PPT
Chapter 2
PPTX
Predicting crop yield and response to Nutrients from soil spectra at WCSS 201...
Intro to Google Prediction API
An Interactive Guide to Creating a Simple LabVIEW Program
Comp 122 lab 3 lab report and source code
Infer Predictive Lead Scoring
2.7 other classifiers
T16 multiple regression
Chapter 2
Predicting crop yield and response to Nutrients from soil spectra at WCSS 201...

Viewers also liked (20)

PPT
2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
PDF
Ijetcas14 379
PPT
T16 multiple regression
PPTX
Regression analysis
PDF
Analysis of crop yield prediction using data mining techniques
PPTX
Predicting the future with Google Prediction API
DOCX
Scale Invariant Feature Tranform
PPT
Prediction of house price using multiple regression
PPTX
Data mining
PPT
Data mining in agriculture
PPT
Day 10 prediction and regression
PPT
Michal Erel's SIFT presentation
PPTX
Chapter 4 Classification
PDF
Regression Analysis
PPTX
Data mining: Classification and prediction
PDF
Correlation and Simple Regression
PDF
B0930610
PPT
Regression analysis ppt
ODP
Multiple linear regression
PPS
Correlation and regression
2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
Ijetcas14 379
T16 multiple regression
Regression analysis
Analysis of crop yield prediction using data mining techniques
Predicting the future with Google Prediction API
Scale Invariant Feature Tranform
Prediction of house price using multiple regression
Data mining
Data mining in agriculture
Day 10 prediction and regression
Michal Erel's SIFT presentation
Chapter 4 Classification
Regression Analysis
Data mining: Classification and prediction
Correlation and Simple Regression
B0930610
Regression analysis ppt
Multiple linear regression
Correlation and regression
Ad

Similar to DOSUG Intro to google prediction api (20)

PDF
Looking into the Future: Using Google's Prediction API
PPTX
Tech meetup Data Driven - Codemotion
PDF
PredictionIO - Building Applications That Predict User Behavior Through Big D...
PDF
Introduction to Google Cloud platform technologies
PPTX
Using Salesforce, ERP, Tableau & R in Sales Forecasting
PPTX
Predire il futuro con Machine Learning & Big Data
PDF
Building Integrated Applications on Google's Cloud Technologies
PDF
Data Analysis - Making Big Data Work
PDF
Big Data Science - hype?
PDF
Supervised Machine Learning Algorithm
PDF
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
PPT
Sonamine casual connect july 2011 beyond metrics - predictives
PPTX
machine learning types methods classification regression decision tree
PPT
Computing at scale
PDF
Mastering Predictive Analytics with R 2nd edition Edition Forte
PPTX
Machine learning Method and techniques
PDF
Building Apps on Google Cloud Technologies
PDF
Data Science for Business Managers - An intro to ROI for predictive analytics
PDF
Demystifying ML/AI
PDF
Intro to Google's Cloud Technologies
Looking into the Future: Using Google's Prediction API
Tech meetup Data Driven - Codemotion
PredictionIO - Building Applications That Predict User Behavior Through Big D...
Introduction to Google Cloud platform technologies
Using Salesforce, ERP, Tableau & R in Sales Forecasting
Predire il futuro con Machine Learning & Big Data
Building Integrated Applications on Google's Cloud Technologies
Data Analysis - Making Big Data Work
Big Data Science - hype?
Supervised Machine Learning Algorithm
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Sonamine casual connect july 2011 beyond metrics - predictives
machine learning types methods classification regression decision tree
Computing at scale
Mastering Predictive Analytics with R 2nd edition Edition Forte
Machine learning Method and techniques
Building Apps on Google Cloud Technologies
Data Science for Business Managers - An intro to ROI for predictive analytics
Demystifying ML/AI
Intro to Google's Cloud Technologies
Ad

More from Gabriel Hamilton (14)

PDF
Javascript Smart Contracts on NEAR
PDF
Smart Contracts
PDF
PDF
Beyond Agile Software
PPTX
Natural language processing: feature extraction
PDF
Software engineering for CEOs
PDF
Adaptive software engineering
PDF
The TensorFlow dance craze
PPTX
DataFlow & Beam
PDF
Software engineering for CEOs ch1
PDF
How BigQuery broke my heart
PDF
How to present lots of information on a screen
ODP
Dojo: Beautiful Web Apps, Fast
ODP
Dojo: Getting Started Today
Javascript Smart Contracts on NEAR
Smart Contracts
Beyond Agile Software
Natural language processing: feature extraction
Software engineering for CEOs
Adaptive software engineering
The TensorFlow dance craze
DataFlow & Beam
Software engineering for CEOs ch1
How BigQuery broke my heart
How to present lots of information on a screen
Dojo: Beautiful Web Apps, Fast
Dojo: Getting Started Today

Recently uploaded (20)

PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPT
Teaching material agriculture food technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
cuic standard and advanced reporting.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Machine learning based COVID-19 study performance prediction
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
Spectroscopy.pptx food analysis technology
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
A Presentation on Artificial Intelligence
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Big Data Technologies - Introduction.pptx
sap open course for s4hana steps from ECC to s4
Digital-Transformation-Roadmap-for-Companies.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
MIND Revenue Release Quarter 2 2025 Press Release
20250228 LYD VKU AI Blended-Learning.pptx
Teaching material agriculture food technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
cuic standard and advanced reporting.pdf
Programs and apps: productivity, graphics, security and other tools
Machine learning based COVID-19 study performance prediction
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
A comparative analysis of optical character recognition models for extracting...
Spectroscopy.pptx food analysis technology
MYSQL Presentation for SQL database connectivity
A Presentation on Artificial Intelligence
Encapsulation_ Review paper, used for researhc scholars
Big Data Technologies - Introduction.pptx

DOSUG Intro to google prediction api

  • 1. Regression Analysis & Prediction Devon Jones Lead Systems Engineer, Knewton Gabe Hamilton Software Engineering Mgr, Revionics
  • 2. Tech Confluence For those who work downtown, check out our DOSUG inspired group. meetup.com/TechConfluence 3rd Wednesday of the month At lunch: 12:30 - 1:30pm
  • 3. The Plan 1. Regression Analysis - Devon 2. Google Prediction API - Gabe 3. Applying Regression - Devon
  • 4. Google Prediction API { { "label": "awesome", "score": 0.98 }, "label": "lame", "score": 0.08 } Gabe Hamilton
  • 5. What kind of Prediction? Predict an output value based on some input values. Things like: Sentiment Analysis, Spam Detection, Today's temperature, GDP Growth
  • 7. How does Google predict things?
  • 8. Well, it's Google Through an intensive breeding program Google has managed to distribute Punxsutawney Phils throughout its datacenters across the world. Each Phil is kept in a climate controlled enclosure that mimics the conditions of a perfectly average February 2nd. A full scale digital sundial maps your problem domain onto the shadow matrix of the enclosure allowing each Phil to fully interact with your model. The early spring / long winter emergence probability of each Phil is then sorted and reduced to determine the final result returned by the prediction API.
  • 9. No Really, How do they do it? Short Answer: I have no idea Long answer: It's a service, they can do whatever works, swap implementations run multiple algorithms
  • 10. Possible Implementations Regression Analysis Neural Networks Support Vector Machine Monte Carlo Sim Decision Trees Evolutionary Algorithms Basically it is STATISTICS
  • 11. Types of Prediction you can do Regression Classification How do inputs cause an output to vary? Deciding which bucket some input belongs in Output is a numeric value: Shopping Cart Size Stock Price Buckets are text values: French, Spanish, English
  • 13. Classification ● ● ● ● ● ● ● ● Sentiment analysis Spam detection Language categorization Tagging Assign priority to bugs Predict movie ratings Message routing decisions <Your brilliant idea here>
  • 14. Getting Started Hello World page is great https://developers.google.com/prediction/docs/hello_world
  • 15. So you have a big pile of data
  • 16. Time for some cleanup 90% of the development time is data cleanup Good talk on data driven projects http://www.slideshare. net/ryanweald/building-data-drivenproducts-with-ruby-rubyconf-2012
  • 17. CSV Input file aka Training Set First column is expected values. 2nd through N columns are input values "French", "Je pense donc j'essuie", "Paris" Output an input No header columns more input 250MB max file size
  • 18. 4 Steps to Prediction 1. Create a CSV file of your training data 2. Create a new Project in the Prediction API a. requires entering billing info 3. Upload your csv file to Google Storage 4. In Prediction API Browser: a. insert a new training set (the csv file) b. view your trained set c. use trainedmodel.predict to make predictions See the hello world for details of the method calls
  • 19. Let's make some predictions...
  • 20. Live demo screenshots: List Models
  • 21. Live demo screens: Analyze Model
  • 22. Live demo: Predict Model Category
  • 23. Live demo: Predict Model Numeric
  • 24. Storage for datasets https://storage.cloud.google.com API Explorer https://developers.google.com/apis-explorer/#s/prediction/v1.6/