SlideShare a Scribd company logo
PredictionIO
Adam Wang
Annie Cheng
Brett Clancy
Erin Chen
Han-Wen Chen
Siyuan Wang
Yiting Wang
Koober–
Koober
Meet Tim
Koober
Tim’s Challenge
+
Provide an efficient service for customers and taxi
drivers by…
Wait Times Revenue
Tim has some questions
How should taxis be allocated
among different locations?
Koober
What factors drive customers
to use taxis more?
But most importantly…
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Current Solutions
Koober
iTaxi
Shiny
TaxiPrediction
Our Solution
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Koober
Why Koober?
Demand Analysis + Prediction. Empower taxi
owners with valuable rider information backed by
past trends.
Koober
Anyone in Ride Sharing Industry. Plan more
appropriate routes and make better transportation
decisions.
So how does it work?
Koober
1. Choose your info need: analysis or prediction.
OR
So how does it work?
Koober
2. Adjust your input query parameters.
So how does it work?
Koober
3. Visualize taxi demand on the heat maps.
So how does it work?
Koober
4. Make more informed taxi resource decisions.
Let’s try it out!
https://koober-dashboard.herokuapp.com/
Koober
How does it really work?
Koober
Train and test machine learning models.
Display demand on interactive heat maps.
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Raw Data
NYC Taxi & Limousine Commission:
Yellow Taxi Cab Dataset
Koober
Weather Underground API:
• Historical Weather Data
• Current Conditions
• 10-Day Forecast
Task 1
Demo Data Loader
Koober
Task 1
Development Production
• Generate fake traffic data
based on user-defined
parameters
• Simplify debugging and
validating the model
prediction result
• Process NYC Taxi Data and
integrate with the historical
weather data
• Incorporate Kafka to
facilitate importing large
amount of training data
Feature Extraction
Koober
Task 1
{
   "eventTime": "2017-01-20T18:54:07.000-05:00",
   "lat": 40.713802337646484,
   "lng": -77.0088882446289,
   "temperature": 0,
   "clear": 1,
   "fog": 0,
   "rain": 0,
   "snow": 0,
   "hail": 0,
   "thunder": 0,
   "tornado": 0
}
Weather
Location
Event Time
Temperature
Location Clustering
Koober
Preparator:
• K-Means Location Clustering
(200 clusters)
• Each cluster represents a
neighborhood geographically
• Demand Aggregation per Unit
Time Interval and Location
Cluster
Task 1
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Koober
Task 2
Gradient-Boosted Trees
Linear Regression with Stochastic Gradient Descent
Neural Network
Random Forest
Ridge Regression
Multiple Models. Supports many classic individual
and combined machine learning models.
Models
Koober
Task 2
PredictionIO - Train
Koober
Task 2
PredictionIO - Predict
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Koober
Task 4
User
Interface
Time
Location
Weather Predict Query
Predict Result
for Each
Algorithm
Map GL
component
Mapbox PredictionIO
Engine
Web App
Map Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Looking Forward
Koober
ModelsTraining Data Prediction Modes
More Cities
Uber/Lyft
Better Tuning
New Models
Wait Times
Prices
Our Mentor
Engineering and Open Source Ambassador at
Salesforce.com
Koober
James Ward
www.jamesward.com
Our Professor
Assistant Professor in Computer Science at
Cornell University
Koober
Ross Tate
http://www.cs.cornell.edu/~ross/
Contribute at
Koober
https://github.com/jamesward/
koober
Questions?
Koober

More Related Content

PDF
Baseride Technologies - solutions for smart transportation & logistics
PPTX
Applying cloud think infrastructure to real time vehicle parameter estimation
PDF
3. Google Maps가 위치정보를 다루는 방식 Places API 이해하기
DOCX
Vehicle detection and speed detection
PPTX
Big Data Pipelines and Machine Learning at Uber
PDF
Cities in Motion: Mapping Singapore’s Night-time Economy through Taxi Data
PPT
4 f ballegeer - seize the real world
Baseride Technologies - solutions for smart transportation & logistics
Applying cloud think infrastructure to real time vehicle parameter estimation
3. Google Maps가 위치정보를 다루는 방식 Places API 이해하기
Vehicle detection and speed detection
Big Data Pipelines and Machine Learning at Uber
Cities in Motion: Mapping Singapore’s Night-time Economy through Taxi Data
4 f ballegeer - seize the real world

Similar to Koober Preduction IO Presentation (20)

PDF
Report_SmartSuggest
PDF
Taxi Demand Prediction using Machine Learning.
PDF
Analyzing NYC Transit Data
ODP
Insight_Project_Presentation
PDF
Is it harder to find a taxi when it is raining?
PDF
A Study on New York City Taxi Rides
PDF
Intelligent Transportation System Based On Machine Learning For Vehicle Perce...
PDF
TAXI DEMAND PREDICTION IN REAL TIME
PPTX
NYC Taxi
PDF
Uber rides data analysis using python under goggle co lab
PPTX
Prediction of taxi rides ETA
PDF
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
PPTX
Visualizing Urban Data - Chris Whong
PPTX
Data Analytics Uber using google cloud and dashboard
PPTX
An effective joint prediction model for travel demands and traffic flows
PPTX
Cab travel time prediction using ensemble models
PDF
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
PDF
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
PDF
Finding the right Machine Learning method for predictive modeling
PDF
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp
Report_SmartSuggest
Taxi Demand Prediction using Machine Learning.
Analyzing NYC Transit Data
Insight_Project_Presentation
Is it harder to find a taxi when it is raining?
A Study on New York City Taxi Rides
Intelligent Transportation System Based On Machine Learning For Vehicle Perce...
TAXI DEMAND PREDICTION IN REAL TIME
NYC Taxi
Uber rides data analysis using python under goggle co lab
Prediction of taxi rides ETA
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Visualizing Urban Data - Chris Whong
Data Analytics Uber using google cloud and dashboard
An effective joint prediction model for travel demands and traffic flows
Cab travel time prediction using ensemble models
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Finding the right Machine Learning method for predictive modeling
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp
Ad

More from Salesforce Engineering (20)

PPTX
Locker Service Ready Lightning Components With Webpack
PPTX
Scaling HBase for Big Data
PPTX
Techniques to Effectively Monitor the Performance of Customers in the Cloud
PPTX
Predictive System Performance Data Analysis
PPTX
Apache HBase State of the Project
PPTX
Hit the Trail with Trailhead
PPTX
HBase/PHOENIX @ Scale
PPTX
Scaling up data science applications
PPTX
Containers and Security for DevOps
PPTX
Aspect Oriented Programming: Hidden Toolkit That You Already Have
PPTX
Monitoring @ Scale in Salesforce
PPTX
Performance Tuning with XHProf
PPTX
A Smarter Pig: Building a SQL interface to Pig using Apache Calcite
PPTX
Implementing a Content Strategy Is Like Running 100 Miles
PPTX
Salesforce Cloud Infrastructure and Challenges - A Brief Overview
PPTX
Finding Security Issues Fast!
PDF
Microservices
PPTX
Global State Management of Micro Services
PPTX
The Future of Hbase
PPTX
Apache BookKeeper Distributed Store- a Salesforce use case
Locker Service Ready Lightning Components With Webpack
Scaling HBase for Big Data
Techniques to Effectively Monitor the Performance of Customers in the Cloud
Predictive System Performance Data Analysis
Apache HBase State of the Project
Hit the Trail with Trailhead
HBase/PHOENIX @ Scale
Scaling up data science applications
Containers and Security for DevOps
Aspect Oriented Programming: Hidden Toolkit That You Already Have
Monitoring @ Scale in Salesforce
Performance Tuning with XHProf
A Smarter Pig: Building a SQL interface to Pig using Apache Calcite
Implementing a Content Strategy Is Like Running 100 Miles
Salesforce Cloud Infrastructure and Challenges - A Brief Overview
Finding Security Issues Fast!
Microservices
Global State Management of Micro Services
The Future of Hbase
Apache BookKeeper Distributed Store- a Salesforce use case
Ad

Recently uploaded (20)

PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
cuic standard and advanced reporting.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
A Presentation on Artificial Intelligence
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Big Data Technologies - Introduction.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Empathic Computing: Creating Shared Understanding
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Cloud computing and distributed systems.
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPT
Teaching material agriculture food technology
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Reach Out and Touch Someone: Haptics and Empathic Computing
“AI and Expert System Decision Support & Business Intelligence Systems”
cuic standard and advanced reporting.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
A Presentation on Artificial Intelligence
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Advanced methodologies resolving dimensionality complications for autism neur...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Big Data Technologies - Introduction.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Diabetes mellitus diagnosis method based random forest with bat algorithm
Empathic Computing: Creating Shared Understanding
Chapter 3 Spatial Domain Image Processing.pdf
Review of recent advances in non-invasive hemoglobin estimation
Cloud computing and distributed systems.
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Teaching material agriculture food technology

Koober Preduction IO Presentation

  • 1. PredictionIO Adam Wang Annie Cheng Brett Clancy Erin Chen Han-Wen Chen Siyuan Wang Yiting Wang Koober–
  • 3. Koober Tim’s Challenge + Provide an efficient service for customers and taxi drivers by… Wait Times Revenue
  • 4. Tim has some questions How should taxis be allocated among different locations? Koober What factors drive customers to use taxis more?
  • 5. But most importantly… Koober How can the taxi industry utilize past data to predict the future needs of their riders?
  • 7. Our Solution Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Koober
  • 8. Why Koober? Demand Analysis + Prediction. Empower taxi owners with valuable rider information backed by past trends. Koober Anyone in Ride Sharing Industry. Plan more appropriate routes and make better transportation decisions.
  • 9. So how does it work? Koober 1. Choose your info need: analysis or prediction. OR
  • 10. So how does it work? Koober 2. Adjust your input query parameters.
  • 11. So how does it work? Koober 3. Visualize taxi demand on the heat maps.
  • 12. So how does it work? Koober 4. Make more informed taxi resource decisions.
  • 13. Let’s try it out! https://koober-dashboard.herokuapp.com/ Koober
  • 14. How does it really work? Koober Train and test machine learning models. Display demand on interactive heat maps.
  • 15. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 16. Raw Data NYC Taxi & Limousine Commission: Yellow Taxi Cab Dataset Koober Weather Underground API: • Historical Weather Data • Current Conditions • 10-Day Forecast Task 1
  • 17. Demo Data Loader Koober Task 1 Development Production • Generate fake traffic data based on user-defined parameters • Simplify debugging and validating the model prediction result • Process NYC Taxi Data and integrate with the historical weather data • Incorporate Kafka to facilitate importing large amount of training data
  • 18. Feature Extraction Koober Task 1 {    "eventTime": "2017-01-20T18:54:07.000-05:00",    "lat": 40.713802337646484,    "lng": -77.0088882446289,    "temperature": 0,    "clear": 1,    "fog": 0,    "rain": 0,    "snow": 0,    "hail": 0,    "thunder": 0,    "tornado": 0 } Weather Location Event Time Temperature
  • 19. Location Clustering Koober Preparator: • K-Means Location Clustering (200 clusters) • Each cluster represents a neighborhood geographically • Demand Aggregation per Unit Time Interval and Location Cluster Task 1
  • 20. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 21. Koober Task 2 Gradient-Boosted Trees Linear Regression with Stochastic Gradient Descent Neural Network Random Forest Ridge Regression Multiple Models. Supports many classic individual and combined machine learning models. Models
  • 24. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 25. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 26. Koober Task 4 User Interface Time Location Weather Predict Query Predict Result for Each Algorithm Map GL component Mapbox PredictionIO Engine Web App Map Visualization
  • 27. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 28. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 29. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 30. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 31. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 32. Looking Forward Koober ModelsTraining Data Prediction Modes More Cities Uber/Lyft Better Tuning New Models Wait Times Prices
  • 33. Our Mentor Engineering and Open Source Ambassador at Salesforce.com Koober James Ward www.jamesward.com
  • 34. Our Professor Assistant Professor in Computer Science at Cornell University Koober Ross Tate http://www.cs.cornell.edu/~ross/