SlideShare a Scribd company logo
PredictionIO
Adam Wang
Annie Cheng
Brett Clancy
Erin Chen
Han-Wen Chen
Siyuan Wang
Yiting Wang
Koober–
Koober
Meet Tim
Koober
Tim’s Challenge
+
Provide an efficient service for customers and taxi
drivers by…
Wait Times Revenue
Tim has some questions
How should taxis be allocated
among different locations?
Koober
What factors drive customers
to use taxis more?
But most importantly…
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Current Solutions
Koober
iTaxi
Shiny
TaxiPrediction
Our Solution
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Koober
Why Koober?
Demand Analysis + Prediction. Empower taxi
owners with valuable rider information backed by
past trends.
Koober
Anyone in Ride Sharing Industry. Plan more
appropriate routes and make better transportation
decisions.
So how does it work?
Koober
1. Choose your info need: analysis or prediction.
OR
So how does it work?
Koober
2. Adjust your input query parameters.
So how does it work?
Koober
3. Visualize taxi demand on the heat maps.
So how does it work?
Koober
4. Make more informed taxi resource decisions.
Let’s try it out!
https://koober-dashboard.herokuapp.com/
Koober
How does it really work?
Koober
Train and test machine learning models.
Display demand on interactive heat maps.
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Raw Data
NYC Taxi & Limousine Commission:
Yellow Taxi Cab Dataset
Koober
Weather Underground API:
• Historical Weather Data
• Current Conditions
• 10-Day Forecast
Task 1
Demo Data Loader
Koober
Task 1
Development Production
• Generate fake traffic data
based on user-defined
parameters
• Simplify debugging and
validating the model
prediction result
• Process NYC Taxi Data and
integrate with the historical
weather data
• Incorporate Kafka to
facilitate importing large
amount of training data
Feature Extraction
Koober
Task 1
{
   "eventTime": "2017-01-20T18:54:07.000-05:00",
   "lat": 40.713802337646484,
   "lng": -77.0088882446289,
   "temperature": 0,
   "clear": 1,
   "fog": 0,
   "rain": 0,
   "snow": 0,
   "hail": 0,
   "thunder": 0,
   "tornado": 0
}
Weather
Location
Event Time
Temperature
Location Clustering
Koober
Preparator:
• K-Means Location Clustering
(200 clusters)
• Each cluster represents a
neighborhood geographically
• Demand Aggregation per Unit
Time Interval and Location
Cluster
Task 1
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Koober
Task 2
Gradient-Boosted Trees
Linear Regression with Stochastic Gradient Descent
Neural Network
Random Forest
Ridge Regression
Multiple Models. Supports many classic individual
and combined machine learning models.
Models
Koober
Task 2
PredictionIO - Train
Koober
Task 2
PredictionIO - Predict
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Koober
Task 4
User
Interface
Time
Location
Weather Predict Query
Predict Result
for Each
Algorithm
Map GL
component
Mapbox PredictionIO
Engine
Web App
Map Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Looking Forward
Koober
ModelsTraining Data Prediction Modes
More Cities
Uber/Lyft
Better Tuning
New Models
Wait Times
Prices
Our Mentor
Engineering and Open Source Ambassador at
Salesforce.com
Koober
James Ward
www.jamesward.com
Our Professor
Assistant Professor in Computer Science at
Cornell University
Koober
Ross Tate
http://www.cs.cornell.edu/~ross/
Contribute at
Koober
https://github.com/jamesward/
koober
Questions?
Koober

More Related Content

PDF
Baseride Technologies - solutions for smart transportation & logistics
PPTX
Applying cloud think infrastructure to real time vehicle parameter estimation
PDF
3. Google Maps가 위치정보를 다루는 방식 Places API 이해하기
DOCX
Vehicle detection and speed detection
PPTX
Big Data Pipelines and Machine Learning at Uber
PDF
Cities in Motion: Mapping Singapore’s Night-time Economy through Taxi Data
PPT
4 f ballegeer - seize the real world
Baseride Technologies - solutions for smart transportation & logistics
Applying cloud think infrastructure to real time vehicle parameter estimation
3. Google Maps가 위치정보를 다루는 방식 Places API 이해하기
Vehicle detection and speed detection
Big Data Pipelines and Machine Learning at Uber
Cities in Motion: Mapping Singapore’s Night-time Economy through Taxi Data
4 f ballegeer - seize the real world

Similar to Koober Machine Learning (20)

PDF
Report_SmartSuggest
PDF
Taxi Demand Prediction using Machine Learning.
PDF
Analyzing NYC Transit Data
ODP
Insight_Project_Presentation
PDF
Is it harder to find a taxi when it is raining?
PDF
A Study on New York City Taxi Rides
PDF
Intelligent Transportation System Based On Machine Learning For Vehicle Perce...
PDF
TAXI DEMAND PREDICTION IN REAL TIME
PPTX
NYC Taxi
PDF
Uber rides data analysis using python under goggle co lab
PPTX
Prediction of taxi rides ETA
PDF
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
PPTX
Visualizing Urban Data - Chris Whong
PPTX
Data Analytics Uber using google cloud and dashboard
PPTX
An effective joint prediction model for travel demands and traffic flows
PPTX
Cab travel time prediction using ensemble models
PDF
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
PDF
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
PDF
Finding the right Machine Learning method for predictive modeling
PDF
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp
Report_SmartSuggest
Taxi Demand Prediction using Machine Learning.
Analyzing NYC Transit Data
Insight_Project_Presentation
Is it harder to find a taxi when it is raining?
A Study on New York City Taxi Rides
Intelligent Transportation System Based On Machine Learning For Vehicle Perce...
TAXI DEMAND PREDICTION IN REAL TIME
NYC Taxi
Uber rides data analysis using python under goggle co lab
Prediction of taxi rides ETA
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Visualizing Urban Data - Chris Whong
Data Analytics Uber using google cloud and dashboard
An effective joint prediction model for travel demands and traffic flows
Cab travel time prediction using ensemble models
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Finding the right Machine Learning method for predictive modeling
945 mpp1 chicago_taxi data research_v1_cyy_33w1xtp
Ad

More from James Ward (8)

PPTX
Kotlin Mullets
PPTX
Introduction to Machine Learning
PPTX
Salesforce Campus Tour - Developer Intro
PPTX
Salesforce Campus Tour - Developer Advanced
PPTX
Salesforce Campus Tour - Declarative
PDF
Integrating Clouds & Humans with Wearable Apps
PPTX
Building Reactive Apps
PPTX
Planet of the AOPs
Kotlin Mullets
Introduction to Machine Learning
Salesforce Campus Tour - Developer Intro
Salesforce Campus Tour - Developer Advanced
Salesforce Campus Tour - Declarative
Integrating Clouds & Humans with Wearable Apps
Building Reactive Apps
Planet of the AOPs
Ad

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Machine learning based COVID-19 study performance prediction
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
Teaching material agriculture food technology
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Machine Learning_overview_presentation.pptx
PPTX
Big Data Technologies - Introduction.pptx
Spectral efficient network and resource selection model in 5G networks
Machine learning based COVID-19 study performance prediction
The AUB Centre for AI in Media Proposal.docx
A comparative analysis of optical character recognition models for extracting...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Teaching material agriculture food technology
MIND Revenue Release Quarter 2 2025 Press Release
Mobile App Security Testing_ A Comprehensive Guide.pdf
Network Security Unit 5.pdf for BCA BBA.
Review of recent advances in non-invasive hemoglobin estimation
Unlocking AI with Model Context Protocol (MCP)
Dropbox Q2 2025 Financial Results & Investor Presentation
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Programs and apps: productivity, graphics, security and other tools
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Encapsulation_ Review paper, used for researhc scholars
Digital-Transformation-Roadmap-for-Companies.pptx
Machine Learning_overview_presentation.pptx
Big Data Technologies - Introduction.pptx

Koober Machine Learning

  • 1. PredictionIO Adam Wang Annie Cheng Brett Clancy Erin Chen Han-Wen Chen Siyuan Wang Yiting Wang Koober–
  • 3. Koober Tim’s Challenge + Provide an efficient service for customers and taxi drivers by… Wait Times Revenue
  • 4. Tim has some questions How should taxis be allocated among different locations? Koober What factors drive customers to use taxis more?
  • 5. But most importantly… Koober How can the taxi industry utilize past data to predict the future needs of their riders?
  • 7. Our Solution Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Koober
  • 8. Why Koober? Demand Analysis + Prediction. Empower taxi owners with valuable rider information backed by past trends. Koober Anyone in Ride Sharing Industry. Plan more appropriate routes and make better transportation decisions.
  • 9. So how does it work? Koober 1. Choose your info need: analysis or prediction. OR
  • 10. So how does it work? Koober 2. Adjust your input query parameters.
  • 11. So how does it work? Koober 3. Visualize taxi demand on the heat maps.
  • 12. So how does it work? Koober 4. Make more informed taxi resource decisions.
  • 13. Let’s try it out! https://koober-dashboard.herokuapp.com/ Koober
  • 14. How does it really work? Koober Train and test machine learning models. Display demand on interactive heat maps.
  • 15. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 16. Raw Data NYC Taxi & Limousine Commission: Yellow Taxi Cab Dataset Koober Weather Underground API: • Historical Weather Data • Current Conditions • 10-Day Forecast Task 1
  • 17. Demo Data Loader Koober Task 1 Development Production • Generate fake traffic data based on user-defined parameters • Simplify debugging and validating the model prediction result • Process NYC Taxi Data and integrate with the historical weather data • Incorporate Kafka to facilitate importing large amount of training data
  • 18. Feature Extraction Koober Task 1 {    "eventTime": "2017-01-20T18:54:07.000-05:00",    "lat": 40.713802337646484,    "lng": -77.0088882446289,    "temperature": 0,    "clear": 1,    "fog": 0,    "rain": 0,    "snow": 0,    "hail": 0,    "thunder": 0,    "tornado": 0 } Weather Location Event Time Temperature
  • 19. Location Clustering Koober Preparator: • K-Means Location Clustering (200 clusters) • Each cluster represents a neighborhood geographically • Demand Aggregation per Unit Time Interval and Location Cluster Task 1
  • 20. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 21. Koober Task 2 Gradient-Boosted Trees Linear Regression with Stochastic Gradient Descent Neural Network Random Forest Ridge Regression Multiple Models. Supports many classic individual and combined machine learning models. Models
  • 24. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 25. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 26. Koober Task 4 User Interface Time Location Weather Predict Query Predict Result for Each Algorithm Map GL component Mapbox PredictionIO Engine Web App Map Visualization
  • 27. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 28. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 29. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 30. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 31. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 32. Looking Forward Koober ModelsTraining Data Prediction Modes More Cities Uber/Lyft Better Tuning New Models Wait Times Prices
  • 33. Our Mentor Engineering and Open Source Ambassador at Salesforce.com Koober James Ward www.jamesward.com
  • 34. Our Professor Assistant Professor in Computer Science at Cornell University Koober Ross Tate http://www.cs.cornell.edu/~ross/