SlideShare a Scribd company logo
Copyright © 2018 weiwei1988 All Rights Reserved.
Proposal for
A Demand Prediction Model and
A Dynamic Pricing Model
for Share Bike Business
Utilizing Machine Learning
weiwei1988
Copyright © 2018 weiwei1988 All Rights Reserved.
1. Basic Understandings for the Share Bike Business
2. Analysis on the Influence Factors on demands
3. The Demand Prediction Model Utilizing Machine
Learning
4. The Dynamic Pricing Model Based on the Demand
Prediction Model
Copyright © 2018 weiwei1988 All Rights Reserved.
1. Basic Understandings for the Share Bike Business
2. Analysis on the Influence Factors on demands
3. The Demand Prediction Model Utilizing Machine
Learning
4. The Dynamic Pricing Model Based on the Demand
Prediction Model
Copyright © 2018 weiwei1988 All Rights Reserved.
The “Gobike” is a share bike business provided by
Ford Motor in San Francisco bay area since 2013.
City Stations
Mountain View 7
Palo Alto 5
Redwood City 7
San Francisco 35
San Jose 16
No. of Stations before Aug. 2015
Source: https://www.fordgobike.com/
Copyright © 2018 weiwei1988 All Rights Reserved.
Issue for the share bike business:
The growth of demand has stagnated since Oct. 2014.
No. of Demands by Month [Trips]
Copyright © 2018 weiwei1988 All Rights Reserved.
The cause of growth stagnation:
The No. of trips varies by the station in different cities.
Period for analysis:
Aug. 2013/8~Aug. 2015
For ex: Demands in downtown area like San Francisco has maximum No. of 50 thousand
trips, Palo Alto, on the other hand, had only 2 thousands trips in total.
Other factors such as Month,
Date, Weather also influence
the No. of demands
Copyright © 2018 weiwei1988 All Rights Reserved.
The cause of growth stagnation:
However, No. of docks in each station did not match
the diversified No. of trips in different stations.
Period for analysis:
Aug. 2013/8~Aug. 2015
Copyright © 2018 weiwei1988 All Rights Reserved.
The cause of growth stagnation:
Inflexible price plan was another big problem.
Current Price Plan:
$3 / Trip (Up to 30min)
$9.95 / Day (Up to 30min)
$149/Year(up to 45min for subscribers)
Source: https://www.fordgobike.com/
Copyright © 2018 weiwei1988 All Rights Reserved.
It is necessary to understand the precise demands of
trips to avoid supply and demands un-matching
situation.
No. of demands varies by station,
city, and some other factors.
However, No. of docks and
price plan are inflexible.
Un-matching problems occurs between supply
and demand, which lead to the stagnation of the
business growth.
It is necessary to understand the precise demand
of trips to adjust the no. of docks and price plan.
A Proposal for demand prediction utilizing the
machine learning technique.
Copyright © 2018 weiwei1988 All Rights Reserved.
1. Basic Understandings for the Share Bike Business
2. Analysis on the Influence Factors on demands
3. The Demand Prediction Model Utilizing Machine
Learning
4. The Dynamic Pricing Model Based on the Demand
Prediction Model
Copyright © 2018 weiwei1988 All Rights Reserved.
Influence Factors on demands:
The No. of trips in weekday and weekend have substantial
difference. Weekdays have 2-3 times larger trips.
Weekday: Mon/Tue/Wen/Thu/Fri
Weekend: Sat/SunNo. of trips by day
Period for analysis:
Aug. 2013/8~Aug. 2015
Copyright © 2018 weiwei1988 All Rights Reserved.
Influence Factors on demands:
Influence of weekends also varies by cities:
Weekend trips in downtown area drops substantially.
Period for analysis:
Aug. 2013/8~Aug. 2015
Copyright © 2018 weiwei1988 All Rights Reserved.
Influence Factors on demands:
The No. of trips also depend on the time zone. Moring rush &
evening rush indicate sharp increase on demand.
集計期間:2013/8~2015/8の3年間
Copyright © 2018 weiwei1988 All Rights Reserved.
Influence factors on demands:
No. of trips tend to increase in summer,
but tend to drop in winter.
Period for analysis:
Aug. 2013/8~Aug. 2015
Copyright © 2018 weiwei1988 All Rights Reserved.
Influence factors on demands:
Most of trips concentrate on the days without weather
events. (No_RainForg stands for the NaN of weather events)
Period for analysis:
Aug. 2013/8~Aug. 2015
Copyright © 2018 weiwei1988 All Rights Reserved.
Influence factors on demands:
The mean temperature seems to have some influence on No. of
trips: Higher temperature lead to more demand.
WeekendWeekday
Period for analysis:
Aug. 2013/8~Aug. 2015
Copyright © 2018 weiwei1988 All Rights Reserved.
Influence factors on demands:
Ref:On the other hand, wind speed seems have little
correlation with the demand.
WeekendWeekday
Period for analysis:
Aug. 2013/8~Aug. 2015
Copyright © 2018 weiwei1988 All Rights Reserved.
1. Basic Understandings for the Share Bike Business
2. Analysis on the Influence Factors on demands
3. A Demand Prediction Model Utilizing Machine
Learning
4. The Dynamic Pricing Model Based on the Demand
Prediction Model
Copyright © 2018 weiwei1988 All Rights Reserved.
The Demand Prediction Model:
A demand prediction model based on machine learning was
proposed to forecast the No. of trips at defined station on
defined date, time zone.
Input
Output
Station Location
(LAT, LONG)
Datetime
(Year/Month/Date)
Weekday Flag
(Weekday/Weekend)
Time Zone
(Moring Rush/Noon…)
Weather indicators
(Temperature, Pressure…)
No. of trips at
defined station in
defined date, time
zone.
Set multiple variables such as
station location, datetime, week
flag, time zone and weather
indicators as inputs of tanning
data.
Set No. of trips at
defined station in
defined datetime,
time zone as
output of the
training data.
Use training data to train the selected machine learning
models, and select the model with highest precision.
x1
x2
x3
xn
xn-1
….
y
Prediction
Inputs
Create a demand prediction model for the No. of trips.
1 2
3
4
Regression
Rige Regression
Lasso Regression
Random Forest
Decision Tree
Gradient Boosting
Ada Boosting
Selected Machine Learning Models
Copyright © 2018 weiwei1988 All Rights Reserved.
Explanatory variables Factor/Unit Data Source
Latitude of the Station (LAT) deg. station.csv
Longitude of the Station (LONG) deg. station.csv
Year 2013/2014/2015 trip.csv
Month 1/2/3/4/5/6/7/8/9/10/11/12 trip.csv
Day 1-31 trip.csv
Weekflag Weekday(Mon/Tue/Wed/Thu/Fri)
Weekend(Sat/Sun)
Variable create using conduction branch
from “Weekday” in trip.csv
Timezone Moring Rush (6-10)
Noon(11-15)
Evening Rush(16-20)
Night(21-5 on next day)
Variable create using conduction branch
from “Weekday” in trip.csv
Weather Events No_RainFog(No events: NaN)/
Rain/Fog-Rain/Rain-Thunderstorm
weather.csv*
Mean temperature F weather.csv*
Mean humidity % weather.csv*
Mean wind speed mph weather.csv*
Mean sea level pressure inches weather.csv*
Cloud cover 0/1/2/3/4/5/6/7/8 weather.csv*
Mean visibility miles weather.csv*
Precipitation inches weather.csv*
Preconditions of the tranining:
Explanatory and explained variables
Period for analysis:Aug. 2013/8~Aug. 2015
Explained Variable Factor/Unit Data Source
No. of Trips times Grouped by explanatory variables from
trips.csv
*the NaN of weather.csv were filled
with fillna(method = ‘pad’)
Copyright © 2018 weiwei1988 All Rights Reserved.
Preconditions of the trainning:
Selected Models, Datasets for Tanning and Validation.
• Selected Machine Learning Models
• Regression
• Rige Regression
• Lasso Regression
• Decision Tree
• Datasets for Training
• 80% of random samples from Aug. 2013 to Aug. 2015
• Datasets for Validation
• 20% of random samples from Aug. 2013 to Aug. 2015
• Cross Validation
• 5 times
• Python library for Tanning
• Scikit learn
• Random Forest
• Gradient Boosting
• Ada Boosting
Copyright © 2018 weiwei1988 All Rights Reserved.
Model Validation:
Radom Forest Regression was selected as prediction
model because of the highest precision scores.
Selected Models Mean R2
Negative Mean Squad
Error
Regression 0.24 -40.02
Rige Regression 0.24 -40.02
Lasso Regression 0.00 -52.65
Decision Tree 0.68 -16.68
Random Forest 0.85 -8.22
Gradient Boosting 0.60 -21.05
Ada Boosting 0.40 -31.53
Copyright © 2018 weiwei1988 All Rights Reserved.
Parameter Tuning:
Random forest regressor was tuned with listed parameters
using grid search.
tuned_parameters_rdfr = {
"max_depth": [2,3, None],
"n_estimators":[100, 200, 300],
"max_features": [1, 3, 5],
"min_samples_split": [2, 3, 10],
"min_samples_leaf": [1, 3, 10],
"bootstrap": [True, False],
}
Regressor was tuned using
GridSearchCV method (CV=5)
max_depth=None, max_features=5, max_leaf_nodes=None, min_impurity_decrease=0.0,
min_impurity_split=None, min_samples_leaf=1, min_samples_split=10,
min_weight_fraction_leaf=0.0, n_estimators=300
Copyright © 2018 weiwei1988 All Rights Reserved.
Prediction results:
The No. of trips by each day were relatively well predicated
by the machine learning model for practical usage.
Actual and predicted No. of trips by day (cumulative sum for all stations)
Copyright © 2018 weiwei1988 All Rights Reserved.
Prediction results:
Ref: Results for head 100 datasets.
Actual and predicted No. of trips by day (cumulative sum for all stations)
Copyright © 2018 weiwei1988 All Rights Reserved.
Prediction results:
The No. of trips by each time zone were relatively well
predicated by the machine learning model for practical usage.
Actual and predicted No. of trips by time zone (San Francisco, Weekday)
Copyright © 2018 weiwei1988 All Rights Reserved.
Prediction results:
Ref: Results for head 100 datasets.
Actual and predicted No. of trips by time zone (San Francisco, Weekday)
Copyright © 2018 weiwei1988 All Rights Reserved.
Importance score:
Variables with relatively high importance score are:
Location(Long&Lat), Timezone, Weekflag, Year and Temperature
Importance score of each explanatory variables
Copyright © 2018 weiwei1988 All Rights Reserved.
1. Basic Understandings for the Share Bike Business
2. Analysis on the Influence Factors on demands
3. A Demand Prediction Model Utilizing Machine
Learning
4. A Dynamic Pricing Model Based on the Demand
Prediction Model
Copyright © 2018 weiwei1988 All Rights Reserved.
Dynamic Pricing Model:
A Dynamic pricing model based on the demand prediction
model instead of current inflexible pricing plan was proposed
to solve the demand supply un-matching problem.
Demands
Price
Price
Demands
Price was constant in spite of fluctuate demand,
which lead to demand supply gap.
Current Pricing Plan Proposed Pricing Model
Price will be set based on the demand prediction
results by machine learning, which could balance
the demand and supply.
Price Fluctuation Rate%(t) = (1/P.E.)×Demand Fluctuation Rate%(t)
Model
Example:
Price Fluctuation Rate = Set Price / Current Constant Price
Demand Fluctuation Rate = Predicted Demand / Average Demands
P.E. : Price Elasticity
Copyright © 2018 weiwei1988 All Rights Reserved.
How to benefit customers?
Integrate the dynamic pricing model to the APP,
Provide customers with real-time optimized price.
Price Fluctuation Rate
InputStation Location/
Date/Weather…
1
Get the necessary
data from user app. 2 Predict the demand at the
defined condition
3 Use the dynamic pricing
model to calculate the
optimized price at this point.
4 Offer the price on the app,
allow customers to make
payments at real time.
Current
Price to
offer
Copyright © 2018 weiwei1988 All Rights Reserved.
Ex: Price fluctuation rate to offer.
Price Fluctuation Rate (San Francisco, Weekday) Price Elasticity = 10

More Related Content

PPTX
ibmuditappt IBM PPT UYUIYNJKJKJKUIUIIUIUIOUIO
PDF
Rides Request Demand Forecast- OLA Bike
PPTX
Final presentation MIS 637 A - Rishab Kothari
PPTX
Bike Sharing Demand Prediction dzzdzdPPT.pptx
PDF
Predictive Analysis of Bike Sharing System Using Machine Learning Algorithms
DOCX
Predictive modeling Paper-Team8 V0.1
PPTX
Project_template_AICTE internship 2025 .pptx
PPTX
Project_template_AICTE.pptx TEMPLAYEB DIDI
ibmuditappt IBM PPT UYUIYNJKJKJKUIUIIUIUIOUIO
Rides Request Demand Forecast- OLA Bike
Final presentation MIS 637 A - Rishab Kothari
Bike Sharing Demand Prediction dzzdzdPPT.pptx
Predictive Analysis of Bike Sharing System Using Machine Learning Algorithms
Predictive modeling Paper-Team8 V0.1
Project_template_AICTE internship 2025 .pptx
Project_template_AICTE.pptx TEMPLAYEB DIDI

Similar to A Demand Prediction Model and A Dynamic Pricing Model for Share Bike Business Utilizing Machine Learning (20)

PPTX
Bike Sharing Demand: Akshay Patil
PPTX
Project template_APSSDC-1hjkkkkkkkllollo
PPTX
SENTIMENT ANALYSIS ON PPT AND Project template_.pptx
PPTX
Project template for presenting it before the panel
PPTX
Project template for projects looks like this
PPTX
Project CHAT BOT APPROVED AICTE_SB4C.pptx
PPTX
ProjectTemplate for any project releted to ppt).pptx
PDF
Data mining
DOCX
6101-Project Report
PDF
MLR PPT.pdf seoul bike sharing demand prediction
PDF
Find my ride (wide)
PPTX
Cab travel time prediction using ensemble models
PDF
Taxi Demand Prediction using Machine Learning.
PPTX
Solving Real Life Problems using Data Science Part - 1
PPTX
Predictive maintenance withsensors_in_utilities_
PPTX
Predicting Power Consumption for a Greener Tomorrow: Machine Learning Project...
PDF
Internet of Things trifft auf Customer Intelligence
PDF
Intelligent Systems Project: Bike sharing service modeling
PDF
Trice-SeniorCapstone
PPTX
Writing predictive web services with Azure ML
Bike Sharing Demand: Akshay Patil
Project template_APSSDC-1hjkkkkkkkllollo
SENTIMENT ANALYSIS ON PPT AND Project template_.pptx
Project template for presenting it before the panel
Project template for projects looks like this
Project CHAT BOT APPROVED AICTE_SB4C.pptx
ProjectTemplate for any project releted to ppt).pptx
Data mining
6101-Project Report
MLR PPT.pdf seoul bike sharing demand prediction
Find my ride (wide)
Cab travel time prediction using ensemble models
Taxi Demand Prediction using Machine Learning.
Solving Real Life Problems using Data Science Part - 1
Predictive maintenance withsensors_in_utilities_
Predicting Power Consumption for a Greener Tomorrow: Machine Learning Project...
Internet of Things trifft auf Customer Intelligence
Intelligent Systems Project: Bike sharing service modeling
Trice-SeniorCapstone
Writing predictive web services with Azure ML
Ad

Recently uploaded (20)

PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
Safety Seminar civil to be ensured for safe working.
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
composite construction of structures.pdf
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
OOP with Java - Java Introduction (Basics)
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPT
introduction to datamining and warehousing
PPTX
Construction Project Organization Group 2.pptx
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
Sustainable Sites - Green Building Construction
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Model Code of Practice - Construction Work - 21102022 .pdf
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
CH1 Production IntroductoryConcepts.pptx
Foundation to blockchain - A guide to Blockchain Tech
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Safety Seminar civil to be ensured for safe working.
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
composite construction of structures.pdf
Internet of Things (IOT) - A guide to understanding
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
OOP with Java - Java Introduction (Basics)
Automation-in-Manufacturing-Chapter-Introduction.pdf
introduction to datamining and warehousing
Construction Project Organization Group 2.pptx
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
R24 SURVEYING LAB MANUAL for civil enggi
Sustainable Sites - Green Building Construction
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Ad

A Demand Prediction Model and A Dynamic Pricing Model for Share Bike Business Utilizing Machine Learning

  • 1. Copyright © 2018 weiwei1988 All Rights Reserved. Proposal for A Demand Prediction Model and A Dynamic Pricing Model for Share Bike Business Utilizing Machine Learning weiwei1988
  • 2. Copyright © 2018 weiwei1988 All Rights Reserved. 1. Basic Understandings for the Share Bike Business 2. Analysis on the Influence Factors on demands 3. The Demand Prediction Model Utilizing Machine Learning 4. The Dynamic Pricing Model Based on the Demand Prediction Model
  • 3. Copyright © 2018 weiwei1988 All Rights Reserved. 1. Basic Understandings for the Share Bike Business 2. Analysis on the Influence Factors on demands 3. The Demand Prediction Model Utilizing Machine Learning 4. The Dynamic Pricing Model Based on the Demand Prediction Model
  • 4. Copyright © 2018 weiwei1988 All Rights Reserved. The “Gobike” is a share bike business provided by Ford Motor in San Francisco bay area since 2013. City Stations Mountain View 7 Palo Alto 5 Redwood City 7 San Francisco 35 San Jose 16 No. of Stations before Aug. 2015 Source: https://www.fordgobike.com/
  • 5. Copyright © 2018 weiwei1988 All Rights Reserved. Issue for the share bike business: The growth of demand has stagnated since Oct. 2014. No. of Demands by Month [Trips]
  • 6. Copyright © 2018 weiwei1988 All Rights Reserved. The cause of growth stagnation: The No. of trips varies by the station in different cities. Period for analysis: Aug. 2013/8~Aug. 2015 For ex: Demands in downtown area like San Francisco has maximum No. of 50 thousand trips, Palo Alto, on the other hand, had only 2 thousands trips in total. Other factors such as Month, Date, Weather also influence the No. of demands
  • 7. Copyright © 2018 weiwei1988 All Rights Reserved. The cause of growth stagnation: However, No. of docks in each station did not match the diversified No. of trips in different stations. Period for analysis: Aug. 2013/8~Aug. 2015
  • 8. Copyright © 2018 weiwei1988 All Rights Reserved. The cause of growth stagnation: Inflexible price plan was another big problem. Current Price Plan: $3 / Trip (Up to 30min) $9.95 / Day (Up to 30min) $149/Year(up to 45min for subscribers) Source: https://www.fordgobike.com/
  • 9. Copyright © 2018 weiwei1988 All Rights Reserved. It is necessary to understand the precise demands of trips to avoid supply and demands un-matching situation. No. of demands varies by station, city, and some other factors. However, No. of docks and price plan are inflexible. Un-matching problems occurs between supply and demand, which lead to the stagnation of the business growth. It is necessary to understand the precise demand of trips to adjust the no. of docks and price plan. A Proposal for demand prediction utilizing the machine learning technique.
  • 10. Copyright © 2018 weiwei1988 All Rights Reserved. 1. Basic Understandings for the Share Bike Business 2. Analysis on the Influence Factors on demands 3. The Demand Prediction Model Utilizing Machine Learning 4. The Dynamic Pricing Model Based on the Demand Prediction Model
  • 11. Copyright © 2018 weiwei1988 All Rights Reserved. Influence Factors on demands: The No. of trips in weekday and weekend have substantial difference. Weekdays have 2-3 times larger trips. Weekday: Mon/Tue/Wen/Thu/Fri Weekend: Sat/SunNo. of trips by day Period for analysis: Aug. 2013/8~Aug. 2015
  • 12. Copyright © 2018 weiwei1988 All Rights Reserved. Influence Factors on demands: Influence of weekends also varies by cities: Weekend trips in downtown area drops substantially. Period for analysis: Aug. 2013/8~Aug. 2015
  • 13. Copyright © 2018 weiwei1988 All Rights Reserved. Influence Factors on demands: The No. of trips also depend on the time zone. Moring rush & evening rush indicate sharp increase on demand. 集計期間:2013/8~2015/8の3年間
  • 14. Copyright © 2018 weiwei1988 All Rights Reserved. Influence factors on demands: No. of trips tend to increase in summer, but tend to drop in winter. Period for analysis: Aug. 2013/8~Aug. 2015
  • 15. Copyright © 2018 weiwei1988 All Rights Reserved. Influence factors on demands: Most of trips concentrate on the days without weather events. (No_RainForg stands for the NaN of weather events) Period for analysis: Aug. 2013/8~Aug. 2015
  • 16. Copyright © 2018 weiwei1988 All Rights Reserved. Influence factors on demands: The mean temperature seems to have some influence on No. of trips: Higher temperature lead to more demand. WeekendWeekday Period for analysis: Aug. 2013/8~Aug. 2015
  • 17. Copyright © 2018 weiwei1988 All Rights Reserved. Influence factors on demands: Ref:On the other hand, wind speed seems have little correlation with the demand. WeekendWeekday Period for analysis: Aug. 2013/8~Aug. 2015
  • 18. Copyright © 2018 weiwei1988 All Rights Reserved. 1. Basic Understandings for the Share Bike Business 2. Analysis on the Influence Factors on demands 3. A Demand Prediction Model Utilizing Machine Learning 4. The Dynamic Pricing Model Based on the Demand Prediction Model
  • 19. Copyright © 2018 weiwei1988 All Rights Reserved. The Demand Prediction Model: A demand prediction model based on machine learning was proposed to forecast the No. of trips at defined station on defined date, time zone. Input Output Station Location (LAT, LONG) Datetime (Year/Month/Date) Weekday Flag (Weekday/Weekend) Time Zone (Moring Rush/Noon…) Weather indicators (Temperature, Pressure…) No. of trips at defined station in defined date, time zone. Set multiple variables such as station location, datetime, week flag, time zone and weather indicators as inputs of tanning data. Set No. of trips at defined station in defined datetime, time zone as output of the training data. Use training data to train the selected machine learning models, and select the model with highest precision. x1 x2 x3 xn xn-1 …. y Prediction Inputs Create a demand prediction model for the No. of trips. 1 2 3 4 Regression Rige Regression Lasso Regression Random Forest Decision Tree Gradient Boosting Ada Boosting Selected Machine Learning Models
  • 20. Copyright © 2018 weiwei1988 All Rights Reserved. Explanatory variables Factor/Unit Data Source Latitude of the Station (LAT) deg. station.csv Longitude of the Station (LONG) deg. station.csv Year 2013/2014/2015 trip.csv Month 1/2/3/4/5/6/7/8/9/10/11/12 trip.csv Day 1-31 trip.csv Weekflag Weekday(Mon/Tue/Wed/Thu/Fri) Weekend(Sat/Sun) Variable create using conduction branch from “Weekday” in trip.csv Timezone Moring Rush (6-10) Noon(11-15) Evening Rush(16-20) Night(21-5 on next day) Variable create using conduction branch from “Weekday” in trip.csv Weather Events No_RainFog(No events: NaN)/ Rain/Fog-Rain/Rain-Thunderstorm weather.csv* Mean temperature F weather.csv* Mean humidity % weather.csv* Mean wind speed mph weather.csv* Mean sea level pressure inches weather.csv* Cloud cover 0/1/2/3/4/5/6/7/8 weather.csv* Mean visibility miles weather.csv* Precipitation inches weather.csv* Preconditions of the tranining: Explanatory and explained variables Period for analysis:Aug. 2013/8~Aug. 2015 Explained Variable Factor/Unit Data Source No. of Trips times Grouped by explanatory variables from trips.csv *the NaN of weather.csv were filled with fillna(method = ‘pad’)
  • 21. Copyright © 2018 weiwei1988 All Rights Reserved. Preconditions of the trainning: Selected Models, Datasets for Tanning and Validation. • Selected Machine Learning Models • Regression • Rige Regression • Lasso Regression • Decision Tree • Datasets for Training • 80% of random samples from Aug. 2013 to Aug. 2015 • Datasets for Validation • 20% of random samples from Aug. 2013 to Aug. 2015 • Cross Validation • 5 times • Python library for Tanning • Scikit learn • Random Forest • Gradient Boosting • Ada Boosting
  • 22. Copyright © 2018 weiwei1988 All Rights Reserved. Model Validation: Radom Forest Regression was selected as prediction model because of the highest precision scores. Selected Models Mean R2 Negative Mean Squad Error Regression 0.24 -40.02 Rige Regression 0.24 -40.02 Lasso Regression 0.00 -52.65 Decision Tree 0.68 -16.68 Random Forest 0.85 -8.22 Gradient Boosting 0.60 -21.05 Ada Boosting 0.40 -31.53
  • 23. Copyright © 2018 weiwei1988 All Rights Reserved. Parameter Tuning: Random forest regressor was tuned with listed parameters using grid search. tuned_parameters_rdfr = { "max_depth": [2,3, None], "n_estimators":[100, 200, 300], "max_features": [1, 3, 5], "min_samples_split": [2, 3, 10], "min_samples_leaf": [1, 3, 10], "bootstrap": [True, False], } Regressor was tuned using GridSearchCV method (CV=5) max_depth=None, max_features=5, max_leaf_nodes=None, min_impurity_decrease=0.0, min_impurity_split=None, min_samples_leaf=1, min_samples_split=10, min_weight_fraction_leaf=0.0, n_estimators=300
  • 24. Copyright © 2018 weiwei1988 All Rights Reserved. Prediction results: The No. of trips by each day were relatively well predicated by the machine learning model for practical usage. Actual and predicted No. of trips by day (cumulative sum for all stations)
  • 25. Copyright © 2018 weiwei1988 All Rights Reserved. Prediction results: Ref: Results for head 100 datasets. Actual and predicted No. of trips by day (cumulative sum for all stations)
  • 26. Copyright © 2018 weiwei1988 All Rights Reserved. Prediction results: The No. of trips by each time zone were relatively well predicated by the machine learning model for practical usage. Actual and predicted No. of trips by time zone (San Francisco, Weekday)
  • 27. Copyright © 2018 weiwei1988 All Rights Reserved. Prediction results: Ref: Results for head 100 datasets. Actual and predicted No. of trips by time zone (San Francisco, Weekday)
  • 28. Copyright © 2018 weiwei1988 All Rights Reserved. Importance score: Variables with relatively high importance score are: Location(Long&Lat), Timezone, Weekflag, Year and Temperature Importance score of each explanatory variables
  • 29. Copyright © 2018 weiwei1988 All Rights Reserved. 1. Basic Understandings for the Share Bike Business 2. Analysis on the Influence Factors on demands 3. A Demand Prediction Model Utilizing Machine Learning 4. A Dynamic Pricing Model Based on the Demand Prediction Model
  • 30. Copyright © 2018 weiwei1988 All Rights Reserved. Dynamic Pricing Model: A Dynamic pricing model based on the demand prediction model instead of current inflexible pricing plan was proposed to solve the demand supply un-matching problem. Demands Price Price Demands Price was constant in spite of fluctuate demand, which lead to demand supply gap. Current Pricing Plan Proposed Pricing Model Price will be set based on the demand prediction results by machine learning, which could balance the demand and supply. Price Fluctuation Rate%(t) = (1/P.E.)×Demand Fluctuation Rate%(t) Model Example: Price Fluctuation Rate = Set Price / Current Constant Price Demand Fluctuation Rate = Predicted Demand / Average Demands P.E. : Price Elasticity
  • 31. Copyright © 2018 weiwei1988 All Rights Reserved. How to benefit customers? Integrate the dynamic pricing model to the APP, Provide customers with real-time optimized price. Price Fluctuation Rate InputStation Location/ Date/Weather… 1 Get the necessary data from user app. 2 Predict the demand at the defined condition 3 Use the dynamic pricing model to calculate the optimized price at this point. 4 Offer the price on the app, allow customers to make payments at real time. Current Price to offer
  • 32. Copyright © 2018 weiwei1988 All Rights Reserved. Ex: Price fluctuation rate to offer. Price Fluctuation Rate (San Francisco, Weekday) Price Elasticity = 10