SlideShare a Scribd company logo
CrowdSkippr
Wafa Soofi
Yosemite trip, May 2011
We had a great time…
Though it might have been better if
the scenery had looked less like this
alanak
and
more
like
this.
Gianluca Vegetti
The problem
I want to go hiking at a time/day that
works for me, but that also minimizes
the size of the crowds.
The problem
I want to go hiking at a time/day that
works for me, but that also minimizes
the size of the crowds.
I would like to predict the crowd size for
a specific location and a range of
future dates.
The problem
I want to go hiking at a time/day that
works for me, but that also minimizes
the size of the crowds.
I would like to predict the crowd size for
a specific location and a range of
future dates.
Then I can use that prediction to make
an intelligent choice about when to
take my trip.
How do we predict crowds right now?
Government data
Often aggregated
Not always immediately accessible
Check-ins
Sparse coverage
Prior knowledge/Intuition
Not always validated
There’s another way.
There’s another way.
We can crowdsource this problem!
Demo Soofi
CrowdSkippr: Inner workings
From flickr.com, extract
the total number of
photos taken at a given
time/place).
Extract data on
temperatures from
NOAA.gov for a
given time/place.
Using this information, create a prediction of how
heavy the crowds will be at a given future
time/place.
TM
Gradient Boosting Regression
Predictors
Day of week (Flickr)
Holiday flag (Flickr)
Day of year (Flickr)
Daily temperature (NOAA)
Response
Number of photos taken (Flickr)
(proxy for size of crowd)
Photos (or
visitors) per
month
normalized by
total
Wait:
Is # photos a good proxy for # visitors?
Photos
Visitors
Photos (or
visitors) per
month
normalized by
total
Wait:
Is # photos a good proxy for # visitors?
Photos
Visitors
0
1
2
3
4
5
6
7
0 0.5 1 1.5
x100000
x 10000
R2 =0.89
No. Photos
Num.Visitors
Day of Year
Temperature
Day of Week
Yosemite National Park
Holiday
Relative Feature Importance
0 0.4 0.6 0.8 10.2
Thanks for your time!
I’m Wafa.
For all 28-day windows in a given year,
the median difference between crowd size on predicted and
actual best days is 4.6%.
(On the days that are predicted to have the lowest crowds, the
crowd size is 29% of the worst possible crowds within that
window.)
Validation:
Rocky Mountain National Park
Predicted
crowd size
Actual
crowd size
(test data)

More Related Content

PDF
Breaking bad the cult of not giving 'bad' news
PPT
Discussion 4 ignite presentation
DOCX
Ashutosh Kumar Mishra CV
PDF
Mardi himal trekking
PPT
Políticas públicas de salud david campos solís.
PDF
Informativa sindaco manifesto
PDF
Placevalued4
PDF
Mukesh Ambani Statement at RIL Annual General Meeting
Breaking bad the cult of not giving 'bad' news
Discussion 4 ignite presentation
Ashutosh Kumar Mishra CV
Mardi himal trekking
Políticas públicas de salud david campos solís.
Informativa sindaco manifesto
Placevalued4
Mukesh Ambani Statement at RIL Annual General Meeting

Viewers also liked (9)

PPTX
PPTX
El simbolismo en el arte plástico
PPS
MUNCH, Edvard, Featured Paintings in Detail (2)
PPTX
EPQ Presentation
PPTX
Espresionismoa (Pintura)
PPS
MUNCH, Edvard, Featured Paintings in Detail (1)
PPT
Gustav klimt
PDF
Real-Time Marketing Flipbook
PPTX
Nearfuture
El simbolismo en el arte plástico
MUNCH, Edvard, Featured Paintings in Detail (2)
EPQ Presentation
Espresionismoa (Pintura)
MUNCH, Edvard, Featured Paintings in Detail (1)
Gustav klimt
Real-Time Marketing Flipbook
Nearfuture
Ad

Recently uploaded (20)

PDF
Delhi to Kashmir Tour Package at Best Price.pdf
PPTX
8 - Airport Statistical Forms icon related
PDF
Explore Luxemburry.eu, the ancient of lands in Europe
PDF
Your Complete Guide to Taj Mahal Day Tour From Delhi.
PDF
Travel Agents Email List for Effective Tourism and Hospitality Marketing.pdf
PDF
Villa Oriente Porto Rotondo - Luxury Villlas Sardinia.pdf
PPTX
Enjoy Peaceful Travel Even in Busy Places.pptx
PDF
When is the best time to Visit Kailash Mansarovar.pdf
PPTX
What Can You Discover If You Scrape Booking Sites for Monsoon Flash Sales Hou...
PDF
Hyderabad to Pune Flight – Complete Travel Guide.pdf
PDF
Fly Smart with Copa Airlines LAX Your Guide to Airfare, Comfort, and Top Attr...
PPTX
MALDIVES.pptx.pptx short power point to guide your explanation
PPTX
Quiz- Thursday.pptxaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
PPTX
Multimedia - Dinagsa Festival, Cadiz City
PPTX
Sri Lanka Tour Plan and places that can be visited during your leave
PPTX
Airline API Integration | Flight API Supplier
PDF
Understanding Travel Insurance: Your Safety Net While Exploring the World
PPTX
How Indian Culture Is Perceived Around the World,Infouncle.pptx
PDF
Memorable Outdoor Adventures with Premium River Rafting & Guided Tours
PPTX
Unlocking Travel Insights with Cruise Critic Dataset for Analysis.pptx
Delhi to Kashmir Tour Package at Best Price.pdf
8 - Airport Statistical Forms icon related
Explore Luxemburry.eu, the ancient of lands in Europe
Your Complete Guide to Taj Mahal Day Tour From Delhi.
Travel Agents Email List for Effective Tourism and Hospitality Marketing.pdf
Villa Oriente Porto Rotondo - Luxury Villlas Sardinia.pdf
Enjoy Peaceful Travel Even in Busy Places.pptx
When is the best time to Visit Kailash Mansarovar.pdf
What Can You Discover If You Scrape Booking Sites for Monsoon Flash Sales Hou...
Hyderabad to Pune Flight – Complete Travel Guide.pdf
Fly Smart with Copa Airlines LAX Your Guide to Airfare, Comfort, and Top Attr...
MALDIVES.pptx.pptx short power point to guide your explanation
Quiz- Thursday.pptxaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
Multimedia - Dinagsa Festival, Cadiz City
Sri Lanka Tour Plan and places that can be visited during your leave
Airline API Integration | Flight API Supplier
Understanding Travel Insurance: Your Safety Net While Exploring the World
How Indian Culture Is Perceived Around the World,Infouncle.pptx
Memorable Outdoor Adventures with Premium River Rafting & Guided Tours
Unlocking Travel Insights with Cruise Critic Dataset for Analysis.pptx
Ad

Demo Soofi

  • 2. Yosemite trip, May 2011 We had a great time…
  • 3. Though it might have been better if the scenery had looked less like this alanak
  • 5. The problem I want to go hiking at a time/day that works for me, but that also minimizes the size of the crowds.
  • 6. The problem I want to go hiking at a time/day that works for me, but that also minimizes the size of the crowds. I would like to predict the crowd size for a specific location and a range of future dates.
  • 7. The problem I want to go hiking at a time/day that works for me, but that also minimizes the size of the crowds. I would like to predict the crowd size for a specific location and a range of future dates. Then I can use that prediction to make an intelligent choice about when to take my trip.
  • 8. How do we predict crowds right now? Government data Often aggregated Not always immediately accessible Check-ins Sparse coverage Prior knowledge/Intuition Not always validated
  • 10. There’s another way. We can crowdsource this problem!
  • 12. CrowdSkippr: Inner workings From flickr.com, extract the total number of photos taken at a given time/place). Extract data on temperatures from NOAA.gov for a given time/place. Using this information, create a prediction of how heavy the crowds will be at a given future time/place. TM
  • 13. Gradient Boosting Regression Predictors Day of week (Flickr) Holiday flag (Flickr) Day of year (Flickr) Daily temperature (NOAA) Response Number of photos taken (Flickr) (proxy for size of crowd)
  • 14. Photos (or visitors) per month normalized by total Wait: Is # photos a good proxy for # visitors? Photos Visitors
  • 15. Photos (or visitors) per month normalized by total Wait: Is # photos a good proxy for # visitors? Photos Visitors 0 1 2 3 4 5 6 7 0 0.5 1 1.5 x100000 x 10000 R2 =0.89 No. Photos Num.Visitors
  • 16. Day of Year Temperature Day of Week Yosemite National Park Holiday Relative Feature Importance 0 0.4 0.6 0.8 10.2
  • 17. Thanks for your time! I’m Wafa.
  • 18. For all 28-day windows in a given year, the median difference between crowd size on predicted and actual best days is 4.6%. (On the days that are predicted to have the lowest crowds, the crowd size is 29% of the worst possible crowds within that window.) Validation: Rocky Mountain National Park Predicted crowd size Actual crowd size (test data)

Editor's Notes

  • #10: On the internet, everyone has a cell phone, people leave a digital footprint of themseves wherever they go, let’s see if we can use these footprints to create a prediction of where people will be in the future.