SlideShare a Scribd company logo
Presented By
Md. Farhan Tanvir(2014-2-60-124)
Kevin Stephen Bishwas (2014-2-60-091)
Nazmul Hasan(2014-2-60-063)
Supervised By
Dr. Mohammad Rezwanul Huq
Assistant Professor
Department Of Computer Science And Engineering
East West University .
Clustering-based Location
Recommendation System
1
The world is an over-crowded place
2
They all want to get our attention
3
We are overloaded
• Thousands of news places to visit
• Millions of restaurants , hotels ,
parks to visit .
4
5
Can Google Help ?
• Yes, but only when we really know what
we are looking for
• What if I just want some interesting place to
visit?
– Btw, what does it mean by “interesting”?
6
Can Facebook Help ?
• Yes, I tend to find my friends’ stuffs
interesting
• What if I had only few friends, and what places
they visit do not always attract me?
7
Can experts help?
• Yes, but it won’t scale well
– Everyone receives exactly the same advice!
• It is what they like, not me!
– Like restaurant , what get expert approval does
not guarantee attention of the mass .
8
OK, Here is the idea called Recommendation System
• Recommendation system is an information filtering technique,
which provides users with information, which user may be
interested in .
• Based on
- Past Behavior
- Relations to the user
- Item Similarity
- Context
9
Existing Work
• Ling Li*, Ya Zhou, Han Xiong, Cailin Hu, Collaborative filtering based on user attributes
and user ratings for restaurant recommendation , 2017 IEEE 2nd Advanced Information
Technology, Electronic and Automation Control Conference (IAEAC) .
• Zhiyang Jia , Wei Gao , Yuting Yang , Xu Chen , User-based Collaborative Filtering for
Tourist Attraction Recommendations , 2015 IEEE International Conference on
Computational Intelligence & Communication Technology.
• Lakshmi Tharun Ponnam (Author) , Sreenivasa Deepak Punyasamudram ,Siva Nagaraju
Nallagulla , Srikanth Yellamati , Movie Recommender System Using Item Based
Collaborative Filtering Technique , 2016 International Conference on Emerging Trends
in Engineering, Technology and Science (ICETETS) .
10
Our Proposal
Input
Dataset
Data
Cleaning
Feature
Engineering Clustering
Find User
Preference
Result
11
Our Dataset
• Foursquare NYC Check-in Dataset
• https://sites.google.com/site/yangdingqi/home/foursquare-dataset
12
Attributes of our Dataset
13
1 User ID
2 Venue ID
3 Venue Category ID
4 Venue Category
5 Latitude
6 Longitude
7 Time zone offset
8 UTC time
But after Data cleaning and
feature engineering we’ve
got some other attribute .
What Data Cleaning and
Feature Engineering ?
Task 1: Data Cleaning
• Removing Home Check-Ins:
-The dataset did not contain the home check-ins for all the users .
After cleaning with certain process we removed this.
14
Task 1: Data Cleaning(Cont…)
• Replacing Multiple category of a venue:
User Id Venue Id Venue Category Id Venue
Category
1 V-1 C001 Bar
1 V-1 C002 Bar
1 V-1 C001 Bar
1 V-1 C002 Bar
1 V-1 C002 Park
Figure : Before Replacing
User Id Venue Id Venue Category Id Venue
Category
1 V-1 C002 Bar
1 V-1 C002 Bar
1 V-1 C002 Bar
1 V-1 C002 Bar
1 V-1 C002 Bar
Figure : After Replacing
15
Task 1: Data Cleaning(Cont…)
• Replacing Sub-Category Id’s From Category Id Column:
User Id Venue Id Venue Category Id Venue
Category
1 V-1 C001 Bar
1 V-2 C002 Bar
1 V-3 C001 Bar
1 V-4 C002 Bar
1 V-5 C002 Bar
Figure : Before Replacing
User Id Venue Id Venue Category Id Venue
Category
1 V-1 C002 Bar
1 V-2 C002 Bar
1 V-3 C002 Bar
1 V-4 C002 Bar
1 V-5 C002 Bar
Figure : After Replacing
16
Task 1: Data Cleaning(Cont…)
• Replacing different latitude and longitude value of a venue:
Figure : Before Replacing Figure : After Replacing
Venue Id Latitude Longitude
V-1 40 -73
V-1 43 -70
V-1 43 -70
V-1 40 -73
V-1 40 -73
17
Venue Id Latitude Longitude
V-1 40 -73
V-1 40 -73
V-1 40 -73
V-1 40 -73
V-1 40 -73
Task 2: Feature Engineering
• Check-In Counts:
User Id Venue Id Check-In Count
1083 V-1 3
1083 V-2 1
1083 V-3 1
1083 V-4 2
1083 V-5 1
Figure : After adding Check-In Count attribute
18
Task 2: Feature Engineering(Cont…)
• Venue Distance from User’s Center:
- First We find out users center point by doing average of latitude and
longitude where user has previously checked .
-Now, Using this center points we calculate the distance of each ven using “The Haversine
Formula”.
Where,
• d is the distance between the two points,
• r is the radius of the sphere,
• φ1, φ2: latitude of point 1 and latitude of point 2, in radians
• λ1, λ2: longitude of point 1 and longitude of point 2, in radians
Reference : https://www.movable-type.co.uk/scripts/latlong.html 19
𝒅 = 𝟐𝒓 𝐬𝐢𝐧−𝟏
𝐬𝐢𝐧 𝟐
𝝋 𝟐 − 𝝋 𝟏
𝟐
+ 𝐜𝐨𝐬 𝝋 𝟏 𝐜𝐨𝐬 𝝋 𝟐 𝐬𝐢𝐧 𝟐
𝝀 𝟐 − 𝝀 𝟏
𝟐
Our Dataset After Feature Engineering
20
1 User ID
2 Venue ID
3 Venue Category ID
4 Venue Category
5 Latitude
6 Longitude
7 Distance From Center
8 Check In Count
Task 2 : Clustering
• We used KNN (k-nearest neighbors) as clustering algorithm .
• First we find the similarity between user using Pearson correlation . We also checked cosine
correlation but Pearson Correlation gives us better result .
Where:-
 Rui, Rvi represent the checkingCount of ith item given by the user
u and v respectively.
 Ru , Rv represent the average checkin of user u and v respectively.
 Iuv donates the set of items checked by both user u and v
𝒔𝒊𝒎 𝒖, 𝒗 =
𝒊∈𝑰 𝒖𝒗
𝑹 𝒖𝒊 − 𝑹 𝒖 . 𝑹 𝒗𝒊 − 𝑹 𝒗
𝒊∈𝑰 𝒖𝒗
𝑹 𝒖𝒊 − 𝑹 𝒖
𝟐
𝒊∈𝑰 𝒖𝒗
𝑹 𝒗𝒊 − 𝑹 𝒗
𝟐
21
Reference : Collaborative filtering based on user attributes and user ratings for restaurant recommendation
Task 2 : Clustering(Cont…)
• After finding similarity we take top n nearest neighbor .
• Then used their checkinCount to find predicted checkinCount for
every places of that user which user didn’t check in . We used
weighted average checkin to predict checkin count for a user .
• After this we took top most checkInCount.
22
Task 3 : Find User Preference
• We used user’s every check-in’s distance from center point and find a mean
distance. If user’s most of the checkin’s distance are more than mean distance
we can say user like to travel in long distance otherwise like to travel in close
distance . Then we sort the recommendation on user preference .
• Example :
Users mean checkin distance = 50 KM
User’s have 50 checkins .
30 of them are more than 50 km.
Result : Users Love o travel in long distance
23
Example
Place1 Place2 Place3 Place4
Me 3 - 5 ?
My Friend 4 6 - -
You 3 - 5 6
Another guy 4 2 - 1
Your Friend 8 - - 3
What will be probable checking count of Place4? 24
Example(Cont..)
Place1 Place2 Place3 Place4
Me 3 - 5 ?
My Friend 4 6 - -
You 3 - 5 6
Another guy 4 2 - 1
Your Friend 8 - - 3
25
Example(Cont..)
Place1 Place2 Place3 Place4
Me 3 - 5 6
My Friend 4 6 - -
You 3 - 5 6
Another guy 4 2 - 1
Your Friend 8 - - 3
26
Evolution
• We used Sampling and RMSE technique for evaluating our recommendation.
• In sampling technique 10% of the entire dataset was selected randomly without replacement
to make a sample dataset.
• RMSE technique was used to evaluate the algorithm. It calculated the error of a predicted
check in count from an actual check in count of a venue by specific user in test dataset.
RMSE Formula:
RMSE =
𝑖=1
𝑁 𝑃 𝑢,𝑖−𝑅 𝑢,𝑖
2
𝑁
Here :
P u,i=is the predicted checkIn Count for user u on venue i
R u,i=is the actual checkIn Count for user u on venue i
N=is the total number of venues where user checked in
Reference : Collaborative filtering based on user attributes and user ratings for restaurant recommendation
27
RMSE Graph
28
Figure : RMSE graph
Demo
• We have created a simple demo where user can enter their id and our system will
recommend place for user .
Figure : Input User Id Figure : Output Recommendation
29
Future Work
30
• Try Model Based Recommendation System
• Add More Domain
• Try Triangulation Technique to find user’s center point .
31

More Related Content

PPTX
Quettra Design Problem Solution - Deepti Chafekar
PDF
Place recommendation system
PDF
IRJET- Popularity based Recommender Sytsem for Google Maps
PDF
FIND MY VENUE: Content & Review Based Location Recommendation System
PDF
IRJET- Scalable Content Aware Collaborative Filtering for Location Recommenda...
PDF
IRJET- Recommendation System for Electronic Products using BigData
PDF
Zomato Crawler & Recommender
PDF
Service Rating Prediction by check-in and check-out behavior of user and POI
Quettra Design Problem Solution - Deepti Chafekar
Place recommendation system
IRJET- Popularity based Recommender Sytsem for Google Maps
FIND MY VENUE: Content & Review Based Location Recommendation System
IRJET- Scalable Content Aware Collaborative Filtering for Location Recommenda...
IRJET- Recommendation System for Electronic Products using BigData
Zomato Crawler & Recommender
Service Rating Prediction by check-in and check-out behavior of user and POI

Similar to Clustering-based Location Recommendation(Collaborative Filtering) (20)

PDF
Paper id 41201614
PDF
Survey on Location Based Recommendation System Using POI
PDF
IRJET- Personalize Travel Recommandation based on Facebook Data
PDF
IRJET- Explore the World
PPT
IEEE Projects 2014-2015
PDF
Providing Highly Accurate Service Recommendation over Big Data using Adaptive...
PDF
IRJET- An Integrated Recommendation System using Graph Database and QGIS
PPTX
Doctoral Consortium Slides at SIGIR 2017
PDF
[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...
PPTX
Movie Recommender System Using Artificial Intelligence
PDF
Notes on Recommender Systems pdf 2nd module
PDF
IRJET- Travelmate Travel Package Recommendation System
PDF
Research on Recommender Systems: Beyond Ratings and Lists
DOCX
Lars an efficient and scalable location aware recommender system
PDF
TOURIST PLACE RECOMMENDATION SYSTEM
PDF
Providing highly accurate service recommendation for semantic clustering over...
PDF
Overview of recommender system
DOCX
Profile based recommendation for Airbnb users-Project report
PPT
Content based recommendation systems
PDF
Study of relevancy, diversity, and novelty in recommender systems
Paper id 41201614
Survey on Location Based Recommendation System Using POI
IRJET- Personalize Travel Recommandation based on Facebook Data
IRJET- Explore the World
IEEE Projects 2014-2015
Providing Highly Accurate Service Recommendation over Big Data using Adaptive...
IRJET- An Integrated Recommendation System using Graph Database and QGIS
Doctoral Consortium Slides at SIGIR 2017
[AFEL] Neighborhood Troubles: On the Value of User Pre-Filtering To Speed Up ...
Movie Recommender System Using Artificial Intelligence
Notes on Recommender Systems pdf 2nd module
IRJET- Travelmate Travel Package Recommendation System
Research on Recommender Systems: Beyond Ratings and Lists
Lars an efficient and scalable location aware recommender system
TOURIST PLACE RECOMMENDATION SYSTEM
Providing highly accurate service recommendation for semantic clustering over...
Overview of recommender system
Profile based recommendation for Airbnb users-Project report
Content based recommendation systems
Study of relevancy, diversity, and novelty in recommender systems
Ad

Recently uploaded (20)

PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Institutional Correction lecture only . . .
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Lesson notes of climatology university.
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Cell Structure & Organelles in detailed.
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Computing-Curriculum for Schools in Ghana
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Microbial disease of the cardiovascular and lymphatic systems
Supply Chain Operations Speaking Notes -ICLT Program
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Module 4: Burden of Disease Tutorial Slides S2 2025
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Pharmacology of Heart Failure /Pharmacotherapy of CHF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Institutional Correction lecture only . . .
O5-L3 Freight Transport Ops (International) V1.pdf
Final Presentation General Medicine 03-08-2024.pptx
Lesson notes of climatology university.
Final Presentation General Medicine 03-08-2024.pptx
Cell Structure & Organelles in detailed.
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Computing-Curriculum for Schools in Ghana
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Ad

Clustering-based Location Recommendation(Collaborative Filtering)

  • 1. Presented By Md. Farhan Tanvir(2014-2-60-124) Kevin Stephen Bishwas (2014-2-60-091) Nazmul Hasan(2014-2-60-063) Supervised By Dr. Mohammad Rezwanul Huq Assistant Professor Department Of Computer Science And Engineering East West University . Clustering-based Location Recommendation System 1
  • 2. The world is an over-crowded place 2
  • 3. They all want to get our attention 3
  • 4. We are overloaded • Thousands of news places to visit • Millions of restaurants , hotels , parks to visit . 4
  • 5. 5
  • 6. Can Google Help ? • Yes, but only when we really know what we are looking for • What if I just want some interesting place to visit? – Btw, what does it mean by “interesting”? 6
  • 7. Can Facebook Help ? • Yes, I tend to find my friends’ stuffs interesting • What if I had only few friends, and what places they visit do not always attract me? 7
  • 8. Can experts help? • Yes, but it won’t scale well – Everyone receives exactly the same advice! • It is what they like, not me! – Like restaurant , what get expert approval does not guarantee attention of the mass . 8
  • 9. OK, Here is the idea called Recommendation System • Recommendation system is an information filtering technique, which provides users with information, which user may be interested in . • Based on - Past Behavior - Relations to the user - Item Similarity - Context 9
  • 10. Existing Work • Ling Li*, Ya Zhou, Han Xiong, Cailin Hu, Collaborative filtering based on user attributes and user ratings for restaurant recommendation , 2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) . • Zhiyang Jia , Wei Gao , Yuting Yang , Xu Chen , User-based Collaborative Filtering for Tourist Attraction Recommendations , 2015 IEEE International Conference on Computational Intelligence & Communication Technology. • Lakshmi Tharun Ponnam (Author) , Sreenivasa Deepak Punyasamudram ,Siva Nagaraju Nallagulla , Srikanth Yellamati , Movie Recommender System Using Item Based Collaborative Filtering Technique , 2016 International Conference on Emerging Trends in Engineering, Technology and Science (ICETETS) . 10
  • 12. Our Dataset • Foursquare NYC Check-in Dataset • https://sites.google.com/site/yangdingqi/home/foursquare-dataset 12
  • 13. Attributes of our Dataset 13 1 User ID 2 Venue ID 3 Venue Category ID 4 Venue Category 5 Latitude 6 Longitude 7 Time zone offset 8 UTC time But after Data cleaning and feature engineering we’ve got some other attribute . What Data Cleaning and Feature Engineering ?
  • 14. Task 1: Data Cleaning • Removing Home Check-Ins: -The dataset did not contain the home check-ins for all the users . After cleaning with certain process we removed this. 14
  • 15. Task 1: Data Cleaning(Cont…) • Replacing Multiple category of a venue: User Id Venue Id Venue Category Id Venue Category 1 V-1 C001 Bar 1 V-1 C002 Bar 1 V-1 C001 Bar 1 V-1 C002 Bar 1 V-1 C002 Park Figure : Before Replacing User Id Venue Id Venue Category Id Venue Category 1 V-1 C002 Bar 1 V-1 C002 Bar 1 V-1 C002 Bar 1 V-1 C002 Bar 1 V-1 C002 Bar Figure : After Replacing 15
  • 16. Task 1: Data Cleaning(Cont…) • Replacing Sub-Category Id’s From Category Id Column: User Id Venue Id Venue Category Id Venue Category 1 V-1 C001 Bar 1 V-2 C002 Bar 1 V-3 C001 Bar 1 V-4 C002 Bar 1 V-5 C002 Bar Figure : Before Replacing User Id Venue Id Venue Category Id Venue Category 1 V-1 C002 Bar 1 V-2 C002 Bar 1 V-3 C002 Bar 1 V-4 C002 Bar 1 V-5 C002 Bar Figure : After Replacing 16
  • 17. Task 1: Data Cleaning(Cont…) • Replacing different latitude and longitude value of a venue: Figure : Before Replacing Figure : After Replacing Venue Id Latitude Longitude V-1 40 -73 V-1 43 -70 V-1 43 -70 V-1 40 -73 V-1 40 -73 17 Venue Id Latitude Longitude V-1 40 -73 V-1 40 -73 V-1 40 -73 V-1 40 -73 V-1 40 -73
  • 18. Task 2: Feature Engineering • Check-In Counts: User Id Venue Id Check-In Count 1083 V-1 3 1083 V-2 1 1083 V-3 1 1083 V-4 2 1083 V-5 1 Figure : After adding Check-In Count attribute 18
  • 19. Task 2: Feature Engineering(Cont…) • Venue Distance from User’s Center: - First We find out users center point by doing average of latitude and longitude where user has previously checked . -Now, Using this center points we calculate the distance of each ven using “The Haversine Formula”. Where, • d is the distance between the two points, • r is the radius of the sphere, • φ1, φ2: latitude of point 1 and latitude of point 2, in radians • λ1, λ2: longitude of point 1 and longitude of point 2, in radians Reference : https://www.movable-type.co.uk/scripts/latlong.html 19 𝒅 = 𝟐𝒓 𝐬𝐢𝐧−𝟏 𝐬𝐢𝐧 𝟐 𝝋 𝟐 − 𝝋 𝟏 𝟐 + 𝐜𝐨𝐬 𝝋 𝟏 𝐜𝐨𝐬 𝝋 𝟐 𝐬𝐢𝐧 𝟐 𝝀 𝟐 − 𝝀 𝟏 𝟐
  • 20. Our Dataset After Feature Engineering 20 1 User ID 2 Venue ID 3 Venue Category ID 4 Venue Category 5 Latitude 6 Longitude 7 Distance From Center 8 Check In Count
  • 21. Task 2 : Clustering • We used KNN (k-nearest neighbors) as clustering algorithm . • First we find the similarity between user using Pearson correlation . We also checked cosine correlation but Pearson Correlation gives us better result . Where:-  Rui, Rvi represent the checkingCount of ith item given by the user u and v respectively.  Ru , Rv represent the average checkin of user u and v respectively.  Iuv donates the set of items checked by both user u and v 𝒔𝒊𝒎 𝒖, 𝒗 = 𝒊∈𝑰 𝒖𝒗 𝑹 𝒖𝒊 − 𝑹 𝒖 . 𝑹 𝒗𝒊 − 𝑹 𝒗 𝒊∈𝑰 𝒖𝒗 𝑹 𝒖𝒊 − 𝑹 𝒖 𝟐 𝒊∈𝑰 𝒖𝒗 𝑹 𝒗𝒊 − 𝑹 𝒗 𝟐 21 Reference : Collaborative filtering based on user attributes and user ratings for restaurant recommendation
  • 22. Task 2 : Clustering(Cont…) • After finding similarity we take top n nearest neighbor . • Then used their checkinCount to find predicted checkinCount for every places of that user which user didn’t check in . We used weighted average checkin to predict checkin count for a user . • After this we took top most checkInCount. 22
  • 23. Task 3 : Find User Preference • We used user’s every check-in’s distance from center point and find a mean distance. If user’s most of the checkin’s distance are more than mean distance we can say user like to travel in long distance otherwise like to travel in close distance . Then we sort the recommendation on user preference . • Example : Users mean checkin distance = 50 KM User’s have 50 checkins . 30 of them are more than 50 km. Result : Users Love o travel in long distance 23
  • 24. Example Place1 Place2 Place3 Place4 Me 3 - 5 ? My Friend 4 6 - - You 3 - 5 6 Another guy 4 2 - 1 Your Friend 8 - - 3 What will be probable checking count of Place4? 24
  • 25. Example(Cont..) Place1 Place2 Place3 Place4 Me 3 - 5 ? My Friend 4 6 - - You 3 - 5 6 Another guy 4 2 - 1 Your Friend 8 - - 3 25
  • 26. Example(Cont..) Place1 Place2 Place3 Place4 Me 3 - 5 6 My Friend 4 6 - - You 3 - 5 6 Another guy 4 2 - 1 Your Friend 8 - - 3 26
  • 27. Evolution • We used Sampling and RMSE technique for evaluating our recommendation. • In sampling technique 10% of the entire dataset was selected randomly without replacement to make a sample dataset. • RMSE technique was used to evaluate the algorithm. It calculated the error of a predicted check in count from an actual check in count of a venue by specific user in test dataset. RMSE Formula: RMSE = 𝑖=1 𝑁 𝑃 𝑢,𝑖−𝑅 𝑢,𝑖 2 𝑁 Here : P u,i=is the predicted checkIn Count for user u on venue i R u,i=is the actual checkIn Count for user u on venue i N=is the total number of venues where user checked in Reference : Collaborative filtering based on user attributes and user ratings for restaurant recommendation 27
  • 29. Demo • We have created a simple demo where user can enter their id and our system will recommend place for user . Figure : Input User Id Figure : Output Recommendation 29
  • 30. Future Work 30 • Try Model Based Recommendation System • Add More Domain • Try Triangulation Technique to find user’s center point .
  • 31. 31