SlideShare a Scribd company logo
Building a Library Recommendation Engine 
www.refme.com 
@getrefme 
Martina Pugliese 
Daniel Pape 
Richard Hanson
What is RefME? 
• award-winning referencing tool! 
• synchronised website + mobile app! 
• easy to use! 
• barcode scanning! 
• many reference types! 
• import from several formats! 
• export in 1000s of styles 
2 / 13
Who uses RefME? 
• 100 000s users! 
• mainly undergraduate students! 
• writing essays, theses, building 
bibliographies! 
• Other tools focus on academic 
market! 
• RefME is much more general 
3 / 13
Our Task 
4 
RefME wants to broaden the appeal of their product by 
offering a tool to build better bibliographies. 
/ 13
Our Task 
4 
RefME wants to broaden the appeal of their product by 
offering a tool to build better bibliographies. 
/ 13 
“take the search out of research”
Our Task 
Task: Develop a recommendation system! 
• suggest relevant references to users! 
• general enough to handle any reference type 
4 
RefME wants to broaden the appeal of their product by 
offering a tool to build better bibliographies. 
/ 13 
“take the search out of research”
The Data 
• small subset of data-base! 
• 21 000 projects with 120 000+ references! 
• references are user-generated and associated via projects! 
• pre-processing: sort/unify identifiers (isbn, doi, url) + retrieve 
subject keywords 
Recommendation! 
Engine Project ID Reference IDs 
5 / 13
Collaborative Filtering Content-based Recommendations 
6 / 13 
Our Recommendation Engine 
“ask your friends” “look at content information”
Collaborative Filtering 
Project Based 
similar projects 
nearest neighbours 
7 / 13
Project Based Reference Based 
similar projects 
nearest neighbours 
interaction patterns 
co-occurrence 
Collaborative Filtering 
7 / 13
Collaborative Filtering 
Project Based Reference Based 
similar projects 
nearest neighbours 
interaction patterns 
co-occurrence 
multiple similarity measures 
for robust results 
7 / 13
Collaborative Filtering 
Project Based Reference Based 
similar projects 
nearest neighbours 
interaction patterns 
co-occurrence 
multiple similarity measures 
for robust results 
recommendations 
7 / 13
Collaborative Filtering Content-based Recommendations 
8 / 13
Content-based Recommendations 
9 / 13
Content-based Recommendations 
no references 
9 / 13
Content-based Recommendations 
no references 
project title 
compare titles with NLP 
retrieve subjects in matching 
projects 
extract references based on 
subject frequency 
9 / 13
Content-based Recommendations 
no references with references 
project title 
compare titles with NLP 
retrieve subjects in matching 
projects 
extract references based on 
subject frequency 
9 / 13
Content-based Recommendations 
no references with references 
project title subject keywords 
compare titles with NLP 
retrieve subjects in matching 
projects 
extract references based on 
subject frequency 
measure similarity 
extract most similar references 
9 / 13
Benefits of two approaches 
Collaborative Filtering Content-based Recommendations 
10 / 13 
independent of item frequency 
good for common topics 
good for items with sufficient 
frequency
Project: “Media Research” 
• Relocating Television: Television in the Digital Context 
• Transmedia Television: Audiences, New Media and Daily Life 
• Television and Its Audience 
• Television Goes Digital. The Economics of Information 
Communication and Entertainment: The Impacts of Digital 
Technology in the 21st Century. 
OUTPUT INPUT 11 / 13 
• Grown Up Digital How the Net Generation is Changing Your World 
• Future Minds: How the Digital Age is Changing Our Minds Why 
This Matters and What We Can Do About It 
• The Television Studies Book 
• Television and Everyday Life
Project: “Group Presentation” 
• Person to Person: Ways of Communicating 
• The Psychology of Interpersonal Perception 
• Interpersonal Communication 
• Body Movement and Interpersonal Communication 
• Intergroup Behaviour 
OUTPUT INPUT 12 / 13 
• The Social Psychology of Everyday 
• Interviews Made Easy How to Get the Psychological Advantage 
• Groups at Work: Theory and Research 
• Theories of Human Communication (with InfoTrac)
13 
Summary 
/ 13 
RefME wants to broaden the appeal of their product by 
offering a tool to build better bibliographies. 
Developed recommendation system! 
• yields good results for any reference type! 
• will be incorporated as premium feature soon
www.refme.com 
@getrefme 
Martina Pugliese 
Daniel Pape 
Richard Hanson
Used Tools 
• Python NLTK libraries! 
• openlibrary.org API! 
• crossref.org API! 
• mahout.apache.org! 
• R, Java

More Related Content

PDF
Customizing Discovery at the University of Michigan
PPTX
Currents steps to be a researcher and faculty
PDF
Fostering Organizational Change through Service and Space Design Strategy
PPTX
How Do Librarians Prefer to Access Collections?
PDF
Psicologia
PDF
Presentación contratación internacional.
PPTX
How to Install SSL Certificate on FileZilla Server
PDF
Breaking the rules - SECR 2012
Customizing Discovery at the University of Michigan
Currents steps to be a researcher and faculty
Fostering Organizational Change through Service and Space Design Strategy
How Do Librarians Prefer to Access Collections?
Psicologia
Presentación contratación internacional.
How to Install SSL Certificate on FileZilla Server
Breaking the rules - SECR 2012

Viewers also liked (15)

PPT
Neural
PDF
The origin: Init (compact version)
PDF
Sales & Marketing Network
PDF
Where to submit_guest_blog_post
PDF
Consejos para disfrutar tus vacaciones.
PPTX
Kinoma Element 味見の会
PPTX
Linked in congratulates pradyp kanabar feb2013
PDF
What if Trump Won?!?
XLS
Dat Afull Thanasis Plhres Eommex
PDF
Atlanta: Tech Center of the Future
PPTX
Start Up - Just Next Door Final Pitch
PPT
Understanding the Basics of Personal Data: Vendors, Users, and You (Web 2.0 NYC)
KEY
Top 10 reasons people love listly and list posts
PDF
5-day process for answering critical business questions
PDF
Whitepaper_Publicism_06122016
Neural
The origin: Init (compact version)
Sales & Marketing Network
Where to submit_guest_blog_post
Consejos para disfrutar tus vacaciones.
Kinoma Element 味見の会
Linked in congratulates pradyp kanabar feb2013
What if Trump Won?!?
Dat Afull Thanasis Plhres Eommex
Atlanta: Tech Center of the Future
Start Up - Just Next Door Final Pitch
Understanding the Basics of Personal Data: Vendors, Users, and You (Web 2.0 NYC)
Top 10 reasons people love listly and list posts
5-day process for answering critical business questions
Whitepaper_Publicism_06122016
Ad

Similar to S2DS final project presentation: Building a recommendation engine for RefME (20)

PPTX
Recommender system
PDF
Mechanical Librarian
PDF
Recommendation engines : Matching items to users
PDF
Recommendation engines matching items to users
PPT
Social Recommender Systems Tutorial - WWW 2011
KEY
Recommender Engines
PPT
Digital Trails Dave King 1 5 10 Part 2 D3
PPTX
Lecture Notes on Recommender System Introduction
PPTX
Mini-training: Personalization & Recommendation Demystified
PDF
Investigation and application of Personalizing Recommender Systems based on A...
PPTX
SNA - Recommendation in social media.pptx
PDF
recommendationsystem1-221109055232-c8b46131.pdf
PPTX
Recommendation system (1).pptx
PDF
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
PDF
Bv31491493
PDF
Icete content-based filtering with applications on tv viewing data
DOC
WORD
PPTX
Recommended System.pptx
PDF
Collaborative Filtering and Recommender Systems By Navisro Analytics
PDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
Recommender system
Mechanical Librarian
Recommendation engines : Matching items to users
Recommendation engines matching items to users
Social Recommender Systems Tutorial - WWW 2011
Recommender Engines
Digital Trails Dave King 1 5 10 Part 2 D3
Lecture Notes on Recommender System Introduction
Mini-training: Personalization & Recommendation Demystified
Investigation and application of Personalizing Recommender Systems based on A...
SNA - Recommendation in social media.pptx
recommendationsystem1-221109055232-c8b46131.pdf
Recommendation system (1).pptx
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
Bv31491493
Icete content-based filtering with applications on tv viewing data
WORD
Recommended System.pptx
Collaborative Filtering and Recommender Systems By Navisro Analytics
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
Ad

Recently uploaded (20)

PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
Cell Types and Its function , kingdom of life
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PPTX
master seminar digital applications in india
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
A systematic review of self-coping strategies used by university students to ...
PPTX
Pharma ospi slides which help in ospi learning
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Weekly quiz Compilation Jan -July 25.pdf
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Chinmaya Tiranga quiz Grand Finale.pdf
Cell Types and Its function , kingdom of life
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Microbial diseases, their pathogenesis and prophylaxis
Orientation - ARALprogram of Deped to the Parents.pptx
master seminar digital applications in india
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
FourierSeries-QuestionsWithAnswers(Part-A).pdf
A systematic review of self-coping strategies used by university students to ...
Pharma ospi slides which help in ospi learning
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Microbial disease of the cardiovascular and lymphatic systems
Supply Chain Operations Speaking Notes -ICLT Program
Weekly quiz Compilation Jan -July 25.pdf
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
O5-L3 Freight Transport Ops (International) V1.pdf
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
O7-L3 Supply Chain Operations - ICLT Program
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf

S2DS final project presentation: Building a recommendation engine for RefME

  • 1. Building a Library Recommendation Engine www.refme.com @getrefme Martina Pugliese Daniel Pape Richard Hanson
  • 2. What is RefME? • award-winning referencing tool! • synchronised website + mobile app! • easy to use! • barcode scanning! • many reference types! • import from several formats! • export in 1000s of styles 2 / 13
  • 3. Who uses RefME? • 100 000s users! • mainly undergraduate students! • writing essays, theses, building bibliographies! • Other tools focus on academic market! • RefME is much more general 3 / 13
  • 4. Our Task 4 RefME wants to broaden the appeal of their product by offering a tool to build better bibliographies. / 13
  • 5. Our Task 4 RefME wants to broaden the appeal of their product by offering a tool to build better bibliographies. / 13 “take the search out of research”
  • 6. Our Task Task: Develop a recommendation system! • suggest relevant references to users! • general enough to handle any reference type 4 RefME wants to broaden the appeal of their product by offering a tool to build better bibliographies. / 13 “take the search out of research”
  • 7. The Data • small subset of data-base! • 21 000 projects with 120 000+ references! • references are user-generated and associated via projects! • pre-processing: sort/unify identifiers (isbn, doi, url) + retrieve subject keywords Recommendation! Engine Project ID Reference IDs 5 / 13
  • 8. Collaborative Filtering Content-based Recommendations 6 / 13 Our Recommendation Engine “ask your friends” “look at content information”
  • 9. Collaborative Filtering Project Based similar projects nearest neighbours 7 / 13
  • 10. Project Based Reference Based similar projects nearest neighbours interaction patterns co-occurrence Collaborative Filtering 7 / 13
  • 11. Collaborative Filtering Project Based Reference Based similar projects nearest neighbours interaction patterns co-occurrence multiple similarity measures for robust results 7 / 13
  • 12. Collaborative Filtering Project Based Reference Based similar projects nearest neighbours interaction patterns co-occurrence multiple similarity measures for robust results recommendations 7 / 13
  • 13. Collaborative Filtering Content-based Recommendations 8 / 13
  • 15. Content-based Recommendations no references 9 / 13
  • 16. Content-based Recommendations no references project title compare titles with NLP retrieve subjects in matching projects extract references based on subject frequency 9 / 13
  • 17. Content-based Recommendations no references with references project title compare titles with NLP retrieve subjects in matching projects extract references based on subject frequency 9 / 13
  • 18. Content-based Recommendations no references with references project title subject keywords compare titles with NLP retrieve subjects in matching projects extract references based on subject frequency measure similarity extract most similar references 9 / 13
  • 19. Benefits of two approaches Collaborative Filtering Content-based Recommendations 10 / 13 independent of item frequency good for common topics good for items with sufficient frequency
  • 20. Project: “Media Research” • Relocating Television: Television in the Digital Context • Transmedia Television: Audiences, New Media and Daily Life • Television and Its Audience • Television Goes Digital. The Economics of Information Communication and Entertainment: The Impacts of Digital Technology in the 21st Century. OUTPUT INPUT 11 / 13 • Grown Up Digital How the Net Generation is Changing Your World • Future Minds: How the Digital Age is Changing Our Minds Why This Matters and What We Can Do About It • The Television Studies Book • Television and Everyday Life
  • 21. Project: “Group Presentation” • Person to Person: Ways of Communicating • The Psychology of Interpersonal Perception • Interpersonal Communication • Body Movement and Interpersonal Communication • Intergroup Behaviour OUTPUT INPUT 12 / 13 • The Social Psychology of Everyday • Interviews Made Easy How to Get the Psychological Advantage • Groups at Work: Theory and Research • Theories of Human Communication (with InfoTrac)
  • 22. 13 Summary / 13 RefME wants to broaden the appeal of their product by offering a tool to build better bibliographies. Developed recommendation system! • yields good results for any reference type! • will be incorporated as premium feature soon
  • 23. www.refme.com @getrefme Martina Pugliese Daniel Pape Richard Hanson
  • 24. Used Tools • Python NLTK libraries! • openlibrary.org API! • crossref.org API! • mahout.apache.org! • R, Java