SlideShare a Scribd company logo
Requirements for Processing Datasets
     for Recommender Systems
 Preliminary Experiences from Three Case
                 Studies

             Giannis Stoitsis
             University of Alcala, Spain
          Agro-Know Technologies, Greece

           RecSys Challenge 2012, Dublin
the learning case
• technology-enhanced learning investigates how
  information and communication technologies can
  be used to support learning and teaching, and
  competence development throughout life.
• various levels/contexts
  –   school
  –   higher education and research
  –   vocational education and training
  –   adult education
recommend resources in moodle
recommend resources in learning portal
handling multiple, diverse sets &
              streams
• various types of social data
• different schemas and formats
• multiple languages and dimensions




       Single criteria            Multi-criteria
why?
• support various usage and recommendation
  scenarios
• combining data from various sources may
  boost the way recommender work in
  education
  – bigger data
  – federated recommender systems
  – open science platform
a European social data infrastructure
              for learning

                                                                      …portals…




                 Meta     Social              Meta         Social                 Meta   Social
     Social      data                                                             data    Data
                           Data               data          Data
      Data




     API                  API                API                                         API
   Federated            Aggregation of metadata, social and usage data
Recommendation
    services


                                                     Resolution
                                                      services
                                     Social                         Metadata
                                     Data                            per URI

                                   Anonymised
Requirements for Processing Datasets for Recommender Systems
challenges
•   define common metadata schema
•   harvest/crawl social data
•   transform each social data schema
•   uri resolution
•   scalability
•   anonymised approach
•   develop item-based non personalized
    algorithms that can perform well
our open science case study
web app for testing neighborhood-based recommendation
      algorithms with multi-criteria rating dataset

                                           Export data
                                            (sql, csv)
     I need
                                                         Refine
     more!!!                     Login
                                                          data
                                         Transfom
                          Import          dataset
                        dataset (sql,
                         csv, xml)         Create
                          Prepare          dataset
                          dataset               Data
                                            characteristics
                               Visualize
                               dataset
                                             Visualize
           RecSys             Export          results
         researcher/          results
          developer
architecture

   Web UI                                                 Developers

                                      API
Components

                Refine and                        Prepare/p
  Import                       Visualize                        Evaluate
                transform                           rocess



                                      API
Cloud/Grid infra

            Monte Carlo      Social     Social   Social    Recommender
                             Data       Data     Data
             Simulator                                        services
experience from Mendeley case
experience from multi-criteria rating
   dataset from a teachers portal
                                               e.g. integration in classroom,
                                            relevance to topics, ability to help
                                                       students learn




                 Size of the neighborhood    Correlation Weight Threshold value
DEMO

More Related Content

PDF
Recommender Systems and Active Learning
PDF
Online recommendations at scale using matrix factorisation
PPTX
Multi Criteria Recommender Systems - Overview
PDF
Recommendation Engine Demystified
PDF
Recommender system algorithm and architecture
PPT
Amazon Item-to-Item Recommendations
PDF
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
PDF
Food, agriculture and open data – the world is changing
Recommender Systems and Active Learning
Online recommendations at scale using matrix factorisation
Multi Criteria Recommender Systems - Overview
Recommendation Engine Demystified
Recommender system algorithm and architecture
Amazon Item-to-Item Recommendations
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Food, agriculture and open data – the world is changing

Viewers also liked (18)

PDF
Text Mining to Correct Missing CRM Information by Jonathan Sedar
PDF
Text mining to correct missing CRM information: a practical data science project
PPT
Datamining for crm
PPTX
Recommender Systems: Advances in Collaborative Filtering
PDF
Customer relationship management_dwm_ankita_dubey
PDF
Ranking Related News Predictions
PPT
How to apply CRM using data mining techniques.
PDF
Recommender.system.presentation.pjug.01.21.2014
PDF
Solving the AL Chicken-and-Egg Corpus and Model Problem
PDF
Customer Relationship Management in Ireland Managing your Customers for Busin...
PPT
Recommendation techniques
PDF
Your own recommendation engine with neo4j and reco4php - DPC16
PDF
Summary of a Recommender Systems Survey paper
PPTX
Profile injection attack detection in recommender system
PPTX
Recommendation Engine Project Presentation
PPT
Data mining
PDF
Tutorial: Context-awareness In Information Retrieval and Recommender Systems
PPTX
Recommendation Engine Powered by Hadoop - Pranab Ghosh
Text Mining to Correct Missing CRM Information by Jonathan Sedar
Text mining to correct missing CRM information: a practical data science project
Datamining for crm
Recommender Systems: Advances in Collaborative Filtering
Customer relationship management_dwm_ankita_dubey
Ranking Related News Predictions
How to apply CRM using data mining techniques.
Recommender.system.presentation.pjug.01.21.2014
Solving the AL Chicken-and-Egg Corpus and Model Problem
Customer Relationship Management in Ireland Managing your Customers for Busin...
Recommendation techniques
Your own recommendation engine with neo4j and reco4php - DPC16
Summary of a Recommender Systems Survey paper
Profile injection attack detection in recommender system
Recommendation Engine Project Presentation
Data mining
Tutorial: Context-awareness In Information Retrieval and Recommender Systems
Recommendation Engine Powered by Hadoop - Pranab Ghosh
Ad

Similar to Requirements for Processing Datasets for Recommender Systems (20)

PPTX
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
PPTX
Linked Data as a Service
PPTX
Building a Data Discovery Network for Sustainability Science
PPTX
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
PPTX
Everything Self-Service:Linked Data Applications with the Information Workbench
PPTX
Introduction to Microsoft SQL Server 2008 R2 Analysis Service
PPTX
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
PDF
LeaderQuest SharePoint Business Intelligence Presentation
PDF
20130117 - Big Data Architectures
PPT
Metadata-powered dissemination of content
PDF
Eclipse day Sydney 2014 BIG data presentation
PPTX
Autoservicio de inteligencia de negocios
PDF
STI Summit 2011 - Digital Worlds
PPT
Revisiting the Multi-Criteria Recommender System of a Learning Portal
PDF
Enterprise Sharepoint Portal
PPTX
Future.ready().watson dataplatform 01
PDF
BI Dashboards with SQL Server 2008 R2
PPTX
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
PPTX
Machine Learning Models in Production
PPTX
Big Data SE vs. SE for Big Data
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
Linked Data as a Service
Building a Data Discovery Network for Sustainability Science
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
Everything Self-Service:Linked Data Applications with the Information Workbench
Introduction to Microsoft SQL Server 2008 R2 Analysis Service
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
LeaderQuest SharePoint Business Intelligence Presentation
20130117 - Big Data Architectures
Metadata-powered dissemination of content
Eclipse day Sydney 2014 BIG data presentation
Autoservicio de inteligencia de negocios
STI Summit 2011 - Digital Worlds
Revisiting the Multi-Criteria Recommender System of a Learning Portal
Enterprise Sharepoint Portal
Future.ready().watson dataplatform 01
BI Dashboards with SQL Server 2008 R2
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
Machine Learning Models in Production
Big Data SE vs. SE for Big Data
Ad

More from Stoitsis Giannis (15)

PDF
Agroknow and FREME presentation @Linda workshop-20-11-2015
PDF
The Open Data Stakeholders’ Ecosystem
PDF
Open Data in the agrifood sector
PDF
Open-data-in-agrifood-sector-challenges-opportunities
PPTX
How internet and open data transforms the agricultural sector (in greek)
PDF
Facilitating regional growth through they use of open agricultural data
PPTX
City to-farm agro-know
PPTX
Open data: Showcases from agricultural domain
PDF
How e-infrastructure can contribute to Linked Germplasm Data
PPTX
Open Data Working Group - Agricultural Showcase
PDF
Intro to-technologies-Green-City-Hackathon-Athens
PPT
Ag infra kream-presentation-7-6-2013
PPTX
Cetaf ISTC Meeting: Natural-Europe Presentation
PPTX
E services for learning in agriculture-stevia-event-dec-2012
PPTX
Organic.lingua presentation cer_organic
Agroknow and FREME presentation @Linda workshop-20-11-2015
The Open Data Stakeholders’ Ecosystem
Open Data in the agrifood sector
Open-data-in-agrifood-sector-challenges-opportunities
How internet and open data transforms the agricultural sector (in greek)
Facilitating regional growth through they use of open agricultural data
City to-farm agro-know
Open data: Showcases from agricultural domain
How e-infrastructure can contribute to Linked Germplasm Data
Open Data Working Group - Agricultural Showcase
Intro to-technologies-Green-City-Hackathon-Athens
Ag infra kream-presentation-7-6-2013
Cetaf ISTC Meeting: Natural-Europe Presentation
E services for learning in agriculture-stevia-event-dec-2012
Organic.lingua presentation cer_organic

Recently uploaded (20)

PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
Complications of Minimal Access Surgery at WLH
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PPTX
Introduction to Building Materials
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PPTX
Unit 4 Skeletal System.ppt.pptxopresentatiom
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
Cell Types and Its function , kingdom of life
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
Hazard Identification & Risk Assessment .pdf
PDF
1_English_Language_Set_2.pdf probationary
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
Indian roads congress 037 - 2012 Flexible pavement
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
Supply Chain Operations Speaking Notes -ICLT Program
Weekly quiz Compilation Jan -July 25.pdf
Complications of Minimal Access Surgery at WLH
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
Orientation - ARALprogram of Deped to the Parents.pptx
Introduction to Building Materials
Final Presentation General Medicine 03-08-2024.pptx
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Unit 4 Skeletal System.ppt.pptxopresentatiom
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Cell Types and Its function , kingdom of life
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Chinmaya Tiranga quiz Grand Finale.pdf
Hazard Identification & Risk Assessment .pdf
1_English_Language_Set_2.pdf probationary
Final Presentation General Medicine 03-08-2024.pptx
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
Paper A Mock Exam 9_ Attempt review.pdf.
Indian roads congress 037 - 2012 Flexible pavement
LDMMIA Reiki Yoga Finals Review Spring Summer
Supply Chain Operations Speaking Notes -ICLT Program

Requirements for Processing Datasets for Recommender Systems

  • 1. Requirements for Processing Datasets for Recommender Systems Preliminary Experiences from Three Case Studies Giannis Stoitsis University of Alcala, Spain Agro-Know Technologies, Greece RecSys Challenge 2012, Dublin
  • 2. the learning case • technology-enhanced learning investigates how information and communication technologies can be used to support learning and teaching, and competence development throughout life. • various levels/contexts – school – higher education and research – vocational education and training – adult education
  • 4. recommend resources in learning portal
  • 5. handling multiple, diverse sets & streams • various types of social data • different schemas and formats • multiple languages and dimensions Single criteria Multi-criteria
  • 6. why? • support various usage and recommendation scenarios • combining data from various sources may boost the way recommender work in education – bigger data – federated recommender systems – open science platform
  • 7. a European social data infrastructure for learning …portals… Meta Social Meta Social Meta Social Social data data Data Data data Data Data API API API API Federated Aggregation of metadata, social and usage data Recommendation services Resolution services Social Metadata Data per URI Anonymised
  • 9. challenges • define common metadata schema • harvest/crawl social data • transform each social data schema • uri resolution • scalability • anonymised approach • develop item-based non personalized algorithms that can perform well
  • 10. our open science case study
  • 11. web app for testing neighborhood-based recommendation algorithms with multi-criteria rating dataset Export data (sql, csv) I need Refine more!!! Login data Transfom Import dataset dataset (sql, csv, xml) Create Prepare dataset dataset Data characteristics Visualize dataset Visualize RecSys Export results researcher/ results developer
  • 12. architecture Web UI Developers API Components Refine and Prepare/p Import Visualize Evaluate transform rocess API Cloud/Grid infra Monte Carlo Social Social Social Recommender Data Data Data Simulator services
  • 14. experience from multi-criteria rating dataset from a teachers portal e.g. integration in classroom, relevance to topics, ability to help students learn Size of the neighborhood Correlation Weight Threshold value
  • 15. DEMO

Editor's Notes

  • #4: smirti.bhagat@technicolor.com
  • #8: Example of using Recommendation API: recommend(itemURI,limit_of_resources), recommend(itemURI,user_tags) Example of social data API provided by the aggregator: get_tags(itemURI), get_reviews(itemURI) etc
  • #13: Here we present the architecture of such an environment and the proposed software stackMonte Carlo will be a separate component that can run also on the Grid and that will br provided through an API. The API will be documented.