SlideShare a Scribd company logo
Learning Knowledge Rich User Models from the Semantic Web Gunnar Aastrand Grimnes User Modeling 2003 – Doctoral Consortium 24th June, 2003
Presentation Overview Motivation Part I - Preliminary Experiments Part II - Agentcities & GraniteNights The Future
Motivation The Semantic Web should: Facilitate learning from the Web. Facilitate reuse of learning outcomes. Hypothesis :  Learning from data annotated with semantic  mark-up should outperform learning from  traditional (HTML) Web. Goals: The learned user model should be expressed in a  Semantic Web Language. Such a learned model should be re-usable across  domains and applications.
Part I - Preliminary Experiments Compare performance of learning from plain text and from semantic meta-data. Using traditional ML algorithms as baseline approach: Naïve Bayes K-Nearest Neighbour  Explore application of more knowledge intensive approaches, such as ILP (Progol). An Empirical Investigation of Learning From the Semantic Web , Pete Edwards,  Gunnar  AA.  Grimnes  and  Alun Preece  – Presented at Semantic Web Mining Workshop at ECML/PKDD, Helsinki, 2002
Issues Datasets in a Semantic Web language were very hard to come by. We used two datasets: ITTalks (Seminars described using HTML vs. DAML+OIL). Citeseer (Full text of Academic Papers vs. BibTex converted to RDF).  How does RDF map to an instance representation suitable for learning?
Results Largely negative.  K Nearest Neighbour on plain-text had best accuracy.  …  but: 10 lines of RDF vs. 6000 words of full-text paper.  Reasons for failure: Shallow and artificial RDF. Statistical methods used. Progol results were the most interesting, although it generated lots of non-compressing rules. % Classifying Machine Learning papers: inClass(A,’ML’) :-  publisher(A,'Morgan Kaufmann'),  booktitleword(A,learning).
Part II Agentcities @ Aberdeen  &  GraniteNights
GraniteNights  Raison d’être:  Agentcities Agent Technology Competition. Need a Semantic Web framework for learning user models. Bring together different people/research areas in the department: agents, learning, scheduling, constraints, etc.  Proof that RDF is usable! GraniteNights  - A Multi-Agent Visit Scheduler Utilising Semantic Web Technology ,  Gunnar  AA.  Grimnes , Stuart  Chalmers , Pete Edwards and  Alun Preece Accepted for CIA2003
GraniteNights - Example
GraniteNights - Architecture
Query By Example RDQL too complicated to write by hand. Query by example is very intuitive. Internal conversion to RDQL. Could be “smarter” than RDQL. <q: Query > <q: template > <akt: Academic > <akt: family-name > Brown </akt: family-name > </akt: Academic > </q: template > </q: Query > SELECT  ?x   WHERE ( ?x ,  ?y ,  ?z ),  (  ?x , < rdf # type >, < akt # Academic > ),  (  ?x , < akt # family-name >, &quot; Brown &quot; )
QbEx with constraints <q: Query > <q: template > <r: R estaurant > <r: type  rdf: resource =“ r#Tandoori &quot; /> <r: open-time > <cif: V ariable  rdf: ID =&quot; x &quot;> <cif: varname > x </cif: varname > </cif: V ariable > </r: open-time > </r: R estaurant > </q: template > <q: constraints > <cif: C omparison > <cif: comparison O perator > &gt; </cif: comparison O perator > <cif: comparison O p1 > <cif: V ariable  rdf: about =&quot; #x &quot;/> </cif: comparison O p1 > <cif: comparison O p2 > <cif: I ntegerconst > <cif: constant V alue > 1900 </cif: constant V alue >  .. . . Give me a Restaurant that is open after X,  where X > 1900. i.e. Give me a restaurant open after 7 pm.
GraniteNights Profiling <ep: User  rdf: about =“ profileagent#gunnar ”    ep: name =“ gunnar ” ep: pword =“ **** ”>  <ep: preference > <q: Query > <q: template > <pub: EnglishPub >  <pub: servesBeer  rdf: resource =“ #flowers ”/> </pub: EnglishPub > ... <ep: interactions >  <rdf: Seq ><rdf: li >  <ep: Interaction  ep: timestamp =“ 20030508T135013 ”> <ep: pref >  <q: Query > <q: template > <pub: EnglishPub >  <pub: servesBeer  rdf: resource =“ #flowers ”/> </pub: EnglishPub > ... <pub: EnglishPub > <pub: servesBeer  rdf: resource =“ #hobgoblin ”/> ... <pub: EnglishPub >  <pub: servesBeer  rdf: resource =“ #flowers ”/> ...
GraniteNights Profiling II  Current implementation:  Most frequently specified constraint.  Possible improvements:  Super/Sub-class inference in the ontology, i.e.  Flowers  and  Hobgoblin  are both sub-classes of  Real Ale .  Combination of constraints important,  i.e.Pete likes  Lager  when eating  Curry , but  Ale  for his occasional pub-visit. Requires more sophisticated techniques than counting.
The Future – RDF Issues Must strike a balance: RDF purely for representation: Not very interoperable.  Not very ground-breaking. Using RDF data-model: RDF expressive power is limited. Extend using Rule-ML? OWL? Lots of unsolved RDF problems for free: Anonymous nodes / graph isomorphism Software support is immature. Inference
The Future – Problems Future work on GraniteNights:  Real users for real user modelling:  Substantial software engineering efforts required to make a system that attracts users. Simulating user interactions is an option.  Adding features: Reviews / Recommendations (This pub has good guinness)
The Future – Plans Near future plans: Further empirical investigations:  Inductive logic programming contd. Case-based reasoning. User modelling in a broader scope: User roles, commitments etc.
Questions ?
Rule-ML Example <rule: Imp ><rule: head > <rule: Atom > <rule: rel  rdf: resource  =“ ~ggrimnes/dev/exp#inClass ”/> <rule: atomArg > <rule: Var  rdf: about =“ #A ”> <rule: varName > A </rule: varName > </rule: Var > </rule: atomArg > <rule: atomArg > ML </rule: atomArg > </rule: Atom > </rule: head > <rule: body > <rule: And > <rule: arg > <rule: Atom > <rule: rel  rdf: resource =“ ~ggrimnes/dev/exp#booktitleword ”/> <rule: atomArg > <rule: Var  rdf: about =“ #A ”> <rule: varName > A </rule: varName > </rule: Var > </rule: atomArg > <rule: atomArg > learning </rule: atomArg > </rule: Atom > </rule: arg > <rule: arg > <rule: Atom > <rule: rel  rdf: resource =“ ~ggrimnes/dev/exp#publisher# ”/> <rule: atomArg > <rule: Var  rdf:about=“ #A ”> <rule: varName > A </rule: varName > </rule: Var > </rule: atomArg > <rule: atomArg > Morgan Kaufman </rule: atomArg >   </rule: Atom > </rule: arg > </rule: And > </rule: body > </rule: Imp >
Agentcities & the Evening Scenario EU funded – 5 th  F.W. In Aberdeen since January’02. WeatherAgent online since February’02. Evening Scenario City Nodes Tourist Information Recommendations

More Related Content

PPT
Learning Knowledge Rich User Models from the Semantic Web
PPTX
PhD Projects in Python With Source Code
PDF
Learning Commonalities in RDF and SPARQL
PDF
Summarization and Abstraction using deep learning
PDF
The Magic of Text Summarization using Deep Networks
PPTX
The Magic of Image processing using Neural Networks
PPTX
The magic of machine translation 20 july 2017
PPT
XML Training Presentation
Learning Knowledge Rich User Models from the Semantic Web
PhD Projects in Python With Source Code
Learning Commonalities in RDF and SPARQL
Summarization and Abstraction using deep learning
The Magic of Text Summarization using Deep Networks
The Magic of Image processing using Neural Networks
The magic of machine translation 20 july 2017
XML Training Presentation

Viewers also liked (13)

DOC
Introducing Evolutionary Component Capabilities in Agent ...
DOCX
Table of Contents
PDF
MACHINE LEARNING METHODS FOR THE
DOCX
June 16, 2008
PPTX
pptx - Psuedo Random Generator for Halfspaces
DOCX
บทคัดย่อ
DOC
BALANCING BOARD MACHINES
PDF
EL MODELO DE NEGOCIO DE YOUTUBE
DOCX
Chua Benh Dau Khop Goi
DOCX
Resume_Akhilesh
PDF
Ativa Brochure (EN) - Scrollable
PDF
resume
DOC
Ceyhun Niftəliyev-IS ANKETI
Introducing Evolutionary Component Capabilities in Agent ...
Table of Contents
MACHINE LEARNING METHODS FOR THE
June 16, 2008
pptx - Psuedo Random Generator for Halfspaces
บทคัดย่อ
BALANCING BOARD MACHINES
EL MODELO DE NEGOCIO DE YOUTUBE
Chua Benh Dau Khop Goi
Resume_Akhilesh
Ativa Brochure (EN) - Scrollable
resume
Ceyhun Niftəliyev-IS ANKETI
Ad

Similar to UM03 - Learning Know.. (20)

PPTX
Knowledge Graph Introduction
PPT
Web Data Extraction Como2010
PDF
Enabling Citizen-empowered Apps over Linked Data
PPT
A Semantic Multimedia Web (Part 2)
PDF
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
PPTX
Semantic Web and Related Work at W3C
PPT
Aidan's PhD Viva
ODP
Gist od2-feb-2011
PDF
Fosdem 2013 petra selmer flexible querying of graph data
PPTX
Inferring Web Citations using Social Data and SPARQL Rules
PPTX
Knowledge Representation, Semantic Web
PPTX
A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...
PPTX
A year on the Semantic Web @ W3C
PPTX
Knowledge mangement
PPTX
20100614 ISWSA Keynote
PPTX
RDF-Gen: Generating RDF from streaming and archival data
PDF
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
PDF
Citizen-centric Linked Data Services for Smarter Cities
ODP
Machine Learning & Embeddings for Large Knowledge Graphs
PPT
What to do when one size does not fit all?!
Knowledge Graph Introduction
Web Data Extraction Como2010
Enabling Citizen-empowered Apps over Linked Data
A Semantic Multimedia Web (Part 2)
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Semantic Web and Related Work at W3C
Aidan's PhD Viva
Gist od2-feb-2011
Fosdem 2013 petra selmer flexible querying of graph data
Inferring Web Citations using Social Data and SPARQL Rules
Knowledge Representation, Semantic Web
A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...
A year on the Semantic Web @ W3C
Knowledge mangement
20100614 ISWSA Keynote
RDF-Gen: Generating RDF from streaming and archival data
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
Citizen-centric Linked Data Services for Smarter Cities
Machine Learning & Embeddings for Large Knowledge Graphs
What to do when one size does not fit all?!
Ad

More from butest (20)

DOC
1. MPEG I.B.P frame之不同
PDF
LESSONS FROM THE MICHAEL JACKSON TRIAL
PPT
Timeline: The Life of Michael Jackson
DOCX
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
PDF
LESSONS FROM THE MICHAEL JACKSON TRIAL
PPTX
Com 380, Summer II
PPT
PPT
DOCX
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
DOC
MICHAEL JACKSON.doc
PPTX
Social Networks: Twitter Facebook SL - Slide 1
PPT
Facebook
DOCX
Executive Summary Hare Chevrolet is a General Motors dealership ...
DOC
Welcome to the Dougherty County Public Library's Facebook and ...
DOC
NEWS ANNOUNCEMENT
DOC
C-2100 Ultra Zoom.doc
DOC
MAC Printing on ITS Printers.doc.doc
DOC
Mac OS X Guide.doc
DOC
hier
DOC
WEB DESIGN!
DOC
Download
1. MPEG I.B.P frame之不同
LESSONS FROM THE MICHAEL JACKSON TRIAL
Timeline: The Life of Michael Jackson
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
LESSONS FROM THE MICHAEL JACKSON TRIAL
Com 380, Summer II
PPT
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
MICHAEL JACKSON.doc
Social Networks: Twitter Facebook SL - Slide 1
Facebook
Executive Summary Hare Chevrolet is a General Motors dealership ...
Welcome to the Dougherty County Public Library's Facebook and ...
NEWS ANNOUNCEMENT
C-2100 Ultra Zoom.doc
MAC Printing on ITS Printers.doc.doc
Mac OS X Guide.doc
hier
WEB DESIGN!
Download

UM03 - Learning Know..

  • 1. Learning Knowledge Rich User Models from the Semantic Web Gunnar Aastrand Grimnes User Modeling 2003 – Doctoral Consortium 24th June, 2003
  • 2. Presentation Overview Motivation Part I - Preliminary Experiments Part II - Agentcities & GraniteNights The Future
  • 3. Motivation The Semantic Web should: Facilitate learning from the Web. Facilitate reuse of learning outcomes. Hypothesis : Learning from data annotated with semantic mark-up should outperform learning from traditional (HTML) Web. Goals: The learned user model should be expressed in a Semantic Web Language. Such a learned model should be re-usable across domains and applications.
  • 4. Part I - Preliminary Experiments Compare performance of learning from plain text and from semantic meta-data. Using traditional ML algorithms as baseline approach: Naïve Bayes K-Nearest Neighbour Explore application of more knowledge intensive approaches, such as ILP (Progol). An Empirical Investigation of Learning From the Semantic Web , Pete Edwards, Gunnar AA. Grimnes and Alun Preece – Presented at Semantic Web Mining Workshop at ECML/PKDD, Helsinki, 2002
  • 5. Issues Datasets in a Semantic Web language were very hard to come by. We used two datasets: ITTalks (Seminars described using HTML vs. DAML+OIL). Citeseer (Full text of Academic Papers vs. BibTex converted to RDF). How does RDF map to an instance representation suitable for learning?
  • 6. Results Largely negative. K Nearest Neighbour on plain-text had best accuracy. … but: 10 lines of RDF vs. 6000 words of full-text paper. Reasons for failure: Shallow and artificial RDF. Statistical methods used. Progol results were the most interesting, although it generated lots of non-compressing rules. % Classifying Machine Learning papers: inClass(A,’ML’) :- publisher(A,'Morgan Kaufmann'), booktitleword(A,learning).
  • 7. Part II Agentcities @ Aberdeen & GraniteNights
  • 8. GraniteNights Raison d’être: Agentcities Agent Technology Competition. Need a Semantic Web framework for learning user models. Bring together different people/research areas in the department: agents, learning, scheduling, constraints, etc. Proof that RDF is usable! GraniteNights - A Multi-Agent Visit Scheduler Utilising Semantic Web Technology , Gunnar AA. Grimnes , Stuart Chalmers , Pete Edwards and Alun Preece Accepted for CIA2003
  • 11. Query By Example RDQL too complicated to write by hand. Query by example is very intuitive. Internal conversion to RDQL. Could be “smarter” than RDQL. <q: Query > <q: template > <akt: Academic > <akt: family-name > Brown </akt: family-name > </akt: Academic > </q: template > </q: Query > SELECT ?x WHERE ( ?x , ?y , ?z ), ( ?x , < rdf # type >, < akt # Academic > ), ( ?x , < akt # family-name >, &quot; Brown &quot; )
  • 12. QbEx with constraints <q: Query > <q: template > <r: R estaurant > <r: type rdf: resource =“ r#Tandoori &quot; /> <r: open-time > <cif: V ariable rdf: ID =&quot; x &quot;> <cif: varname > x </cif: varname > </cif: V ariable > </r: open-time > </r: R estaurant > </q: template > <q: constraints > <cif: C omparison > <cif: comparison O perator > &gt; </cif: comparison O perator > <cif: comparison O p1 > <cif: V ariable rdf: about =&quot; #x &quot;/> </cif: comparison O p1 > <cif: comparison O p2 > <cif: I ntegerconst > <cif: constant V alue > 1900 </cif: constant V alue > .. . . Give me a Restaurant that is open after X, where X > 1900. i.e. Give me a restaurant open after 7 pm.
  • 13. GraniteNights Profiling <ep: User rdf: about =“ profileagent#gunnar ” ep: name =“ gunnar ” ep: pword =“ **** ”> <ep: preference > <q: Query > <q: template > <pub: EnglishPub > <pub: servesBeer rdf: resource =“ #flowers ”/> </pub: EnglishPub > ... <ep: interactions > <rdf: Seq ><rdf: li > <ep: Interaction ep: timestamp =“ 20030508T135013 ”> <ep: pref > <q: Query > <q: template > <pub: EnglishPub > <pub: servesBeer rdf: resource =“ #flowers ”/> </pub: EnglishPub > ... <pub: EnglishPub > <pub: servesBeer rdf: resource =“ #hobgoblin ”/> ... <pub: EnglishPub > <pub: servesBeer rdf: resource =“ #flowers ”/> ...
  • 14. GraniteNights Profiling II Current implementation: Most frequently specified constraint. Possible improvements: Super/Sub-class inference in the ontology, i.e. Flowers and Hobgoblin are both sub-classes of Real Ale . Combination of constraints important, i.e.Pete likes Lager when eating Curry , but Ale for his occasional pub-visit. Requires more sophisticated techniques than counting.
  • 15. The Future – RDF Issues Must strike a balance: RDF purely for representation: Not very interoperable. Not very ground-breaking. Using RDF data-model: RDF expressive power is limited. Extend using Rule-ML? OWL? Lots of unsolved RDF problems for free: Anonymous nodes / graph isomorphism Software support is immature. Inference
  • 16. The Future – Problems Future work on GraniteNights: Real users for real user modelling: Substantial software engineering efforts required to make a system that attracts users. Simulating user interactions is an option. Adding features: Reviews / Recommendations (This pub has good guinness)
  • 17. The Future – Plans Near future plans: Further empirical investigations: Inductive logic programming contd. Case-based reasoning. User modelling in a broader scope: User roles, commitments etc.
  • 19. Rule-ML Example <rule: Imp ><rule: head > <rule: Atom > <rule: rel rdf: resource =“ ~ggrimnes/dev/exp#inClass ”/> <rule: atomArg > <rule: Var rdf: about =“ #A ”> <rule: varName > A </rule: varName > </rule: Var > </rule: atomArg > <rule: atomArg > ML </rule: atomArg > </rule: Atom > </rule: head > <rule: body > <rule: And > <rule: arg > <rule: Atom > <rule: rel rdf: resource =“ ~ggrimnes/dev/exp#booktitleword ”/> <rule: atomArg > <rule: Var rdf: about =“ #A ”> <rule: varName > A </rule: varName > </rule: Var > </rule: atomArg > <rule: atomArg > learning </rule: atomArg > </rule: Atom > </rule: arg > <rule: arg > <rule: Atom > <rule: rel rdf: resource =“ ~ggrimnes/dev/exp#publisher# ”/> <rule: atomArg > <rule: Var rdf:about=“ #A ”> <rule: varName > A </rule: varName > </rule: Var > </rule: atomArg > <rule: atomArg > Morgan Kaufman </rule: atomArg > </rule: Atom > </rule: arg > </rule: And > </rule: body > </rule: Imp >
  • 20. Agentcities & the Evening Scenario EU funded – 5 th F.W. In Aberdeen since January’02. WeatherAgent online since February’02. Evening Scenario City Nodes Tourist Information Recommendations