SlideShare a Scribd company logo
Learning Knowledge Rich
  User Models from the
     Semantic Web
  Gunnar Aastrand Grimnes


         First Year Talk

         14th May, 2003
Presentation Overview
 Motivation
 Preliminary Experiments
 Agentcities & GraniteNights
 The Future
Motivation
The Semantic Web should:
    Facilitate learning from the Web.
    Facilitate reuse of learning outcomes.
Hypothesis :
    Learning from data annotated with semantic
     mark-up should outperform learning from
     traditional (HTML) Web.
Goals:
    The learned model should be expressed in a
     Semantic Web Language.
    Such a learned model should be re-usable across
     domains and applications.
Preliminary Experiments
   Compare performance of learning from plain
   text and from semantic meta-data.
   Using traditional ML algorithms as baseline
   approach:
       Naïve Bayes
       K-Nearest Neighbour
   Explore application of more knowledge
   intensive approaches, such as ILP (Progol).
An Empirical Investigation of Learning From the Semantic Web, Pete Edwards,
Gunnar AA. Grimnes and Alun Preece – Presented at Semantic Web Mining
Workshop at ECML/PKDD, Helsinki, 2002
Issues
 Datasets in a Semantic Web language were
 very hard to come by.
 We used two datasets:
    ITTalks (Seminars described using HTML vs.
   DAML+OIL).
    Citeseer (Full text of Academic Papers vs. BibTex
   converted to RDF).
 How does RDF map to an instance
 representation suitable for learning?
Results
 Largely negative.
    K Nearest Neighbour on plain-text had best accuracy.
    … but: 10 lines of RDF vs. 6000 words of full-text paper.
 Reasons for failure:
    Shallow and artificial RDF.
    Statistical methods used.
 Progol results were the most interesting:
   % Classifying Machine Learning papers:
   inClass(A) :- publisher(A,'Morgan Kaufmann'),
                 booktitleword(A,learning).
Agentcities & the Evening Scenario
                         EU funded – 5th F.W.
                         In Aberdeen since
                         January’02.
                         WeatherAgent
                         online since
                         February’02.

                       Evening Scenario
                         City Nodes
                         Tourist Information
                         Recommendations

                       The fun has just
                         started:
                         OpenNET
GraniteNights
  Raison d’être:
       Agentcities Agent Technology Competition.
       Need a Semantic Web framework for
      learning user profiles.
       Bring together different people/research
      areas in the department: agents, learning,
      scheduling, constraints, etc.
       Proof that RDF is usable!
GraniteNights - A Multi-Agent Visit Scheduler Utilising Semantic Web Technology,
Gunnar AA. Grimnes, Stuart Chalmers, Pete Edwards and Alun Preece
Submitted to CIA2003
GraniteNights - Example
GraniteNights - Architecture
Query By Example
  RDQL too complicated to write by hand.
  Query by example is very intuitive.
  Internal conversion to RDQL.
  Could be “smarter” than RDQL.
<q:Query>               SELECT ?x WHERE (?x, ?y, ?z),
 <q:template>
  <akt:Academic>         ( ?x, <rdf # type>, <akt # Academic> ),
   <akt:family-name>     ( ?x, <akt # family-name>, "Brown" )
           Brown
   </akt:family-name>
   </akt:Academic>
  </q:template>
 </q:Query>
QbEx with constraints
<q:Query>
  <q:template>
    <r:Restaurant>
      <r:type rdf:resource=“r#Tandoori" />
        <r:open-time>
           <cif:Variable rdf:ID="x">
             <cif:varname>x</cif:varname>
           </cif:Variable>
        </r:open-time>
     </r:Restaurant>
   </q:template>
   <q:constraints>
     <cif:Comparison>
       <cif:comparisonOperator>&gt;</cif:comparisonOperator>
       <cif:comparisonOp1>
          <cif:Variable rdf:about="#x"/>
       </cif:comparisonOp1>
       <cif:comparisonOp2>
          <cif:Integerconst>
            <cif:constantValue>1900</cif:constantValue> .. . .
GraniteNights Profiling
<ep:User rdf:about=“profileagent#gunnar”
          ep:name=“gunnar” ep:pword=“****”>
  <ep:preference>
    <q:Query>
      <q:template>
        <pub:EnglishPub>
           <pub:servesBeer rdf:resource=“#flowers”/>
        </pub:EnglishPub>
...
  <ep:interactions>
    <rdf:Seq><rdf:li>
        <ep:Interaction ep:timestamp=“20030508T135013”>
          <ep:pref>
            <q:Query>
              <q:template>
                <pub:EnglishPub>
                  <pub:servesBeer rdf:resource=“#flowers”/>
                </pub:EnglishPub>
...
                <pub:EnglishPub>
                  <pub:servesBeer rdf:resource=“#hobgoblin”/>
...
                <pub:EnglishPub>
                  <pub:servesBeer rdf:resource=“#flowers”/>
...
GraniteNights Profiling II
 Current implementation:
   Most frequently specified constraint.
 Possible improvements:
    Super/Sub-class inference in the ontology,
   i.e. Flowers and Hobgoblin are both sub-
   classes of Real Ale.
    Combination of constraints important,
   i.e.Pete likes Lager when eating Curry, but
   Ale for his occasional pub-visit.
    Requires more sophisticated techniques
   than counting.
The Future
    User modelling in a broader scope:
         User roles, commitments etc.
    Learning from RDF:
         Generalisation.
         Case-based reasoning.
         RDF as model language.



Learning Knowledge Rich User Models from the Semantic Web, Gunnar AA. Grimnes
To appear in Doctoral Consortium, User Modeling 2003, Pittsburgh, July 2003.
Questions ?

More Related Content

PPT
UM03 - Learning Know..
PPT
Tourettes
PPTX
Tourettes
PPT
Tourettes
PPT
Caterpillar
PDF
Tourettes Syndrome
DOC
1. MPEG I.B.P frame之不同
PDF
EL MODELO DE NEGOCIO DE YOUTUBE
UM03 - Learning Know..
Tourettes
Tourettes
Tourettes
Caterpillar
Tourettes Syndrome
1. MPEG I.B.P frame之不同
EL MODELO DE NEGOCIO DE YOUTUBE

Similar to Learning Knowledge Rich User Models from the Semantic Web (20)

KEY
History and Background of the USEWOD Data Challenge
PPT
Exploring the Semantic Web
PPTX
Knowledge Representation, Semantic Web
PPT
Semantic Search overview at SSSW 2012
PDF
Citizen-centric Linked Data Services for Smarter Cities
PPTX
20100614 ISWSA Keynote
PDF
Semantic web, python, construction industry
PPTX
Knowledge mangement
PPT
A review of the state of the art in Machine Learning on the Semantic Web
PDF
Enabling Citizen-empowered Apps over Linked Data
PPT
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
PPT
Finding knowledge, data and answers on the Semantic Web
PDF
Some news about the SW
PDF
Building product suggestions for a BIM model based on rule sets and a semant...
PPTX
Introduction to the Semantic Web
PDF
Question answer template
PDF
SMERST 2013
PPTX
Semantic Web Application Development
PPTX
Linked Data: opportunities and challenges
PDF
Semantic web-primer
History and Background of the USEWOD Data Challenge
Exploring the Semantic Web
Knowledge Representation, Semantic Web
Semantic Search overview at SSSW 2012
Citizen-centric Linked Data Services for Smarter Cities
20100614 ISWSA Keynote
Semantic web, python, construction industry
Knowledge mangement
A review of the state of the art in Machine Learning on the Semantic Web
Enabling Citizen-empowered Apps over Linked Data
Piloting Linked Data to Connect Library and Archive Resources to the New Worl...
Finding knowledge, data and answers on the Semantic Web
Some news about the SW
Building product suggestions for a BIM model based on rule sets and a semant...
Introduction to the Semantic Web
Question answer template
SMERST 2013
Semantic Web Application Development
Linked Data: opportunities and challenges
Semantic web-primer
Ad

More from butest (20)

PDF
LESSONS FROM THE MICHAEL JACKSON TRIAL
PPT
Timeline: The Life of Michael Jackson
DOCX
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
PDF
LESSONS FROM THE MICHAEL JACKSON TRIAL
PPTX
Com 380, Summer II
PPT
PPT
DOCX
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
DOC
MICHAEL JACKSON.doc
PPTX
Social Networks: Twitter Facebook SL - Slide 1
PPT
Facebook
DOCX
Executive Summary Hare Chevrolet is a General Motors dealership ...
DOC
Welcome to the Dougherty County Public Library's Facebook and ...
DOC
NEWS ANNOUNCEMENT
DOC
C-2100 Ultra Zoom.doc
DOC
MAC Printing on ITS Printers.doc.doc
DOC
Mac OS X Guide.doc
DOC
hier
DOC
WEB DESIGN!
DOC
Download
DOC
resume.doc
LESSONS FROM THE MICHAEL JACKSON TRIAL
Timeline: The Life of Michael Jackson
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
LESSONS FROM THE MICHAEL JACKSON TRIAL
Com 380, Summer II
PPT
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
MICHAEL JACKSON.doc
Social Networks: Twitter Facebook SL - Slide 1
Facebook
Executive Summary Hare Chevrolet is a General Motors dealership ...
Welcome to the Dougherty County Public Library's Facebook and ...
NEWS ANNOUNCEMENT
C-2100 Ultra Zoom.doc
MAC Printing on ITS Printers.doc.doc
Mac OS X Guide.doc
hier
WEB DESIGN!
Download
resume.doc
Ad

Learning Knowledge Rich User Models from the Semantic Web

  • 1. Learning Knowledge Rich User Models from the Semantic Web Gunnar Aastrand Grimnes First Year Talk 14th May, 2003
  • 2. Presentation Overview Motivation Preliminary Experiments Agentcities & GraniteNights The Future
  • 3. Motivation The Semantic Web should: Facilitate learning from the Web. Facilitate reuse of learning outcomes. Hypothesis : Learning from data annotated with semantic mark-up should outperform learning from traditional (HTML) Web. Goals: The learned model should be expressed in a Semantic Web Language. Such a learned model should be re-usable across domains and applications.
  • 4. Preliminary Experiments Compare performance of learning from plain text and from semantic meta-data. Using traditional ML algorithms as baseline approach: Naïve Bayes K-Nearest Neighbour Explore application of more knowledge intensive approaches, such as ILP (Progol). An Empirical Investigation of Learning From the Semantic Web, Pete Edwards, Gunnar AA. Grimnes and Alun Preece – Presented at Semantic Web Mining Workshop at ECML/PKDD, Helsinki, 2002
  • 5. Issues Datasets in a Semantic Web language were very hard to come by. We used two datasets: ITTalks (Seminars described using HTML vs. DAML+OIL). Citeseer (Full text of Academic Papers vs. BibTex converted to RDF). How does RDF map to an instance representation suitable for learning?
  • 6. Results Largely negative. K Nearest Neighbour on plain-text had best accuracy. … but: 10 lines of RDF vs. 6000 words of full-text paper. Reasons for failure: Shallow and artificial RDF. Statistical methods used. Progol results were the most interesting: % Classifying Machine Learning papers: inClass(A) :- publisher(A,'Morgan Kaufmann'), booktitleword(A,learning).
  • 7. Agentcities & the Evening Scenario EU funded – 5th F.W. In Aberdeen since January’02. WeatherAgent online since February’02. Evening Scenario City Nodes Tourist Information Recommendations The fun has just started: OpenNET
  • 8. GraniteNights Raison d’être: Agentcities Agent Technology Competition. Need a Semantic Web framework for learning user profiles. Bring together different people/research areas in the department: agents, learning, scheduling, constraints, etc. Proof that RDF is usable! GraniteNights - A Multi-Agent Visit Scheduler Utilising Semantic Web Technology, Gunnar AA. Grimnes, Stuart Chalmers, Pete Edwards and Alun Preece Submitted to CIA2003
  • 11. Query By Example RDQL too complicated to write by hand. Query by example is very intuitive. Internal conversion to RDQL. Could be “smarter” than RDQL. <q:Query> SELECT ?x WHERE (?x, ?y, ?z), <q:template> <akt:Academic> ( ?x, <rdf # type>, <akt # Academic> ), <akt:family-name> ( ?x, <akt # family-name>, "Brown" ) Brown </akt:family-name> </akt:Academic> </q:template> </q:Query>
  • 12. QbEx with constraints <q:Query> <q:template> <r:Restaurant> <r:type rdf:resource=“r#Tandoori" /> <r:open-time> <cif:Variable rdf:ID="x"> <cif:varname>x</cif:varname> </cif:Variable> </r:open-time> </r:Restaurant> </q:template> <q:constraints> <cif:Comparison> <cif:comparisonOperator>&gt;</cif:comparisonOperator> <cif:comparisonOp1> <cif:Variable rdf:about="#x"/> </cif:comparisonOp1> <cif:comparisonOp2> <cif:Integerconst> <cif:constantValue>1900</cif:constantValue> .. . .
  • 13. GraniteNights Profiling <ep:User rdf:about=“profileagent#gunnar” ep:name=“gunnar” ep:pword=“****”> <ep:preference> <q:Query> <q:template> <pub:EnglishPub> <pub:servesBeer rdf:resource=“#flowers”/> </pub:EnglishPub> ... <ep:interactions> <rdf:Seq><rdf:li> <ep:Interaction ep:timestamp=“20030508T135013”> <ep:pref> <q:Query> <q:template> <pub:EnglishPub> <pub:servesBeer rdf:resource=“#flowers”/> </pub:EnglishPub> ... <pub:EnglishPub> <pub:servesBeer rdf:resource=“#hobgoblin”/> ... <pub:EnglishPub> <pub:servesBeer rdf:resource=“#flowers”/> ...
  • 14. GraniteNights Profiling II Current implementation: Most frequently specified constraint. Possible improvements: Super/Sub-class inference in the ontology, i.e. Flowers and Hobgoblin are both sub- classes of Real Ale. Combination of constraints important, i.e.Pete likes Lager when eating Curry, but Ale for his occasional pub-visit. Requires more sophisticated techniques than counting.
  • 15. The Future User modelling in a broader scope: User roles, commitments etc. Learning from RDF: Generalisation. Case-based reasoning. RDF as model language. Learning Knowledge Rich User Models from the Semantic Web, Gunnar AA. Grimnes To appear in Doctoral Consortium, User Modeling 2003, Pittsburgh, July 2003.