SlideShare a Scribd company logo
Pascal Christoph



Catalog enrichment à la
Linked Open Data


  SWIB12, Cologne, 2012-12-26
  Workshop: Introduction to Linked Open Data
License
2




    This presentation – inclusive the graphics made by the author, are licensed CC0:
    https://creativecommons.org/about/cc0

    Pictures from http://www.istockphoto.com/ at slides 5, 7, 8 and 41 are licensed CC-BY-ND:
    http://creativecommons.org/licenses/by-nd/3.0/de/

    Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-
    cloud.net/




     Christoph - Catalog enrichment à la Linked Open Data                            2012-12-26
Overview
3




       Catalog enrichment
          Definition

          Technique

          Matching

          Linking

       Implementation demo
       Conclusion

    Christoph - Catalog enrichment à la Linked Open Data     2012-12-26
Overview
4




       Catalog enrichment
          Definition

          Technique

          Matching

          Linking

       Implementation demo
       Conclusion

    Christoph - Catalog enrichment à la Linked Open Data     2012-12-26
Catalog enrichment ?
Catalog enrichment: definition
6



       Any addendum to the records:
          linksto fulltexts/webpages/...
          subjects, tags, recensions

          covers

          ...

       The source of the addendum does not matter
        (users, libraries, companies...)
       New features: only indirect

    Christoph - Catalog enrichment à à la Linked Open Data
                Kataloganreicherung la Linked Open Data      24.05.2012
                                                              2012-12-26
                                                              2012-09-27
„INSTANT GRATIFICATION“
Swib12 workshop lod_beginners
Overview
9




       Catalog enrichment
          Definition

          Technique

          Matching

          Linking

       Implementation demo
       Conclusion

    Christoph - Catalog enrichment à la Linked Open Data     2012-12-26
Catalog enrichment: methods
10




                                                               Sourtce of the pictures :http://findicons.com/about


       database vs.                                           mashup
     Christoph - Catalog enrichment à à la Linked Open Data
                 Kataloganreicherung la Linked Open Data                                 24.05.2012
                                                                                          2012-12-26
                                                                                          2012-09-27
methods
11




     locale DB:                                               dynamic mashup:
     + elaborated combination of the                          + data always up-to-date
     data
                                                              + relatively easy to integrate the data
     + data can be used to search and
     browse and other features                                - needs (performant) API
     - continously high effort to                             - no search etc.
     integrate the data




     Christoph - Catalog enrichment à à la Linked Open Data
                 Kataloganreicherung la Linked Open Data                           24.05.2012
                                                                                    2012-12-26
                                                                                    2012-09-27
infrastructure
12




     RDF based storing with SPARQL endpoint:
        Easy to add data
        Open to be used by customer
        Self-describing data
        SPARQL is a (too?) powerful API



     Christoph - Catalog enrichment à à la Linked Open Data
                 Kataloganreicherung la Linked Open Data             24.05.2012
                                                                      2012-12-26
Overview
13




        Catalog enrichment
           Definition

           Technique

           Matching

           Linking

        Implementation demo
        Conclusion

     Christoph - Catalog enrichment à la Linked Open Data     2012-12-26
14




     Source of the picture: http://www.flickr.com/photos/jhsum-commons/4419490136/
lobid.org
15


        triple store with SPARQL Endpoint: 4store
        open data from the hbz union catalog
        16 M records <=> 1 B Triple
        links to:
• 5.500 Projekt Gutenberg                                     • 1.250.000 Open Library
• 12.000 DBpedia                                              • 700.000 ZDB
• 70.000 b3kat                                                • 800.000 LOC Iso-639-2
• 200.000 Dewey Decimal Class.                                • 22.000.000 gnd authority file
• 270.000 DNB Nationalbiografie                               • 32.000.000 lobid-organisations
• 420.000 OCLC


     Christoph - Catalog enrichment à à la Linked Open Data
                 Kataloganreicherung la Linked Open Data                       24.05.2012
                                                                                2012-12-26
                                                                                2012-09-27
Software
16



        Silk
        Culturegraph
        Google-refine
        Hadoop
        ...




     Christoph - Catalog-enrichment à à la Linkedmit LOD
     Jansen / Christoph KataloganreicherungOpen Data
                 Kataloganreicherung la Linked Open Data     24.05.2012
                                                              2012-12-26
                                                              2012-09-27
Matching algorithms
17



        depending on the data
           Interestingdata reside „elsewhere“
           => other cataloging rules

          DBpedia example:
           Creator, ISBN etc. are often missing => only title
           constraints:
               german  DBpedia
               category:Literarisches_Werk ,
                category:Lexikon,_Enzyklopädie

     Christoph - Catalog enrichment à à la Linked Open Data
                 Kataloganreicherung la Linked Open Data      24.05.2012
                                                               2012-12-26
                                                               2012-09-27
Problem: disambiguation
18



        matching is to blurry
        Post processing:
          Allow only bundle with same creator




     Christoph - Catalog-enrichment à à la Linkedmit LOD
     Jansen / Christoph KataloganreicherungOpen Data
                 Kataloganreicherung la Linked Open Data   24.05.2012
                                                            2012-12-26
                                                            2012-09-27
Bundle having the same creator
19




     Christoph - Catalog-enrichment à à la Linkedmit LOD
     Jansen / Christoph KataloganreicherungOpen Data
                 Kataloganreicherung la Linked Open Data   24.05.2012
                                                            2012-12-26
                                                            2012-09-27
Bundle having different creators
20




     Christoph - Catalog-enrichment à à la Linkedmit LOD
     Jansen / Christoph KataloganreicherungOpen Data
                 Kataloganreicherung la Linked Open Data   24.05.2012
                                                            2012-12-26
                                                            2012-09-27
LOW-HANGING
     FRUIT
Kai Schreiber, „Reiche Ernte” 7. August 2005 via Flickr CC BY-SA 2.0
Overview
22




        Catalog enrichment
           Definition

           Technique

           Matching

           Linking

        Implementation demo
        Conclusion

     Christoph - Catalog enrichment à la Linked Open Data     2012-12-26
triplification
23



        Find predicates or mint them yourself
           rdrel:workManifested

           =>      Triple:
             <lobid-resource> <rdrel:workManifested> <dbpedia-resource>




     Christoph - Catalog-enrichment à à la Linkedmit LOD
     Jansen / Christoph KataloganreicherungOpen Data
                 Kataloganreicherung la Linked Open Data         24.05.2012
                                                                  2012-12-26
                                                                  2012-09-27
indexing
24



        What is the license ?
        Import triples into the SPARQL-Endpoint
          own „named graph“ has advantages:
               Easilyremovable/changeable
               Provenience is stored
               Query specific named graphs




     Christoph - Catalog-enrichment à à la Linkedmit LOD
     Jansen / Christoph KataloganreicherungOpen Data
                 Kataloganreicherung la Linked Open Data     24.05.2012
                                                              2012-12-26
                                                              2012-09-27
Named Graphs
25




     Christoph - Catalog-enrichment à à la Linkedmit LOD
     Jansen / Christoph KataloganreicherungOpen Data
                 Kataloganreicherung la Linked Open Data          24.05.2012
                                                                   2012-12-26
                                                                   2012-09-27
What we achieved
26



        12.000 „sure“ links to 4.000 DBpedia
         resources => 4.000 new „Work“-levels (21.000
         discared links)
          average size of a bundle: 3

        links to freebase: 3.000
        0.1 % enrichment




     Christoph - Kataloganreicherung à la Linkedmit LOD
     Jansen / Christoph -enrichment à la Linked Open Data
                 Catalog Kataloganreicherung Open Data           24.05.2012
                                                                  2012-09-27
                                                                  2012-12-26
What we achieved
27



        5.500 links zu 400 Project Gutenberg
         ressources (fulltexts in differnet formats)
          => 0.05% enrichment



        1.200.000 links to the work level of the Open
         Library
          => 12.5% enrichment




     Christoph - Kataloganreicherung à la Linkedmit LOD
     Jansen / Christoph -enrichment à la Linked Open Data
                 Catalog Kataloganreicherung Open Data           24.05.2012
                                                                  2012-09-27
                                                                  2012-12-26
What we achieved
28




     Sir Tim Berners Lee:




                                                   Source of picture: http://www.w3.org/DesignIssues/LinkedData.html




      Christoph - Catalog enrichment à à la Linked Open Data
                  Kataloganreicherung la Linked Open Data                                       2012-12-26
                                                                                                2012-09-27
LOW-HANGING
    FRUIT
Kai Schreiber, „Reiche Ernte” 7. August 2005 via Flickr CC BY-SA 2.0
What we achieved
30




                                    DBpedia example:

                  „Die Heilige Johanna der Schlachthöfe“




     Christoph - Kataloganreicherung à la Linkedmit LOD
     Jansen / Christoph -enrichment à la Linked Open Data
                 Catalog Kataloganreicherung Open Data           24.05.2012
                                                                  2012-09-27
                                                                  2012-12-26
Swib12 workshop lod_beginners
Swib12 workshop lod_beginners
Swib12 workshop lod_beginners
What we achieved
34




                              Open Library example:

                             „With reference to reference“




     Christoph - Kataloganreicherung à la Linkedmit LOD
     Jansen / Christoph -enrichment à la Linked Open Data
                 Catalog Kataloganreicherung Open Data           24.05.2012
                                                                  2012-09-27
                                                                  2012-12-26
Swib12 workshop lod_beginners
Linking Example: LODUM
36




     Christoph - Catalog enrichment à à la Linked Open Data
                 Kataloganreicherung la Linked Open Data      24.05.2012
                                                               2012-12-26
                                                               2012-09-27
Integration into the catalog
37



        What is allowed ?
        What should be integrated, what not?
        Human readable presentation of the
         links/URIs
        (some) data should be indexed locally (e. g. to
         be able to search)
        ...


     Christoph - Kataloganreicherung à la Linkedmit LOD
     Jansen / Christoph -enrichment à la Linked Open Data
                 Catalog Kataloganreicherung Open Data      24.05.2012
                                                             2012-09-27
                                                             2012-12-26
Overview
38




        Catalog enrichment
           Definition

           Technique

           Matching

           Linking

        Implementation demo
        Conclusion

     Christoph - Catalog enrichment à la Linked Open Data     2012-12-26
Implementation demo
39




     Christoph - Kataloganreicherung à la Linkedmit LOD
     Jansen / Christoph -enrichment à la Linked Open Data
                 Catalog Kataloganreicherung Open Data      24.05.2012
                                                             2012-09-27
                                                             2012-12-26
Implementation demo
40




     Christoph - Kataloganreicherung à la Linkedmit LOD
     Jansen / Christoph -enrichment à la Linked Open Data
                 Catalog Kataloganreicherung Open Data      24.05.2012
                                                             2012-09-27
                                                             2012-12-26
Overview
41




        Catalog enrichment
           Definition

           Technique

           Matching

           Linking

        Implementation demo
        Conclusion

     Christoph - Catalog enrichment à la Linked Open Data     2012-12-26
Swib12 workshop lod_beginners
43




     Bildquelle: http://www.flickr.com/photos/library_of_congress/4037490394/
conclusion
44




     Everything that's possible with LOD could also
     be achieved without LOD.


     It's just easier with LOD.




     Christoph - Kataloganreicherung à la Linkedmit LOD
     Jansen / Christoph -enrichment à la Linked Open Data
                 Catalog Kataloganreicherung Open Data          24.05.2012
                                                                 2012-09-27
                                                                 2012-12-26
LOD - Definition „linked“
45                           Ad astra ?
                             Addata ! ?
                             Ad astra
                             Ad data !
To boldly go where no data has gone before.

           To boldly go where no data has gone before .

           Source of the picture:http://hubblesite.org/gallery/album/star/pr2006050d
     Christoph - Kataloganreicherung à la Linked Open Data                             2012-09-27
Open source
46




                                               http://sourceforge.net/projects/culturegraph/


                           http://4store.org/



                       https://github.com/lobid/



     Silk            https://www.assembla.com/spaces/silk


     Christoph - Catalog enrichment à la Linked Open Data
47   Thank you !


          Pascal Christoph
          christoph@hbz-nrw.de

          semweb@hbz-nrw.de
48              list of references
- KiM: Empfehlungen zur Öffnung bibliothekarischer Daten
https://wiki.d-nb.de/pages/viewpage.action?pageId=45419980
- Till Kreutzer (2010): Open Data – Freigabe von Daten aus Bibliothekskatalogen
http://www.hbz-nrw.de/dokumentencenter/veroeffentlichungen/open-data-leitfaden.pdf
- Adrian Pohl (2010): Open Data im hbz-Verbund. Erschienen in: ProLibris. 3. Preprint:
http://www.hbz-nrw.de/dokumentencenter/produkte/lod/aktuell/pohl_2010_open-data.pdf
- Tim Berners Lee's talk of Open Data (2010): http://www.youtube.com/watch?v=3YcZ3Zqk0a8
- Jansen / Christoph: Dynamische Kataloganreicherung auf Basis von Linked Open Data
http://de.slideshare.net/h_jansen/dynamische-kataloganreicherung-auf-basis-von-linked-open-data
- Blog post: First results using SILK to link to DBpedia
https://wiki1.hbz-nrw.de/display/SEM/2012/05/03/First+results+using+SILK+to+link+to+DBpedia
- Blog post: 1.2 M links to Open Library
https://wiki1.hbz-nrw.de/display/SEM/2012/05/23/1.2+M+links+to+Open+Library
- Oliver Flimm (2010): LOD und die Open Library http://de.slideshare.net/flimm/lod-openlibrary20100512
- Directory of data „thedatahub“ aka CKAN: http://www.thedatahub.org/
- 49 bibliographic data sources as LODhttp://thedatahub.org/group/bibliographic?tags=lod

More Related Content

PPT
PPTX
LOD2 Webinar Series: 3rd relase of the Stack
PPTX
The Semantic Data Web, Sören Auer, University of Leipzig
PDF
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
PDF
Interactive exploration of complex relational data sets in a web - SemWeb.Pro...
ODP
Lod2 review meeting
LOD2 Webinar Series: 3rd relase of the Stack
The Semantic Data Web, Sören Auer, University of Leipzig
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
Interactive exploration of complex relational data sets in a web - SemWeb.Pro...
Lod2 review meeting

What's hot (20)

PDF
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
PDF
Wed roman tut_open_datapub
PPTX
OpenAIRE and the Case of Irish Repositories
PDF
Linguistic Linked Open Data, Challenges, Approaches, Future Work
PPT
LOD2 Webinar Series: D2R and Sparqlify
PDF
DBpedia Tutorial - Feb 2015, Dublin
PDF
ROI in Linking Content to CRM by Applying the Linked Data Stack
PPTX
Linked data life cycles
ODP
Data Integration And Visualization
PPT
Migrating from HDF5 1.6 to 1.8
PDF
Data 2 Documents: Modular and Distributive Content Management in RDF
PDF
Ivan Herman - Semantic Web Activities @ W3C
PPTX
Marklogic and the Linked Data Connection
PDF
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
PDF
Scalability 09262012
ODT
Catalog enrichment: importing Dewey Decimal Classification from external sour...
PDF
LDP-DL: A language to define the design of Linked Data Platforms
PPTX
Scaling up Linked Data
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
Wed roman tut_open_datapub
OpenAIRE and the Case of Irish Repositories
Linguistic Linked Open Data, Challenges, Approaches, Future Work
LOD2 Webinar Series: D2R and Sparqlify
DBpedia Tutorial - Feb 2015, Dublin
ROI in Linking Content to CRM by Applying the Linked Data Stack
Linked data life cycles
Data Integration And Visualization
Migrating from HDF5 1.6 to 1.8
Data 2 Documents: Modular and Distributive Content Management in RDF
Ivan Herman - Semantic Web Activities @ W3C
Marklogic and the Linked Data Connection
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
Scalability 09262012
Catalog enrichment: importing Dewey Decimal Classification from external sour...
LDP-DL: A language to define the design of Linked Data Platforms
Scaling up Linked Data
Ad

Similar to Swib12 workshop lod_beginners (20)

PPT
OCLC Linked Data Roundtable event IFLA 2012
PDF
Linked Open Library Data @hbz
PPT
Of Cataloging & Context
PPTX
Comet project
PDF
Linked data radical change
PPT
Developments in catalogues and data sharing
PPT
Slash n: Tech Talk Track 1 – Art and Science of Cataloguing - Utkarsh
PPT
Tutorial
PDF
Linked Data Basics
PDF
20110728 datalift-rpi-troy
PDF
Aaai2012
PPTX
Semantic Web and Related Work at W3C
PDF
Linked Data and OCLC
PDF
Linked data - A radical change?
PDF
Planetdata simpda
PDF
PlanetData: Consuming Structured Data at Web Scale
PDF
PDF
Finding Data Sets
PDF
Sharing data on the web (2013)
PPTX
NCompass Live: Linked Data and Libraries: What? Why? How?
OCLC Linked Data Roundtable event IFLA 2012
Linked Open Library Data @hbz
Of Cataloging & Context
Comet project
Linked data radical change
Developments in catalogues and data sharing
Slash n: Tech Talk Track 1 – Art and Science of Cataloguing - Utkarsh
Tutorial
Linked Data Basics
20110728 datalift-rpi-troy
Aaai2012
Semantic Web and Related Work at W3C
Linked Data and OCLC
Linked data - A radical change?
Planetdata simpda
PlanetData: Consuming Structured Data at Web Scale
Finding Data Sets
Sharing data on the web (2013)
NCompass Live: Linked Data and Libraries: What? Why? How?
Ad

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPTX
A Presentation on Artificial Intelligence
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPTX
Machine Learning_overview_presentation.pptx
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PPT
Teaching material agriculture food technology
PDF
Approach and Philosophy of On baking technology
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Mushroom cultivation and it's methods.pdf
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Machine learning based COVID-19 study performance prediction
NewMind AI Weekly Chronicles - August'25-Week II
A Presentation on Artificial Intelligence
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
MIND Revenue Release Quarter 2 2025 Press Release
Group 1 Presentation -Planning and Decision Making .pptx
Machine Learning_overview_presentation.pptx
Univ-Connecticut-ChatGPT-Presentaion.pdf
Assigned Numbers - 2025 - Bluetooth® Document
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Teaching material agriculture food technology
Approach and Philosophy of On baking technology
cloud_computing_Infrastucture_as_cloud_p
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Mushroom cultivation and it's methods.pdf
TLE Review Electricity (Electricity).pptx
Spectral efficient network and resource selection model in 5G networks
gpt5_lecture_notes_comprehensive_20250812015547.pdf

Swib12 workshop lod_beginners

  • 1. Pascal Christoph Catalog enrichment à la Linked Open Data SWIB12, Cologne, 2012-12-26 Workshop: Introduction to Linked Open Data
  • 2. License 2 This presentation – inclusive the graphics made by the author, are licensed CC0: https://creativecommons.org/about/cc0 Pictures from http://www.istockphoto.com/ at slides 5, 7, 8 and 41 are licensed CC-BY-ND: http://creativecommons.org/licenses/by-nd/3.0/de/ Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod- cloud.net/ Christoph - Catalog enrichment à la Linked Open Data 2012-12-26
  • 3. Overview 3  Catalog enrichment  Definition  Technique  Matching  Linking  Implementation demo  Conclusion Christoph - Catalog enrichment à la Linked Open Data 2012-12-26
  • 4. Overview 4  Catalog enrichment  Definition  Technique  Matching  Linking  Implementation demo  Conclusion Christoph - Catalog enrichment à la Linked Open Data 2012-12-26
  • 6. Catalog enrichment: definition 6  Any addendum to the records:  linksto fulltexts/webpages/...  subjects, tags, recensions  covers  ...  The source of the addendum does not matter (users, libraries, companies...)  New features: only indirect Christoph - Catalog enrichment à à la Linked Open Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 9. Overview 9  Catalog enrichment  Definition  Technique  Matching  Linking  Implementation demo  Conclusion Christoph - Catalog enrichment à la Linked Open Data 2012-12-26
  • 10. Catalog enrichment: methods 10 Sourtce of the pictures :http://findicons.com/about database vs. mashup Christoph - Catalog enrichment à à la Linked Open Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 11. methods 11 locale DB: dynamic mashup: + elaborated combination of the + data always up-to-date data + relatively easy to integrate the data + data can be used to search and browse and other features - needs (performant) API - continously high effort to - no search etc. integrate the data Christoph - Catalog enrichment à à la Linked Open Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 12. infrastructure 12 RDF based storing with SPARQL endpoint:  Easy to add data  Open to be used by customer  Self-describing data  SPARQL is a (too?) powerful API Christoph - Catalog enrichment à à la Linked Open Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26
  • 13. Overview 13  Catalog enrichment  Definition  Technique  Matching  Linking  Implementation demo  Conclusion Christoph - Catalog enrichment à la Linked Open Data 2012-12-26
  • 14. 14 Source of the picture: http://www.flickr.com/photos/jhsum-commons/4419490136/
  • 15. lobid.org 15  triple store with SPARQL Endpoint: 4store  open data from the hbz union catalog  16 M records <=> 1 B Triple  links to: • 5.500 Projekt Gutenberg • 1.250.000 Open Library • 12.000 DBpedia • 700.000 ZDB • 70.000 b3kat • 800.000 LOC Iso-639-2 • 200.000 Dewey Decimal Class. • 22.000.000 gnd authority file • 270.000 DNB Nationalbiografie • 32.000.000 lobid-organisations • 420.000 OCLC Christoph - Catalog enrichment à à la Linked Open Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 16. Software 16  Silk  Culturegraph  Google-refine  Hadoop  ... Christoph - Catalog-enrichment à à la Linkedmit LOD Jansen / Christoph KataloganreicherungOpen Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 17. Matching algorithms 17  depending on the data  Interestingdata reside „elsewhere“  => other cataloging rules  DBpedia example:  Creator, ISBN etc. are often missing => only title  constraints:  german DBpedia  category:Literarisches_Werk , category:Lexikon,_Enzyklopädie Christoph - Catalog enrichment à à la Linked Open Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 18. Problem: disambiguation 18  matching is to blurry  Post processing:  Allow only bundle with same creator Christoph - Catalog-enrichment à à la Linkedmit LOD Jansen / Christoph KataloganreicherungOpen Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 19. Bundle having the same creator 19 Christoph - Catalog-enrichment à à la Linkedmit LOD Jansen / Christoph KataloganreicherungOpen Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 20. Bundle having different creators 20 Christoph - Catalog-enrichment à à la Linkedmit LOD Jansen / Christoph KataloganreicherungOpen Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 21. LOW-HANGING FRUIT Kai Schreiber, „Reiche Ernte” 7. August 2005 via Flickr CC BY-SA 2.0
  • 22. Overview 22  Catalog enrichment  Definition  Technique  Matching  Linking  Implementation demo  Conclusion Christoph - Catalog enrichment à la Linked Open Data 2012-12-26
  • 23. triplification 23  Find predicates or mint them yourself  rdrel:workManifested  => Triple: <lobid-resource> <rdrel:workManifested> <dbpedia-resource> Christoph - Catalog-enrichment à à la Linkedmit LOD Jansen / Christoph KataloganreicherungOpen Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 24. indexing 24  What is the license ?  Import triples into the SPARQL-Endpoint  own „named graph“ has advantages:  Easilyremovable/changeable  Provenience is stored  Query specific named graphs Christoph - Catalog-enrichment à à la Linkedmit LOD Jansen / Christoph KataloganreicherungOpen Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 25. Named Graphs 25 Christoph - Catalog-enrichment à à la Linkedmit LOD Jansen / Christoph KataloganreicherungOpen Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 26. What we achieved 26  12.000 „sure“ links to 4.000 DBpedia resources => 4.000 new „Work“-levels (21.000 discared links)  average size of a bundle: 3  links to freebase: 3.000  0.1 % enrichment Christoph - Kataloganreicherung à la Linkedmit LOD Jansen / Christoph -enrichment à la Linked Open Data Catalog Kataloganreicherung Open Data 24.05.2012 2012-09-27 2012-12-26
  • 27. What we achieved 27  5.500 links zu 400 Project Gutenberg ressources (fulltexts in differnet formats)  => 0.05% enrichment  1.200.000 links to the work level of the Open Library  => 12.5% enrichment Christoph - Kataloganreicherung à la Linkedmit LOD Jansen / Christoph -enrichment à la Linked Open Data Catalog Kataloganreicherung Open Data 24.05.2012 2012-09-27 2012-12-26
  • 28. What we achieved 28 Sir Tim Berners Lee: Source of picture: http://www.w3.org/DesignIssues/LinkedData.html Christoph - Catalog enrichment à à la Linked Open Data Kataloganreicherung la Linked Open Data 2012-12-26 2012-09-27
  • 29. LOW-HANGING FRUIT Kai Schreiber, „Reiche Ernte” 7. August 2005 via Flickr CC BY-SA 2.0
  • 30. What we achieved 30 DBpedia example: „Die Heilige Johanna der Schlachthöfe“ Christoph - Kataloganreicherung à la Linkedmit LOD Jansen / Christoph -enrichment à la Linked Open Data Catalog Kataloganreicherung Open Data 24.05.2012 2012-09-27 2012-12-26
  • 34. What we achieved 34 Open Library example: „With reference to reference“ Christoph - Kataloganreicherung à la Linkedmit LOD Jansen / Christoph -enrichment à la Linked Open Data Catalog Kataloganreicherung Open Data 24.05.2012 2012-09-27 2012-12-26
  • 36. Linking Example: LODUM 36 Christoph - Catalog enrichment à à la Linked Open Data Kataloganreicherung la Linked Open Data 24.05.2012 2012-12-26 2012-09-27
  • 37. Integration into the catalog 37  What is allowed ?  What should be integrated, what not?  Human readable presentation of the links/URIs  (some) data should be indexed locally (e. g. to be able to search)  ... Christoph - Kataloganreicherung à la Linkedmit LOD Jansen / Christoph -enrichment à la Linked Open Data Catalog Kataloganreicherung Open Data 24.05.2012 2012-09-27 2012-12-26
  • 38. Overview 38  Catalog enrichment  Definition  Technique  Matching  Linking  Implementation demo  Conclusion Christoph - Catalog enrichment à la Linked Open Data 2012-12-26
  • 39. Implementation demo 39 Christoph - Kataloganreicherung à la Linkedmit LOD Jansen / Christoph -enrichment à la Linked Open Data Catalog Kataloganreicherung Open Data 24.05.2012 2012-09-27 2012-12-26
  • 40. Implementation demo 40 Christoph - Kataloganreicherung à la Linkedmit LOD Jansen / Christoph -enrichment à la Linked Open Data Catalog Kataloganreicherung Open Data 24.05.2012 2012-09-27 2012-12-26
  • 41. Overview 41  Catalog enrichment  Definition  Technique  Matching  Linking  Implementation demo  Conclusion Christoph - Catalog enrichment à la Linked Open Data 2012-12-26
  • 43. 43 Bildquelle: http://www.flickr.com/photos/library_of_congress/4037490394/
  • 44. conclusion 44 Everything that's possible with LOD could also be achieved without LOD. It's just easier with LOD. Christoph - Kataloganreicherung à la Linkedmit LOD Jansen / Christoph -enrichment à la Linked Open Data Catalog Kataloganreicherung Open Data 24.05.2012 2012-09-27 2012-12-26
  • 45. LOD - Definition „linked“ 45 Ad astra ? Addata ! ? Ad astra Ad data ! To boldly go where no data has gone before. To boldly go where no data has gone before . Source of the picture:http://hubblesite.org/gallery/album/star/pr2006050d Christoph - Kataloganreicherung à la Linked Open Data 2012-09-27
  • 46. Open source 46 http://sourceforge.net/projects/culturegraph/ http://4store.org/ https://github.com/lobid/ Silk https://www.assembla.com/spaces/silk Christoph - Catalog enrichment à la Linked Open Data
  • 47. 47 Thank you ! Pascal Christoph christoph@hbz-nrw.de semweb@hbz-nrw.de
  • 48. 48 list of references - KiM: Empfehlungen zur Öffnung bibliothekarischer Daten https://wiki.d-nb.de/pages/viewpage.action?pageId=45419980 - Till Kreutzer (2010): Open Data – Freigabe von Daten aus Bibliothekskatalogen http://www.hbz-nrw.de/dokumentencenter/veroeffentlichungen/open-data-leitfaden.pdf - Adrian Pohl (2010): Open Data im hbz-Verbund. Erschienen in: ProLibris. 3. Preprint: http://www.hbz-nrw.de/dokumentencenter/produkte/lod/aktuell/pohl_2010_open-data.pdf - Tim Berners Lee's talk of Open Data (2010): http://www.youtube.com/watch?v=3YcZ3Zqk0a8 - Jansen / Christoph: Dynamische Kataloganreicherung auf Basis von Linked Open Data http://de.slideshare.net/h_jansen/dynamische-kataloganreicherung-auf-basis-von-linked-open-data - Blog post: First results using SILK to link to DBpedia https://wiki1.hbz-nrw.de/display/SEM/2012/05/03/First+results+using+SILK+to+link+to+DBpedia - Blog post: 1.2 M links to Open Library https://wiki1.hbz-nrw.de/display/SEM/2012/05/23/1.2+M+links+to+Open+Library - Oliver Flimm (2010): LOD und die Open Library http://de.slideshare.net/flimm/lod-openlibrary20100512 - Directory of data „thedatahub“ aka CKAN: http://www.thedatahub.org/ - 49 bibliographic data sources as LODhttp://thedatahub.org/group/bibliographic?tags=lod