CELLAR: The Publications Office's
Semantic Repository
Marc Wilhelm Küster
Publications Office of the EU
European Policy Perspectives on Data-intensive
Agriculture & Food
Brussels, 30 September 2016
What goes into the CELLAR?
Contractors
…
Reception Validation Conversion
IMMC
CELLARMETS
ELI: European Legislation Identifier
IMMC: Standardized XML transmission envelope
METS: Metadata Encoding Standard
How are things structured in the CELLAR?
Ontologies /
Common Data Model
InstanceDataControlData
Thesauri / authority
tables
…
WORK
<Directive>
e.g. 32006L0121
Expression
FR: Directive 2006/121/CE du
Parlement européen et du Conseil
du 18 décembre 2006[…]
Expression
EN: Directive 2006/121/EC of the
European Parliament and of the
Council of 18 December 2006
amending Council Directive 67/
548/EEC[…]
Expression
EL: Οδηγία 2006/121/ΕΚ του
Ευρωπαϊκού Κοινοβουλίου και του
Συμβουλίου, της 18ης Δεκεμβρίου
2006 , για την τροποποίηση της
οδηγίας 67/548/ΕΟΚ […]
Manifestation
PDF
Manifestation
xhtml
Manifestation
PDF
Manifestation
xhtml
Manifestation
PDF
Manifestation
xhtml
SUBJECT
002897: rapprochement des
législations
AGENT
PE: European Parliament
CONSIL: Council
How can you retrieve data from CELLAR?
SPARQLDirect access /
RESTful WS
Notification
/ RSS
EUR-Lex
OP Portal
Internet
http://publications.europa.eu/webapi/rdf/sparql
http://publications.europa.eu/resource/...
 Dublin Core (core metadata)
 Linked Open Data (LOD)
 Web-friendly ("RESTful") Interface
 Resource Description Framework (RDF)
 Standard Query Language (SPARQL)
 FRBR model
 URIs:
http://publications.europa.eu/resource/
{ps-id}/{obj-id}
Eurovoc
Eurovoc
Result:: 18630 acts (as of 2016-09-29)
Eurovoc
Result: 18630 acts (as of 2016-09-29) Grape
•8 mio requests per day served on
average, peaks >20 mio
•>100k SPARQL queries / day
•> 1 mio different resources in > 10
million linguistic versions and > 28 mio
items
•> 230 million persistent identifiers
•> 1500 million triples in Oracle RDF
store
•Ca. 5000 resources treated each day
(most in 23 languages)
• Sizes:
•4 TB Oracle DB (compressed)
•Content (in Fedora repository) > 17.5
TB
•120 million files in Fedora
State: 2016-09
How much is CELLAR used?
Requests from internet / country (2016-09)
Daily requests / day (2016-09)
SPARQL requests / day (2016-09)
Attributions for reused images:
Wine CELLAR: https://flic.kr/p/pkG1QS
Photo of OWL: https://flic.kr/p/6AMV1C
http://gephi.github.io/features/
Network: https://en.wikipedia.org/wiki/Network_theory#/media/File:Internet_map_1024.jpg
https://openclipart.org/detail/169750/fileiconpdf
https://openclipart.org/detail/169753/fileiconxml
https://openclipart.org/detail/169751/fileiconhtml

More Related Content

DOCX
Letter of Recommendation
PDF
Luigi Selmi - The Big Data Integrator Platform
PDF
Josep Maria Salanova - Introduction to BDE+SC4
PDF
Rajendra Akerkar - LeMO Project
PDF
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
PDF
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
PDF
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
PDF
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Letter of Recommendation
Luigi Selmi - The Big Data Integrator Platform
Josep Maria Salanova - Introduction to BDE+SC4
Rajendra Akerkar - LeMO Project
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...

More from BigData_Europe (20)

PDF
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
PDF
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
PDF
BDE SC3.3 Workshop - Agenda
PDF
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
PDF
BDE SC3.3 Workshop - Data management in WT testing and monitoring
PDF
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
PDF
BDE SC3.3 Workshop - BDE Platform: Technical overview
PDF
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
PDF
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
PDF
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
PDF
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
PPTX
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
PPTX
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
PPTX
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
PPTX
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
PPTX
SC1 Hangout: Updating public databases: Automation and other challenges for c...
PDF
SC7 Webinar 5 13/12/2017 SatCen Presentation "Secure societies activities: th...
PDF
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
PDF
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
PDF
SC7 Webinar 5 13/12/2017 SatCen Presentation "The Secure Societies Community ...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
BDE SC3.3 Workshop - Agenda
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
BDE SC3.3 Workshop - Data management in WT testing and monitoring
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
BDE SC3.3 Workshop - BDE Platform: Technical overview
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
SC1 Hangout: Updating public databases: Automation and other challenges for c...
SC7 Webinar 5 13/12/2017 SatCen Presentation "Secure societies activities: th...
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
SC7 Webinar 5 13/12/2017 SatCen Presentation "The Secure Societies Community ...
Ad

Recently uploaded (20)

PDF
Session 11 - Data Visualization Storytelling (2).pdf
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
eGramSWARAJ-PPT Training Module for beginners
PPTX
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
DOCX
Factor Analysis Word Document Presentation
PPT
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PPTX
1 hour to get there before the game is done so you don’t need a car seat for ...
PPTX
Crypto_Trading_Beginners.pptxxxxxxxxxxxxxx
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
IMPACT OF LANDSLIDE.....................
PDF
Global Data and Analytics Market Outlook Report
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPTX
CYBER SECURITY the Next Warefare Tactics
PPT
DU, AIS, Big Data and Data Analytics.ppt
Session 11 - Data Visualization Storytelling (2).pdf
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
eGramSWARAJ-PPT Training Module for beginners
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
Factor Analysis Word Document Presentation
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
1 hour to get there before the game is done so you don’t need a car seat for ...
Crypto_Trading_Beginners.pptxxxxxxxxxxxxxx
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
IMPACT OF LANDSLIDE.....................
Global Data and Analytics Market Outlook Report
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
CYBER SECURITY the Next Warefare Tactics
DU, AIS, Big Data and Data Analytics.ppt
Ad

SC2 Workshop 2: CELLAR: The Publications Office's Semantic Repository

  • 1. CELLAR: The Publications Office's Semantic Repository Marc Wilhelm Küster Publications Office of the EU European Policy Perspectives on Data-intensive Agriculture & Food Brussels, 30 September 2016
  • 2. What goes into the CELLAR? Contractors … Reception Validation Conversion IMMC CELLARMETS ELI: European Legislation Identifier IMMC: Standardized XML transmission envelope METS: Metadata Encoding Standard
  • 3. How are things structured in the CELLAR? Ontologies / Common Data Model InstanceDataControlData Thesauri / authority tables … WORK <Directive> e.g. 32006L0121 Expression FR: Directive 2006/121/CE du Parlement européen et du Conseil du 18 décembre 2006[…] Expression EN: Directive 2006/121/EC of the European Parliament and of the Council of 18 December 2006 amending Council Directive 67/ 548/EEC[…] Expression EL: Οδηγία 2006/121/ΕΚ του Ευρωπαϊκού Κοινοβουλίου και του Συμβουλίου, της 18ης Δεκεμβρίου 2006 , για την τροποποίηση της οδηγίας 67/548/ΕΟΚ […] Manifestation PDF Manifestation xhtml Manifestation PDF Manifestation xhtml Manifestation PDF Manifestation xhtml SUBJECT 002897: rapprochement des législations AGENT PE: European Parliament CONSIL: Council
  • 4. How can you retrieve data from CELLAR? SPARQLDirect access / RESTful WS Notification / RSS EUR-Lex OP Portal Internet http://publications.europa.eu/webapi/rdf/sparql http://publications.europa.eu/resource/...  Dublin Core (core metadata)  Linked Open Data (LOD)  Web-friendly ("RESTful") Interface  Resource Description Framework (RDF)  Standard Query Language (SPARQL)  FRBR model  URIs: http://publications.europa.eu/resource/ {ps-id}/{obj-id}
  • 6. Eurovoc Result:: 18630 acts (as of 2016-09-29)
  • 7. Eurovoc Result: 18630 acts (as of 2016-09-29) Grape
  • 8. •8 mio requests per day served on average, peaks >20 mio •>100k SPARQL queries / day •> 1 mio different resources in > 10 million linguistic versions and > 28 mio items •> 230 million persistent identifiers •> 1500 million triples in Oracle RDF store •Ca. 5000 resources treated each day (most in 23 languages) • Sizes: •4 TB Oracle DB (compressed) •Content (in Fedora repository) > 17.5 TB •120 million files in Fedora State: 2016-09 How much is CELLAR used? Requests from internet / country (2016-09) Daily requests / day (2016-09) SPARQL requests / day (2016-09)
  • 9. Attributions for reused images: Wine CELLAR: https://flic.kr/p/pkG1QS Photo of OWL: https://flic.kr/p/6AMV1C http://gephi.github.io/features/ Network: https://en.wikipedia.org/wiki/Network_theory#/media/File:Internet_map_1024.jpg https://openclipart.org/detail/169750/fileiconpdf https://openclipart.org/detail/169753/fileiconxml https://openclipart.org/detail/169751/fileiconhtml