SlideShare a Scribd company logo
The Mint Mapping tool
The MoRe aggregator
Vassilis Tzouvaras, Dimitris Gavrilis
National Technical University of Athens
Digital Curation Unit - IMIS, Athena Research Center
LoCloud is funded by the
European Commission's ICT Policy Support Programme
Cultural Heritage Content
• Diversity of cultural heritage content
– Numerous metadata schemas to annotate content
(LIDO, CIDOC-CRM, EAD, METS )
• Massive digitization and annotation activities are in
progress
• Need for interoperability
MINT Mapping Tool
• Provides users the ability to perform a mapping of
their own metadata schemas to reference domain
models
• Follows a typical web based architecture
• It was developed for ATHENA, but it is currently used
for EUScreen, CARARE, Judaica, ECLAP, DCA and
Linked Heritage
MINT 2 – What’s new?
• The backend was reconstructed for better
performance
– File size for imports is extended
• The frontend was updated
– New interface
– Workflow is integrated in UI
– Facilitated browsing of input and target schema
The Mint Mapping tool and the MoRe aggregator
The Mint Mapping tool and the MoRe aggregator
The Mint Mapping tool and the MoRe aggregator
The Mint Mapping tool and the MoRe aggregator
The Mint Mapping tool and the MoRe aggregator
The Mint Mapping tool and the MoRe aggregator
The Mint Mapping tool and the MoRe aggregator
MORe Overall Architecture
Registry
Apache Cassandra cluster
Fedora-commons
Temporary storage
Vocabulary services
Storage
JMS logging
Messaging
Core services
Enrichment service
management
Entity matching / NLP
Geocoding / Historic
Place names
REST
External enrichment
services
Publish service
management
OAI-PMH
RDF Store
Elastic Search
Archive
Cloud architecture
• De-centralized
• Scalable
• Four cloud environmets
– Storage
– Monitoring & logging
– Core services deployment
– Enrichment services deployment
Distributed
• Enrichment services run on:
– Austria
– Spain
– Greece
– Lithuania
– Slovenia
– Norway
• Scalability can be facilitated through a virtualization
infrastructure
Workflow
OAI-PMH
LoCloud
Collections
Wikimedia
MINT
Harvest
Ingest
Transform Enrich
Publish
OAI-PMH
Archive
RDF Store
SolR
Validate Index
Delete Reject
Omeka
Intermediate Schemas
Dublin Core
LIDO
CARARE
EAD
ESE
EDM
Dublin Core
LIDO
CARARE
EAD
ESE
EDM
OMEKA-XML
OGD
• Harvesting
• Validation
• Ingestion
• Transformation
• Enrichment
• Previewing
• Publishing
Core services
Harvests content from metadata sources
OAI-PMH repository
MINT
LoCloud Collections
Wikimedia
Multiple schemas are supported
OAI_DC
CARARE
CARARE 2.0
LIDO
EAD
EDM
ESE
• Harvesting
• Validation
• Ingestion
• Transformation
• Enrichment
• Previewing
• Publishing
Core services
Validates incoming information packages
Executes validation schemes
Validation micro-services
Structure
Schema
Linking
Schematron rules
Flexible
How it is used in MoRe:
Pre-validation
Post-validation
• Harvesting
• Validation
• Ingestion
• Transformation
• Enrichment
• Previewing
• Publishing
Core services
Ingest content into storage
Uses storage layer API
Pluggable drivers for attaching different technologies /
repositories
Apache Cassandra
Filesystem-based
Fedora-commons
Versioning support
Complex digital object support
• Harvesting
• Validation
• Ingestion
• Transformation
• Enrichment
• Previewing
• Publishing
Core services
Content Model
Digital objects comprise data streams
Each data stream can hold any kind of information
• XML/RDF, Image, Video, Documents, etc.
Each different representation of an information object is
stored as a different data stream
Each curation action generates a new version
• Transformation, Enrichment
• Harvesting
• Validation
• Ingestion
• Transformation
• Enrichment
• Previewing
• Publishing
Core services
Transforms entire information packages into the
Europeana Data Model (EDM), or any other schema
Multiple transformation routines
Per schema
Per project
Per provider
User can attach rights statement
• Harvesting
• Validation
• Ingestion
• Transformation
• Enrichment
• Previewing
• Publishing
Core services
The generic enrichment service facilitates the execution
of the enrichment micro-services
• Hides the complexity from the user by using
enrichment plans
• Provides seamless integration with the UI of
MORE
Virtual Enrichment driver
• Allows developers/creative industries to create
their own enrichment services and declare/use
them within MoRe
• Harvesting
• Validation
• Ingestion
• Transformation
• Enrichment
• Previewing
• Publishing
Core services
Preview the XML record information for all datastreams
Preview the record in HTML (using the Europeana style
sheet)
• Harvesting
• Validation
• Ingestion
• Transformation
• Enrichment
• Previewing
• Publishing
Core services
Publish transformed / enriched information
• Internal OAI-PMH provider
• XML export
• Publish directly to RDF repositories
• Sesame
• Virtuoso
• SolR index server
• Thematic
– Thesauri collections
– Vocabulary matching
– Background links
• Spatial
– Geo normalization
– Geo coding
– Reverse geo-coding
– Historic place names
• Other
– Language identification
Enrichment micro-services
SKOS Thesauri
Geo-Names
DBPedia
Wikipedia
Enrichment Plan
• Enrichment micro-services are used
within enrichment workflows:
– Enrichment plans
• Each enrichment plan applies to a
specific schema
• Each enrichment plan executes
enrichment micro-services in a specific
order
Enrichment plans
Language
identification
Vocabulary matching
Geo-normalization
Geo-coding
Enrichment Plan
• Each enrichment plan defines run-time
parameters for specific services
– Content based
Enrichment plans
Language
identification
Vocabulary matching
Geo-normalization
Geo-coding
Add subject collection
A only if term X or Y
are matched
Dashboard
Packages organization
Package overview
Package lifecycle overview
Preview
Metadata completeness & statistics
Enrichment services overview
Direct access to 27 thesauri
Create & (re)use subject collections
Thank you
tzouvaras@image.ntua.gr
d.gavrilis@dcu.gr

More Related Content

PPTX
Resource space
PPT
Cloud computing
PPTX
Different Online Platforms in ICT
PPTX
1. introduction to cloud computing
PDF
06 lo cloud
PDF
PDF
Cloud computing
PPTX
Pesentation on cloud computing by vijesh
Resource space
Cloud computing
Different Online Platforms in ICT
1. introduction to cloud computing
06 lo cloud
Cloud computing
Pesentation on cloud computing by vijesh

What's hot (17)

PPTX
Cloud Services Providers
PPTX
Cloud Computing
PPT
Cloud Computing Introduction - Deep Dive
PPTX
Cloud computing
PPTX
Cloud computing power point presentation
PPTX
Cloud computing
PPTX
Cloud computing
PPTX
cloud shilpa
PDF
Why Cloud Computing?
PPTX
Cloud Service Model
PPTX
Introduction to cloud computing
PPTX
All about paas_iaas_saas_29.01.2015
PPTX
Cloud Computing Basics
PPT
Unit 2 -Cloud Computing Architecture
PPTX
Cloud introducton and_openstack_nova
PPT
Cloud storage and services
PPT
Open Data Masterclass - Europeana and LOD
Cloud Services Providers
Cloud Computing
Cloud Computing Introduction - Deep Dive
Cloud computing
Cloud computing power point presentation
Cloud computing
Cloud computing
cloud shilpa
Why Cloud Computing?
Cloud Service Model
Introduction to cloud computing
All about paas_iaas_saas_29.01.2015
Cloud Computing Basics
Unit 2 -Cloud Computing Architecture
Cloud introducton and_openstack_nova
Cloud storage and services
Open Data Masterclass - Europeana and LOD
Ad

Similar to The Mint Mapping tool and the MoRe aggregator (20)

PPTX
The Mint Mapping tool
PDF
The LoCloud MORE aggregator, Gavrilis Dimitris Afiontzi Eleni, Makri Dimit...
PDF
Semantic Technologies for Enterprise Cloud Management
PPTX
EOSC-hub service portfolio
PPT
Ict uses in libraries
PDF
ADV Slides: Trends in Streaming Analytics and Message-oriented Middleware
PDF
EGI Services
PDF
Evolving Domains, Problems and Solutions for Long Term Digital Preservation
PPTX
Dublinked tech workshop_15_dec2011
PDF
Javaday jplaton presentation final
PDF
How Docker EE is Finnish Railway’s Ticket to App Modernization
PPTX
BPM und SOA machen mobil - Ein Architekturüberblick
PPTX
BPM and SOA are going mobile - An architectural perspective
PPTX
Lecture 5- Data Collection and Storage.pptx
PPTX
Mobility and federation of Cloud computing
PPT
All WP Meeting Athens - Europeana Inside - Gordon McKenna
PDF
ELIXIR Competence Centre in EOSC-hub
PPTX
Supporting Research through "Desktop as a Service" models of e-infrastructure...
PPTX
LoCloud - Local content in a Europeana cloud
PDF
LoCloud: Local Content in a Europeana Cloud
The Mint Mapping tool
The LoCloud MORE aggregator, Gavrilis Dimitris Afiontzi Eleni, Makri Dimit...
Semantic Technologies for Enterprise Cloud Management
EOSC-hub service portfolio
Ict uses in libraries
ADV Slides: Trends in Streaming Analytics and Message-oriented Middleware
EGI Services
Evolving Domains, Problems and Solutions for Long Term Digital Preservation
Dublinked tech workshop_15_dec2011
Javaday jplaton presentation final
How Docker EE is Finnish Railway’s Ticket to App Modernization
BPM und SOA machen mobil - Ein Architekturüberblick
BPM and SOA are going mobile - An architectural perspective
Lecture 5- Data Collection and Storage.pptx
Mobility and federation of Cloud computing
All WP Meeting Athens - Europeana Inside - Gordon McKenna
ELIXIR Competence Centre in EOSC-hub
Supporting Research through "Desktop as a Service" models of e-infrastructure...
LoCloud - Local content in a Europeana cloud
LoCloud: Local Content in a Europeana Cloud
Ad

More from locloud (20)

PDF
Digital Cultural Heritage and the new EU Framework Programme
PPTX
LoCloud Overview
PPTX
LoCloud geolocation enrichment tools: On the Map
PPT
Microservices in LoCloud
PPTX
Do MORe with your data
PPTX
Bastille, Bastille or Bastille?
PPT
Beyond the space: the LoCloud Historical Place Names microservice
PDF
LoCloud Collections, or how to make your local heritage available on-line
PPTX
Cultural Heritage & H2020
PPTX
Small, smaller and smallest: working with small archaeological content provid...
PPTX
Spanish collections in Locloud: a round-trip talk between european institutions
PPTX
From local to global: Romanian cultural values in Europeana through Locloud
PPT
Dynamics and partnerships with local associations involved in LoCloud: a case...
PPT
Increasing Visibility of Cultural Heritage Objects: A Case of Turkish Conten...
PPTX
A house museum in the cloud: the experience of Fondazione Ranieri di Sorbello...
PPTX
LoCloud: Enabling local digital heritage in Ireland
PPSX
Serbia in the (Lo)Clouds
PDF
LoCloud: Report on the content delivered to Europeana
PDF
LoCloud - D6.5 Sustainability and Exploitation Plan
PDF
LoCloud - D6 3: Final Dissemination Report
Digital Cultural Heritage and the new EU Framework Programme
LoCloud Overview
LoCloud geolocation enrichment tools: On the Map
Microservices in LoCloud
Do MORe with your data
Bastille, Bastille or Bastille?
Beyond the space: the LoCloud Historical Place Names microservice
LoCloud Collections, or how to make your local heritage available on-line
Cultural Heritage & H2020
Small, smaller and smallest: working with small archaeological content provid...
Spanish collections in Locloud: a round-trip talk between european institutions
From local to global: Romanian cultural values in Europeana through Locloud
Dynamics and partnerships with local associations involved in LoCloud: a case...
Increasing Visibility of Cultural Heritage Objects: A Case of Turkish Conten...
A house museum in the cloud: the experience of Fondazione Ranieri di Sorbello...
LoCloud: Enabling local digital heritage in Ireland
Serbia in the (Lo)Clouds
LoCloud: Report on the content delivered to Europeana
LoCloud - D6.5 Sustainability and Exploitation Plan
LoCloud - D6 3: Final Dissemination Report

Recently uploaded (20)

PDF
FINAL CALL-6th International Conference on Networks & IOT (NeTIOT 2025)
PDF
Introduction to the IoT system, how the IoT system works
PDF
💰 𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓 💰
PDF
WebRTC in SignalWire - troubleshooting media negotiation
PDF
Slides PDF The World Game (s) Eco Economic Epochs.pdf
PPTX
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
PDF
Sims 4 Historia para lo sims 4 para jugar
PPTX
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
PDF
Vigrab.top – Online Tool for Downloading and Converting Social Media Videos a...
PPTX
introduction about ICD -10 & ICD-11 ppt.pptx
PDF
RPKI Status Update, presented by Makito Lay at IDNOG 10
PPTX
Internet___Basics___Styled_ presentation
PDF
SASE Traffic Flow - ZTNA Connector-1.pdf
PDF
Paper PDF World Game (s) Great Redesign.pdf
PPT
tcp ip networks nd ip layering assotred slides
PDF
Automated vs Manual WooCommerce to Shopify Migration_ Pros & Cons.pdf
PPTX
Digital Literacy And Online Safety on internet
PPTX
innovation process that make everything different.pptx
PPTX
INTERNET------BASICS-------UPDATED PPT PRESENTATION
PDF
Cloud-Scale Log Monitoring _ Datadog.pdf
FINAL CALL-6th International Conference on Networks & IOT (NeTIOT 2025)
Introduction to the IoT system, how the IoT system works
💰 𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓 💰
WebRTC in SignalWire - troubleshooting media negotiation
Slides PDF The World Game (s) Eco Economic Epochs.pdf
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
Sims 4 Historia para lo sims 4 para jugar
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
Vigrab.top – Online Tool for Downloading and Converting Social Media Videos a...
introduction about ICD -10 & ICD-11 ppt.pptx
RPKI Status Update, presented by Makito Lay at IDNOG 10
Internet___Basics___Styled_ presentation
SASE Traffic Flow - ZTNA Connector-1.pdf
Paper PDF World Game (s) Great Redesign.pdf
tcp ip networks nd ip layering assotred slides
Automated vs Manual WooCommerce to Shopify Migration_ Pros & Cons.pdf
Digital Literacy And Online Safety on internet
innovation process that make everything different.pptx
INTERNET------BASICS-------UPDATED PPT PRESENTATION
Cloud-Scale Log Monitoring _ Datadog.pdf

The Mint Mapping tool and the MoRe aggregator

  • 1. The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena Research Center LoCloud is funded by the European Commission's ICT Policy Support Programme
  • 2. Cultural Heritage Content • Diversity of cultural heritage content – Numerous metadata schemas to annotate content (LIDO, CIDOC-CRM, EAD, METS ) • Massive digitization and annotation activities are in progress • Need for interoperability
  • 3. MINT Mapping Tool • Provides users the ability to perform a mapping of their own metadata schemas to reference domain models • Follows a typical web based architecture • It was developed for ATHENA, but it is currently used for EUScreen, CARARE, Judaica, ECLAP, DCA and Linked Heritage
  • 4. MINT 2 – What’s new? • The backend was reconstructed for better performance – File size for imports is extended • The frontend was updated – New interface – Workflow is integrated in UI – Facilitated browsing of input and target schema
  • 12. MORe Overall Architecture Registry Apache Cassandra cluster Fedora-commons Temporary storage Vocabulary services Storage JMS logging Messaging Core services Enrichment service management Entity matching / NLP Geocoding / Historic Place names REST External enrichment services Publish service management OAI-PMH RDF Store Elastic Search Archive
  • 13. Cloud architecture • De-centralized • Scalable • Four cloud environmets – Storage – Monitoring & logging – Core services deployment – Enrichment services deployment
  • 14. Distributed • Enrichment services run on: – Austria – Spain – Greece – Lithuania – Slovenia – Norway • Scalability can be facilitated through a virtualization infrastructure
  • 16. Intermediate Schemas Dublin Core LIDO CARARE EAD ESE EDM Dublin Core LIDO CARARE EAD ESE EDM OMEKA-XML OGD
  • 17. • Harvesting • Validation • Ingestion • Transformation • Enrichment • Previewing • Publishing Core services Harvests content from metadata sources OAI-PMH repository MINT LoCloud Collections Wikimedia Multiple schemas are supported OAI_DC CARARE CARARE 2.0 LIDO EAD EDM ESE
  • 18. • Harvesting • Validation • Ingestion • Transformation • Enrichment • Previewing • Publishing Core services Validates incoming information packages Executes validation schemes Validation micro-services Structure Schema Linking Schematron rules Flexible How it is used in MoRe: Pre-validation Post-validation
  • 19. • Harvesting • Validation • Ingestion • Transformation • Enrichment • Previewing • Publishing Core services Ingest content into storage Uses storage layer API Pluggable drivers for attaching different technologies / repositories Apache Cassandra Filesystem-based Fedora-commons Versioning support Complex digital object support
  • 20. • Harvesting • Validation • Ingestion • Transformation • Enrichment • Previewing • Publishing Core services Content Model Digital objects comprise data streams Each data stream can hold any kind of information • XML/RDF, Image, Video, Documents, etc. Each different representation of an information object is stored as a different data stream Each curation action generates a new version • Transformation, Enrichment
  • 21. • Harvesting • Validation • Ingestion • Transformation • Enrichment • Previewing • Publishing Core services Transforms entire information packages into the Europeana Data Model (EDM), or any other schema Multiple transformation routines Per schema Per project Per provider User can attach rights statement
  • 22. • Harvesting • Validation • Ingestion • Transformation • Enrichment • Previewing • Publishing Core services The generic enrichment service facilitates the execution of the enrichment micro-services • Hides the complexity from the user by using enrichment plans • Provides seamless integration with the UI of MORE Virtual Enrichment driver • Allows developers/creative industries to create their own enrichment services and declare/use them within MoRe
  • 23. • Harvesting • Validation • Ingestion • Transformation • Enrichment • Previewing • Publishing Core services Preview the XML record information for all datastreams Preview the record in HTML (using the Europeana style sheet)
  • 24. • Harvesting • Validation • Ingestion • Transformation • Enrichment • Previewing • Publishing Core services Publish transformed / enriched information • Internal OAI-PMH provider • XML export • Publish directly to RDF repositories • Sesame • Virtuoso • SolR index server
  • 25. • Thematic – Thesauri collections – Vocabulary matching – Background links • Spatial – Geo normalization – Geo coding – Reverse geo-coding – Historic place names • Other – Language identification Enrichment micro-services SKOS Thesauri Geo-Names DBPedia Wikipedia
  • 26. Enrichment Plan • Enrichment micro-services are used within enrichment workflows: – Enrichment plans • Each enrichment plan applies to a specific schema • Each enrichment plan executes enrichment micro-services in a specific order Enrichment plans Language identification Vocabulary matching Geo-normalization Geo-coding
  • 27. Enrichment Plan • Each enrichment plan defines run-time parameters for specific services – Content based Enrichment plans Language identification Vocabulary matching Geo-normalization Geo-coding Add subject collection A only if term X or Y are matched
  • 35. Direct access to 27 thesauri Create & (re)use subject collections