SlideShare a Scribd company logo
Concise Preservation by combining Managed
Forgetting and Contextualized Remembering
The Preserve-or-Forget Reference Model and Framework (WP8 ForgetIT 1st year review)
Francesco Gallo
EURIX
WP8 Presentation
The Preserve-or-Forget Reference
Model and Framework
ForgetIT 1st Review Meeting, April 29-30, 2014
Kaiserslautern, Germany
WP Objectives (from DoW)
●
integrate project components into a technologically coherent framework
●
adopt flexible and extensible solutions
●
define a PoF reference model supporting ForgetIT concepts
Focus of Year 1
• design of the PoF framework architecture
• technology assessment
• identification of project components and early integration
• started working on definition of PoF model
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Objectives of WP and Year 1 Focus
Design of the architecture for the Preserve-or-Forget Framework
Assessment of technologies for PoF middleware and AIS
Definition of components from all WPs, preliminary integration
Analysis of requirements for PoF reference model
Testbed setup and integration plan
PoF framework prototype, synergetic preservation workflow
• ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Achievements in Year 1
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Preserve-or-Forget Architecture (D8.1)
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
PoF Framework Components and APIs (D8.1)
middlewareapplications archive
cloud storage
WP3,WP4,WP5,WP6WP10
WP9 WP8
WP7
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
PoF Middleware Components
Shared components (general tasks)
●
ID Manager, Metadata Repository, Scheduler, Context-Aware
Preservation Manager
Other components (core ForgetIT principles)
●
Forgettor, Extractor, Condensator, Contextualizer, Navigator,
Collector, Archiver
For each component full description provided in D8.1
Many components available as prototypes, already integrated
Evaluated several candidates for implementing the Archive
●
Archivematica, FedoraCommons, DSpace, RODA, P4, iRODS, …
Assessment criteria:
●
open-source license, support for ForgetIT data types, TRL,
integration with PDS, language and technologies, documentation,
supporting community, …
DSpace selected as the candidate implementation of PoF Archive
●
widely adopted and actively maintained, supports all ForgetIT data
types, integrates with cloud storage solutions and other platforms
●
extensible with custom add-ons, periodic digital curation tasks,
validation upon ingest, user profiles, ...
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Assessment of OAIS platforms (D8.1)
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Archive: DSpace admin interface
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Archive: DSpace AIP preview
Enterprise Service Bus
●
communication layer for all integrated components
Enterprise Integration Patterns
●
distributed applications and services developed by all WPs
●
leverage best practices in enterprise application integration
Message Oriented Middleware
●
message-based communication layer: asynchronism, routing,
transformation, decoupling, reduced integration complexity
Candidate MOM implementation: Apache ServiceMix
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Technologies for PoF Middleware (D8.3)
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
PoF Middleware: Message Oriented Middleware
© F. Munz, Middleware and Cloud Computing, 2011
© D.A. Chappel, Enterprise Service Bus, 2004
PoF Middleware includes rule-based routing and mediation engine
implementing all Enterprise Integration Patterns (EIPs)
Seamless integration with messaging system
Pattern Examples: Reply/Forward, MessageRouter
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Workflows: rule-based message routing
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
PoF Middleware: messaging system
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
PoF Middleware: preserved items
Synergetic Preservation and Managed Forgetting Workflow
Resource restore from Applications (after processing in the storage)
Periodic Curation Tasks (fixity and format checks for bitstreams,
metadata checks for link consistency and completeness, …)
Format migration: leverage PDS computational storage (Storlets)
Metadata migration: DSpace provides tools to convert metadata from
one schema to another, intermediate mapping, extensible
Storage management: integration of DSpace Archive and PDS
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Preservation in PoF Framework
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Basic Synergetic Preservation Workflow (D8.1)
applications
middleware
archive
cloud storage
Archive: DSpace data model for SIP and AIP (items, collections,
communities), OAIS functional entities implemented (Ingest/Access,
Administration, Data Management, Preservation Planning and
Archival Storage shared with PDS)
PoF Middleware: SIP creation and ingest, DIP access, smooth bi-
directional transition from Applications to AIS, workflow management
PDS: Archival Storage, preservation actions close to data, internal
data model (tenant, docket, aggregation)
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
OAIS and the PoF Framework
Expected outcome at the end of the project: extend OAIS model to
support ForgetIT approach to preservation
OAIS specification provides: functional model, information model and
model for information package transformation
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
PoF Reference Model
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
PoF Reference Model -2
Work in progress: identify model requirements based on ForgetIT
principles and scenarios, analyze available internal models (e.g.
contextualization model) and compare with OAIS
Evaluate emerging digital preservation standards (e.g. MPEG MP-AF)
Challenging activity throughout the whole project lifetime, and an opportunity
Subversion repository and Trac for issue reporting system
ForgetIT Private Network (VPN)
KVM for virtualization (components and services deployed as VMs)
Shared data storage for test samples
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Testbed environment and collaborative
development
Collaboration with Presto4U Coordination Action (FP7)
●
assessment of digital preservation platforms and tools adopted by
different communities of practice (archivists, museums,
broadcasters, ...)
Lead of MPEG MP-AF Working Group
●
Multimedia Preservation Application Format
●
evaluation of standard metadata formats for digital preservation
●
new standard ISO/IEC 23000-15 for interoperable digital
preservation format
ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014
Dissemination activities
The Preserve-or-Forget Reference Model and Framework (WP8 ForgetIT 1st year review)
Thank you for your attention!

More Related Content

PPTX
Personal Preservation (WP9 ForgetIT 1st year review)
PDF
Organizational Preservation (WP10 ForgetIT 1st year review)
PPTX
Joint Information and Preservation Management (WP5 ForgetIT 1st year review)
PPTX
Contextualization / Decontextualization (WP6 ForgetIT 1st year review)
PPTX
Foundations of Forgetting and Remembering (WP2 - ForgetIT 1st year review)
PDF
Managed Forgetting (WP3 - ForgetIT 1st year review)
PPTX
Information Consolidation and Concentration (WP4 ForgetIT 1st year review)
PDF
Computational Storage Services (WP7 ForgetIT 1st year review)
Personal Preservation (WP9 ForgetIT 1st year review)
Organizational Preservation (WP10 ForgetIT 1st year review)
Joint Information and Preservation Management (WP5 ForgetIT 1st year review)
Contextualization / Decontextualization (WP6 ForgetIT 1st year review)
Foundations of Forgetting and Remembering (WP2 - ForgetIT 1st year review)
Managed Forgetting (WP3 - ForgetIT 1st year review)
Information Consolidation and Concentration (WP4 ForgetIT 1st year review)
Computational Storage Services (WP7 ForgetIT 1st year review)

What's hot (6)

PDF
Digital dark age - Are we doing enough to preserve our website heritage?
PDF
TYPO3 and CMIS
PPTX
PPTX
Filling the digital preservation gap
PDF
Science and Research - a new experimental platform in Brazil
PPTX
Data Preservation Service Area
Digital dark age - Are we doing enough to preserve our website heritage?
TYPO3 and CMIS
Filling the digital preservation gap
Science and Research - a new experimental platform in Brazil
Data Preservation Service Area
Ad

Similar to The Preserve-or-Forget Reference Model and Framework (WP8 ForgetIT 1st year review) (20)

PDF
5th Content Providers Community Call
PPTX
TechEvent Agile infrastructure projects
PPTX
The habitats approach to build the inspire infrastructure
PDF
Smarter Manufacturing Sustainable Futures 4 FLEXINET project IT Perspective
PPT
Rapid Software Development Process
PPTX
AGILE M18 – State of the “Nation”
PDF
DEEP general presentation
PDF
Producing documentation for Eclipse RCP applications using single source prin...
PPTX
Research Data Shared Services
PDF
SCAPE - Scalable Preservation Environments
PDF
Bhadale group of companies projects portfolio
PDF
Red Hat TUG Utrecht - Storage Update june 2015
PDF
Deploying and Managing Artificial Intelligence Services using the Open Data H...
PPT
Goobi
PDF
H2O World - Collaborative, Reproducible Research with H2O - Nick Elprin
PDF
Demo of integrated synergetic preservation workflow (WP8 ForgetIT 1st year r...
PPTX
ArchAIDE Project Introduction
PPTX
Deep Hybrid DataCloud
PDF
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
PPTX
The Future of Apache Hadoop an Enterprise Architecture View
5th Content Providers Community Call
TechEvent Agile infrastructure projects
The habitats approach to build the inspire infrastructure
Smarter Manufacturing Sustainable Futures 4 FLEXINET project IT Perspective
Rapid Software Development Process
AGILE M18 – State of the “Nation”
DEEP general presentation
Producing documentation for Eclipse RCP applications using single source prin...
Research Data Shared Services
SCAPE - Scalable Preservation Environments
Bhadale group of companies projects portfolio
Red Hat TUG Utrecht - Storage Update june 2015
Deploying and Managing Artificial Intelligence Services using the Open Data H...
Goobi
H2O World - Collaborative, Reproducible Research with H2O - Nick Elprin
Demo of integrated synergetic preservation workflow (WP8 ForgetIT 1st year r...
ArchAIDE Project Introduction
Deep Hybrid DataCloud
Enriching SMW based Virtual Research Environments with external data, Jan Nov...
The Future of Apache Hadoop an Enterprise Architecture View
Ad

Recently uploaded (20)

PDF
Electronic commerce courselecture one. Pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
KodekX | Application Modernization Development
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Cloud computing and distributed systems.
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
MYSQL Presentation for SQL database connectivity
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Machine learning based COVID-19 study performance prediction
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Encapsulation theory and applications.pdf
Electronic commerce courselecture one. Pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
KodekX | Application Modernization Development
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Cloud computing and distributed systems.
NewMind AI Weekly Chronicles - August'25 Week I
MYSQL Presentation for SQL database connectivity
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Unlocking AI with Model Context Protocol (MCP)
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Understanding_Digital_Forensics_Presentation.pptx
20250228 LYD VKU AI Blended-Learning.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
The Rise and Fall of 3GPP – Time for a Sabbatical?
Machine learning based COVID-19 study performance prediction
Building Integrated photovoltaic BIPV_UPV.pdf
Network Security Unit 5.pdf for BCA BBA.
Spectral efficient network and resource selection model in 5G networks
Encapsulation theory and applications.pdf

The Preserve-or-Forget Reference Model and Framework (WP8 ForgetIT 1st year review)

  • 1. Concise Preservation by combining Managed Forgetting and Contextualized Remembering
  • 3. Francesco Gallo EURIX WP8 Presentation The Preserve-or-Forget Reference Model and Framework ForgetIT 1st Review Meeting, April 29-30, 2014 Kaiserslautern, Germany
  • 4. WP Objectives (from DoW) ● integrate project components into a technologically coherent framework ● adopt flexible and extensible solutions ● define a PoF reference model supporting ForgetIT concepts Focus of Year 1 • design of the PoF framework architecture • technology assessment • identification of project components and early integration • started working on definition of PoF model ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Objectives of WP and Year 1 Focus
  • 5. Design of the architecture for the Preserve-or-Forget Framework Assessment of technologies for PoF middleware and AIS Definition of components from all WPs, preliminary integration Analysis of requirements for PoF reference model Testbed setup and integration plan PoF framework prototype, synergetic preservation workflow • ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Achievements in Year 1
  • 6. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Preserve-or-Forget Architecture (D8.1)
  • 7. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 PoF Framework Components and APIs (D8.1) middlewareapplications archive cloud storage WP3,WP4,WP5,WP6WP10 WP9 WP8 WP7
  • 8. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 PoF Middleware Components Shared components (general tasks) ● ID Manager, Metadata Repository, Scheduler, Context-Aware Preservation Manager Other components (core ForgetIT principles) ● Forgettor, Extractor, Condensator, Contextualizer, Navigator, Collector, Archiver For each component full description provided in D8.1 Many components available as prototypes, already integrated
  • 9. Evaluated several candidates for implementing the Archive ● Archivematica, FedoraCommons, DSpace, RODA, P4, iRODS, … Assessment criteria: ● open-source license, support for ForgetIT data types, TRL, integration with PDS, language and technologies, documentation, supporting community, … DSpace selected as the candidate implementation of PoF Archive ● widely adopted and actively maintained, supports all ForgetIT data types, integrates with cloud storage solutions and other platforms ● extensible with custom add-ons, periodic digital curation tasks, validation upon ingest, user profiles, ... ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Assessment of OAIS platforms (D8.1)
  • 10. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Archive: DSpace admin interface
  • 11. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Archive: DSpace AIP preview
  • 12. Enterprise Service Bus ● communication layer for all integrated components Enterprise Integration Patterns ● distributed applications and services developed by all WPs ● leverage best practices in enterprise application integration Message Oriented Middleware ● message-based communication layer: asynchronism, routing, transformation, decoupling, reduced integration complexity Candidate MOM implementation: Apache ServiceMix ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Technologies for PoF Middleware (D8.3)
  • 13. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 PoF Middleware: Message Oriented Middleware © F. Munz, Middleware and Cloud Computing, 2011 © D.A. Chappel, Enterprise Service Bus, 2004
  • 14. PoF Middleware includes rule-based routing and mediation engine implementing all Enterprise Integration Patterns (EIPs) Seamless integration with messaging system Pattern Examples: Reply/Forward, MessageRouter ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Workflows: rule-based message routing
  • 15. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 PoF Middleware: messaging system
  • 16. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 PoF Middleware: preserved items
  • 17. Synergetic Preservation and Managed Forgetting Workflow Resource restore from Applications (after processing in the storage) Periodic Curation Tasks (fixity and format checks for bitstreams, metadata checks for link consistency and completeness, …) Format migration: leverage PDS computational storage (Storlets) Metadata migration: DSpace provides tools to convert metadata from one schema to another, intermediate mapping, extensible Storage management: integration of DSpace Archive and PDS ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Preservation in PoF Framework
  • 18. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Basic Synergetic Preservation Workflow (D8.1) applications middleware archive cloud storage
  • 19. Archive: DSpace data model for SIP and AIP (items, collections, communities), OAIS functional entities implemented (Ingest/Access, Administration, Data Management, Preservation Planning and Archival Storage shared with PDS) PoF Middleware: SIP creation and ingest, DIP access, smooth bi- directional transition from Applications to AIS, workflow management PDS: Archival Storage, preservation actions close to data, internal data model (tenant, docket, aggregation) ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 OAIS and the PoF Framework
  • 20. Expected outcome at the end of the project: extend OAIS model to support ForgetIT approach to preservation OAIS specification provides: functional model, information model and model for information package transformation ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 PoF Reference Model
  • 21. ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 PoF Reference Model -2 Work in progress: identify model requirements based on ForgetIT principles and scenarios, analyze available internal models (e.g. contextualization model) and compare with OAIS Evaluate emerging digital preservation standards (e.g. MPEG MP-AF) Challenging activity throughout the whole project lifetime, and an opportunity
  • 22. Subversion repository and Trac for issue reporting system ForgetIT Private Network (VPN) KVM for virtualization (components and services deployed as VMs) Shared data storage for test samples ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Testbed environment and collaborative development
  • 23. Collaboration with Presto4U Coordination Action (FP7) ● assessment of digital preservation platforms and tools adopted by different communities of practice (archivists, museums, broadcasters, ...) Lead of MPEG MP-AF Working Group ● Multimedia Preservation Application Format ● evaluation of standard metadata formats for digital preservation ● new standard ISO/IEC 23000-15 for interoperable digital preservation format ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014 Dissemination activities
  • 25. Thank you for your attention!