SlideShare a Scribd company logo
Fedora-Commons Repository A brief overview Etienne Posthumus, DPO, TU Delft Library 08-06-09
What is it? Fedora  is a general-purpose, open-source digital object repository system.
What is it? Fedora  is a general-purpose, open-source digital object repository system.  "Flexible Extensible Digital Object Repository Architecture"  NOT the Linux distribution
What is it? Fedora  is a general-purpose, open-source digital object repository system.  …a persistence and service delivery platform
What is it? Fedora  is a general-purpose, open-source digital object repository system.  …a persistence and service delivery platform …implemented as a set of Java services (Tomcat)
What is it? Fedora  is a general-purpose, open-source digital object repository system.  A well-funded active community with a long history.
Conforms to the OAIS reference model Open Archival Information System  - an ISO model defined by Consultative Committee for Space Data Systems widely adopted in Archives community Built on sub-systems Open Source, Open Standards
Persistent digital archive ‘files on disk’ bootstrappeable in the face of disaster XML containers asset management Built on sub-systems Open Source, Open Standards
Persistent digital archive ‘files on disk’ bootstrappeable in the face of disaster XML containers asset management Scaleable triple-store Mulgara  Built on sub-systems Open Source, Open Standards
Persistent digital archive ‘files on disk’ bootstrappeable in the face of disaster XML containers asset management Scaleable triple-store Mulgara  Service delivery  XSLT via Saxon built-in others pluggable Built on sub-systems Open Source, Open Standards
Advantages? Scaling Integrated Asset management Extensive Versioning Audit trails Large(ish) community, esp. mindshare General ‘fit and polish’ Goal is support for > 10 million objects
Downsides? It’s a toolbox not a complete solution (front-end support) Java code base (for some this is an advantage ;-) Moving target in terms of innovation (also debatable) Complexity of some concepts, steep learning curve A lot of DIY is needed
Digital Object Model Compound objects, aggregate items per object
Digital Object Model Compound objects, aggregate items per object Some ‘reserved’ item IDs
Digital Object Model Compound objects, aggregate items per object Some ‘reserved’ item IDs Essentially an XML container
Digital Object Model <foxml:digitalObject PID=&quot;demo:31&quot; VERSION=&quot;1.1&quot; xmlns:foxml=&quot;info:fedora/fedora-system:def/foxml#&quot; xmlns:xsi=&quot;http://www.w3.org/2001/XMLSchema-instance&quot; xsi:schemaLocation=&quot;info:fedora/fedora-system:def/foxml# http://www.fedora.info/definitions/1/0/foxml1-1.xsd&quot;> <foxml:objectProperties> <foxml:property NAME=&quot;info:fedora/fedora-system:def/model#state&quot; VALUE=&quot;Active&quot;/> <foxml:property NAME=&quot;info:fedora/fedora-system:def/model#ownerId&quot; VALUE=&quot;fedoraAdmin&quot;/> <foxml:property NAME=&quot;info:fedora/fedora-system:def/model#createdDate&quot; VALUE=&quot;2008-07-02T05:09:42.015Z&quot;/> <foxml:property NAME=&quot;info:fedora/fedora-system:def/view#lastModifiedDate&quot; VALUE=&quot;2008-07-02T05:09:42.015Z&quot;/> </foxml:objectProperties> <foxml:datastream CONTROL_GROUP=&quot;X&quot; ID=&quot;DC&quot; STATE=&quot;A&quot; VERSIONABLE=&quot;true&quot;> <foxml:datastreamVersion CREATED=&quot;2008-07-02T05:09:43.328Z&quot; FORMAT_URI=&quot;http://www.openarchives.org/OAI/2.0/oai_dc/&quot; ID=&quot;DC1.0&quot; LABEL=&quot;Dublin Core Record for this object&quot; MIMETYPE=&quot;text/xml&quot;> <foxml:xmlContent> <oai_dc:dc xmlns:dc=&quot;http://purl.org/dc/elements/1.1/&quot; xmlns:oai_dc=&quot;http://www.openarchives.org/OAI/2.0/oai_dc/&quot;> <dc:title>Apache Batik 1.5 Demo Directory</dc:title> <dc:creator>Chris Wilper</dc:creator> <dc:creator>The Apache Group</dc:creator> <dc:subject>svg</dc:subject> <dc:publisher>The Apache Group</dc:publisher> <dc:format>image/svg+xml</dc:format> <dc:identifier>demo:31</dc:identifier> <dc:relation>http://xml.apache.org/batik/</dc:relation> <dc:rights>The Apache License</dc:rights> </oai_dc:dc> </foxml:xmlContent> </foxml:datastreamVersion> </foxml:datastream> <foxml:datastream CONTROL_GROUP=&quot;E&quot; ID=&quot;DS1&quot; STATE=&quot;A&quot; VERSIONABLE=&quot;true&quot;> <foxml:datastreamVersion CREATED=&quot;2008-07-02T05:09:43.328Z&quot; ID=&quot;DS1.0&quot; LABEL=&quot;Local copy of batikBatik.svg&quot; MIMETYPE=&quot;image/svg+xml&quot;> <foxml:contentLocation REF=&quot;http://www.fedora-commons.org/demo/misc/batikBatik.svg&quot; TYPE=&quot;URL&quot;/> </foxml:datastreamVersion> </foxml:datastream> An Example
Content Model Architecture “ an integrated structure for persisting and delivering the essential characteristics of digital objects in Fedora” Modelling behaviours and structure of objects. Complicated, fairly new, not widely implemented yet, in flux. … can also be safely ignored when starting out. (which we did not realise)
Fedora at TU Delft Library Used to create a repository Infrastructure
Fedora at TU Delft Library Used to create a Repository Infrastructure Multiple projects: Institutional Repository (academic output) Special Collections (Tresor, bouwkunde dias) Expert Search (finding People) 3TU Datacentrum
Implementation Front-end is Django application
Implementation Front-end is Django application We use the REST API for communication
Implementation Front-end is Django application We use the REST API for communication Middleware as Python code providing a web API
Implementation Front-end is Django application We use the REST API for communication Middleware as Python code providing a web API MOAI flexible OAI-PMH service
Implementation Front-end is Django application We use the REST API for communication Middleware as Python code providing a web API MOAI flexible OAI-PMH service Indexing via SOLR subsequently linked to DISCOVER
 
Conversion of existing repository Legacy Stellent CMS stored in Oracle Circa 12000 items Mostly PDF plus metadata Conversion scripts in Python Bulk ingest
Community Participation NL Fedora Usergroup Contact with key Fedora participants at conferences Presentations and workshop Developer days planned Possible collaboration on development Sponsorship of open-source components
Plans Convert existing repository with existing functionality Incrementally build in new features on ongoing basis Extend APIs and services Linked Data repository More innovative repository applications
Questions? [email_address] http://www.fedora-commons.org/

More Related Content

PPTX
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
PDF
What's with the 1s and 0s? Making sense of binary data at scale with Tika and...
PPT
Apache Tika
PPT
Apache Tika end-to-end
PPT
Apache Tika: 1 point Oh!
PPTX
Optimizing Application Architecture (.NET/Java topics)
PPT
Content Analysis with Apache Tika
PPT
Text and metadata extraction with Apache Tika
Data FAIRport Prototype & Demo - Presentation to Elsevier, Jul 10, 2015
What's with the 1s and 0s? Making sense of binary data at scale with Tika and...
Apache Tika
Apache Tika end-to-end
Apache Tika: 1 point Oh!
Optimizing Application Architecture (.NET/Java topics)
Content Analysis with Apache Tika
Text and metadata extraction with Apache Tika

Viewers also liked (20)

PPT
Using Fedora Commons To Create A Persistent Archive
PDF
Greenstone Digital Library
DOCX
Digital library softaware greenstone & dsapce
PDF
Librarians and Open Access: the case of E-LIS
PPTX
EPrints and the Cloud
PPT
EPrints for Data
PDF
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
PPT
E-LIS: an Eprints LIS Repository
PDF
Digital preservation and institutional repositories
PPT
Biblio to Fedora Commons REST API
PPTX
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
PPTX
eprints digital library software
PPT
Repositories and digital preservation
PDF
Introduction to fedora 20cat
PDF
2.28.17 Introducing DSpace 7 Webinar Slides
PDF
DSpace Training Presentation
PDF
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
PPT
DSpace Tutorial : Open Source Digital Library
PPS
What is Greenstone Digital Library and Tips for Development
Using Fedora Commons To Create A Persistent Archive
Greenstone Digital Library
Digital library softaware greenstone & dsapce
Librarians and Open Access: the case of E-LIS
EPrints and the Cloud
EPrints for Data
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
E-LIS: an Eprints LIS Repository
Digital preservation and institutional repositories
Biblio to Fedora Commons REST API
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
eprints digital library software
Repositories and digital preservation
Introduction to fedora 20cat
2.28.17 Introducing DSpace 7 Webinar Slides
DSpace Training Presentation
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
DSpace Tutorial : Open Source Digital Library
What is Greenstone Digital Library and Tips for Development
Ad

Similar to Fedora Overview (20)

PDF
Wilcox - Open Source Repositories and the Future of Fedora
DOCX
tutorial2.docx - Fedora Tutorial
PPTX
3TU Datacentrum Tech Overview
PDF
A Durable Space from April 23 NISO Virtual Conference: Dealing with the Data ...
PPTX
Fedora Commons in the CLARIN Infrastructure
PDF
Fedora 4 Deep Dive
PDF
2.28.18 Getting Started with Fedora presentation slides
PDF
Fedora Futures - CNI 2012
PDF
Drupal for Higher Education and Virtual Learning
PDF
10.15.14 Presentation Slides, “Fedora 4.0 in Action at The Art Institute of C...
PDF
Getting Involved in an Free/Open Source Operating System
PPT
dotte.ppt
PDF
9-10-13 Stewarding Research Data with Fedora and Islandora Presentation Slides
PPT
Fedora Futures for OR13
PPTX
A Little Sweat Goes A Long Way - Museums and The Web 2016
PDF
Fedora App Slide 2009 Hastac
PPTX
Fedora 4 :Introduction and Overview
PDF
Fedora Modularity
PPTX
Fedora os presentation
Wilcox - Open Source Repositories and the Future of Fedora
tutorial2.docx - Fedora Tutorial
3TU Datacentrum Tech Overview
A Durable Space from April 23 NISO Virtual Conference: Dealing with the Data ...
Fedora Commons in the CLARIN Infrastructure
Fedora 4 Deep Dive
2.28.18 Getting Started with Fedora presentation slides
Fedora Futures - CNI 2012
Drupal for Higher Education and Virtual Learning
10.15.14 Presentation Slides, “Fedora 4.0 in Action at The Art Institute of C...
Getting Involved in an Free/Open Source Operating System
dotte.ppt
9-10-13 Stewarding Research Data with Fedora and Islandora Presentation Slides
Fedora Futures for OR13
A Little Sweat Goes A Long Way - Museums and The Web 2016
Fedora App Slide 2009 Hastac
Fedora 4 :Introduction and Overview
Fedora Modularity
Fedora os presentation
Ad

Recently uploaded (20)

PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PPTX
Cell Structure & Organelles in detailed.
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
01-Introduction-to-Information-Management.pdf
PDF
Business Ethics Teaching Materials for college
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
Cell Types and Its function , kingdom of life
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
STATICS OF THE RIGID BODIES Hibbelers.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Week 4 Term 3 Study Techniques revisited.pptx
Cell Structure & Organelles in detailed.
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
01-Introduction-to-Information-Management.pdf
Business Ethics Teaching Materials for college
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
2.FourierTransform-ShortQuestionswithAnswers.pdf
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Microbial diseases, their pathogenesis and prophylaxis
Cell Types and Its function , kingdom of life
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
human mycosis Human fungal infections are called human mycosis..pptx

Fedora Overview

  • 1. Fedora-Commons Repository A brief overview Etienne Posthumus, DPO, TU Delft Library 08-06-09
  • 2. What is it? Fedora is a general-purpose, open-source digital object repository system.
  • 3. What is it? Fedora is a general-purpose, open-source digital object repository system. &quot;Flexible Extensible Digital Object Repository Architecture&quot; NOT the Linux distribution
  • 4. What is it? Fedora is a general-purpose, open-source digital object repository system. …a persistence and service delivery platform
  • 5. What is it? Fedora is a general-purpose, open-source digital object repository system. …a persistence and service delivery platform …implemented as a set of Java services (Tomcat)
  • 6. What is it? Fedora is a general-purpose, open-source digital object repository system. A well-funded active community with a long history.
  • 7. Conforms to the OAIS reference model Open Archival Information System - an ISO model defined by Consultative Committee for Space Data Systems widely adopted in Archives community Built on sub-systems Open Source, Open Standards
  • 8. Persistent digital archive ‘files on disk’ bootstrappeable in the face of disaster XML containers asset management Built on sub-systems Open Source, Open Standards
  • 9. Persistent digital archive ‘files on disk’ bootstrappeable in the face of disaster XML containers asset management Scaleable triple-store Mulgara Built on sub-systems Open Source, Open Standards
  • 10. Persistent digital archive ‘files on disk’ bootstrappeable in the face of disaster XML containers asset management Scaleable triple-store Mulgara Service delivery XSLT via Saxon built-in others pluggable Built on sub-systems Open Source, Open Standards
  • 11. Advantages? Scaling Integrated Asset management Extensive Versioning Audit trails Large(ish) community, esp. mindshare General ‘fit and polish’ Goal is support for > 10 million objects
  • 12. Downsides? It’s a toolbox not a complete solution (front-end support) Java code base (for some this is an advantage ;-) Moving target in terms of innovation (also debatable) Complexity of some concepts, steep learning curve A lot of DIY is needed
  • 13. Digital Object Model Compound objects, aggregate items per object
  • 14. Digital Object Model Compound objects, aggregate items per object Some ‘reserved’ item IDs
  • 15. Digital Object Model Compound objects, aggregate items per object Some ‘reserved’ item IDs Essentially an XML container
  • 16. Digital Object Model <foxml:digitalObject PID=&quot;demo:31&quot; VERSION=&quot;1.1&quot; xmlns:foxml=&quot;info:fedora/fedora-system:def/foxml#&quot; xmlns:xsi=&quot;http://www.w3.org/2001/XMLSchema-instance&quot; xsi:schemaLocation=&quot;info:fedora/fedora-system:def/foxml# http://www.fedora.info/definitions/1/0/foxml1-1.xsd&quot;> <foxml:objectProperties> <foxml:property NAME=&quot;info:fedora/fedora-system:def/model#state&quot; VALUE=&quot;Active&quot;/> <foxml:property NAME=&quot;info:fedora/fedora-system:def/model#ownerId&quot; VALUE=&quot;fedoraAdmin&quot;/> <foxml:property NAME=&quot;info:fedora/fedora-system:def/model#createdDate&quot; VALUE=&quot;2008-07-02T05:09:42.015Z&quot;/> <foxml:property NAME=&quot;info:fedora/fedora-system:def/view#lastModifiedDate&quot; VALUE=&quot;2008-07-02T05:09:42.015Z&quot;/> </foxml:objectProperties> <foxml:datastream CONTROL_GROUP=&quot;X&quot; ID=&quot;DC&quot; STATE=&quot;A&quot; VERSIONABLE=&quot;true&quot;> <foxml:datastreamVersion CREATED=&quot;2008-07-02T05:09:43.328Z&quot; FORMAT_URI=&quot;http://www.openarchives.org/OAI/2.0/oai_dc/&quot; ID=&quot;DC1.0&quot; LABEL=&quot;Dublin Core Record for this object&quot; MIMETYPE=&quot;text/xml&quot;> <foxml:xmlContent> <oai_dc:dc xmlns:dc=&quot;http://purl.org/dc/elements/1.1/&quot; xmlns:oai_dc=&quot;http://www.openarchives.org/OAI/2.0/oai_dc/&quot;> <dc:title>Apache Batik 1.5 Demo Directory</dc:title> <dc:creator>Chris Wilper</dc:creator> <dc:creator>The Apache Group</dc:creator> <dc:subject>svg</dc:subject> <dc:publisher>The Apache Group</dc:publisher> <dc:format>image/svg+xml</dc:format> <dc:identifier>demo:31</dc:identifier> <dc:relation>http://xml.apache.org/batik/</dc:relation> <dc:rights>The Apache License</dc:rights> </oai_dc:dc> </foxml:xmlContent> </foxml:datastreamVersion> </foxml:datastream> <foxml:datastream CONTROL_GROUP=&quot;E&quot; ID=&quot;DS1&quot; STATE=&quot;A&quot; VERSIONABLE=&quot;true&quot;> <foxml:datastreamVersion CREATED=&quot;2008-07-02T05:09:43.328Z&quot; ID=&quot;DS1.0&quot; LABEL=&quot;Local copy of batikBatik.svg&quot; MIMETYPE=&quot;image/svg+xml&quot;> <foxml:contentLocation REF=&quot;http://www.fedora-commons.org/demo/misc/batikBatik.svg&quot; TYPE=&quot;URL&quot;/> </foxml:datastreamVersion> </foxml:datastream> An Example
  • 17. Content Model Architecture “ an integrated structure for persisting and delivering the essential characteristics of digital objects in Fedora” Modelling behaviours and structure of objects. Complicated, fairly new, not widely implemented yet, in flux. … can also be safely ignored when starting out. (which we did not realise)
  • 18. Fedora at TU Delft Library Used to create a repository Infrastructure
  • 19. Fedora at TU Delft Library Used to create a Repository Infrastructure Multiple projects: Institutional Repository (academic output) Special Collections (Tresor, bouwkunde dias) Expert Search (finding People) 3TU Datacentrum
  • 20. Implementation Front-end is Django application
  • 21. Implementation Front-end is Django application We use the REST API for communication
  • 22. Implementation Front-end is Django application We use the REST API for communication Middleware as Python code providing a web API
  • 23. Implementation Front-end is Django application We use the REST API for communication Middleware as Python code providing a web API MOAI flexible OAI-PMH service
  • 24. Implementation Front-end is Django application We use the REST API for communication Middleware as Python code providing a web API MOAI flexible OAI-PMH service Indexing via SOLR subsequently linked to DISCOVER
  • 25.  
  • 26. Conversion of existing repository Legacy Stellent CMS stored in Oracle Circa 12000 items Mostly PDF plus metadata Conversion scripts in Python Bulk ingest
  • 27. Community Participation NL Fedora Usergroup Contact with key Fedora participants at conferences Presentations and workshop Developer days planned Possible collaboration on development Sponsorship of open-source components
  • 28. Plans Convert existing repository with existing functionality Incrementally build in new features on ongoing basis Extend APIs and services Linked Data repository More innovative repository applications