SlideShare a Scribd company logo
EPrintsCloud Visions
What is EPrints For?EPrints offers a safe, open and useful place to store, share and manage material in the pursuit of research and educational agendas.administrative reporting, collaboration, data sharing, digital profile enhancement , e-learning, e-publishing, e-research, marketing, open access, preservation, publicity, research assessment, research management, scholarly collections
 Research Curation, Researcher SupportResearchers’ environment supported by repositoryResearch data managed by repositoryResearch community assisted by repository
What is a RepositorySafe, secure, persistent, managed storage for filesSafe, secure, persistent management of shareable FRBR worksSafe, secure, persistent, management of scholarly & scientific working Leading to…Science 2.0 / The Fourth Paradigm / Data Intensive ScienceThe challenge is not cloud computing but cloud thinking
Bio-Diversity
Current EPrints Cloud CapabilitiesAmazon Elastic Compute Machine Images (AMIs)Small (Single Core / 1.7Gb)Large (64 Bit / Quad Core / 7.5Gb)Extra Large (64 Bit / 8 Core / 15Gb) EPrints 3.2 is 64 Bit EnabledPersistent Database & StorageReally Excited - Super Fast / Cheap / Easy!
 Cloud to Desktop StorageData can be stored on multiple storage servicesLocal disk, SAN, NAS, Honeycomb, CloudResearchers can mount repository objects as a networked filesystemService usage and preservation risks can be monitored and analysed.
Hybrid Storage In EPrintsA single storage solution has drawbacks.Cost vs. Speed vs. ReliabilityRepositories need to be agile: to utilize and be able to migrate to new platformsLeverage the benefits of each solution without losing control of your digital objects.
Local Disk StorageNo local bandwidth costsHard to expand Locally Managed High overheads cost Requires space and cooling Tied closely to the software Storage ecosystem
Local Archival StorageSpecialist Expensive to purchase Locally Managed Space and running costs Expandable Storage ecosystem
Cloud StorageScalable Externally controlled Known CostingsUnclear retention policy Re-Useable (using simple APIs) Global ScaleStorage ecosystem
But Clouds Blow AwayRecently:Yahoo BriefcaseXDriveAOL PicturesHP UplineSony Image StationSource: Tom Spring - PCWorld
Why use Hybrid StorageUse the best features of each storage typePerformanceScaling-up bandwidthOptimisationLarge-file handlingMultimedia streamingLocalised DeliveryLocal delivery from the cloud
EPrints Storage ControllerThe storage controller decides where to put a file.
Rule-based policy defined by XML configuration file
Large binary files of scientific data (raw machine result data) can be stored in a large disk (slower access) system and sent to a tape company for long term storage.
Processed results can be stored locally and in the cloud ready for rapid delivery to end points. Architecture Diagram
Controller Ruleset<choose>       <when test="datasetid = 'document'">           <choose>               <when test="$parent{relation_type} = 'isVolatileVersionOf'">                   <plugin name="Local"/>               </when>               <otherwise>                   <plugin name="AmazonS3"/>               </otherwise>           </choose>       </when>       <otherwise>           <plugin name="Local"/>       </otherwise>   </choose>
EPrints Storage Manager
Amazon S3 Localisation (1)
Amazon S3 Localisation (2)
Preservation ServicesObject ClassificationRisk AnalysisMitigation and Migration
EPrintsForthcoming Development
EPrints Cloud ServicesWeb based repository setupMuch like getting started with a blog.Fill in a form and obtain a repository.Coming to EPrints core in next major release.Enterprise Support for Cloud SolutionsFull Setup & ConfigurationGlobal DistributionAuto Upgrade & PatchingTrusted Backup
EPrints 3.2Plug-ins / ModulesEverything builds on the core layerMajor part of v3.2 is strengthening the core and adding more abstraction layersImproved data model Enhanced data facilities Enhanced metadata facilitiesImproved programming & API
EPrints 3.2 Structure
Community Driven DevelopmentThere are many abstraction layers.Display ManipulationUpload HandlersCustom DatasetsImport / Export Plug-insTranscoding Plug-insDatabase Plug-insStorage Plug-insOne API
Storage Plug-insLocalNFSAmazon S3Sun Cloud Storage ServiceMicrosoft AzureAny others based on the S3 API…. (the last 3 all are)5 Call API (about 30mins to write a plug-in)
Our Development VisionEmpower the Community with a simple APIAPI in 3.2Give the community a platform to test their codeUse the Cloud!Give the community a distribution mechanismThe EPrints Bazaar (beta)

More Related Content

PPT
E-LIS: an Eprints LIS Repository
PPTX
eprints digital library software
PPTX
Eprints digital library software.final
PPTX
Eprints digital library software.final
PPTX
Dspace
PPTX
Digital library software
PDF
Greenstone Digital Library Software
PPTX
Dspace software
E-LIS: an Eprints LIS Repository
eprints digital library software
Eprints digital library software.final
Eprints digital library software.final
Dspace
Digital library software
Greenstone Digital Library Software
Dspace software

What's hot (20)

PDF
Introduction to DSpace
PDF
DSpace Training Presentation
PPTX
Introduction to DSpace
PPTX
Inroduction to Dspace
PPT
Module 1 introduction of Dspace
ODP
Niatalk24jan10
PDF
File system discovery
PPSX
Two day-long training on "DSpace" Institutional Repository
PPTX
Group project linux helix
PPT
Linux forensics
PDF
File system discovery
PDF
Encase V7 Presented by Guidance Software august 2011
PDF
Bigdata ready reference
PPT
File system
PPT
Windowsforensics
PPT
Edubooktraining
PDF
Linux nic training_intro_14_dec_09
PPTX
Windows 8 Forensics & Anti Forensics
PDF
Techbuddy: Introduction to Linux session
Introduction to DSpace
DSpace Training Presentation
Introduction to DSpace
Inroduction to Dspace
Module 1 introduction of Dspace
Niatalk24jan10
File system discovery
Two day-long training on "DSpace" Institutional Repository
Group project linux helix
Linux forensics
File system discovery
Encase V7 Presented by Guidance Software august 2011
Bigdata ready reference
File system
Windowsforensics
Edubooktraining
Linux nic training_intro_14_dec_09
Windows 8 Forensics & Anti Forensics
Techbuddy: Introduction to Linux session
Ad

Viewers also liked (20)

PDF
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
PPT
EPrints for Data
PDF
Librarians and Open Access: the case of E-LIS
PDF
Digital preservation and institutional repositories
PPT
Biblio to Fedora Commons REST API
PPT
Fedora Overview
PPT
Using Fedora Commons To Create A Persistent Archive
PPTX
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
PPT
Repositories and digital preservation
PDF
Introduction to fedora 20cat
PDF
2.28.17 Introducing DSpace 7 Webinar Slides
PDF
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
PPT
DSpace Tutorial : Open Source Digital Library
PPS
What is Greenstone Digital Library and Tips for Development
ODP
Introduction To Fedora
PDF
Greenstone Digital Library
PPT
Digital libraries power point
PPTX
DSpace 4.2 Basics & Configuration
PDF
DSpace repositories today and tomorrow
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
EPrints for Data
Librarians and Open Access: the case of E-LIS
Digital preservation and institutional repositories
Biblio to Fedora Commons REST API
Fedora Overview
Using Fedora Commons To Create A Persistent Archive
11.5.14 Presentation Slides, “Fedora 4.0 in Action at Penn State and Stanford”
Repositories and digital preservation
Introduction to fedora 20cat
2.28.17 Introducing DSpace 7 Webinar Slides
3.7.17 DSpace for Data: issues, solutions and challenges Webinar Slides
DSpace Tutorial : Open Source Digital Library
What is Greenstone Digital Library and Tips for Development
Introduction To Fedora
Greenstone Digital Library
Digital libraries power point
DSpace 4.2 Basics & Configuration
DSpace repositories today and tomorrow
Ad

Similar to EPrints and the Cloud (20)

PPT
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
PPTX
Watson christofer j_180208
PPTX
Windows Azure: Lessons From The Field
PPTX
How to run your Hadoop Cluster in 10 minutes
PDF
S100299 ibm-cos-orlando-v1804c
PPTX
seed block algorithm
PPTX
Eliminating the Problems of Exponential Data Growth, Forever
PDF
Spectrum Scale final
PPT
Waters Grid & HPC Course
ODP
Storage for next-generation sequencing
PPTX
Analytics with unified file and object
PDF
Dipping Your Toes: Azure Data Lake for DBAs
PPTX
SharePoint Governance: stories, myths, legends and real life
PPTX
Why 2015 is the Year of Copy Data - What are the requirements?
PPT
Info. Archive Customer Presentation - SSI version
PDF
Building modern data lakes
PPT
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
PPT
Hadoop and Voldemort @ LinkedIn
PDF
S016825 ibm-cos-nola-v1710d
PPT
Google Cloud Computing on Google Developer 2008 Day
Physical preservation with EPrints: 1 Storage, by Adam Field, David Tarrant, ...
Watson christofer j_180208
Windows Azure: Lessons From The Field
How to run your Hadoop Cluster in 10 minutes
S100299 ibm-cos-orlando-v1804c
seed block algorithm
Eliminating the Problems of Exponential Data Growth, Forever
Spectrum Scale final
Waters Grid & HPC Course
Storage for next-generation sequencing
Analytics with unified file and object
Dipping Your Toes: Azure Data Lake for DBAs
SharePoint Governance: stories, myths, legends and real life
Why 2015 is the Year of Copy Data - What are the requirements?
Info. Archive Customer Presentation - SSI version
Building modern data lakes
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Hadoop and Voldemort @ LinkedIn
S016825 ibm-cos-nola-v1710d
Google Cloud Computing on Google Developer 2008 Day

More from Leslie Carr (6)

PPTX
Future of Text
PPTX
What is the Internet?
PPTX
Open Platforms
PPTX
Repositories, Plugins and the REF
PDF
What is Web Science?
PPT
Leverage
Future of Text
What is the Internet?
Open Platforms
Repositories, Plugins and the REF
What is Web Science?
Leverage

Recently uploaded (20)

PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
Pharma ospi slides which help in ospi learning
PDF
Business Ethics Teaching Materials for college
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Classroom Observation Tools for Teachers
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Complications of Minimal Access Surgery at WLH
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Insiders guide to clinical Medicine.pdf
PDF
Basic Mud Logging Guide for educational purpose
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
TR - Agricultural Crops Production NC III.pdf
Renaissance Architecture: A Journey from Faith to Humanism
Pharma ospi slides which help in ospi learning
Business Ethics Teaching Materials for college
Abdominal Access Techniques with Prof. Dr. R K Mishra
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Classroom Observation Tools for Teachers
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Anesthesia in Laparoscopic Surgery in India
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Complications of Minimal Access Surgery at WLH
STATICS OF THE RIGID BODIES Hibbelers.pdf
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Insiders guide to clinical Medicine.pdf
Basic Mud Logging Guide for educational purpose
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Supply Chain Operations Speaking Notes -ICLT Program
Mark Klimek Lecture Notes_240423 revision books _173037.pdf

EPrints and the Cloud

  • 2. What is EPrints For?EPrints offers a safe, open and useful place to store, share and manage material in the pursuit of research and educational agendas.administrative reporting, collaboration, data sharing, digital profile enhancement , e-learning, e-publishing, e-research, marketing, open access, preservation, publicity, research assessment, research management, scholarly collections
  • 3. Research Curation, Researcher SupportResearchers’ environment supported by repositoryResearch data managed by repositoryResearch community assisted by repository
  • 4. What is a RepositorySafe, secure, persistent, managed storage for filesSafe, secure, persistent management of shareable FRBR worksSafe, secure, persistent, management of scholarly & scientific working Leading to…Science 2.0 / The Fourth Paradigm / Data Intensive ScienceThe challenge is not cloud computing but cloud thinking
  • 6. Current EPrints Cloud CapabilitiesAmazon Elastic Compute Machine Images (AMIs)Small (Single Core / 1.7Gb)Large (64 Bit / Quad Core / 7.5Gb)Extra Large (64 Bit / 8 Core / 15Gb) EPrints 3.2 is 64 Bit EnabledPersistent Database & StorageReally Excited - Super Fast / Cheap / Easy!
  • 7. Cloud to Desktop StorageData can be stored on multiple storage servicesLocal disk, SAN, NAS, Honeycomb, CloudResearchers can mount repository objects as a networked filesystemService usage and preservation risks can be monitored and analysed.
  • 8. Hybrid Storage In EPrintsA single storage solution has drawbacks.Cost vs. Speed vs. ReliabilityRepositories need to be agile: to utilize and be able to migrate to new platformsLeverage the benefits of each solution without losing control of your digital objects.
  • 9. Local Disk StorageNo local bandwidth costsHard to expand Locally Managed High overheads cost Requires space and cooling Tied closely to the software Storage ecosystem
  • 10. Local Archival StorageSpecialist Expensive to purchase Locally Managed Space and running costs Expandable Storage ecosystem
  • 11. Cloud StorageScalable Externally controlled Known CostingsUnclear retention policy Re-Useable (using simple APIs) Global ScaleStorage ecosystem
  • 12. But Clouds Blow AwayRecently:Yahoo BriefcaseXDriveAOL PicturesHP UplineSony Image StationSource: Tom Spring - PCWorld
  • 13. Why use Hybrid StorageUse the best features of each storage typePerformanceScaling-up bandwidthOptimisationLarge-file handlingMultimedia streamingLocalised DeliveryLocal delivery from the cloud
  • 14. EPrints Storage ControllerThe storage controller decides where to put a file.
  • 15. Rule-based policy defined by XML configuration file
  • 16. Large binary files of scientific data (raw machine result data) can be stored in a large disk (slower access) system and sent to a tape company for long term storage.
  • 17. Processed results can be stored locally and in the cloud ready for rapid delivery to end points. Architecture Diagram
  • 18. Controller Ruleset<choose> <when test="datasetid = 'document'"> <choose> <when test="$parent{relation_type} = 'isVolatileVersionOf'"> <plugin name="Local"/> </when> <otherwise> <plugin name="AmazonS3"/> </otherwise> </choose> </when> <otherwise> <plugin name="Local"/> </otherwise> </choose>
  • 22. Preservation ServicesObject ClassificationRisk AnalysisMitigation and Migration
  • 24. EPrints Cloud ServicesWeb based repository setupMuch like getting started with a blog.Fill in a form and obtain a repository.Coming to EPrints core in next major release.Enterprise Support for Cloud SolutionsFull Setup & ConfigurationGlobal DistributionAuto Upgrade & PatchingTrusted Backup
  • 25. EPrints 3.2Plug-ins / ModulesEverything builds on the core layerMajor part of v3.2 is strengthening the core and adding more abstraction layersImproved data model Enhanced data facilities Enhanced metadata facilitiesImproved programming & API
  • 27. Community Driven DevelopmentThere are many abstraction layers.Display ManipulationUpload HandlersCustom DatasetsImport / Export Plug-insTranscoding Plug-insDatabase Plug-insStorage Plug-insOne API
  • 28. Storage Plug-insLocalNFSAmazon S3Sun Cloud Storage ServiceMicrosoft AzureAny others based on the S3 API…. (the last 3 all are)5 Call API (about 30mins to write a plug-in)
  • 29. Our Development VisionEmpower the Community with a simple APIAPI in 3.2Give the community a platform to test their codeUse the Cloud!Give the community a distribution mechanismThe EPrints Bazaar (beta)
  • 30. EPrints BazaarSimilar in concept to Apple’s App StoreEvery install of EPrints will have access to the BazaarSingle click install/uninstall of plug-insEPrints Services Approved Plug-insEnterprise support for limited 3rd party plug-ins
  • 31. SummaryEPrints provides the professional, enterprise level application for resource managementIncluding cloud support at many levelsRepository-in-the-cloudStorage-in-the-cloudServices-in-the-cloud