SlideShare a Scribd company logo
Web 2.0 and repositories… … have we got our repository architecture right?
Outline where are we now? what’s wrong with where we are now? what can we do about it? do we need a new vision?
Where are we now? where are we now?
What is a repository? a university-based  institutional repository  is a set of services that a university offers to the members of its community for the  management and dissemination of digital materials  created by the institution and its community members. It is most essentially an organizational commitment to the  stewardship of these digital materials, including long-term preservation where appropriate, as well as organization and access or distribution . … An institutional repository is  not simply a fixed set of software and hardware (Cliff Lynch, 2003)
Repository “doing” words manage deposit disclose make openly available curate preserve
Repository content all sorts… but most “academic” focus currently on scholarly publications learning objects research data
Repository content all sorts… but most “academic” focus currently on scholarly publications learning objects research data this talk focuses on the first of these, but with the intention that most of what I say will be generic
Repository architecture largely institutional focus though some exceptions – arXiv, RePEC, JORUM, etc. interoperability through centralised aggregators (national and global) search services (OAIster, Intute, …) registries (DOAR, ROAR, …) harvesting metadata about content using OAI-PMH (metadata = simple Dublin Core) content = PDF SWORD as deposit API
What’s “wrong” with where we are now? what’s “ wrong ” with where we are now?
#1 We talk about “repositories”…
…rather than “the Web” a focus on ‘ making content available on the Web’  would be more intuitive to researchers
Whatever happened to the CMS? a focus on ‘ content management ’ would change our emphasis OAI-PMH out… search engine optimisation, usability, accessibility, Web design, tagging, information architecture, cool URIs in…
#2 We don’t emphasise… Google indexing RSS feeds widget technology – embedding functionality into other sites
#3 Our focus is on sharing metadata… … even though we have full-text to share worse… the full-text we share tends to be PDF rather than native Web format the Web equivalent of a cul de sac and the metadata we share tends to be “simple Dublin Core” little consistency in approaches to describing ‘files’ vs. ‘documents’ little consistency in naming authors and subjects ultimately, it is both too simple and too complex!
#4 We ignore the Web Architecture we have tended to adopt service oriented approaches in line with long tradition from Z39.50 to SOAP/WSDL e.g. JISC eFramework focus is on building “services on content” rather than on the “content” pbo31 @ flickr
REST is good we don’t tend to adopt a resource oriented approach we don’t adopt REST – an architectural style with a focus on resources, their identifiers (e.g. URIs), and a simple uniform set of operations that each resource supports (e.g. GET, PUT, POST, DELETE) we don’t encourage a Web style “follow your nose” approach
#5 We are antisocial… …  at least, we tend to treat “content” in isolation from the “social networks” that need to grow around that content successful “repositories” (Flickr, YouTube, Slideshare, etc.) promote the social activity that takes place around content as well as the content management and disclosure activity friends, groups, social tagging, comments, embedding, re-purposing, etc.
But not just about functionality… the institutional approach has fundamental mismatch with the real-life social networks adopted by researchers subject-based cross-institutional global while institutional approach is good from perspective of institutional management, preservation, etc. globally “concentrated” repositories might better reflect the social networks that need to arise
The net effect… … is that there is no net effect repositories remain uncompelling places to disclose scholarly publications from POV of the researcher perceived cost of deposit remains higher than perceived benefits we resort to institutional or funder mandates, “thou shalt deposit”, to fill what would otherwise remain empty
Wait just a minute… didn’t we used to have globally “concentrated” repository services? arXiv – the first Web 2.0 service? invented before the Web unfortunately, also invented before Amazon S3 i.e. before we knew how to scale things
Wait just another minute… … doesn’t the blogsphere successfully layer a set of globally concentrated services over a distributed network of content? e.g. Technorati yes… but… the content is under the control of ‘individuals’ rather than ‘institutions’, and… the interoperability “glue” (RSS and tagging) is very lightweight and RESTful
Having the conversation is hard highly political space strong “open access” voices who, understandably, don’t want their agenda de-railed by discussion about preservation search engine optimisation Web 2.0 social networks semantic Web the future of peer review it can be hard to get the conversation started
What can we do about it? what can we do about it?
Things can go two ways… I think that things can go two ways… The Web 2.0 Way or The Semantic Web Way … possibly both
Things can go two ways… what would a Web 2.0 repository look like?
Like this?
A Web 2.0 repository? high-quality browser-based document viewer (not Acrobat!) tagging, commentary, more-like-this, favorites, … persistent (cool) URIs to content ability to form simple social groups ability to embed documents in other Web sites high visibility to Google offer RSS as primary API use of Amazon S3 to cope with scalability
In short… we go “simple” we develop simple(ish) repositories and complex aggregators and search engines RSS/Atom as primary “glue” social tagging as “description” full-text indexing microformats Google Sitemaps to guide harvesters to content complex functional requirements (e.g. author disambiguation) either ignored or met thru complexity in aggregators
Alternatively… we go “complex” …we look to the Semantic Web we create and share much richer metadata about scholarly publications than we do currently we explicitly model complexity (a la FRBR) and aggregations we expose resulting metadata thru the SW “graph”
We go “complex”... SWAP and ORE
We go “complex”… SWAP – Scholarly Works Application Profile an application of the Dublin Core Abstract Model and Application Profiles capturing relationships between works, expressions, manifestations, items and agents ORE – OAI Object Re-use and Exchange capturing relationships between aggregations and aggregated resources note that ORE not tied to specific entity in FRBR note that ORE implemented as profile of Atom
SWAP application profile model ScholarlyWork Expression 0..∞ isExpressedAs Manifestation isManifestedAs 0..∞ Copy isAvailableAs 0..∞ 0..∞ 0..∞ isCreatedBy isPublishedBy 0..∞ isEditedBy 0..∞ isFundedBy isSupervisedBy AffiliatedInstitution Agent
OAI ORE
Summary what can we learn from Web 2.0? user interface design matters global ‘concentration’ is an enabler of social interaction simple DC is both too simple and too complex richer DC application profiles such as SWAP and/or RDF applications like ORE may be a way forward but need to ensure that their use does not over-complicate user interfaces and workflows
A new vision? a new vision?
Flickr and digital cameras… didn’t just take the practice of photography and put it on the Web they fundamentally changed what photography was about
What’s our vision? the standards we adopt in the scholarly communication space… OAI-PMH, OpenURL, DOI, PDF, … are primarily about replicating in a Web world what we have always done on paper this is not surprising given the necessary inertia of the scholarly communication life-cycle but… do we need to re-envision scholarly communication as a true Web process? if so, what would a repository look like?
thank you

More Related Content

PPTX
Taylor & francis online ppt
PPT
Writing for emerald journals
PPTX
BIMM Sept 2021
PDF
Introduction to “Research Tools”: Tools for Collecting, Writing, Publishing, ...
PDF
Enhance Research Visibility by Tracking Citations
PDF
Analysis of Bibliometrics information for selecting the best field of study
PPTX
Improving the Transparency and Credibility of Open Access Publishing
PDF
Promote your research work on LinkedIn
Taylor & francis online ppt
Writing for emerald journals
BIMM Sept 2021
Introduction to “Research Tools”: Tools for Collecting, Writing, Publishing, ...
Enhance Research Visibility by Tracking Citations
Analysis of Bibliometrics information for selecting the best field of study
Improving the Transparency and Credibility of Open Access Publishing
Promote your research work on LinkedIn

What's hot (20)

PDF
Make your data discoverable on a data repository
PPTX
Publish or Perish - Realising Google Scholar's potential to democratise citat...
PPTX
Entomology 314 training
PDF
Introduction to the “Research Tools” for Research Methodology course
PDF
Establish your expertise with a science blog
PDF
Database & Academic Research
PPTX
Writing Seminar Rogers
PPTX
Writing Seminar Rogers Spring 2012
PDF
Microblogging for Enhancing the Research Accessibility
PPTX
Getting Published in academic journals: tips and tricks. 2015
PPTX
Citation metrics across disciplines - Google Scholar, Scopus, and the Web of ...
PDF
Elsevier researcher profiles and metrics that count - University of Balamand
PDF
Strategies to enhance research visibility, impact & citations by nader ale e...
PPTX
Library Services for Nursing Staff
PDF
Prepare a pre/post print of your documents for advertisement
PPTX
Literature Searching Tips and Tricks
PPT
Library Research 101
DOC
Prevod
PDF
Create and maintain an up-to-date ResearcherID profile
PPTX
Researcher Profiles and Networks Webinar
Make your data discoverable on a data repository
Publish or Perish - Realising Google Scholar's potential to democratise citat...
Entomology 314 training
Introduction to the “Research Tools” for Research Methodology course
Establish your expertise with a science blog
Database & Academic Research
Writing Seminar Rogers
Writing Seminar Rogers Spring 2012
Microblogging for Enhancing the Research Accessibility
Getting Published in academic journals: tips and tricks. 2015
Citation metrics across disciplines - Google Scholar, Scopus, and the Web of ...
Elsevier researcher profiles and metrics that count - University of Balamand
Strategies to enhance research visibility, impact & citations by nader ale e...
Library Services for Nursing Staff
Prepare a pre/post print of your documents for advertisement
Literature Searching Tips and Tricks
Library Research 101
Prevod
Create and maintain an up-to-date ResearcherID profile
Researcher Profiles and Networks Webinar
Ad

Viewers also liked (20)

PPT
SAGE Journals Online
PPTX
PPT
PPT
MLA International Bibliography
PPT
Engineer Inforamtion
PPT
PPTX
Faculty of 1000
PDF
Scholarly video journals to increase productivity in research and education
PPT
Mathscinet
DOC
Bmj
PPT
Mathscinet2
PDF
PsycINFO and PsycArticles
PPTX
PPT
Wolfram Alpha
PPT
BioMed Central's open data initiatives
PPT
MCC 2011 - Slide 24
PDF
Quick springer link training august 2010
SAGE Journals Online
MLA International Bibliography
Engineer Inforamtion
Faculty of 1000
Scholarly video journals to increase productivity in research and education
Mathscinet
Bmj
Mathscinet2
PsycINFO and PsycArticles
Wolfram Alpha
BioMed Central's open data initiatives
MCC 2011 - Slide 24
Quick springer link training august 2010
Ad

Similar to Web 2.0 and repositories - have we got our repository architecture right? (20)

PPT
Audio in a social Web of linked data
PPT
Open, social and linked - what do current Web trends tell us about the future...
PPT
Repositories thru the looking glass
PPT
2.0 for PLE
PPTX
Networking Repositories, Optimizing Impact: Georgia Knowledge Repository Meeting
PDF
From Open Access to Open Standards, (Linked) Data and Collaborations
PPT
Lk Sue Tr Embed
PPT
Lk Sue Tr Embed
PPT
Repositories and the Open Web
PPT
The Future for Educational Resource Repositories in a Web 2.0 World
PPTX
SPARC Repositories conference in Baltimore - Nov 2010
PPT
Web 2.0, Hip or Hype - A Library Perspective
PPTX
How do I aggregate oers
PPT
Re-usable metadata, re-usable content
PPT
Web 2.0 workshop
PPT
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
ODP
Web 2.0 Core Concepts, Applications, and Implications
PPTX
Cultural heritage collections in a web 2
PPTX
Next Generation Repositories
PPT
Semantic Web, NON-technically speaking
Audio in a social Web of linked data
Open, social and linked - what do current Web trends tell us about the future...
Repositories thru the looking glass
2.0 for PLE
Networking Repositories, Optimizing Impact: Georgia Knowledge Repository Meeting
From Open Access to Open Standards, (Linked) Data and Collaborations
Lk Sue Tr Embed
Lk Sue Tr Embed
Repositories and the Open Web
The Future for Educational Resource Repositories in a Web 2.0 World
SPARC Repositories conference in Baltimore - Nov 2010
Web 2.0, Hip or Hype - A Library Perspective
How do I aggregate oers
Re-usable metadata, re-usable content
Web 2.0 workshop
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
Web 2.0 Core Concepts, Applications, and Implications
Cultural heritage collections in a web 2
Next Generation Repositories
Semantic Web, NON-technically speaking

More from Eduserv Foundation (20)

PPT
Virtual worlds, real learning? The uptake of virtual worlds for teaching and ...
PPT
Virtual worlds in education - a story so far
PDF
OpenAthens LA 2.0: An introduction
PPT
The Eduserv Research Programme
PPT
Openid - an identity system for the open Web
PPT
Virtual worlds in education - a story so far
PPT
ORE and SWAP: Composition and Complexity
PPT
Using Second Life to support events
PPT
DC-2008 Tutorial: Basic Concepts
PPT
Both sides, now - are we builders or users of services in the cloud?
PPT
Does metadata matter?
PDF
Identity - Future Directions
PPT
Virtually where? Are 3-D virtual worlds such as Second Life having an impact ...
PPT
The JISC DC Application Profiles: Some thoughts on requirements and scope
PPT
Everything you wanted to know about Dublin Core metadata
PPT
Virtually educated - the reality of using Second Life and other virtual world...
PPT
Safer Children in a Digital World: a summary for children and young people
PPT
Digital Natives Run by Digital Immigrants: IT Services are Dead, Long Live IT...
PPT
The OAI ORE Project
PPT
Aggregation Identity & owl:sameAs
Virtual worlds, real learning? The uptake of virtual worlds for teaching and ...
Virtual worlds in education - a story so far
OpenAthens LA 2.0: An introduction
The Eduserv Research Programme
Openid - an identity system for the open Web
Virtual worlds in education - a story so far
ORE and SWAP: Composition and Complexity
Using Second Life to support events
DC-2008 Tutorial: Basic Concepts
Both sides, now - are we builders or users of services in the cloud?
Does metadata matter?
Identity - Future Directions
Virtually where? Are 3-D virtual worlds such as Second Life having an impact ...
The JISC DC Application Profiles: Some thoughts on requirements and scope
Everything you wanted to know about Dublin Core metadata
Virtually educated - the reality of using Second Life and other virtual world...
Safer Children in a Digital World: a summary for children and young people
Digital Natives Run by Digital Immigrants: IT Services are Dead, Long Live IT...
The OAI ORE Project
Aggregation Identity & owl:sameAs

Recently uploaded (20)

PDF
IFRS Notes in your pocket for study all the time
PDF
Ôn tập tiếng anh trong kinh doanh nâng cao
PPTX
Amazon (Business Studies) management studies
PDF
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
PDF
Business model innovation report 2022.pdf
PPTX
job Avenue by vinith.pptxvnbvnvnvbnvbnbmnbmbh
PDF
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
PDF
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
PPTX
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
PDF
Nidhal Samdaie CV - International Business Consultant
PDF
Types of control:Qualitative vs Quantitative
PDF
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
PPTX
HR Introduction Slide (1).pptx on hr intro
PPTX
Business Ethics - An introduction and its overview.pptx
PPTX
New Microsoft PowerPoint Presentation - Copy.pptx
PPT
Chapter four Project-Preparation material
PDF
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf
DOCX
Euro SEO Services 1st 3 General Updates.docx
PPTX
Probability Distribution, binomial distribution, poisson distribution
PDF
Dr. Enrique Segura Ense Group - A Self-Made Entrepreneur And Executive
IFRS Notes in your pocket for study all the time
Ôn tập tiếng anh trong kinh doanh nâng cao
Amazon (Business Studies) management studies
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
Business model innovation report 2022.pdf
job Avenue by vinith.pptxvnbvnvnvbnvbnbmnbmbh
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
Nidhal Samdaie CV - International Business Consultant
Types of control:Qualitative vs Quantitative
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
HR Introduction Slide (1).pptx on hr intro
Business Ethics - An introduction and its overview.pptx
New Microsoft PowerPoint Presentation - Copy.pptx
Chapter four Project-Preparation material
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf
Euro SEO Services 1st 3 General Updates.docx
Probability Distribution, binomial distribution, poisson distribution
Dr. Enrique Segura Ense Group - A Self-Made Entrepreneur And Executive

Web 2.0 and repositories - have we got our repository architecture right?

  • 1. Web 2.0 and repositories… … have we got our repository architecture right?
  • 2. Outline where are we now? what’s wrong with where we are now? what can we do about it? do we need a new vision?
  • 3. Where are we now? where are we now?
  • 4. What is a repository? a university-based institutional repository is a set of services that a university offers to the members of its community for the management and dissemination of digital materials created by the institution and its community members. It is most essentially an organizational commitment to the stewardship of these digital materials, including long-term preservation where appropriate, as well as organization and access or distribution . … An institutional repository is not simply a fixed set of software and hardware (Cliff Lynch, 2003)
  • 5. Repository “doing” words manage deposit disclose make openly available curate preserve
  • 6. Repository content all sorts… but most “academic” focus currently on scholarly publications learning objects research data
  • 7. Repository content all sorts… but most “academic” focus currently on scholarly publications learning objects research data this talk focuses on the first of these, but with the intention that most of what I say will be generic
  • 8. Repository architecture largely institutional focus though some exceptions – arXiv, RePEC, JORUM, etc. interoperability through centralised aggregators (national and global) search services (OAIster, Intute, …) registries (DOAR, ROAR, …) harvesting metadata about content using OAI-PMH (metadata = simple Dublin Core) content = PDF SWORD as deposit API
  • 9. What’s “wrong” with where we are now? what’s “ wrong ” with where we are now?
  • 10. #1 We talk about “repositories”…
  • 11. …rather than “the Web” a focus on ‘ making content available on the Web’ would be more intuitive to researchers
  • 12. Whatever happened to the CMS? a focus on ‘ content management ’ would change our emphasis OAI-PMH out… search engine optimisation, usability, accessibility, Web design, tagging, information architecture, cool URIs in…
  • 13. #2 We don’t emphasise… Google indexing RSS feeds widget technology – embedding functionality into other sites
  • 14. #3 Our focus is on sharing metadata… … even though we have full-text to share worse… the full-text we share tends to be PDF rather than native Web format the Web equivalent of a cul de sac and the metadata we share tends to be “simple Dublin Core” little consistency in approaches to describing ‘files’ vs. ‘documents’ little consistency in naming authors and subjects ultimately, it is both too simple and too complex!
  • 15. #4 We ignore the Web Architecture we have tended to adopt service oriented approaches in line with long tradition from Z39.50 to SOAP/WSDL e.g. JISC eFramework focus is on building “services on content” rather than on the “content” pbo31 @ flickr
  • 16. REST is good we don’t tend to adopt a resource oriented approach we don’t adopt REST – an architectural style with a focus on resources, their identifiers (e.g. URIs), and a simple uniform set of operations that each resource supports (e.g. GET, PUT, POST, DELETE) we don’t encourage a Web style “follow your nose” approach
  • 17. #5 We are antisocial… … at least, we tend to treat “content” in isolation from the “social networks” that need to grow around that content successful “repositories” (Flickr, YouTube, Slideshare, etc.) promote the social activity that takes place around content as well as the content management and disclosure activity friends, groups, social tagging, comments, embedding, re-purposing, etc.
  • 18. But not just about functionality… the institutional approach has fundamental mismatch with the real-life social networks adopted by researchers subject-based cross-institutional global while institutional approach is good from perspective of institutional management, preservation, etc. globally “concentrated” repositories might better reflect the social networks that need to arise
  • 19. The net effect… … is that there is no net effect repositories remain uncompelling places to disclose scholarly publications from POV of the researcher perceived cost of deposit remains higher than perceived benefits we resort to institutional or funder mandates, “thou shalt deposit”, to fill what would otherwise remain empty
  • 20. Wait just a minute… didn’t we used to have globally “concentrated” repository services? arXiv – the first Web 2.0 service? invented before the Web unfortunately, also invented before Amazon S3 i.e. before we knew how to scale things
  • 21. Wait just another minute… … doesn’t the blogsphere successfully layer a set of globally concentrated services over a distributed network of content? e.g. Technorati yes… but… the content is under the control of ‘individuals’ rather than ‘institutions’, and… the interoperability “glue” (RSS and tagging) is very lightweight and RESTful
  • 22. Having the conversation is hard highly political space strong “open access” voices who, understandably, don’t want their agenda de-railed by discussion about preservation search engine optimisation Web 2.0 social networks semantic Web the future of peer review it can be hard to get the conversation started
  • 23. What can we do about it? what can we do about it?
  • 24. Things can go two ways… I think that things can go two ways… The Web 2.0 Way or The Semantic Web Way … possibly both
  • 25. Things can go two ways… what would a Web 2.0 repository look like?
  • 27. A Web 2.0 repository? high-quality browser-based document viewer (not Acrobat!) tagging, commentary, more-like-this, favorites, … persistent (cool) URIs to content ability to form simple social groups ability to embed documents in other Web sites high visibility to Google offer RSS as primary API use of Amazon S3 to cope with scalability
  • 28. In short… we go “simple” we develop simple(ish) repositories and complex aggregators and search engines RSS/Atom as primary “glue” social tagging as “description” full-text indexing microformats Google Sitemaps to guide harvesters to content complex functional requirements (e.g. author disambiguation) either ignored or met thru complexity in aggregators
  • 29. Alternatively… we go “complex” …we look to the Semantic Web we create and share much richer metadata about scholarly publications than we do currently we explicitly model complexity (a la FRBR) and aggregations we expose resulting metadata thru the SW “graph”
  • 30. We go “complex”... SWAP and ORE
  • 31. We go “complex”… SWAP – Scholarly Works Application Profile an application of the Dublin Core Abstract Model and Application Profiles capturing relationships between works, expressions, manifestations, items and agents ORE – OAI Object Re-use and Exchange capturing relationships between aggregations and aggregated resources note that ORE not tied to specific entity in FRBR note that ORE implemented as profile of Atom
  • 32. SWAP application profile model ScholarlyWork Expression 0..∞ isExpressedAs Manifestation isManifestedAs 0..∞ Copy isAvailableAs 0..∞ 0..∞ 0..∞ isCreatedBy isPublishedBy 0..∞ isEditedBy 0..∞ isFundedBy isSupervisedBy AffiliatedInstitution Agent
  • 34. Summary what can we learn from Web 2.0? user interface design matters global ‘concentration’ is an enabler of social interaction simple DC is both too simple and too complex richer DC application profiles such as SWAP and/or RDF applications like ORE may be a way forward but need to ensure that their use does not over-complicate user interfaces and workflows
  • 35. A new vision? a new vision?
  • 36. Flickr and digital cameras… didn’t just take the practice of photography and put it on the Web they fundamentally changed what photography was about
  • 37. What’s our vision? the standards we adopt in the scholarly communication space… OAI-PMH, OpenURL, DOI, PDF, … are primarily about replicating in a Web world what we have always done on paper this is not surprising given the necessary inertia of the scholarly communication life-cycle but… do we need to re-envision scholarly communication as a true Web process? if so, what would a repository look like?