SlideShare a Scribd company logo
‘ aggregation as a tactic’ -  to support discovery  Peter Burnhill & Stuart Macdonald EDINA national data centre University of Edinburgh CERN workshop on Innovations in Scholarly Communication (OAI7) University of Geneva, 23 June 2011
RDTF Vision: The joint JISC / RLUK Resource Discovery Task Force (RDTF) Vision: “ UK researchers and students will have easy, flexible, and ongoing access to  content and services through a collaborative, aggregated and integrated resource  discovery and delivery framework which is comprehensive, open and sustainable ” Making content more discoverable  both by people and machine  via a  mixed economy of technological solutions.   The Discovery Initiative aims to: Engage stakeholders across libraries, archives and museums Build critical mass of open content to inspire others to participate Encourage development of ‘purposeful aggregations and compelling applications’ -  mashing at the macro-level   Exemplify what can be done across domains to free data and explore how to  make that data work harder   No one-size fits all solution! Context
Key concept in RDTF Vision is aggregation, directly or represented through metadata – to unlock the online & digital riches held in our organisations ‘ Regard aggregation as intervention t o exploit the telematic opportunity for things [that] are  'remote, digital & published’  -  a phrase derived from an IASSIST conference in 1990 exploring what it meant with the Internet if we regarded all [content] as ‘remote and published’.  The Web in mid-1990s simplified and thus improved  Unfortunately, even now, much which is online and on the Web is badly or inadequately published … We have to improve, re-interpreting what it means to be ‘well-published’ ‘ aggregation as a tactic’  -  a phrase coined to end an an impasse during a meeting to discuss technical aspects of the RDTF Vision statement to identify stakeholder groups
The term aggregation is used a lot in computer science for: “ objects … assembled or configured together to create a more complex object” UML, IBM “ aggregating resources based on … properties. … they are owl:sameAs and their other properties can be intermixed .”  For purposes of RDTF aggregation means:   an assembly of data sources more than a collection of objects (image banks, data services, catalogues, activity data) – related or otherwise for machine-as-user – independent of presentation layer However aggregation is not a goal nor an end in itself  -  It is an intervention   to be used for a twofold strategic purpose: ‘ improvement’   -  merge & match, customisation and consumption, multiple output formats, reduce duplication of effort ‘ discoverability’  – via ‘promiscuous’ or ‘well-dressed’ metadata through e.g. Google or tailored services
Digital Library has mixed parentage   - a ‘re-mix’ of the document  tradition & the computation tradition “ approaches based on a concern with documents, with  signifying records : archives, bibliography, documentation, librarianship, records management, and the like …   [ Content Provider speak ] “ approaches based on  uses of formal techniques , whether mechanical (such as punch cards and data-processing equipment) or mathematical/computational (as in algorithmic procedures).”    [ Developer speak ] Prof. Michael Buckland,  Presidential Address, American Society for Information Science,  JASIS’s 50th (1998) http://people.ischool.berkeley.edu/~buckland/asis62.html Language & Perspectives
EDINA  -  develops and delivers JISC-sponsored national online services adding value to data and content Digimap Collections (OS mapping; SeaZone; BGS) NewsfilmOnline (various; digitised with JISC £) UK Access Management Federation (institutions; authentication) Data Library  – move from support to middle folk Research data support for Edinburgh researchers Research data management guidelines, training, OER materials Edinburgh DataShare – open data repository RADAR – Researching A Data Asset Registry Maybe as  ‘middle folk’ -  c.f. those who deal in middleware sometimes having the role of creator and supplier of some service sometimes being the user of what others supply  ‘ inter-operator’ Perspectives … as provider
Perspective … as aggregator:  developing and delivering JISC-sponsored aggregation services JISCMediahub   - links to collections & hosted content   (c. 1m resources) CultureGrid; First World War Poetry; Films of Scotland; Getty images (all content searchable and viewable within JISC Media Hub) GoGeo!  - metadata registry for spatially-referenced data Geodoc Metadata creation tool, ShareGeo Open SUNCAT – serials union catalogue: 80 libraries metadata/links to full text, download MARC records (& XML & SUTRS -  Simple  Unstructured Text Record Syntax - data exchange format widely used in  Z39.50) PEPRS   -  e-journal preservation registry  jointly led by EDINA with the ISSN International Centre metadata registry of available back copy e-journals - aggregated from  preservation agencies (incl. British Library, UK LOCKSS Alliance, CLOCKSS)
Some RDTF-related projects @ EDINA GOgeo Linked Data  (GOLD) – triplify INSPIRE compliant metadata to – improve discoverability of metadata records via search engines SUNCAT : Exploring Open [bibliographic] Metadata (working with OKF to open up data sent by contributing libraries – convert to RDF) Sharing OpenURL Activity Data   - monthly usage data: date & time; anonymised IP address/inst. ID; title; author; ISSN, DOI  Uses – article/journal recommendations, publishers reviewing  what content is of interest to specific communities, innovative  services to meet users’ needs CHALICE  – Use data mining to extract placenames from the English Place Name Survey to create a UK historic gazetteer published as Linked Data & link it to the Geonames ontology on the semantic web.  AddressingHistory  – Geo-parsing of Scottish Post Office Directories, API onto digitised content, output in XML, CSV, JSON 3 further case studies on other EDINA services illustrating how other collections can benefit from the same techniques.
The end is the start of a new beginning … In earlier ‘web time’ we had the MODELS ‘user-verbs’: Discover -> Locate -> Request -> Access (Deliver) Dempsey, Russell & Murray (1999)  http://www.ukoln.ac.uk/dlis/models/publications/utopia/ where Access was the end game for us ‘middle folk’ even if the  beginning & part of a deeper process for researchers, students  … Now there is call for  more than bilateral & negotiated interoperability, where Access is the beginning  for developers and for other services RDF/Linked Data enables information to be shared in a more Web-friendly way RDF/Linked Data enables structure and content of those data sources to be explicit  - vocabularies, ontologies, relationships Exposing the complexity and relationship in the underlying data, hanging the insides on the outside!
The treasures are on show inside, but … Centre  Pompidou
…  and so to summarise..   Early web approaches focused on making content accessible for humans hiding  the complexity and relationship in the underlying data  paying attention to the user interface:  HCI & GUI; Usability and Accessibility However to ensure content gets noticed it must be made easier for machines to understand by: exposing  the complexity and relationship in the underlying data having in mind the machine-as-user: API as well as HCI Aggregation should be seen   as intervention,  with strategic purpose: to engage in value-added improvement of content to enhance the discoverability of that which is ‘aggregated’ to be a focus of attention (thro’ promiscuous metadata!) If it is with RDF, then that’s good don’t make a fuss if not Publish RDBMS schemas, catalogue records, codebooks, and  ancillary or related content in multiple, machine-readable formats
The Many Minds principle “ the coolest thing to do with your data will be thought of by someone else“  Using data as the building platform Jo Walsh & Rufus Pollock  (2007-05-17).  Open Data and Componentization .  XTech 2007  (slide 14) "Benefits of freeing data are many, arguably being the most relevant one  the “Many Minds principle”: there’ll always be someone that will find out  a way to reuse data that you wouldn’t have even figured.“ José Manuel Alonso ,  Notes from the  5th Internet, Law and Politics Conference: The Pros and Cons of Social Networking Sites , organized by the Open  University of Catalonia, School of Law and Political Science, and held in Barcelona, Spain, on July 6th and 7th, 2009.
[email_address] [email_address]   http:// edina.ac.uk /   Repository Fringe 2011 –  call for participants: http://www.repositoryfringe.org/   THANK YOU CC BY-NC-ND 2.0 -  image by enggul courtesy of Flickr –  http://www.flickr.com/photos/enggul/2361808668 /

More Related Content

PPT
Open Repositories and Interoperability Challenges in UK
PDF
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
PPTX
Supporting the development of a national Research Data Discovery Service – a ...
PPT
Digital Preservation Case Study: Community Action via UK LOCKSS Alliance
PPTX
Research Data MANTRA Project at Edinburgh
PPTX
Roles & Skills for RDM
PPT
A national repository (library?) service for learning materials
PPTX
Reference Rot and Linked Data: Threat and Remedy
Open Repositories and Interoperability Challenges in UK
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
Supporting the development of a national Research Data Discovery Service – a ...
Digital Preservation Case Study: Community Action via UK LOCKSS Alliance
Research Data MANTRA Project at Edinburgh
Roles & Skills for RDM
A national repository (library?) service for learning materials
Reference Rot and Linked Data: Threat and Remedy

What's hot (20)

PPT
Introduction to Research Data Management
PPTX
Designing and delivering an international MOOC on Research Data Management an...
PPTX
End of COBWEB Co-Design Projects Celebration
PPTX
Six Use Cases for Edinburgh DataShare
PPT
PEPRS: Recording The Extent Preserved
PPT
MANTRA & Open Educational Resources
PPTX
Where data and journal content collide: what does it mean to ‘publish your da...
PPT
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
PDF
Research Data Management Training and Support
PPTX
Data Library Services at the University of Edinburgh
PPTX
Educause 2015 RDM Maturity
PPTX
Open data and research data management at the University of Edinburgh: polici...
PPTX
Introduction to data support services and resources for public policy
PDF
Tales from the Keepers Registry
PPTX
Introduction to RDM for trainee physicians
PPTX
Engaging the Researcher in RDM
PPTX
IASSIST40: Data management & curation workshop
PPTX
DIY RDM Training Kit for Librarians (PK)
PPT
What's So Special about the Social Sciences
PPT
Library roles in research data management
Introduction to Research Data Management
Designing and delivering an international MOOC on Research Data Management an...
End of COBWEB Co-Design Projects Celebration
Six Use Cases for Edinburgh DataShare
PEPRS: Recording The Extent Preserved
MANTRA & Open Educational Resources
Where data and journal content collide: what does it mean to ‘publish your da...
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Research Data Management Training and Support
Data Library Services at the University of Edinburgh
Educause 2015 RDM Maturity
Open data and research data management at the University of Edinburgh: polici...
Introduction to data support services and resources for public policy
Tales from the Keepers Registry
Introduction to RDM for trainee physicians
Engaging the Researcher in RDM
IASSIST40: Data management & curation workshop
DIY RDM Training Kit for Librarians (PK)
What's So Special about the Social Sciences
Library roles in research data management
Ad

Viewers also liked (20)

PPTX
Discovering What You Can't Always Get From Google: Jisc MediaHub
PPTX
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
PPTX
Research Data Management at the University of Edinburgh
PPT
Digimap for Schools: Mapping the Nation
PPTX
Trading Consequences & Planning Project Communications / Launches
PPTX
Reference Rot: Threat and Remedy
PPTX
Digital maps: past, present; on your desktop and in the palm of your hand
PPT
Who is doing what, and how do we know? [PEPRS]
PPTX
PPTX
What does it mean to build a Citizen Science Project?
PPTX
Research Data Mantra (Management Training) Online Course Launch
PPT
OGC Interoperability Experiments and Authentication
PPT
Shibboleth Access Management Federations as an Organisational Model for SDI
PPTX
GoGeo: A Jisc-funded service to promote and support spatial data management a...
PPTX
6th COBWEB Consortium Meeting
PPT
UKLA Update On Activities
PPT
DIY Research Data Management Training Kit for Librarians
PPT
Introduction to Digimap's Ordnance Survey Collection
PPT
Privacy and Consent
PPT
Increase usage of online resources Edina presentation
Discovering What You Can't Always Get From Google: Jisc MediaHub
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Management at the University of Edinburgh
Digimap for Schools: Mapping the Nation
Trading Consequences & Planning Project Communications / Launches
Reference Rot: Threat and Remedy
Digital maps: past, present; on your desktop and in the palm of your hand
Who is doing what, and how do we know? [PEPRS]
What does it mean to build a Citizen Science Project?
Research Data Mantra (Management Training) Online Course Launch
OGC Interoperability Experiments and Authentication
Shibboleth Access Management Federations as an Organisational Model for SDI
GoGeo: A Jisc-funded service to promote and support spatial data management a...
6th COBWEB Consortium Meeting
UKLA Update On Activities
DIY Research Data Management Training Kit for Librarians
Introduction to Digimap's Ordnance Survey Collection
Privacy and Consent
Increase usage of online resources Edina presentation
Ad

Similar to Aggregation as Tactic (20)

PPT
Discovery event peter burnhill (aggregation as tactic)
PPTX
Boundless Opportunity
KEY
Technical standards & the RDTF Vision: some considerations
PPT
5 steps to becoming a JISC IE content provider
PPTX
Agile resources on the open web …. a global digital library
PDF
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
PPT
Doing data in the social sciences and humanities: links to and from published...
PPT
The JISC Information Environment and VLEs
PPT
Digital librarie
PPTX
RDAP13 John Kunze: The Data Management Ecosystem
PPT
The Archives Forum - The National Archives - 02 March 2011
PDF
Linked data and the future of scientific publishing
PPT
Going for GOLD - Adventures in Open Linked Geospatial Metadata
ZIP
Intro to Linked Open Data in Libraries, Archives & Museums
PPTX
Building a Data Discovery Network for Sustainability Science
PPTX
The Data Management Ecosystem
PPT
Technical overview of the JISC Information Environment
PPTX
e-Science, Research Data and Libaries
PPTX
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
Discovery event peter burnhill (aggregation as tactic)
Boundless Opportunity
Technical standards & the RDTF Vision: some considerations
5 steps to becoming a JISC IE content provider
Agile resources on the open web …. a global digital library
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Doing data in the social sciences and humanities: links to and from published...
The JISC Information Environment and VLEs
Digital librarie
RDAP13 John Kunze: The Data Management Ecosystem
The Archives Forum - The National Archives - 02 March 2011
Linked data and the future of scientific publishing
Going for GOLD - Adventures in Open Linked Geospatial Metadata
Intro to Linked Open Data in Libraries, Archives & Museums
Building a Data Discovery Network for Sustainability Science
The Data Management Ecosystem
Technical overview of the JISC Information Environment
e-Science, Research Data and Libaries
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...

More from EDINA, University of Edinburgh (20)

PDF
The Making of the English Landscape:
PPTX
Spatial Data, Spatial Humanities
PDF
Land Cover Map 2015
PPTX
We have the technology... We have the data... What next?
PPTX
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
PPTX
GeoForum EDINA report 2017
PPTX
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
PPTX
Moray housemarch2017
PPTX
Uniof stirlingmarch2017secondary
PPT
Uniof glasgow jan2017_secondary
PPTX
Managing your Digital Footprint : Taking control of the metadata and tracks a...
PPTX
Social media and blogging to develop and communicate research in the arts and...
PPTX
Enhancing your research impact through social media - Nicola Osborne
PPTX
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
PPTX
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
PPTX
SCURL and SUNCAT serials holdings comparison service
PPTX
Big data in Digimap
PPTX
Introduction to Edinburgh University Data Library and national data services
PPT
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
PPTX
Digimap Update - Geoforum 2016 - Guy McGarva
The Making of the English Landscape:
Spatial Data, Spatial Humanities
Land Cover Map 2015
We have the technology... We have the data... What next?
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
GeoForum EDINA report 2017
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
Moray housemarch2017
Uniof stirlingmarch2017secondary
Uniof glasgow jan2017_secondary
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Social media and blogging to develop and communicate research in the arts and...
Enhancing your research impact through social media - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
SCURL and SUNCAT serials holdings comparison service
Big data in Digimap
Introduction to Edinburgh University Data Library and national data services
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap Update - Geoforum 2016 - Guy McGarva

Recently uploaded (20)

PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PPTX
GDM (1) (1).pptx small presentation for students
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
RMMM.pdf make it easy to upload and study
PDF
Complications of Minimal Access Surgery at WLH
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Lesson notes of climatology university.
PPTX
Cell Structure & Organelles in detailed.
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Cell Types and Its function , kingdom of life
PDF
Classroom Observation Tools for Teachers
PDF
01-Introduction-to-Information-Management.pdf
PDF
Computing-Curriculum for Schools in Ghana
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
Final Presentation General Medicine 03-08-2024.pptx
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
GDM (1) (1).pptx small presentation for students
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
RMMM.pdf make it easy to upload and study
Complications of Minimal Access Surgery at WLH
Anesthesia in Laparoscopic Surgery in India
Lesson notes of climatology university.
Cell Structure & Organelles in detailed.
Module 4: Burden of Disease Tutorial Slides S2 2025
Cell Types and Its function , kingdom of life
Classroom Observation Tools for Teachers
01-Introduction-to-Information-Management.pdf
Computing-Curriculum for Schools in Ghana
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Abdominal Access Techniques with Prof. Dr. R K Mishra
Chinmaya Tiranga quiz Grand Finale.pdf

Aggregation as Tactic

  • 1. ‘ aggregation as a tactic’ - to support discovery Peter Burnhill & Stuart Macdonald EDINA national data centre University of Edinburgh CERN workshop on Innovations in Scholarly Communication (OAI7) University of Geneva, 23 June 2011
  • 2. RDTF Vision: The joint JISC / RLUK Resource Discovery Task Force (RDTF) Vision: “ UK researchers and students will have easy, flexible, and ongoing access to content and services through a collaborative, aggregated and integrated resource discovery and delivery framework which is comprehensive, open and sustainable ” Making content more discoverable both by people and machine via a mixed economy of technological solutions. The Discovery Initiative aims to: Engage stakeholders across libraries, archives and museums Build critical mass of open content to inspire others to participate Encourage development of ‘purposeful aggregations and compelling applications’ - mashing at the macro-level Exemplify what can be done across domains to free data and explore how to make that data work harder No one-size fits all solution! Context
  • 3. Key concept in RDTF Vision is aggregation, directly or represented through metadata – to unlock the online & digital riches held in our organisations ‘ Regard aggregation as intervention t o exploit the telematic opportunity for things [that] are 'remote, digital & published’ - a phrase derived from an IASSIST conference in 1990 exploring what it meant with the Internet if we regarded all [content] as ‘remote and published’. The Web in mid-1990s simplified and thus improved Unfortunately, even now, much which is online and on the Web is badly or inadequately published … We have to improve, re-interpreting what it means to be ‘well-published’ ‘ aggregation as a tactic’ - a phrase coined to end an an impasse during a meeting to discuss technical aspects of the RDTF Vision statement to identify stakeholder groups
  • 4. The term aggregation is used a lot in computer science for: “ objects … assembled or configured together to create a more complex object” UML, IBM “ aggregating resources based on … properties. … they are owl:sameAs and their other properties can be intermixed .” For purposes of RDTF aggregation means: an assembly of data sources more than a collection of objects (image banks, data services, catalogues, activity data) – related or otherwise for machine-as-user – independent of presentation layer However aggregation is not a goal nor an end in itself - It is an intervention to be used for a twofold strategic purpose: ‘ improvement’ - merge & match, customisation and consumption, multiple output formats, reduce duplication of effort ‘ discoverability’ – via ‘promiscuous’ or ‘well-dressed’ metadata through e.g. Google or tailored services
  • 5. Digital Library has mixed parentage - a ‘re-mix’ of the document tradition & the computation tradition “ approaches based on a concern with documents, with signifying records : archives, bibliography, documentation, librarianship, records management, and the like … [ Content Provider speak ] “ approaches based on uses of formal techniques , whether mechanical (such as punch cards and data-processing equipment) or mathematical/computational (as in algorithmic procedures).” [ Developer speak ] Prof. Michael Buckland, Presidential Address, American Society for Information Science, JASIS’s 50th (1998) http://people.ischool.berkeley.edu/~buckland/asis62.html Language & Perspectives
  • 6. EDINA - develops and delivers JISC-sponsored national online services adding value to data and content Digimap Collections (OS mapping; SeaZone; BGS) NewsfilmOnline (various; digitised with JISC £) UK Access Management Federation (institutions; authentication) Data Library – move from support to middle folk Research data support for Edinburgh researchers Research data management guidelines, training, OER materials Edinburgh DataShare – open data repository RADAR – Researching A Data Asset Registry Maybe as ‘middle folk’ - c.f. those who deal in middleware sometimes having the role of creator and supplier of some service sometimes being the user of what others supply ‘ inter-operator’ Perspectives … as provider
  • 7. Perspective … as aggregator: developing and delivering JISC-sponsored aggregation services JISCMediahub - links to collections & hosted content (c. 1m resources) CultureGrid; First World War Poetry; Films of Scotland; Getty images (all content searchable and viewable within JISC Media Hub) GoGeo! - metadata registry for spatially-referenced data Geodoc Metadata creation tool, ShareGeo Open SUNCAT – serials union catalogue: 80 libraries metadata/links to full text, download MARC records (& XML & SUTRS - Simple Unstructured Text Record Syntax - data exchange format widely used in Z39.50) PEPRS - e-journal preservation registry jointly led by EDINA with the ISSN International Centre metadata registry of available back copy e-journals - aggregated from preservation agencies (incl. British Library, UK LOCKSS Alliance, CLOCKSS)
  • 8. Some RDTF-related projects @ EDINA GOgeo Linked Data (GOLD) – triplify INSPIRE compliant metadata to – improve discoverability of metadata records via search engines SUNCAT : Exploring Open [bibliographic] Metadata (working with OKF to open up data sent by contributing libraries – convert to RDF) Sharing OpenURL Activity Data - monthly usage data: date & time; anonymised IP address/inst. ID; title; author; ISSN, DOI Uses – article/journal recommendations, publishers reviewing what content is of interest to specific communities, innovative services to meet users’ needs CHALICE – Use data mining to extract placenames from the English Place Name Survey to create a UK historic gazetteer published as Linked Data & link it to the Geonames ontology on the semantic web. AddressingHistory – Geo-parsing of Scottish Post Office Directories, API onto digitised content, output in XML, CSV, JSON 3 further case studies on other EDINA services illustrating how other collections can benefit from the same techniques.
  • 9. The end is the start of a new beginning … In earlier ‘web time’ we had the MODELS ‘user-verbs’: Discover -> Locate -> Request -> Access (Deliver) Dempsey, Russell & Murray (1999) http://www.ukoln.ac.uk/dlis/models/publications/utopia/ where Access was the end game for us ‘middle folk’ even if the beginning & part of a deeper process for researchers, students … Now there is call for more than bilateral & negotiated interoperability, where Access is the beginning for developers and for other services RDF/Linked Data enables information to be shared in a more Web-friendly way RDF/Linked Data enables structure and content of those data sources to be explicit - vocabularies, ontologies, relationships Exposing the complexity and relationship in the underlying data, hanging the insides on the outside!
  • 10. The treasures are on show inside, but … Centre Pompidou
  • 11. … and so to summarise.. Early web approaches focused on making content accessible for humans hiding the complexity and relationship in the underlying data paying attention to the user interface: HCI & GUI; Usability and Accessibility However to ensure content gets noticed it must be made easier for machines to understand by: exposing the complexity and relationship in the underlying data having in mind the machine-as-user: API as well as HCI Aggregation should be seen as intervention, with strategic purpose: to engage in value-added improvement of content to enhance the discoverability of that which is ‘aggregated’ to be a focus of attention (thro’ promiscuous metadata!) If it is with RDF, then that’s good don’t make a fuss if not Publish RDBMS schemas, catalogue records, codebooks, and ancillary or related content in multiple, machine-readable formats
  • 12. The Many Minds principle “ the coolest thing to do with your data will be thought of by someone else“ Using data as the building platform Jo Walsh & Rufus Pollock (2007-05-17). Open Data and Componentization . XTech 2007 (slide 14) "Benefits of freeing data are many, arguably being the most relevant one the “Many Minds principle”: there’ll always be someone that will find out a way to reuse data that you wouldn’t have even figured.“ José Manuel Alonso , Notes from the 5th Internet, Law and Politics Conference: The Pros and Cons of Social Networking Sites , organized by the Open University of Catalonia, School of Law and Political Science, and held in Barcelona, Spain, on July 6th and 7th, 2009.
  • 13. [email_address] [email_address] http:// edina.ac.uk / Repository Fringe 2011 – call for participants: http://www.repositoryfringe.org/ THANK YOU CC BY-NC-ND 2.0 - image by enggul courtesy of Flickr – http://www.flickr.com/photos/enggul/2361808668 /

Editor's Notes

  • #2: Based on a presentation by Peter Burnhill at launch of Discovery – “a UK metadata ecology for UK education and research” in May 2011 JISC consolidation of service
  • #3: Creating ‘new or novel knowledge products, increasing serendipity, cross-fertilisation of resources a metadata ecology for UK education and research No one size fits all solutions – mixed economies (technical and subject expertise, infrastructures, cultural differences across domains and organisations)
  • #4: A key concept in the RDTF Vision is aggregation , directly or represented through metadata: to unlock riches held in our organisations, typically digital and online. This recognises the added value in assembly for tactical purpose – to improve ‘discoverability’. The ultimate aim being to improve access for research and educational purpose: for researchers, students and their teachers, -  with few barriers for potential take-up beyond that Promiscuous metadata, washing your dirty metadata in public, making sure your metadata is well dressed !!! It’s very easy to publish in the web. Maximising the full potential of that content is another matter entirely.
  • #5: the audience for any given work, service or aggregation is now machine as well as human Make content easier to use in global informaiton ecosystem, reducing technical and licensing barriers, potential to create new knowledge products or services, foster social innovation, Let your metadata speak for you when you have no one to speak for you, Removeal of duplicate of effort
  • #6: The process of disintermediation
  • #7: Technical support provided by EDINA SDSS Expert Group in Identity and Access Management Keep at a strategic level rather than diving into detailed issues and concerns Orientate towards opportunities and feasible steps, however small those might be.
  • #8: Restricted outside JiscMedia Hub and Unrestricted content outside JISC Media Hub, collections inside JISC Media Hub available under subscription
  • #9: SUNCAT – getting permission from contributing libraries – make data available in ODC PDDL Carmichael Watson – gaelic folklorist – digitising his greatest work Carmina Gadelica – an anthology of hebridean charms, hymns, songs, poetry
  • #10: .Aggregation should be regarded as intervention to achieve value added improvement and context as aid to discoverability. Among their recommendations is that “such Aggregations should have supported APIs which are attractive to and convenient for developers”. Interpretation of other recommendations suggest that the Framework should: (a) assist Aggregators (extant and as funded aggregation projects) to demonstrate value to Content Providers were they to progressively follow each of the four Linked Data steps (b) encourage Content Providers to provide a  semantic sitemap prior data aggregation, e.g. publication of RDF schema of underlying database, from a registry of recommended schemas provided as guidance by Aggregation services. http://www.w3.org/DesignIssues/LinkedData
  • #12: Multi-part work – data is meaningless without ancillary material (i.e. provides both context & meaning) – ancillary material in machine-readable formats Publish your DDI compliant XML codebook.
  • #13: An extension of one of Tim O’Reilly’s 7 principles of Web 2.0 (2005) – Harnessing Collective Intelligence Bearing mind that there is a cost to keeping data closed !!! Open data are really open when they can be ‘always’ reused!