SlideShare a Scribd company logo
Being a Good Data Provider  Alastair Dunning JISC Programme Manager - Digitisation a.dunning AT jisc.ac.uk , 0203 006 6065 March 2011, Oxford This presentation is intended to give some brief advice for those publishing digital content (digital images, cultural heritage, scholarly information etc.) on the Internet
Outline Being a Good Data Provider: A simple thing gets complex Cool URIs Being Friends with Google, Is Google Enough? International Portals Geographies Re-Use and APIs Licensing
Cool URIs http://www.ariadne.ac.uk/issue31/web-focus/ URI ( Uniform Resource Identifier ) refers to the "generic set of all names/addresses that are short strings that refer to resources" whereas URI ( Uniform Resource Locator ) is "an informal term (no longer used in technical specifications) associated with popular URI schemes: http, ftp, mailto, etc.“ Keep them  stable ,  memorable  and  consistent – develop a short URI policy
Cool URIs Where do URIs get quoted? – Often taken out of their environment Publicity material – expensive to reprint Academic Citations – damages scholarly trust (plus citation guidelines?) Bookmarks within browser or on social bookmarking sites Emails (therefore less than 76 characters, avoid underscores) Blogs and other URIs By search engines – loss will inhibit resource discovery  Guesswork – users make guesses at URIs – use redirects and good 404 pages Good Example – BBC website Bad Example …
Item level not  collection level Users may have no interest in the general resource but plenty of interest in a particular item Designing Shakespeare  – Shakespeare performed in London & Stratford, 1960 – 2000, 1000s of plays Researchers & teachers interested in general resource Actors interested in  specific performances . Needed  stable URIs  for cast lists and photos
Being friends with Google No need to explain the importance in exposing content to metadata –  many users have Google as their principal springboard  for digital information Even if using authentication,  expose metadata Make sure your database is  easily queried by robots  like Google  Optimisation is complex and depends  on good communications process Use established URIs – Ensure your website is trusted Get incoming links from other trusted sources – this drives up traffic via Google and via the original sites themselves Strategic Content Alliance / Netskills training and documentation
Being friends with Google Give distinctive <title> to each page – helps with clarity on Google Use Google Sitemaps to upload details of your pages Google Analytics can help with measuring web usage Google Maps, Google Scholar? http://www.google.com/publicsector
Is Google everything? Recommendation by peers and other respected persons gets resources used Marketing a resource is an integrated strategy to marketing which involves technical and ‘academic’ integration Workshop will be held in this area for all JISC projects in this programme Source – Lesly Huxley et al (2007):  Gathering evidence: Current ICT use and future needs for arts and humanities researchers
Is Google everything? How is your collection integrated into library catalogue? How does your resource fit in with other resources? Source – Mark Greengrass et al  (2007):  RePAH: A User Requirements Analysis for Portals in the Arts and Humanities “ Resource discovery and use would be increased by separate collections being aggregated logically based on their content” Recommendation 3 – Daisy Abbott (2008):  Digital Repositories and Archives Inventory
Working with Aggregators CultureGrid -  http://www.culturegrid.org.uk/ UK aggregator cultural heritage material Large-scale harvest of digital resources Works well for images and multimedia Culture grid then exposes metadata to Europeana WorldCat -  http://www.worldcat.org/librarians/default.jsp Bibliographic data  - both digital and not digital  Metadata exposed via Registry of Digital Masters Requires membership – so best done via institution
Aggregators Other options Archives Hub,  http://archiveshub.ac.uk/ Connected Histories , British History 1500 – 1900 JISC Historic Books, JISC MediaHub Other options exist and will emerge, particularly within specific subject fields and areas of interest.  Key is to have easily exposable or transferable metadata
Geographies “ 80% of data has a geographical component” … possibly Lists, text, word can be confusing to navigate Maps have a simplicity which many, but not all, find engaging Examples - BL Sound Archive, Population Reports online, Flickr It’s about visualising your data in different ways … time is also a powerful metaphor
Geographies
Application Programming Interfaces (API) “ The best use of your data will be thought of by someone else” Separating data from its interface Publishing each strand of metadata as a separate URI Allows others to build interfaces over your data (and edit / annotate your data, if you want) Requires certain amount of technical knowledge in setting up and institutional belief Good example –  http://www.vam.ac.uk/api
Licensing A different challenge for re-use – making sure people know what they can do with your content Licensing in – clearing third party rights Licensing out – what can your users do Possibilities – re-use in educational context, remashing (including editing, cropping, rearranging), commercial use, anything, attribution Various existing licence s– worth exploring Creative Commons Other options may be required for third-party material Clarity over this is essential to avoid user confusion and legal ramifications But all JISC projects must indicate what can be done to their content
In Summary  Irrespective of the type of content  ... Cool URIs Being Friends with Google, Is Google Enough? International Portals Geographies Re-Use and APIs Licensing

More Related Content

PPT
Being a Good Data Provider, by Alastair Dunning
PPTX
SEAD: Opening Data in the "Long Tail" for Active and Social Curation
PPTX
PDF
DataShare for UC Campuses
PPTX
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
PDF
Mapping the Repository Landscape
PPTX
Collections meet the researcher. Digitalization, disintegration and disillusi...
PPTX
Data Library Services at the University of Edinburgh
Being a Good Data Provider, by Alastair Dunning
SEAD: Opening Data in the "Long Tail" for Active and Social Curation
DataShare for UC Campuses
A Lined Data Approach to Interoperability between Biomedical Resource Invento...
Mapping the Repository Landscape
Collections meet the researcher. Digitalization, disintegration and disillusi...
Data Library Services at the University of Edinburgh

What's hot (20)

PPTX
Research Data Management and Spatial Data
PPTX
PPT
Semantic.edu, an introduction
PPT
Kickoff Meeting for the Interoperable Geographic Information for Biosphere St...
PPTX
Geospatial metadata and spatial data workshop: 19 June 2014
PPTX
The role of virtual research environments (VRE's) within the context of an e-...
PPT
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
PDF
SDI – National to Global: perspectives from the UK academic sector
PPTX
AReS and Altmetrics: How we use them at ILRI
PPT
AddressingHistory: Lessons and Messages
PDF
Delivering Postgraduate Training - MANTRA
PPT
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
PPTX
Introduction to Edinburgh University Data Library and national data services
PPT
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
PPT
COBWEB, AIP-6, and Access Management Federations
PPT
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
PPTX
#1 FAIR: Into to FAIR and F for Findable
PPT
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
PPTX
Research Data and the Future of Software Engineering
PPTX
Fieldtrip GB: A customisable mapping and data capture app
Research Data Management and Spatial Data
Semantic.edu, an introduction
Kickoff Meeting for the Interoperable Geographic Information for Biosphere St...
Geospatial metadata and spatial data workshop: 19 June 2014
The role of virtual research environments (VRE's) within the context of an e-...
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
SDI – National to Global: perspectives from the UK academic sector
AReS and Altmetrics: How we use them at ILRI
AddressingHistory: Lessons and Messages
Delivering Postgraduate Training - MANTRA
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
Introduction to Edinburgh University Data Library and national data services
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
COBWEB, AIP-6, and Access Management Federations
Some Academic Sector/NMCA outcomes from the OGC Web Service Shibboleth Intero...
#1 FAIR: Into to FAIR and F for Findable
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Research Data and the Future of Software Engineering
Fieldtrip GB: A customisable mapping and data capture app
Ad

Similar to Being A Good Data Provider (20)

PDF
Linked data and the future of scientific publishing
PPTX
Improving usage and impact of digitised resources
PPTX
Improving usage and impact of digitised resources
PDF
Digital Convergence at the AWM
PPTX
Swiss_GLAM_Survey_presentation_20130412
PPTX
Introduction to APIs and Linked Data
PDF
Open for Reuse: Library data and mashups
KEY
Technical standards & the RDTF Vision: some considerations
PPTX
User Participation in Digital Library Development
PPT
Digitisation Infrastructure - June 2007
PPT
Bits To Blogs 050209
PPT
24 Hour Museum - Portal To Publisher
PPT
The Archives Forum - The National Archives - 02 March 2011
PPT
Developments in Access to Art Information: EnCompass Digital Portal. 2003
PPT
Web2 Oct08
PPT
Web2 Oct08
PPT
Linking Data the ALM way (Boris Zetterlund)
PPTX
Boutiques, Shopping Malls and Specialist Shops
PPTX
Finding Resources for The MITCET Chemistry Bridge Project
PPT
5 steps to becoming a JISC IE content provider
Linked data and the future of scientific publishing
Improving usage and impact of digitised resources
Improving usage and impact of digitised resources
Digital Convergence at the AWM
Swiss_GLAM_Survey_presentation_20130412
Introduction to APIs and Linked Data
Open for Reuse: Library data and mashups
Technical standards & the RDTF Vision: some considerations
User Participation in Digital Library Development
Digitisation Infrastructure - June 2007
Bits To Blogs 050209
24 Hour Museum - Portal To Publisher
The Archives Forum - The National Archives - 02 March 2011
Developments in Access to Art Information: EnCompass Digital Portal. 2003
Web2 Oct08
Web2 Oct08
Linking Data the ALM way (Boris Zetterlund)
Boutiques, Shopping Malls and Specialist Shops
Finding Resources for The MITCET Chemistry Bridge Project
5 steps to becoming a JISC IE content provider
Ad

More from Alastair Dunning (20)

PPTX
The Landscape of Research Data Management
PPTX
Open Access in Architectural Research
PPTX
Winning the Tour de France, Research Data and Data Stewardship
PPT
Launch of Welsh Newspapers Online
PPT
Future Directions of the European Library
PPT
Digital Libraries: Local and Global
PPTX
Risk Management
PPTX
Crowdsourcing as Public Engagement
PPTX
JISC Digging into Data
PPTX
Digitised Content in an API world
PPTX
Joining Up Content
PPTX
Impact, Aggregation, Partnership
PPT
Paola Marchionni, Working with JISC
PPTX
Partnership and Collaboration, Sarah Fahmy
PPTX
Fahmy partnerships
PPT
Working with JISC
PPTX
Summary of the Programme Meeting by Catherine Grout
PPT
How JISC Projects are Funded and Sustained (2010 version)
PDF
How JISC projects are funded and sustained
PPT
Digital Content – The British Library Vision
The Landscape of Research Data Management
Open Access in Architectural Research
Winning the Tour de France, Research Data and Data Stewardship
Launch of Welsh Newspapers Online
Future Directions of the European Library
Digital Libraries: Local and Global
Risk Management
Crowdsourcing as Public Engagement
JISC Digging into Data
Digitised Content in an API world
Joining Up Content
Impact, Aggregation, Partnership
Paola Marchionni, Working with JISC
Partnership and Collaboration, Sarah Fahmy
Fahmy partnerships
Working with JISC
Summary of the Programme Meeting by Catherine Grout
How JISC Projects are Funded and Sustained (2010 version)
How JISC projects are funded and sustained
Digital Content – The British Library Vision

Recently uploaded (20)

PDF
HVAC Specification 2024 according to central public works department
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
DOCX
Cambridge-Practice-Tests-for-IELTS-12.docx
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PPTX
Computer Architecture Input Output Memory.pptx
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
Uderstanding digital marketing and marketing stratergie for engaging the digi...
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PPTX
Introduction to pro and eukaryotes and differences.pptx
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
Hazard Identification & Risk Assessment .pdf
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PDF
advance database management system book.pdf
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
HVAC Specification 2024 according to central public works department
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
Unit 4 Computer Architecture Multicore Processor.pptx
Cambridge-Practice-Tests-for-IELTS-12.docx
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
Computer Architecture Input Output Memory.pptx
FORM 1 BIOLOGY MIND MAPS and their schemes
Uderstanding digital marketing and marketing stratergie for engaging the digi...
History, Philosophy and sociology of education (1).pptx
AI-driven educational solutions for real-life interventions in the Philippine...
202450812 BayCHI UCSC-SV 20250812 v17.pptx
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Share_Module_2_Power_conflict_and_negotiation.pptx
B.Sc. DS Unit 2 Software Engineering.pptx
Introduction to pro and eukaryotes and differences.pptx
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
Hazard Identification & Risk Assessment .pdf
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
advance database management system book.pdf
Environmental Education MCQ BD2EE - Share Source.pdf

Being A Good Data Provider

  • 1. Being a Good Data Provider Alastair Dunning JISC Programme Manager - Digitisation a.dunning AT jisc.ac.uk , 0203 006 6065 March 2011, Oxford This presentation is intended to give some brief advice for those publishing digital content (digital images, cultural heritage, scholarly information etc.) on the Internet
  • 2. Outline Being a Good Data Provider: A simple thing gets complex Cool URIs Being Friends with Google, Is Google Enough? International Portals Geographies Re-Use and APIs Licensing
  • 3. Cool URIs http://www.ariadne.ac.uk/issue31/web-focus/ URI ( Uniform Resource Identifier ) refers to the &quot;generic set of all names/addresses that are short strings that refer to resources&quot; whereas URI ( Uniform Resource Locator ) is &quot;an informal term (no longer used in technical specifications) associated with popular URI schemes: http, ftp, mailto, etc.“ Keep them stable , memorable and consistent – develop a short URI policy
  • 4. Cool URIs Where do URIs get quoted? – Often taken out of their environment Publicity material – expensive to reprint Academic Citations – damages scholarly trust (plus citation guidelines?) Bookmarks within browser or on social bookmarking sites Emails (therefore less than 76 characters, avoid underscores) Blogs and other URIs By search engines – loss will inhibit resource discovery Guesswork – users make guesses at URIs – use redirects and good 404 pages Good Example – BBC website Bad Example …
  • 5. Item level not collection level Users may have no interest in the general resource but plenty of interest in a particular item Designing Shakespeare – Shakespeare performed in London & Stratford, 1960 – 2000, 1000s of plays Researchers & teachers interested in general resource Actors interested in specific performances . Needed stable URIs for cast lists and photos
  • 6. Being friends with Google No need to explain the importance in exposing content to metadata – many users have Google as their principal springboard for digital information Even if using authentication, expose metadata Make sure your database is easily queried by robots like Google Optimisation is complex and depends on good communications process Use established URIs – Ensure your website is trusted Get incoming links from other trusted sources – this drives up traffic via Google and via the original sites themselves Strategic Content Alliance / Netskills training and documentation
  • 7. Being friends with Google Give distinctive <title> to each page – helps with clarity on Google Use Google Sitemaps to upload details of your pages Google Analytics can help with measuring web usage Google Maps, Google Scholar? http://www.google.com/publicsector
  • 8. Is Google everything? Recommendation by peers and other respected persons gets resources used Marketing a resource is an integrated strategy to marketing which involves technical and ‘academic’ integration Workshop will be held in this area for all JISC projects in this programme Source – Lesly Huxley et al (2007): Gathering evidence: Current ICT use and future needs for arts and humanities researchers
  • 9. Is Google everything? How is your collection integrated into library catalogue? How does your resource fit in with other resources? Source – Mark Greengrass et al (2007): RePAH: A User Requirements Analysis for Portals in the Arts and Humanities “ Resource discovery and use would be increased by separate collections being aggregated logically based on their content” Recommendation 3 – Daisy Abbott (2008): Digital Repositories and Archives Inventory
  • 10. Working with Aggregators CultureGrid - http://www.culturegrid.org.uk/ UK aggregator cultural heritage material Large-scale harvest of digital resources Works well for images and multimedia Culture grid then exposes metadata to Europeana WorldCat - http://www.worldcat.org/librarians/default.jsp Bibliographic data - both digital and not digital Metadata exposed via Registry of Digital Masters Requires membership – so best done via institution
  • 11. Aggregators Other options Archives Hub, http://archiveshub.ac.uk/ Connected Histories , British History 1500 – 1900 JISC Historic Books, JISC MediaHub Other options exist and will emerge, particularly within specific subject fields and areas of interest. Key is to have easily exposable or transferable metadata
  • 12. Geographies “ 80% of data has a geographical component” … possibly Lists, text, word can be confusing to navigate Maps have a simplicity which many, but not all, find engaging Examples - BL Sound Archive, Population Reports online, Flickr It’s about visualising your data in different ways … time is also a powerful metaphor
  • 14. Application Programming Interfaces (API) “ The best use of your data will be thought of by someone else” Separating data from its interface Publishing each strand of metadata as a separate URI Allows others to build interfaces over your data (and edit / annotate your data, if you want) Requires certain amount of technical knowledge in setting up and institutional belief Good example – http://www.vam.ac.uk/api
  • 15. Licensing A different challenge for re-use – making sure people know what they can do with your content Licensing in – clearing third party rights Licensing out – what can your users do Possibilities – re-use in educational context, remashing (including editing, cropping, rearranging), commercial use, anything, attribution Various existing licence s– worth exploring Creative Commons Other options may be required for third-party material Clarity over this is essential to avoid user confusion and legal ramifications But all JISC projects must indicate what can be done to their content
  • 16. In Summary Irrespective of the type of content ... Cool URIs Being Friends with Google, Is Google Enough? International Portals Geographies Re-Use and APIs Licensing