The Names ProjectDan Needham & Phil Cross04-08-2011
BackgroundJohn SmithSmith, J.J C SmithJ Hall née Smith  ???Smith, J. B.Smith, JoanneJoanne Clare Hall
Data In‹‹#RAENamesDataZetocDisambiguationEPrints
DisambiguationCox, B.E.Physics“Double defractiveHiggs…”ManchesterProf, OBEMaxfield, S.J.Co-authorImage: Paul Clarke@flickr
Data OutJSONRDFAPINamesDataMARKXMLHTML
Image: Bob Lee@flickr
Use casesNames
Current recordsHow many records permanent and where from?46840 Records made permanent~ 30 MillionRecords inProgressThousands more on the way
What’s nextProcessing the zetoc data!ISNIORCID…Other IDsMore data!NamesUsage!Data Ownership
Thanks Brian!http://names.mimas.ac.uk/individual/35219Image: unimancschools@flickr
Names for Repositories- EPrints plugin- Extracting researcher data from EPrints- Manual submission of researcher data to Names
EPrints Plugin- Augments name auto-completion with 			Names API- Adds Person URI to repositoryCox, B.T.http://names.mimas.ac.uk/individual/38855Repository
Plugin in Action
Plugin in Action
Plugin in Action
Plugin in Action
Future PluginsResearcher: export your own information to NamesAdmin: global editing of existing info to Names formatRoberts, L.R.http://names.mimas.ac.uk/individual/38855http://names.mimas.ac.uk/individual/55665Repository
Importing data from EPrintsRDF
Submission to NamesDisambiguationNames Data
Infonames.mimas.ac.ukamanda.hill@manchester.ac.ukdaniel.needham@manchester.ac.ukphilip.cross@manchester.ac.uk

More Related Content

PDF
Science and Engineering Resources: Physics and Astronomy
PDF
Publication and Dissemination of Data
PPTX
KM SHOWCASE 2019 - Findability Strategies
PDF
Changing The Way We Discover Research
PDF
Class7 feb21-2011
PPT
Alan Cope (De Montfort University) – EXPLORER (create workflows and processes...
PPTX
Ben Ryan (University of Leeds) – Timescapes Project
PPTX
Jodie Double (Univ. Leeds) – RePosit
Science and Engineering Resources: Physics and Astronomy
Publication and Dissemination of Data
KM SHOWCASE 2019 - Findability Strategies
Changing The Way We Discover Research
Class7 feb21-2011
Alan Cope (De Montfort University) – EXPLORER (create workflows and processes...
Ben Ryan (University of Leeds) – Timescapes Project
Jodie Double (Univ. Leeds) – RePosit

More from Repository Fringe (20)

PPTX
Unlocking Thesis Data - Stephen Grace, University of East London
PPTX
Integration - the heart of researcher centric research data management system...
PPTX
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheon
PPTX
Repositories for OA, RDM and Beyond - Rory McNicholl
PDF
RSpace - Rory Macneil at Repository Fringe 2015
PPTX
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
PPTX
Building data networks: exploring trust and interoperability between authoris...
PPTX
Jisc on repositories unleashing data - Daniela Duca
PDF
IRUS-UK at Repository Fringe 2015 - Jo Alcock
PPTX
Impact and EPrints - Rosie-Marie Barbeau and Mick Eadie
PPTX
Open Data and Sharing Science - Graham Steel, Contentmine
PPTX
SHERPA Services breakout session - Bill Hubbard
PPTX
REF compliance - what Jisc is doing
PPTX
RCUK - what Jisc is doing
PPTX
Linking Software: citations, roles, references and more
PPTX
Jisc Publications Router
PDF
Linking Research Outputs - Rachel Kotarski
PPTX
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
PPTX
Latest developments in Hydra-land - Chris Awre, University of Hull
PDF
ArchivesSpace - Scott Renton, University of Edinburgh
Unlocking Thesis Data - Stephen Grace, University of East London
Integration - the heart of researcher centric research data management system...
Open Access workshop at Repository Fringe 2015 - Valerie McCutcheon
Repositories for OA, RDM and Beyond - Rory McNicholl
RSpace - Rory Macneil at Repository Fringe 2015
Repository Fringe 2015 - Jisc RDM Session, Linda Naughton, Jisc
Building data networks: exploring trust and interoperability between authoris...
Jisc on repositories unleashing data - Daniela Duca
IRUS-UK at Repository Fringe 2015 - Jo Alcock
Impact and EPrints - Rosie-Marie Barbeau and Mick Eadie
Open Data and Sharing Science - Graham Steel, Contentmine
SHERPA Services breakout session - Bill Hubbard
REF compliance - what Jisc is doing
RCUK - what Jisc is doing
Linking Software: citations, roles, references and more
Jisc Publications Router
Linking Research Outputs - Rachel Kotarski
HHuLO Access – Hull, Huddersfield and Lincoln explore open access good practi...
Latest developments in Hydra-land - Chris Awre, University of Hull
ArchivesSpace - Scott Renton, University of Edinburgh
Ad

Recently uploaded (20)

PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PDF
Comparative analysis of machine learning models for fake news detection in so...
PPTX
Build Your First AI Agent with UiPath.pptx
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
Developing a website for English-speaking practice to English as a foreign la...
DOCX
search engine optimization ppt fir known well about this
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
CloudStack 4.21: First Look Webinar slides
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPTX
Configure Apache Mutual Authentication
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Flame analysis and combustion estimation using large language and vision assi...
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
Comparative analysis of machine learning models for fake news detection in so...
Build Your First AI Agent with UiPath.pptx
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Developing a website for English-speaking practice to English as a foreign la...
search engine optimization ppt fir known well about this
1 - Historical Antecedents, Social Consideration.pdf
CloudStack 4.21: First Look Webinar slides
The influence of sentiment analysis in enhancing early warning system model f...
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
Getting started with AI Agents and Multi-Agent Systems
A review of recent deep learning applications in wood surface defect identifi...
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Configure Apache Mutual Authentication
sbt 2.0: go big (Scala Days 2025 edition)
Zenith AI: Advanced Artificial Intelligence
NewMind AI Weekly Chronicles – August ’25 Week III
Chapter 5: Probability Theory and Statistics
Flame analysis and combustion estimation using large language and vision assi...
Ad

Dan Needham & Phil Cross (mimas) – Names Project

Editor's Notes

  • #2: My name is Dan and this is Phil. We are from Mimas and we’re going to be giving an overview of the Names project and the work we’ve been doing with it in the repository area.
  • #3: Names is a jisc funded project, in partnership with the British Library. Working to uniquely identify individuals and institutions within UK acedimia, and investigating requirements for a name authority service. As part of this we have developed a software prototype that we are moving towards live service.
  • #4: For the prototype we needed to build our own record store. We have done this by pulling in data from a variety of sources and attempting to deduplicate and disambiguate the unique individuals contained to form our own records.
  • #5: Using a famous case study we can see that the information associated with Brian Cox can be used to help deduplicate and disambiguate him from individuals with similar names across a number of different data sources. In our case we’re looking at things like name form, fields of interest, institutional affiliation and collaborative relationships.
  • #6: Having built an initial set of records derived from external data we built a RESTful API which could be used to query and retrieve data in a variety of datas including JSOn, RDF, MARKXML and default html
  • #7: So here we have Brian’s names record represented using html. His identifier is a resolvable url which brings back the information associated with his record. You can see that it can be returned in the other formats mentioned, and can be filtered by different attributes.
  • #8: Individuals may use their identifiers to identify themselves for paper submission. External applications or repositories might use the API to retrieve data for their own purposes. Researchers, other academics or reporters may search the service to find details on an individual. Libraries my use it for cataloguing purposes. All of these sources might feed data back into the service.
  • #9: We currently have 46840 permanent records derived from RAE merit data and a submission for RGU. We are processing ~30 million records from Zetoc, UKPMC and Southampton Eprints. Thousands on the way from different repositories.
  • #10: There are several next steps for names: 1 – Increase and flesh out records within Names. 2 – develop interoperability with other identifier systems. 3 – Continue to develop connections with other applications that might use names data 3 – develop facilities for researcher ownership over their own data.
  • #11: So that’s a brief overview of the names prototype and where we’re up to with it. Thanks to Brian for unwittingly participating in my presentation. I’ll now pass you over to Phil who will discuss the work we’ve been doing with Names in the repository area.
  • #12: I’ll be talking about how Names is working with repositoriesMostly looking at EPrints software at the moment but the work should be extensible to other platformsSo I’ll be talking about an EPrints plugin we’re writingAbout how we are able to extract researcher information from EPrints repositoriesAnd about how you can manually submit researcher information to us
  • #13: The plugin augments the existing auto-complete on name entry but searches our API for matching names in addition to searching the local repositoryIt adds a preferred form for the name together with the unique person identifier supplied by Names
  • #14: First sceenshot shows the auto-complete drop down but showing results from Names – including the unique identifiersIt’s Brian again!
  • #15: You need to differentiate between people with the same name so a mouseover shows disambiguating information from Names.At present this is only the Field of Interest information. Will include other information such as example publications, co-authors, possibly a homepage
  • #16: Here we see another Roberts, A but with a different Field of Interest.Note that the URI is currently going into the Email field. This is the EPrints creator id field but we are intending to generate a new Names URI field with the plugin to avoid overwriting any existing internal identifiers
  • #17: Finally you can see that the plugin still searches across the local database – see the entries for Cross, PhilWe’ll be demoing the software if you want to have a better look
  • #18: Complimentary plugins that we are considering for the future are:a means for researchers to export info about themselves to Names via our API to create an entry and identifieran admin plugin to enable global editing of existing researcher names to the Names format, adding unique identifiers – would depend on there already being creator ids assigned internally
  • #19: Other ways we are interoperating with repositories:EPrints has the ability to export it’s entire set of metadata as an RDF graph. We have created a module that imports this data where an internal identifier has been used to automatically add researcher information to our database
  • #20: Finally, for institutions who wish to send us data about their researchers to generate Names URIs, we have published a spreadsheet format that people can email to us for bulk upload.
  • #21: We are very interested in receiving input from the repository community about what you would consider useful for our service to provide and from anyone who would be interested in working with us.