SlideShare a Scribd company logo
How many citations are there in the Data Citation Index? 
D Torres-Salinas, E Jiménez-Contreras & N Robinson-Garcia 
EC3 Research Group 
EC3Metrics 
University of Granada 
19th International Conference on Science and Technology Indicators 
3-5 September 2014 Leiden, The Netherlands
Outline 
Rationale 
Data and citations 
Data Citation Index 
Discussion
Rationale 
“The data deluge has arrived.[…] If the rewards of the data deluge are to be reaped, then researchers who produce those data must share them” 
Borgman, 2012 
Peng, 2011
Rationale 
“The ‘dirty little secret’ behind the promotion of data sharing is that not much sharing may be taking place” 
Borgman, 2012 
“The lack of recognition incentives is regarded as a crucial and unresolved obstacle to establishing a data sharing culture” 
Piwowar et al., 2008
Data and citations 
“A consistent, rigorous approach to data citation is lacking” 
Parsons et al., 2010 
What do we cite? 
 Original study <- Piwowar et al. 
 Data papers <- Scientific Data 
 Data sets <- Data Citation Index
Data Citation Index 
GENERAL DESCRIPTION 
 Multidisciplinary database launched in 2012 
 It indexes data repositories from all scientific fields along with citation data associated to them 
 Follows an evaluation and selection process at the level of repository based on: subject, editorial content and geographic origin and scope
Data Citation Index 
PUBLICATION TYPES 
Data repositories a database comprising datasets and data studies which stores and provides access to the raw data 
Datasets a single or coherent set of data or a data file provided by the repository, as part of a collection, data study or experiment. 
Data studies description of studies or experiments held in repositories with the associated data which have been used in the data study.
Data Citation Index 
DATA STUDY EXAMPLES
Data Citation Index 
DATA SET EXAMPLES
Data Citation Index 
MATERIAL AND METHODS 
 Data retrieval in May-June 2013 
 Analysis by areas: Science, Engineering & Technology, Social Sciences and Arts & Humanities 
arXiv:1306.6584
Data Citation Index 
GENERAL INDICATORS All Document TypesDatasetsData studiesTotal Citations404,211294,051106,895Total Records2,623,5282,468,736154,674Uncited Records2,311,5532,185,062126,428% Uncited88.1188.5181.74Citation Average0.150.120.69Standard Desviation3.060.369.56
Data Citation Index 
REPOSITORIES BY AREA Engineering & Technology1Science67Social Sciences19Humanities & Arts9
Datasets Citat ions Data studies Citat ions 
Engineering & Technology 1545 890 240 26 
Humanit ies & Arts 44588 1 6847 20459 
Science 2004449 293193 114338 26189 
Social Sciences 424952 7 37855 69659 
Data Citation Index 
RECORDS AND CITATIONS BY AREA AND TYPE
Data Citation Index 
TOP 10 CATEGORIES HIGHLY CITED FOR DATASETS 0.000.501.001.500% 10% 20% 30% 40% 50% CrystallographyBiochemistry & Mol. BiologyGenetics & HeredityGeosciencesPhysics, Atomic, MolecularEvolutionary BiologyCell BiologySpectroscopyMedical Laboratory Tech. Nanoscience & Nanotech. Citation average andstandard deviation% of total citations from DCI 47% 23% 16%
Data Citation Index 
TOP 10 CATEGORIES HIGHLY CITED FOR DATA STUDIES 051015202530350% 10% 20% 30% SociologyDemographyEconomicsBusinessPolitical ScienceBiochemistry & Mol. BiologyGenetics & HeredityHealth Care SciencesCriminology & PenologyFamily StudiesCitation average andstandard deviation% of total citations from DCI 30%
Data Citation Index 
MAIN REPOSITORIES IN THE DCI, CITATIONS & RECORDS 0200004000060000800001000001200001400001600000100000200000300000400000500000600000700000MiRBaseGene Expression UniProt knowledgebaseCrystallography Open DatabaseU.S. Census Bureau TIGERProteinData BankArrayExpress ArchivePANGEAUK DATAARCHIVEInter-university Consortium for Political and Social ResearchAnimal QTL Database TotalNumber of citations in the Data Citation Index TotalNumber of records indexed the Data Citation IndexSize= Total CitationsPie Chart= % of citationsLEGEND
Discussion 
I. High rate of uncitedness (88%) 
II.Biased towards the Science 
III.Data sets vs. Data studies (Two Cultures?) 
IV.Too soon or too presumptious?
THANK YOU D Torres-Salinas torressalinas@gmail.com N Robinson-Garcia elrobin@ugr.es E Jiménez Contreras evaristo@ugr.es 
19th International Conference on Science and Technology Indicators 
3-5 September 2014 Leiden, The Netherlands

More Related Content

PDF
Research Data Management Services at UWA (July 2015)
PPTX
RESEARCH PROTOCOL
PPTX
Research data: publishers, policies and patient privacy
PDF
Experimental research data quality in
PPTX
Impact of clinical research as a confounder for medical school rankings
PPTX
SPARC 2013 Data Management Presentation
PPTX
Searching for evidence - PTY5EHR
PDF
Digital Scholar Webinar: Understanding and using PROSPERO: International pros...
Research Data Management Services at UWA (July 2015)
RESEARCH PROTOCOL
Research data: publishers, policies and patient privacy
Experimental research data quality in
Impact of clinical research as a confounder for medical school rankings
SPARC 2013 Data Management Presentation
Searching for evidence - PTY5EHR
Digital Scholar Webinar: Understanding and using PROSPERO: International pros...

What's hot (20)

PPT
Research evaluation in the Netherlands : a library perspective
PDF
Is it appropriate to limit searches to prospective trials registries? Resear...
PDF
Developing a Replicable Methodology for Automated Identification of Emerging ...
PPTX
PPTX
Continued citation of bad science and what we can do about it--2021-04-20
PPTX
Effectiveness of New, Informationist-led Curriculum Changes at the College of...
PPTX
Transparency and reproducibility in research
PPT
Baljeet ppt(1)2
PDF
Validity of Instruments, Appropriateness of Designs and Statistics in Article...
PPTX
Pharmacy libguide slideshare
PPTX
Cochrane Library_intro
PPT
Why study Data Sharing? (+ why share your data)
PPTX
Research data and scholarly publications: going from casual acquaintances to ...
PPT
systematic reviews and what the library can do to help
PDF
Leaders in Science - A/Prof Shane Grey
PPTX
PPTX
Share & Flourish workshop, Leiden, August 2014
Research evaluation in the Netherlands : a library perspective
Is it appropriate to limit searches to prospective trials registries? Resear...
Developing a Replicable Methodology for Automated Identification of Emerging ...
Continued citation of bad science and what we can do about it--2021-04-20
Effectiveness of New, Informationist-led Curriculum Changes at the College of...
Transparency and reproducibility in research
Baljeet ppt(1)2
Validity of Instruments, Appropriateness of Designs and Statistics in Article...
Pharmacy libguide slideshare
Cochrane Library_intro
Why study Data Sharing? (+ why share your data)
Research data and scholarly publications: going from casual acquaintances to ...
systematic reviews and what the library can do to help
Leaders in Science - A/Prof Shane Grey
Share & Flourish workshop, Leiden, August 2014
Ad

Viewers also liked (13)

PDF
Exploring Citation Networks to Study Intertextuality in Classics
PDF
Cloud Deployments with Apache Hadoop and Apache HBase
PDF
Efficient blocking method for a large scale citation matching
PDF
Emerging sources citation index (esci)
PPT
Cited Reference Searching
PDF
Intelligent web crawling
PPTX
CSMR: A Scalable Algorithm for Text Clustering with Cosine Similarity and Map...
PDF
Towards a Semantic Citation Index for the German Social Sciences
PDF
How to build your own citation index
PPT
The Research Paper and Citation Methodology
PDF
Using HBase Coprocessors to implement Prospective Search - Berlin Buzzwords -...
PPTX
Building a Scalable Web Crawler with Hadoop
Exploring Citation Networks to Study Intertextuality in Classics
Cloud Deployments with Apache Hadoop and Apache HBase
Efficient blocking method for a large scale citation matching
Emerging sources citation index (esci)
Cited Reference Searching
Intelligent web crawling
CSMR: A Scalable Algorithm for Text Clustering with Cosine Similarity and Map...
Towards a Semantic Citation Index for the German Social Sciences
How to build your own citation index
The Research Paper and Citation Methodology
Using HBase Coprocessors to implement Prospective Search - Berlin Buzzwords -...
Building a Scalable Web Crawler with Hadoop
Ad

Similar to How many citations are there in the Data Citation Index? (20)

PDF
How many citations are there in the Data Citation Index
PDF
How many citations are there in the Data Citation Index?
PDF
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
PPTX
THOR Workshop - Data Publishing PLOS
PDF
Alain Frey Research Data for universities and information producers
PDF
NordForsk Open Access Reykjavik 14-15/8-2014:Status and-plans-norway
PDF
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
PDF
EXPERIMENTAL RESEARCH DATA QUALITY IN MATERIALS SCIENCE
PDF
EXPERIMENTAL RESEARCH DATA QUALITY IN MATERIALS SCIENCE
PDF
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
PPT
The Future: Overcoming the Barriers to Using NHS Clinical Data For Research P...
PDF
Gaining credit for sharing research data
PPTX
Research Data Management Services at UWA (November 2015)
PPTX
Tugas 1_Septiani Wulandari_engineering.pptx
PPTX
OER_s1220738.pptx
PPTX
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
PPTX
Research methodology
PPTX
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
PPT
Data Quality: Missing Data (PPT slides)
How many citations are there in the Data Citation Index
How many citations are there in the Data Citation Index?
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
THOR Workshop - Data Publishing PLOS
Alain Frey Research Data for universities and information producers
NordForsk Open Access Reykjavik 14-15/8-2014:Status and-plans-norway
Ontology-Driven Clinical Intelligence: Removing Data Barriers for Cross-Disci...
EXPERIMENTAL RESEARCH DATA QUALITY IN MATERIALS SCIENCE
EXPERIMENTAL RESEARCH DATA QUALITY IN MATERIALS SCIENCE
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
The Future: Overcoming the Barriers to Using NHS Clinical Data For Research P...
Gaining credit for sharing research data
Research Data Management Services at UWA (November 2015)
Tugas 1_Septiani Wulandari_engineering.pptx
OER_s1220738.pptx
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Research methodology
NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...
Data Quality: Missing Data (PPT slides)

More from Nicolas Robinson-Garcia (20)

PDF
Task specialization across research careers
PDF
Nuevas fuentes bibliométricas abiertas: Altmetrics y Acceso Abierto
PDF
Indicadores avanzados: Acceso Abierto y movilidad
PDF
Unveiling the Ecosystem of Science: How can we characterize and assess divers...
PDF
The effects of specialization on research careers
PDF
¿Cómo preparar y afrontar con éxito una estancia de investigación internacional?
PDF
Aligning scientific impact and societal relevance: The roles of academic enga...
PDF
Towards a multidimensional valuation model of scientists
PPTX
Breaking the Wall of Science Policy
PDF
Practical Applications of Altmetrics
PDF
Introduction to bibliometric data sources - Google Scholar
PDF
Aplicaciones prácticas de las Altmétricas
PDF
Curso básico de lenguaje R aplicado a las Ciencias Sociales
PDF
Altmétricas aplicadas a nivel institucional
PDF
From theory to practice: Operationalization of the GTEC framework
PDF
Practical applications of altmetrics
PDF
Disentangling gold open access
PDF
Making an impact: Scientific profiles and bibliometric indicators
PDF
The SSH conundrum: A matter of audiences
PDF
Indicadores de movilidad científica basados en datos bibliométricos
Task specialization across research careers
Nuevas fuentes bibliométricas abiertas: Altmetrics y Acceso Abierto
Indicadores avanzados: Acceso Abierto y movilidad
Unveiling the Ecosystem of Science: How can we characterize and assess divers...
The effects of specialization on research careers
¿Cómo preparar y afrontar con éxito una estancia de investigación internacional?
Aligning scientific impact and societal relevance: The roles of academic enga...
Towards a multidimensional valuation model of scientists
Breaking the Wall of Science Policy
Practical Applications of Altmetrics
Introduction to bibliometric data sources - Google Scholar
Aplicaciones prácticas de las Altmétricas
Curso básico de lenguaje R aplicado a las Ciencias Sociales
Altmétricas aplicadas a nivel institucional
From theory to practice: Operationalization of the GTEC framework
Practical applications of altmetrics
Disentangling gold open access
Making an impact: Scientific profiles and bibliometric indicators
The SSH conundrum: A matter of audiences
Indicadores de movilidad científica basados en datos bibliométricos

Recently uploaded (20)

PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Classroom Observation Tools for Teachers
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Sports Quiz easy sports quiz sports quiz
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
01-Introduction-to-Information-Management.pdf
PPTX
master seminar digital applications in india
PDF
Pre independence Education in Inndia.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Cell Types and Its function , kingdom of life
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Computing-Curriculum for Schools in Ghana
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
Institutional Correction lecture only . . .
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Basic Mud Logging Guide for educational purpose
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Classroom Observation Tools for Teachers
GDM (1) (1).pptx small presentation for students
Sports Quiz easy sports quiz sports quiz
STATICS OF THE RIGID BODIES Hibbelers.pdf
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
01-Introduction-to-Information-Management.pdf
master seminar digital applications in india
Pre independence Education in Inndia.pdf
Microbial disease of the cardiovascular and lymphatic systems
Cell Types and Its function , kingdom of life
Microbial diseases, their pathogenesis and prophylaxis
O7-L3 Supply Chain Operations - ICLT Program
Computing-Curriculum for Schools in Ghana
102 student loan defaulters named and shamed – Is someone you know on the list?
Institutional Correction lecture only . . .
PPH.pptx obstetrics and gynecology in nursing
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Basic Mud Logging Guide for educational purpose

How many citations are there in the Data Citation Index?

  • 1. How many citations are there in the Data Citation Index? D Torres-Salinas, E Jiménez-Contreras & N Robinson-Garcia EC3 Research Group EC3Metrics University of Granada 19th International Conference on Science and Technology Indicators 3-5 September 2014 Leiden, The Netherlands
  • 2. Outline Rationale Data and citations Data Citation Index Discussion
  • 3. Rationale “The data deluge has arrived.[…] If the rewards of the data deluge are to be reaped, then researchers who produce those data must share them” Borgman, 2012 Peng, 2011
  • 4. Rationale “The ‘dirty little secret’ behind the promotion of data sharing is that not much sharing may be taking place” Borgman, 2012 “The lack of recognition incentives is regarded as a crucial and unresolved obstacle to establishing a data sharing culture” Piwowar et al., 2008
  • 5. Data and citations “A consistent, rigorous approach to data citation is lacking” Parsons et al., 2010 What do we cite?  Original study <- Piwowar et al.  Data papers <- Scientific Data  Data sets <- Data Citation Index
  • 6. Data Citation Index GENERAL DESCRIPTION  Multidisciplinary database launched in 2012  It indexes data repositories from all scientific fields along with citation data associated to them  Follows an evaluation and selection process at the level of repository based on: subject, editorial content and geographic origin and scope
  • 7. Data Citation Index PUBLICATION TYPES Data repositories a database comprising datasets and data studies which stores and provides access to the raw data Datasets a single or coherent set of data or a data file provided by the repository, as part of a collection, data study or experiment. Data studies description of studies or experiments held in repositories with the associated data which have been used in the data study.
  • 8. Data Citation Index DATA STUDY EXAMPLES
  • 9. Data Citation Index DATA SET EXAMPLES
  • 10. Data Citation Index MATERIAL AND METHODS  Data retrieval in May-June 2013  Analysis by areas: Science, Engineering & Technology, Social Sciences and Arts & Humanities arXiv:1306.6584
  • 11. Data Citation Index GENERAL INDICATORS All Document TypesDatasetsData studiesTotal Citations404,211294,051106,895Total Records2,623,5282,468,736154,674Uncited Records2,311,5532,185,062126,428% Uncited88.1188.5181.74Citation Average0.150.120.69Standard Desviation3.060.369.56
  • 12. Data Citation Index REPOSITORIES BY AREA Engineering & Technology1Science67Social Sciences19Humanities & Arts9
  • 13. Datasets Citat ions Data studies Citat ions Engineering & Technology 1545 890 240 26 Humanit ies & Arts 44588 1 6847 20459 Science 2004449 293193 114338 26189 Social Sciences 424952 7 37855 69659 Data Citation Index RECORDS AND CITATIONS BY AREA AND TYPE
  • 14. Data Citation Index TOP 10 CATEGORIES HIGHLY CITED FOR DATASETS 0.000.501.001.500% 10% 20% 30% 40% 50% CrystallographyBiochemistry & Mol. BiologyGenetics & HeredityGeosciencesPhysics, Atomic, MolecularEvolutionary BiologyCell BiologySpectroscopyMedical Laboratory Tech. Nanoscience & Nanotech. Citation average andstandard deviation% of total citations from DCI 47% 23% 16%
  • 15. Data Citation Index TOP 10 CATEGORIES HIGHLY CITED FOR DATA STUDIES 051015202530350% 10% 20% 30% SociologyDemographyEconomicsBusinessPolitical ScienceBiochemistry & Mol. BiologyGenetics & HeredityHealth Care SciencesCriminology & PenologyFamily StudiesCitation average andstandard deviation% of total citations from DCI 30%
  • 16. Data Citation Index MAIN REPOSITORIES IN THE DCI, CITATIONS & RECORDS 0200004000060000800001000001200001400001600000100000200000300000400000500000600000700000MiRBaseGene Expression UniProt knowledgebaseCrystallography Open DatabaseU.S. Census Bureau TIGERProteinData BankArrayExpress ArchivePANGEAUK DATAARCHIVEInter-university Consortium for Political and Social ResearchAnimal QTL Database TotalNumber of citations in the Data Citation Index TotalNumber of records indexed the Data Citation IndexSize= Total CitationsPie Chart= % of citationsLEGEND
  • 17. Discussion I. High rate of uncitedness (88%) II.Biased towards the Science III.Data sets vs. Data studies (Two Cultures?) IV.Too soon or too presumptious?
  • 18. THANK YOU D Torres-Salinas torressalinas@gmail.com N Robinson-Garcia elrobin@ugr.es E Jiménez Contreras evaristo@ugr.es 19th International Conference on Science and Technology Indicators 3-5 September 2014 Leiden, The Netherlands