SlideShare a Scribd company logo
Peer review uncertainty at the
institutional level
V.A. Traag1, M. Malgarini2, T. Cicero2, S. Sarlo2, L. Waltman1
1Centre for Science and Technology Studies (CWTS), Leiden University, the Netherlands
2ANVUR, Rome, Italy
Comparing peer review with metrics
• Post-publication research evaluation
• Context: performance based research funding
• How well do metrics agree with peer review?
• Relevant aggregation level
– Institutional level
– Article level
• Consider peer review uncertainty
1
Third Italian Research Evaluation Exercise
(VQR 2011-2014)
• VQR launched in June 2015, covering the period 2011—
2014.
• VQR covered
• 114 431 publications by
• 60 455 researchers working in
• 96 Universities,
• 12 Public Research Organizations (PRO) , and
• 27 other research Institutes (participating on a voluntary basis).
• in 16 research areas (GEVs)
• of which 11 bibliometric GEVs.
• VQR executed by the Italian National Agency for the
evaluation of Universities and Research Institutes
(ANVUR, ENQA Affiliate from September 2013).
2
Final Result
VQR research outputs evaluation method
3
Peer-review
method
Informed peer-review
method
GEV final approval
Research Outputs to be evaluated
GEV assigns each output to two GEV members
GEV members may choose evaluation method on the basis of:
• GEV rules (stated into GEV criteria documents)
• Output characteristics
Excellent
Good
Fair
Acceptable
Limited
Ineligible
Sample journal articles
• Random sample of 10% of all output of bibliometric GEV.
• The sample has been stratified on the basis of GEV.
• Selected articles were submitted for peer review using the
same process as in the VQR exercise.
• The number of articles effectively peer reviewed covers 9.3%
of all articles submitted to bibliometric evaluation.
• No substantial selection biases (language, number of pages,
bibliometric evaluation) emerged from a post stratification
analysis.
• The final sample includes 7164 publications.
4
Sample shares at the institutional level
5
Bibliometric Methodology
• Sampling 7164 papers, matched 5183 to WoS.
• 4560 from 78 Italian universities, almost 8%.
• Two peer reviews of same article
– Randomly designated reviewer 1 and reviewer 2
– Score between 3 – 30 (covering originality, rigour and impact)
• Institutional scores
– Average of peer review scores
– Average of normalised WoS citations
– Average of normalised WoS journal impact
– Average of VQR citation percentile (various data sources)
– Average of VQR journal percentile (various data sources)
6
Metrics and peer review uncertainty
• Peer review uncertainty
– Compare institutional score of reviewer 2 to reviewer 1.
• Metrics
– Compare institutional metric score to reviewer 1.
7
Article Spearman correlations (WoS)
8
Institutional Spearman correlations
(WoS)
9
Institutional Spearman correlations
(VQR)
10
Conclusions
• Institutional level agreement higher than article level
agreement.
• Agreement with peer review is higher for journal
indicators than for citation indicators.
• Internal peer review agreement comparable to journal
indicator agreement.
• Do reviewers base evaluation largely on journal?
11

More Related Content

PPTX
83341 ch25 jacobsen
PPTX
Research-only rankings of HEIs: Is it possible to measure scientific performa...
PPTX
Responsible use of university rankings
PPTX
Scientific information retrieval: Challenges and opportunities
PPTX
From econometrics to bibliometrics
PPTX
Comparing scientific performance across disciplines: Methodological and conce...
PPTX
Contextualized scientometrics: What's behind the numbers?
PPTX
An in-depth bibliometric perspective on China’s scientific performance
83341 ch25 jacobsen
Research-only rankings of HEIs: Is it possible to measure scientific performa...
Responsible use of university rankings
Scientific information retrieval: Challenges and opportunities
From econometrics to bibliometrics
Comparing scientific performance across disciplines: Methodological and conce...
Contextualized scientometrics: What's behind the numbers?
An in-depth bibliometric perspective on China’s scientific performance

What's hot (20)

PPTX
Responsible metrics: One size doesn't fit all
PPTX
Scientometrics for research assessment
PPTX
Comparing bibliographic data sources
PPTX
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
PPTX
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
PPTX
Ranking universities responsibly
PDF
A systematic empirical comparison of different approaches for normalizing cit...
PPTX
Slide share wheretopublish2015
PPTX
Bibliometrics in the library Wageningen UR Library experience
PPTX
PPTX
My Research Impact
PPTX
New developments in the CWTS Leiden Ranking
PDF
Lecture workshop 2 am open access and altmetrics
PDF
Advanced citation matching and large-scale cited reference extraction
PDF
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
PDF
Clinical Trial Information at Crossref
PPTX
Slide share wheretopublish2015
PPTX
Open science: Implications for bibliometrics and scientometrics
PPT
Cochrane for Librarians: An update on searching and specialised registers
PPTX
Semantometrics: Towards Fulltext-based Research Evaluation
Responsible metrics: One size doesn't fit all
Scientometrics for research assessment
Comparing bibliographic data sources
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Ranking universities responsibly
A systematic empirical comparison of different approaches for normalizing cit...
Slide share wheretopublish2015
Bibliometrics in the library Wageningen UR Library experience
My Research Impact
New developments in the CWTS Leiden Ranking
Lecture workshop 2 am open access and altmetrics
Advanced citation matching and large-scale cited reference extraction
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
Clinical Trial Information at Crossref
Slide share wheretopublish2015
Open science: Implications for bibliometrics and scientometrics
Cochrane for Librarians: An update on searching and specialised registers
Semantometrics: Towards Fulltext-based Research Evaluation
Ad

More from Vincent Traag (20)

PDF
Replacing peer review by metrics in the UK REF?
PDF
Use of the journal impact factor for assessing individual articles need not b...
PDF
Uncovering important intermediate publications
PDF
Complex contagion of campaign donations
PDF
Polarization and consensus in citation networks
PDF
Community structure in complex networks
PDF
Introduction to complex networks
PDF
Public thesis defence: groups and reputation in social networks
PDF
Structure of media attention
PDF
Dynamics of Media Attention
PDF
Dynamical Models Explaining Social Balance
PDF
Significant scales in community structure
PDF
Reconstructing Third World Elite Rotation Events from Newspapers
PDF
Reputation Dynamics Through Gossiping
PDF
Limits of community detection
PDF
Cooperation, Reputation & Gossiping
PDF
Resolution-free community detection
PDF
Cooperation, Reputation & Gossiping
PDF
Exponential Ranking: Taking into account negative links.
PDF
Social Event Detection
Replacing peer review by metrics in the UK REF?
Use of the journal impact factor for assessing individual articles need not b...
Uncovering important intermediate publications
Complex contagion of campaign donations
Polarization and consensus in citation networks
Community structure in complex networks
Introduction to complex networks
Public thesis defence: groups and reputation in social networks
Structure of media attention
Dynamics of Media Attention
Dynamical Models Explaining Social Balance
Significant scales in community structure
Reconstructing Third World Elite Rotation Events from Newspapers
Reputation Dynamics Through Gossiping
Limits of community detection
Cooperation, Reputation & Gossiping
Resolution-free community detection
Cooperation, Reputation & Gossiping
Exponential Ranking: Taking into account negative links.
Social Event Detection
Ad

Recently uploaded (20)

PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PPTX
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
PPT
Mutation in dna of bacteria and repairss
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PPTX
PMR- PPT.pptx for students and doctors tt
PDF
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
PPTX
Understanding the Circulatory System……..
PPTX
perinatal infections 2-171220190027.pptx
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PDF
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
PPT
LEC Synthetic Biology and its application.ppt
PDF
The Land of Punt — A research by Dhani Irwanto
PPTX
gene cloning powerpoint for general biology 2
PDF
Placing the Near-Earth Object Impact Probability in Context
PPTX
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
PPT
veterinary parasitology ````````````.ppt
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PPT
Presentation of a Romanian Institutee 2.
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PDF
CHAPTER 2 The Chemical Basis of Life Lecture Outline.pdf
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
Mutation in dna of bacteria and repairss
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PMR- PPT.pptx for students and doctors tt
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
Understanding the Circulatory System……..
perinatal infections 2-171220190027.pptx
lecture 2026 of Sjogren's syndrome l .pdf
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
LEC Synthetic Biology and its application.ppt
The Land of Punt — A research by Dhani Irwanto
gene cloning powerpoint for general biology 2
Placing the Near-Earth Object Impact Probability in Context
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
veterinary parasitology ````````````.ppt
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
Presentation of a Romanian Institutee 2.
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
CHAPTER 2 The Chemical Basis of Life Lecture Outline.pdf

Peer review uncertainty at the institutional level

  • 1. Peer review uncertainty at the institutional level V.A. Traag1, M. Malgarini2, T. Cicero2, S. Sarlo2, L. Waltman1 1Centre for Science and Technology Studies (CWTS), Leiden University, the Netherlands 2ANVUR, Rome, Italy
  • 2. Comparing peer review with metrics • Post-publication research evaluation • Context: performance based research funding • How well do metrics agree with peer review? • Relevant aggregation level – Institutional level – Article level • Consider peer review uncertainty 1
  • 3. Third Italian Research Evaluation Exercise (VQR 2011-2014) • VQR launched in June 2015, covering the period 2011— 2014. • VQR covered • 114 431 publications by • 60 455 researchers working in • 96 Universities, • 12 Public Research Organizations (PRO) , and • 27 other research Institutes (participating on a voluntary basis). • in 16 research areas (GEVs) • of which 11 bibliometric GEVs. • VQR executed by the Italian National Agency for the evaluation of Universities and Research Institutes (ANVUR, ENQA Affiliate from September 2013). 2
  • 4. Final Result VQR research outputs evaluation method 3 Peer-review method Informed peer-review method GEV final approval Research Outputs to be evaluated GEV assigns each output to two GEV members GEV members may choose evaluation method on the basis of: • GEV rules (stated into GEV criteria documents) • Output characteristics Excellent Good Fair Acceptable Limited Ineligible
  • 5. Sample journal articles • Random sample of 10% of all output of bibliometric GEV. • The sample has been stratified on the basis of GEV. • Selected articles were submitted for peer review using the same process as in the VQR exercise. • The number of articles effectively peer reviewed covers 9.3% of all articles submitted to bibliometric evaluation. • No substantial selection biases (language, number of pages, bibliometric evaluation) emerged from a post stratification analysis. • The final sample includes 7164 publications. 4
  • 6. Sample shares at the institutional level 5
  • 7. Bibliometric Methodology • Sampling 7164 papers, matched 5183 to WoS. • 4560 from 78 Italian universities, almost 8%. • Two peer reviews of same article – Randomly designated reviewer 1 and reviewer 2 – Score between 3 – 30 (covering originality, rigour and impact) • Institutional scores – Average of peer review scores – Average of normalised WoS citations – Average of normalised WoS journal impact – Average of VQR citation percentile (various data sources) – Average of VQR journal percentile (various data sources) 6
  • 8. Metrics and peer review uncertainty • Peer review uncertainty – Compare institutional score of reviewer 2 to reviewer 1. • Metrics – Compare institutional metric score to reviewer 1. 7
  • 12. Conclusions • Institutional level agreement higher than article level agreement. • Agreement with peer review is higher for journal indicators than for citation indicators. • Internal peer review agreement comparable to journal indicator agreement. • Do reviewers base evaluation largely on journal? 11