SlideShare a Scribd company logo
Reproducibility, Argument and Data in
Translational Medicine
© 2015 Massachusetts General Hospital
Tim Clark, Ph.D.
Assistant Professor of Neurology
Massachusetts General Hospital & Harvard Medical School
Massachusetts Alzheimer Disease Research Center
seminar presentation at the Biostatistics Department,
Harvard T.H. Chan School of Public Health,
February 4, 2015
“It has become apparent that an alarming number of
published results cannot be reproduced by other people.
That is what caused John Ioannidis to write his now
famous paper, Why Most Published Research Findings Are
False [1].
That sounds very strong. But in some areas of science it is
probably right.”
- David Colquhoun [2]
1. Ioannidis, J.P.A. (2005) Why Most Published Research Findings Are False, PLoS Med, 2, e124.
2. Colquhoun, D. (2014) An investigation of the false discovery rate and the misinterpretation of p-
values, Royal Society Open Science, 1.
Outline
• The translation gap
• The false reported discovery rate
• Attrition in pharmaceutical pipelines
• Historical background on reproducibility
• Logical status of scientific articles
• Coping strategies at the ecosystem level
• The global argument graph
• Conclusions & postscript
T1
T2
Scannell et al. 2012. Nat Rev Drug Discov, 2012;11(3):191–200.
T2
• ~ 80% to 90% of top-tier academic research is non-
reproducible in pharma target discovery labs.
• All phases in pharma discovery, development,
preclinical and clinical have significant attrition.
• ~ 90% attrition in clinical trials has huge financial
and social impact: risk avoidance.
T1
Hay et al.(2014) Nature Biotechnology 32,40–51.
Begley and Ellis (2012) Nature, 483, 531-533.
Prinz et al. (2011) Nat Rev Drug Discov, 10, 712.
Non-reproduciblity
11%
Begley CG and Ellis LM, Nature 2012, 483(7391):531-533
Obokata et al. 2014 Nature
On Obokata et al. 2014 from
pubpeer.com & imgur.com
[…]
http://imgur.com/1nBfKT
r
• Obakata et al. received extraordinary scrutiny
because of its surprising conclusions.
But what proportion of more “ordinary” papers receive
this type of scrutiny?
It received further scrutiny because upon examination
there turned out to be fraud.
What about non-fraudulent, but incorrect papers?
• Furthermore…
(1) It seems possible that Obokata’s fraudulent use of
data came from her inability to reproduce the
original Vacanti lab experiments in the RIKEN
environment.
(2) We do not know whether the technique began with
fraud at Harvard, or was simply “reproduced by
fraud” when legitimate reproduction failed at RIKEN.
Colquhoun 2014
• “Almost universal failure of biomedical papers to
appreciate what governs the false discovery rate.”
• “If you use p=0.05 to suggest that you have made a
discovery, you will be wrong at least 30% of the time.”
• “If, as is often the case, experiments are underpowered,
you will be wrong most of the time.”
• “To keep your false discovery rate below 5%, you need
to use a three-sigma rule, or to insist on p≤0.001.”
False discovery rate in
diagnostic tests
• For disorder X, a test correctly diagnoses
• 95% of people without X as “false(X)” (specificity =
.95) and
• 80% with X as “true(X)” (sensitivity = .80).
• Prevalence of X in the population = 1%
Diagnostic tests (contd.)
Colquhoun 2014, “An investigation of the false discovery rate and the misinterpretation of p-values”, Royal Society Open Science, 1.
False discovery rate: 86%
Drug screening
• Assume drug candidates work in 10% of cases.
• Power = 0.8, sig level 0.05
• False discovery rate = 45/(45+80)=36%
False discovery rate: 36%
“We optimistically estimate the median statistical
power of studies in the neuroscience field to be
between about 8% and about 31%.”
Button et al. 2013 Nature Reviews Neuroscience 14: 365-376
Underpowered
sensitivity=0.2 20% test positive
(20 true pos tests)
80% test negative
(80 false neg tests)
• False discovery rate = 45/(45+20)=69%
False discovery rate: 69%
Pharma attrition &
productivity
attrition = 95.9%
$1.78 billion per new drug
Paul, S.M., et al. (2010) How to improve R&D productivity: the pharmaceutical industry's grand challenge, Nat Rev Drug Discov, 9, 203-214.
Pharma attrition &
productivity
attrition = 95.9%
$1.78 billion per new drug
Paul, S.M., et al. (2010) How to improve R&D productivity: the pharmaceutical industry's grand challenge, Nat Rev Drug Discov, 9, 203-214.
target
selection
?
“Improving the quality of target selection is
the single most important factor to transform
industry productivity and bring innovative
new medicines to patients.”
Bunnage, M.E. (2011) Getting pharmaceutical R&D back on target, Nat
Chem Biol, 7, 335-339.
Historical background
Reproducibility
“Virtual witnessing” for those not present
using new the information technology of
the scientific journal &
the scientific article.
c. 1660: Robert Boyle and colleagues
concerned with scientific vlidity of claims,
e.g. “transformation of lead into gold”…
Scientific facts will now be established
by reproducible demonstration before a
“jury of one’s peers”.
adapted from [1] Steven Shapin 1984,
Pump and Circumstance:
Robert Boyle’s Literary Technology.
Social Studies of Science 14(4):481-520
BOYLE: “We took a large and lusty frog and having
included him in a small receiver we drew out the air
not very much and left him very much swelled and
able to move his throat from time to time - though
not so fast as when he freely breathed before the
exsuction (extraction) of the air. He continued alive
about two hours that we took notice of, sometimes
removing from one side of the receiver to the other,
but he swelled more than before, and did not
appear by any motion of his throat or thorax (chest)
to exercise respiration. But his head was not very
much swelled, nor his mouth forced open. After he
had remained there somewhat above 3 hours, for it
was not 3 hours and an half, perceiving noe signe of
life in him, we let in the air upon him, at which the
formerly tumid (swelled) body shrunk very much,
but seemed not to have any other change wrought
in it and though we took him out of the receiver yet
in the free air it self, he continued to appear stark
dead nevertheless to see the utmost of the
experiment having caused him to be carried into a
garden and layd upon the grass all night, the next
morning we found him perfectly alive again.” (BP 18,
fol. 127r)
adapted from Carusi 2015, “Virtual Witnessing”, in Future of Research Communications
& eScholarship, Mathematical Institute, Oxford UK, 11-12 January 2015.
Definition: A scientific article is a
1. defeasible argument for claims; supported by
2. exhibited, reproducible data and methods, and
3. explicit references to other work in the domain;
4. described using domain-agreed technical
terminology.
5. It exists in a complex ecosystem of technologies,
people and activities.
Logical status of a scientific article
16th Century: Phil. Trans. Royal Society v.1 (1665-6)
21st Century: J Immunology v.187 (2010)
Efforts to improve the
ecosystem
• Mandatory open access
• Direct data citation & archiving
• Methods cataloging & ID
• Open annotation (W3C OA)
• Micro- & nano-publications μPub
• Reproducibility initiative
Joint Declaration of Data Citation Principles
endorsed by over 80 scholarly organizations
Direct deposition and citation of primary research data
“Micropublications” may be used to
construct a graph of the discussion
and evidence including challenges.
Clark, Ciccarese & Goble: Micropublications: a Semantic Model of
Claims, Evidence, Argument and Annotation for Biomedical
Communication. Journal of Biomedical Semantics 2014 5:28
(http://www.jbiomedsem.com/content/5/1/28/abstract).
IPS: http://www.ebi.ac.uk/efo/EFO_0004905
Stem Cell: http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C
Semantic
Tags
http://purl.org/mp/mp:claim
http://purl.org/mp/mp:supportedBy
http://purl.org/mp/mp:data
Micropublication
Creating micropublications
Micropublication semantic
summary{
:MP3 rdf:type mp:Micropublication;
mp:name "MP(a3)";
mp:description "Digital summary of Spillman et al. 2010";
pav:authoredBy [ a foaf:Person ; foaf:name "Tim Clark" ];
pav:createdBy [ a foaf:Person ; foaf:name "Tim Clark" ];
pav:createdOn "2013-03-06T09:49:12-05:00"^^xsd:dateTime ;
mp:argues :C3;
mp:supportedBy <info:doi:10.1371/journal.pone.0009979> .
} .
:MP3 = {
:S1 rdf:type mp:Statement;
mp:hasContent "Rapamycin [is] an inhibitor of the mTOR pathway." ;
mp:supportedBy <info:doi/10.1038/nature08221> .
:S2 rdf:type mp:Statement;
mp:hasContent "PDAPP mice accumulate soluble and deposited Aβ and develop AD-like synaptic deficits as well as cognitive
impairment and hippocampal atrophy." ;
mp:supportedBy <info:doi/10.1073/pnas.96.6.3228> .
:S3 rdf:type mp:Statement;
mp:hasContent "Rapamycin-fed transgenic PDAPP mice showed improved learning (Figure 1a) and memory (Figure 1b). We
observed significant deficits in learning and memory in control-fed transgenic PDAPP animals." ;
mp:supportedBy <http://www.jneurosci.org/content/20/11/4050> .
:M1 rdf:type mp:Procedure;
mp:hasName "Rapamycin-supplemented mouse diet protocol" ;
mp:hasContent "We fed a rapamycin-supplemented diet... or control chow to groups of PDAPP mice and littermate non-
transgenic controls for 13 weeks. At the end of treatment (7 mo), learning and memory were tested using the Morris water maze." .
:M2 rdf:type mp:Material;
mp:hasName "PDAPP J20";
mp:hasDescription "Lennart Mucke's PDAPP J20 transgenic mice, as obtained from JAX, stock#006293" ;
mp:describedBy: <http://jaxmice.jax.org/strain/006293.html> .
:D1 rdf:type mp:Data;
pav:retrievedFrom <http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0009979#pone-0009979-g001>;
mp:supportedBy :M1, :M2 .
:C3 rdf:type mp:Claim;
mp:hasContent "Inhibition of mTOR by rapamycin can slow or block AD progression in a transgenic mouse model of the
disease." ;
mp:supportedBy :S1, :S2, :S3, :D1.
} .
Navigable claim-evidence
networks
Figure from Greenberg SA, British Medical Journal 2009, 339:b2680
Micro-pubs + Logical formalisms
coming soon: Open BEL…
(Biological Expression Language)
W3C Open Annotation Model
<body1> a cnt:ContentAsText, dctypes:Text ;
cnt:chars "content" ;
dc:format "text/plain" .
<target1> dc:format “application/pdf”
<anno1> a oa:Annotation ;
oa:hasBody <body1> ;
oa:hasTarget <target1> .
RDF
Micropublication of Obakata’s
original claims & data
Micropublication of
discussion from
PubPeer & Riken
Reproducibility, argument and data in translational medicine
But is this really such a great idea?
Does failure to reproduce
invalidate the original experiment,
or the reproduction experiment?
Transparency vs. Reproducibility
• Require significant effort to achieve progress but transparency
is more pragmatic.
• Transparency should naturally lead to more rapid
correction/validation/responsibility.
• Open licenses will facilitate assessment of reproducibility in
transparent content.
• Innovation and standardization needed in filtering and
identification of most reproducible works.
42
adapted with thanks, from a talk by Iain Hrynaszekwicz, Nature Publishing Group,
on “Transparency vs. Reproducibility”, Mathematical Institute, Oxford UK, Jan. 11, 2015
Should Scholarly Research Aim for
Reproducibility or Robustness?
Reproducibility: The ability of an entire experiment or study to be
reproduced, ideally according to the same reproducible experimental
description and procedure
Robustness: A characteristic describing a phenomenon / finding to be
detected effectively while the variables of a test system are altered
 A robust concept can be observed without failure under a variety of
conditions
 A robust finding may be (biologically) more relevant than reproducibility
⇨ Robustness of data may be key
adapted with thanks, from a talk by Thomas Steckler, Janssen Pharmaceuticals,
on “Reproducibility vs. Robustness”, Mathematical Institute, Oxford UK, Jan. 11, 2015
Conclusions
• False reported discovery rate (FRDR) is a systemic
problem in biomedical research and communication.
• FRDR drives up pharmaceutical attrition, cost of
health care; negatively impacts translation T1-T4.
• There are statistical, ethical, informatics and social
components to scientific reproducibility - all of which
need to be addressed.
Postscript
Ernest Rutherford: “All science is either physics
or stamp collecting.”
Paraphrase: Physics is the best and most rigorous
of all scientific enterprises, i.e., the “gold standard”.
Historical values of the speed of light
• pre-17th century: ∞ (instantaneous)
• 1638 Galileo: at least 10 times faster than sound
• 1675 Ole Roemer: 200,000 Km/sec
• 1728 James Bradley: 301,000 Km/sec
• 1849 Hippolyte Louis Fizeau: 313,300 Km/s
• 1862 Leon Foucault 299,796 Km/s
• Today: 299,792.458 km/s
Acknowledgements
• Sudeshna Das
• Paolo Ciccarese
• Emily Merrill
• Stian Soiland-Reyes
• Carole Goble
• Maryann Martone
• Annamaria Carusi
• Iain Hrynaskiewicz
• Thomas Steckler
• Brad Hyman

More Related Content

PPTX
2014 CrossRef Annual Meeting Keynote: Ways and Needs to Promote Rapid Data Sh...
PDF
What is the reproducibility crisis in science and what can we do about it?
PDF
References on Reproducibility Crisis in Science by D.V.M. Bishop
DOCX
Bishop reproducibility references nov2016
PPTX
Laurie Goodman at the BMC Roadshow: Transparency in Publishing and Being an O...
PDF
Fixing Science: The Replicability Crisis
PDF
NCI systems epidemiology 03012019
PPTX
COMPLETE GUIDE ON HOW TO DEBATE ABOUT ANIMAL EXPERIMENTATION
2014 CrossRef Annual Meeting Keynote: Ways and Needs to Promote Rapid Data Sh...
What is the reproducibility crisis in science and what can we do about it?
References on Reproducibility Crisis in Science by D.V.M. Bishop
Bishop reproducibility references nov2016
Laurie Goodman at the BMC Roadshow: Transparency in Publishing and Being an O...
Fixing Science: The Replicability Crisis
NCI systems epidemiology 03012019
COMPLETE GUIDE ON HOW TO DEBATE ABOUT ANIMAL EXPERIMENTATION

What's hot (17)

PDF
Casey Greene's Keynote for Rocky 2015
PDF
HRB-Health Research In Action booklet (feat. NICB)
PDF
Searching for predictors of male fecundity
PDF
FERASAT: A Serendipity-Fostering Faceted Browser for Linked Data
PDF
Op-Ed writing sample - NEAVS
PDF
The Modern Internist: Using Technology to Your Advantage
PDF
Use of Social Media for your Journal
PDF
Chirag patel unite for sight 041418
PPTX
HRA - Lush Prize Conference 2014
PPTX
Hypothesis woisri
PDF
Everything wrong with statistics (and how to fix it)
PPTX
Pepe "Enriching Preprints with Provenance, Reproducibility, and Trustworthiness"
PPTX
Eto when all else fails 002
PPTX
Laurie Goodman at #crossref14: Ways and Needs to Promote Rapid Data Sharing
PDF
Chevron Case: Re 21 - Public - Grandjean Expert Report (nov. 7, 2014)
PDF
Making Sense of Uncertainty
PDF
Is it appropriate to limit searches to prospective trials registries? Resear...
Casey Greene's Keynote for Rocky 2015
HRB-Health Research In Action booklet (feat. NICB)
Searching for predictors of male fecundity
FERASAT: A Serendipity-Fostering Faceted Browser for Linked Data
Op-Ed writing sample - NEAVS
The Modern Internist: Using Technology to Your Advantage
Use of Social Media for your Journal
Chirag patel unite for sight 041418
HRA - Lush Prize Conference 2014
Hypothesis woisri
Everything wrong with statistics (and how to fix it)
Pepe "Enriching Preprints with Provenance, Reproducibility, and Trustworthiness"
Eto when all else fails 002
Laurie Goodman at #crossref14: Ways and Needs to Promote Rapid Data Sharing
Chevron Case: Re 21 - Public - Grandjean Expert Report (nov. 7, 2014)
Making Sense of Uncertainty
Is it appropriate to limit searches to prospective trials registries? Resear...
Ad

Viewers also liked (9)

PPTX
Self-Service Linked Government Data with dcat and Gridworks
PPT
eXframe: A Semantic Web Platform for Genomic Experiments
PDF
201502 cmu portugal_highlights
KEY
Dagstuhl "Future" sesssion intro slides
PPT
Semantic Water Quality - Ping Wang
PPT
Annotopia open annotation services platform
PPT
Ngsp
PPT
exFrame: a Semantic Web Platform for Genomics Experiments
PPTX
Fairport domain specific metadata using w3 c dcat & skos w ontology views
Self-Service Linked Government Data with dcat and Gridworks
eXframe: A Semantic Web Platform for Genomic Experiments
201502 cmu portugal_highlights
Dagstuhl "Future" sesssion intro slides
Semantic Water Quality - Ping Wang
Annotopia open annotation services platform
Ngsp
exFrame: a Semantic Web Platform for Genomics Experiments
Fairport domain specific metadata using w3 c dcat & skos w ontology views
Ad

Similar to Reproducibility, argument and data in translational medicine (20)

PDF
P-values in crisis
PPT
Data collection methods to improve reproducibility
DOCX
NHS FPX 4000 Assessment 2 Applying Research Skills Answered.docx
PDF
Critical appraisal: How to read a scientific paper?
PDF
Visualization Tools for the Refinery Platform - Supporting reproducible resea...
PPT
How Do We Know What We Know?
PPTX
Selecting a research question
PPT
abstracts lecture--aiello--14 july 2011.ppt
PPT
Cullen: Why most published research is wrong
PPT
Velterop 2 a ssp arlington may 2015
PPT
The Reproducibility Crises in Biomedical Research and its Impact on Pharmaceu...
PPTX
Research Question and Hypothesis
PPT
Velterop 2 a ssp arlington may 2015
PDF
Biomedical Research_House of Cards_Editorial
PDF
08.11.08: The Information Cycle
PPTX
Biomedical Literature
PPT
Advice to junior researchers: High or low road to success?
PDF
Automatically Detecting Scientific Misinformation
P-values in crisis
Data collection methods to improve reproducibility
NHS FPX 4000 Assessment 2 Applying Research Skills Answered.docx
Critical appraisal: How to read a scientific paper?
Visualization Tools for the Refinery Platform - Supporting reproducible resea...
How Do We Know What We Know?
Selecting a research question
abstracts lecture--aiello--14 july 2011.ppt
Cullen: Why most published research is wrong
Velterop 2 a ssp arlington may 2015
The Reproducibility Crises in Biomedical Research and its Impact on Pharmaceu...
Research Question and Hypothesis
Velterop 2 a ssp arlington may 2015
Biomedical Research_House of Cards_Editorial
08.11.08: The Information Cycle
Biomedical Literature
Advice to junior researchers: High or low road to success?
Automatically Detecting Scientific Misinformation

Recently uploaded (20)

PPTX
AI_in_Pharmaceutical_Technology_Presentation.pptx
PPTX
Infection prevention and control for medical students
PPT
Recent advances in Diagnosis of Autoimmune Disorders
PDF
Assessment of Complications in Patients Maltreated with Fixed Self Cure Acryl...
PPT
Parental-Carer-mental-illness-and-Potential-impact-on-Dependant-Children.ppt
PPTX
Pulmonary Circulation PPT final for easy
PPTX
Vaginal Bleeding and Uterine Fibroids p
PDF
Dermatology diseases Index August 2025.pdf
PPTX
Nancy Caroline Emergency Paramedic Chapter 8
PDF
Structure Composition and Mechanical Properties of Australian O.pdf
PPT
Adrenergic drugs (sympathomimetics ).ppt
PPTX
Nursing Care Aspects for High Risk newborn.pptx
PPTX
ABG advance Arterial Blood Gases Analysis
PPTX
Basics of pharmacology (Pharmacology I).pptx
PPTX
NUTRITIONAL PROBLEMS, CHANGES NEEDED TO PREVENT MALNUTRITION
PPTX
Bronchial_Asthma_in_acute_exacerbation_.pptx
PPTX
Genaralised anxiety disorder presentation
PPTX
community services team project 2(4).pptx
PDF
2E-Learning-Together...PICS-PCISF con.pdf
PPTX
Current Treatment Of Heart Failure By Dr Masood Ahmed
AI_in_Pharmaceutical_Technology_Presentation.pptx
Infection prevention and control for medical students
Recent advances in Diagnosis of Autoimmune Disorders
Assessment of Complications in Patients Maltreated with Fixed Self Cure Acryl...
Parental-Carer-mental-illness-and-Potential-impact-on-Dependant-Children.ppt
Pulmonary Circulation PPT final for easy
Vaginal Bleeding and Uterine Fibroids p
Dermatology diseases Index August 2025.pdf
Nancy Caroline Emergency Paramedic Chapter 8
Structure Composition and Mechanical Properties of Australian O.pdf
Adrenergic drugs (sympathomimetics ).ppt
Nursing Care Aspects for High Risk newborn.pptx
ABG advance Arterial Blood Gases Analysis
Basics of pharmacology (Pharmacology I).pptx
NUTRITIONAL PROBLEMS, CHANGES NEEDED TO PREVENT MALNUTRITION
Bronchial_Asthma_in_acute_exacerbation_.pptx
Genaralised anxiety disorder presentation
community services team project 2(4).pptx
2E-Learning-Together...PICS-PCISF con.pdf
Current Treatment Of Heart Failure By Dr Masood Ahmed

Reproducibility, argument and data in translational medicine

  • 1. Reproducibility, Argument and Data in Translational Medicine © 2015 Massachusetts General Hospital Tim Clark, Ph.D. Assistant Professor of Neurology Massachusetts General Hospital & Harvard Medical School Massachusetts Alzheimer Disease Research Center seminar presentation at the Biostatistics Department, Harvard T.H. Chan School of Public Health, February 4, 2015
  • 2. “It has become apparent that an alarming number of published results cannot be reproduced by other people. That is what caused John Ioannidis to write his now famous paper, Why Most Published Research Findings Are False [1]. That sounds very strong. But in some areas of science it is probably right.” - David Colquhoun [2] 1. Ioannidis, J.P.A. (2005) Why Most Published Research Findings Are False, PLoS Med, 2, e124. 2. Colquhoun, D. (2014) An investigation of the false discovery rate and the misinterpretation of p- values, Royal Society Open Science, 1.
  • 3. Outline • The translation gap • The false reported discovery rate • Attrition in pharmaceutical pipelines • Historical background on reproducibility • Logical status of scientific articles • Coping strategies at the ecosystem level • The global argument graph • Conclusions & postscript
  • 5. Scannell et al. 2012. Nat Rev Drug Discov, 2012;11(3):191–200. T2
  • 6. • ~ 80% to 90% of top-tier academic research is non- reproducible in pharma target discovery labs. • All phases in pharma discovery, development, preclinical and clinical have significant attrition. • ~ 90% attrition in clinical trials has huge financial and social impact: risk avoidance. T1 Hay et al.(2014) Nature Biotechnology 32,40–51. Begley and Ellis (2012) Nature, 483, 531-533. Prinz et al. (2011) Nat Rev Drug Discov, 10, 712.
  • 7. Non-reproduciblity 11% Begley CG and Ellis LM, Nature 2012, 483(7391):531-533
  • 8. Obokata et al. 2014 Nature
  • 9. On Obokata et al. 2014 from pubpeer.com & imgur.com
  • 11. • Obakata et al. received extraordinary scrutiny because of its surprising conclusions. But what proportion of more “ordinary” papers receive this type of scrutiny? It received further scrutiny because upon examination there turned out to be fraud. What about non-fraudulent, but incorrect papers?
  • 12. • Furthermore… (1) It seems possible that Obokata’s fraudulent use of data came from her inability to reproduce the original Vacanti lab experiments in the RIKEN environment. (2) We do not know whether the technique began with fraud at Harvard, or was simply “reproduced by fraud” when legitimate reproduction failed at RIKEN.
  • 13. Colquhoun 2014 • “Almost universal failure of biomedical papers to appreciate what governs the false discovery rate.” • “If you use p=0.05 to suggest that you have made a discovery, you will be wrong at least 30% of the time.” • “If, as is often the case, experiments are underpowered, you will be wrong most of the time.” • “To keep your false discovery rate below 5%, you need to use a three-sigma rule, or to insist on p≤0.001.”
  • 14. False discovery rate in diagnostic tests • For disorder X, a test correctly diagnoses • 95% of people without X as “false(X)” (specificity = .95) and • 80% with X as “true(X)” (sensitivity = .80). • Prevalence of X in the population = 1%
  • 15. Diagnostic tests (contd.) Colquhoun 2014, “An investigation of the false discovery rate and the misinterpretation of p-values”, Royal Society Open Science, 1. False discovery rate: 86%
  • 16. Drug screening • Assume drug candidates work in 10% of cases. • Power = 0.8, sig level 0.05 • False discovery rate = 45/(45+80)=36% False discovery rate: 36%
  • 17. “We optimistically estimate the median statistical power of studies in the neuroscience field to be between about 8% and about 31%.” Button et al. 2013 Nature Reviews Neuroscience 14: 365-376
  • 18. Underpowered sensitivity=0.2 20% test positive (20 true pos tests) 80% test negative (80 false neg tests) • False discovery rate = 45/(45+20)=69% False discovery rate: 69%
  • 19. Pharma attrition & productivity attrition = 95.9% $1.78 billion per new drug Paul, S.M., et al. (2010) How to improve R&D productivity: the pharmaceutical industry's grand challenge, Nat Rev Drug Discov, 9, 203-214.
  • 20. Pharma attrition & productivity attrition = 95.9% $1.78 billion per new drug Paul, S.M., et al. (2010) How to improve R&D productivity: the pharmaceutical industry's grand challenge, Nat Rev Drug Discov, 9, 203-214. target selection ?
  • 21. “Improving the quality of target selection is the single most important factor to transform industry productivity and bring innovative new medicines to patients.” Bunnage, M.E. (2011) Getting pharmaceutical R&D back on target, Nat Chem Biol, 7, 335-339.
  • 23. Reproducibility “Virtual witnessing” for those not present using new the information technology of the scientific journal & the scientific article. c. 1660: Robert Boyle and colleagues concerned with scientific vlidity of claims, e.g. “transformation of lead into gold”… Scientific facts will now be established by reproducible demonstration before a “jury of one’s peers”.
  • 24. adapted from [1] Steven Shapin 1984, Pump and Circumstance: Robert Boyle’s Literary Technology. Social Studies of Science 14(4):481-520
  • 25. BOYLE: “We took a large and lusty frog and having included him in a small receiver we drew out the air not very much and left him very much swelled and able to move his throat from time to time - though not so fast as when he freely breathed before the exsuction (extraction) of the air. He continued alive about two hours that we took notice of, sometimes removing from one side of the receiver to the other, but he swelled more than before, and did not appear by any motion of his throat or thorax (chest) to exercise respiration. But his head was not very much swelled, nor his mouth forced open. After he had remained there somewhat above 3 hours, for it was not 3 hours and an half, perceiving noe signe of life in him, we let in the air upon him, at which the formerly tumid (swelled) body shrunk very much, but seemed not to have any other change wrought in it and though we took him out of the receiver yet in the free air it self, he continued to appear stark dead nevertheless to see the utmost of the experiment having caused him to be carried into a garden and layd upon the grass all night, the next morning we found him perfectly alive again.” (BP 18, fol. 127r) adapted from Carusi 2015, “Virtual Witnessing”, in Future of Research Communications & eScholarship, Mathematical Institute, Oxford UK, 11-12 January 2015.
  • 26. Definition: A scientific article is a 1. defeasible argument for claims; supported by 2. exhibited, reproducible data and methods, and 3. explicit references to other work in the domain; 4. described using domain-agreed technical terminology. 5. It exists in a complex ecosystem of technologies, people and activities. Logical status of a scientific article
  • 27. 16th Century: Phil. Trans. Royal Society v.1 (1665-6)
  • 28. 21st Century: J Immunology v.187 (2010)
  • 29. Efforts to improve the ecosystem • Mandatory open access • Direct data citation & archiving • Methods cataloging & ID • Open annotation (W3C OA) • Micro- & nano-publications μPub • Reproducibility initiative
  • 30. Joint Declaration of Data Citation Principles endorsed by over 80 scholarly organizations
  • 31. Direct deposition and citation of primary research data
  • 32. “Micropublications” may be used to construct a graph of the discussion and evidence including challenges. Clark, Ciccarese & Goble: Micropublications: a Semantic Model of Claims, Evidence, Argument and Annotation for Biomedical Communication. Journal of Biomedical Semantics 2014 5:28 (http://www.jbiomedsem.com/content/5/1/28/abstract).
  • 33. IPS: http://www.ebi.ac.uk/efo/EFO_0004905 Stem Cell: http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C Semantic Tags http://purl.org/mp/mp:claim http://purl.org/mp/mp:supportedBy http://purl.org/mp/mp:data Micropublication
  • 35. Micropublication semantic summary{ :MP3 rdf:type mp:Micropublication; mp:name "MP(a3)"; mp:description "Digital summary of Spillman et al. 2010"; pav:authoredBy [ a foaf:Person ; foaf:name "Tim Clark" ]; pav:createdBy [ a foaf:Person ; foaf:name "Tim Clark" ]; pav:createdOn "2013-03-06T09:49:12-05:00"^^xsd:dateTime ; mp:argues :C3; mp:supportedBy <info:doi:10.1371/journal.pone.0009979> . } . :MP3 = { :S1 rdf:type mp:Statement; mp:hasContent "Rapamycin [is] an inhibitor of the mTOR pathway." ; mp:supportedBy <info:doi/10.1038/nature08221> . :S2 rdf:type mp:Statement; mp:hasContent "PDAPP mice accumulate soluble and deposited Aβ and develop AD-like synaptic deficits as well as cognitive impairment and hippocampal atrophy." ; mp:supportedBy <info:doi/10.1073/pnas.96.6.3228> . :S3 rdf:type mp:Statement; mp:hasContent "Rapamycin-fed transgenic PDAPP mice showed improved learning (Figure 1a) and memory (Figure 1b). We observed significant deficits in learning and memory in control-fed transgenic PDAPP animals." ; mp:supportedBy <http://www.jneurosci.org/content/20/11/4050> . :M1 rdf:type mp:Procedure; mp:hasName "Rapamycin-supplemented mouse diet protocol" ; mp:hasContent "We fed a rapamycin-supplemented diet... or control chow to groups of PDAPP mice and littermate non- transgenic controls for 13 weeks. At the end of treatment (7 mo), learning and memory were tested using the Morris water maze." . :M2 rdf:type mp:Material; mp:hasName "PDAPP J20"; mp:hasDescription "Lennart Mucke's PDAPP J20 transgenic mice, as obtained from JAX, stock#006293" ; mp:describedBy: <http://jaxmice.jax.org/strain/006293.html> . :D1 rdf:type mp:Data; pav:retrievedFrom <http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0009979#pone-0009979-g001>; mp:supportedBy :M1, :M2 . :C3 rdf:type mp:Claim; mp:hasContent "Inhibition of mTOR by rapamycin can slow or block AD progression in a transgenic mouse model of the disease." ; mp:supportedBy :S1, :S2, :S3, :D1. } .
  • 36. Navigable claim-evidence networks Figure from Greenberg SA, British Medical Journal 2009, 339:b2680
  • 37. Micro-pubs + Logical formalisms coming soon: Open BEL… (Biological Expression Language)
  • 38. W3C Open Annotation Model <body1> a cnt:ContentAsText, dctypes:Text ; cnt:chars "content" ; dc:format "text/plain" . <target1> dc:format “application/pdf” <anno1> a oa:Annotation ; oa:hasBody <body1> ; oa:hasTarget <target1> . RDF
  • 39. Micropublication of Obakata’s original claims & data Micropublication of discussion from PubPeer & Riken
  • 41. But is this really such a great idea? Does failure to reproduce invalidate the original experiment, or the reproduction experiment?
  • 42. Transparency vs. Reproducibility • Require significant effort to achieve progress but transparency is more pragmatic. • Transparency should naturally lead to more rapid correction/validation/responsibility. • Open licenses will facilitate assessment of reproducibility in transparent content. • Innovation and standardization needed in filtering and identification of most reproducible works. 42 adapted with thanks, from a talk by Iain Hrynaszekwicz, Nature Publishing Group, on “Transparency vs. Reproducibility”, Mathematical Institute, Oxford UK, Jan. 11, 2015
  • 43. Should Scholarly Research Aim for Reproducibility or Robustness? Reproducibility: The ability of an entire experiment or study to be reproduced, ideally according to the same reproducible experimental description and procedure Robustness: A characteristic describing a phenomenon / finding to be detected effectively while the variables of a test system are altered  A robust concept can be observed without failure under a variety of conditions  A robust finding may be (biologically) more relevant than reproducibility ⇨ Robustness of data may be key adapted with thanks, from a talk by Thomas Steckler, Janssen Pharmaceuticals, on “Reproducibility vs. Robustness”, Mathematical Institute, Oxford UK, Jan. 11, 2015
  • 44. Conclusions • False reported discovery rate (FRDR) is a systemic problem in biomedical research and communication. • FRDR drives up pharmaceutical attrition, cost of health care; negatively impacts translation T1-T4. • There are statistical, ethical, informatics and social components to scientific reproducibility - all of which need to be addressed.
  • 46. Ernest Rutherford: “All science is either physics or stamp collecting.” Paraphrase: Physics is the best and most rigorous of all scientific enterprises, i.e., the “gold standard”.
  • 47. Historical values of the speed of light • pre-17th century: ∞ (instantaneous) • 1638 Galileo: at least 10 times faster than sound • 1675 Ole Roemer: 200,000 Km/sec • 1728 James Bradley: 301,000 Km/sec • 1849 Hippolyte Louis Fizeau: 313,300 Km/s • 1862 Leon Foucault 299,796 Km/s • Today: 299,792.458 km/s
  • 48. Acknowledgements • Sudeshna Das • Paolo Ciccarese • Emily Merrill • Stian Soiland-Reyes • Carole Goble • Maryann Martone • Annamaria Carusi • Iain Hrynaskiewicz • Thomas Steckler • Brad Hyman

Editor's Notes

  • #26: Boyle has in common with NGSP: Creating a multiplicity of witnesses. In principle ‘all men …’ Universal language / display of trustworthiness (diligence and modesty); elaborate sentences, plus a lot of circumstantial details. EXAMPLE Matters of fact -- this is what could be agreed upon. Management of dispute (supported by/ refutes) FIND PREVIOUS SLIDE Distinction of roles: direct witnessing; potential replicators; virtual witnessing --- not aimed at replication/ reproducion ----- Meeting Notes (09/09/2014 09:27) ----- NEXT: EXAMPLE NEXT: HOW SCIENCE GOES WRONG