SlideShare a Scribd company logo
An Analysis and Characterization of
DMPs in NSF Proposals from the
University of Illinois
RDAP14 Research Data Access & Preservation
Summit
March 26, 2014
William H. Mischo, Mary C. Schlembach, &
Megan N. O’Donnell
University of Illinois at Urbana-Champaign
Iowa State University
NSF Data Management Plans
• Data Management Plans (DMPs): required
element in NSF proposals, January 2011
• July 2011: the Library, working with the campus
Office of Sponsored Programs and Research
Administration (OSPRA) began an analysis of
DMPs in submitted NSF grant proposals
• Currently, looked at 1,600 grants with 1,260 in
the analysis.
Reasons for Analysis
•What storage venues and mechanisms for
sharing and reuse are being used?
•Are the PI’s using local templates and local
campus resources such as the IDEALS?
Follow-on
• Develop campus-wide infrastructure (Research
Data Service - RDS)
• Assist in compliance with federal agencies
• Develop important partnerships with campus
units (CITES, NCSA, Colleges) and national
entities
• Develop best practices and standard approaches
Analysis
• Analysis attempts to characterize and classify
DMPs into categories
• DMPs assigned multiple categories
• 1,260 DMPs from July 2011 to November 2013
Categories
• PI Server – Servers and workstations that the PIs
(and their students/staff) use to store project
data.
laboratory server/workstations, external hard drives, group
computer
• PI Website – Websites edited or administered
by the PI or a group they belong to.
Examples: lab website, project website, wiki, PI’s website
Categories
• Campus – Services located, operated by, run by
or endorsed by Illinois.
IDEALS, Netfiles and Box.net, NCSA, and Beckman
Institute.
• Department – Used when a department was
specifically mentioned as providing a storage or
hosting resource.
Departmental website, departmental server, departmental
backup service or a web address traced back to an
academic department (also given the “campus” label)
Categories
• Remote – Services and sites not located on the
Illinois campus.
NASA, other campuses, collaborative projects, non-Illinois
institutes
• Disciplinary – Disciplinary repositories.
GenBank, arXiv, ICPSR, SEAD, Nanohub, and Dryad
• Cloud – Storage services using cloud technology.
Google Drive, Google Code, Box.net, Amazon, Microsoft,
Dropbox
Categories
• Publication - Scholarly outputs.
Journal articles, workshops, and conference
presentations/posters.
• Analog - Physical records/data.
Lab notebooks, photographs, files
• Specimens - Physical specimens.
Usually biological or artifacts
Categories
• Optical Disc - DVD, CD, and Blu-ray discs.
• Not specified – the DMP was not specific
enough for us to categorize further.
• No Data – Indicated the proposal will produce
no data products.
• Local Template Used – used a library authored
template.
Category Number Percent
PI Server 503 39.9%
PI Website 529 41.9%
Campus 667 52.9%
Department 142 11.2%
Remote 353 28%
Disciplinary 275 21.8%
Publication 556 44.1%
Cloud 63 5%
Optical Disc 56 4%
Analog 131 10.4%
Specimens 111 8.8%
Not Specified 66 5.2%
Collaborative 164 13%
No Data 103 8.2%
ALL DMPs (n=1,260)
Data Venue and Risk
Data Location
Submitted Proposals Funded Proposals
Risk of Loss/Corruption/ Breach
n=1260 n=298
PI Server/Website 64% High 61% High
Departmental
Server/Website
11.2% Medium to High 7% Medium to High
Campus-Wide Resource 52.9%
Low
45%
LowIDEALS (Institutional
Repos.)
21.9% 19.8%
NCSA 4.3% 16.4%
Disciplinary
Repository/Cloud
25.8% Medium to Low 21.4% Medium to Low
Remote Repository 28% Medium to High 22.8% Medium to High
Optical Disk, Specimens,
Analog
19.4% Out of Scope 11% Out of Scope
Notables
• Funded: 298
• Used local
template: 254
• Only 87 DMPS
contained
information about
file types
• IDEALS: 275
• NCSA/XSEDE: 55
• Dryad: 22
• ICPSR: 17
• GenBank: 55
• ArX: 61
Analysis
• Any differences in storage venue or technologies
between the unfunded proposals and the funded
proposals?
• Any differences between the proposals from the
first year and the more current proposals?
• Other differences in proposal categories
between funded and unfunded
• 734 active NSF awards, $861.8 million
Analysis: Funded vs. Not-funded
• IDEALS institutional repository:
62 funded, 197 not funded: chi-square: 0.17
• Storing data on PI server or website:
183 funded, 569 not funded: chi-square: 0.7
• Disciplinary or Cloud:
67 funded, 241 not funded: chi-square: 0.85
• Remote storage:
68 funded, 267 not funded: chi-square: 3.01
Analysis
• Use of IDEALS
before August 2012 = 108
after (thru November 2013) = 166
chi-square: 4.59, p < .05
• Use of Disciplinary or Cloud
before August 2012 = 121
after = 182
chi-square: 4.33, p < .05
Implications and Conclusions
1. No significant differences between
funded/unfunded proposals in storage venues -
no advantage in IDEALS, Disciplinary.
2. More recent proposals suggest IDEALS and
disciplinary repositories included at a
significantly higher level
• What is the role of the library? The campus?
The subject discipline?
• Connecting data to the literature important

More Related Content

PPTX
Data management plans archeology class 10 18 2012
PPTX
NISO Training Thursday Crafting a Scientific Data Management Plan
PDF
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PDF
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
PDF
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PDF
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
PPTX
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Data management plans archeology class 10 18 2012
NISO Training Thursday Crafting a Scientific Data Management Plan
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
RDAP 15 Navigating the Rocky Road to Research Data Acceptance
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research

What's hot (20)

PDF
Poster RDAP13: Data information literacy multiple paths to a single goal
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PPTX
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
PDF
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
PDF
Integration of research literature and data (InFoLiS)
PPT
Data Management for Undergraduate Researchers (updated - 02/2016)
PPTX
Repository Fringe 2016 - Survey Documentation and Analysis
PPTX
The Data Management Ecosystem
PPTX
Re tooling for data management-support
PDF
Who owns the data? Intellectual property considerations for academic research...
PPTX
RDAP13 Elizabeth Moss: The impact of data reuse
PDF
Research Data Management and Sharing for the Social Sciences and Humanities
PDF
Poster: Very Open Data Project
PPTX
DMPTool webinar 2011-10-19
PPTX
Next generation data services at the Marriott Library
PPT
Ownership, intellectual property, and governance considerations for academic ...
PPTX
Research Data Services at the University of Utah
PPTX
ICPSR Data Exploration Tools
PPTX
Data Management Plans: Tips, Tricks and Tools
PPTX
Best practices data collection
Poster RDAP13: Data information literacy multiple paths to a single goal
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Integration of research literature and data (InFoLiS)
Data Management for Undergraduate Researchers (updated - 02/2016)
Repository Fringe 2016 - Survey Documentation and Analysis
The Data Management Ecosystem
Re tooling for data management-support
Who owns the data? Intellectual property considerations for academic research...
RDAP13 Elizabeth Moss: The impact of data reuse
Research Data Management and Sharing for the Social Sciences and Humanities
Poster: Very Open Data Project
DMPTool webinar 2011-10-19
Next generation data services at the Marriott Library
Ownership, intellectual property, and governance considerations for academic ...
Research Data Services at the University of Utah
ICPSR Data Exploration Tools
Data Management Plans: Tips, Tricks and Tools
Best practices data collection
Ad

Viewers also liked (6)

PPTX
Copyright basics and update 5 9 2013
PPTX
Harp 101 9 16 2013
PPTX
Copyright eab 11 21 2014
PPTX
Harp 101 9 12 2013
PPTX
Beyond the shelves e brown
PPTX
Leveraging and interpreting library assessment data 4 17 2016
Copyright basics and update 5 9 2013
Harp 101 9 16 2013
Copyright eab 11 21 2014
Harp 101 9 12 2013
Beyond the shelves e brown
Leveraging and interpreting library assessment data 4 17 2016
Ad

Similar to An analysis and characterization of DMPs in NSF proposals from the University of Illinois (#RDAP14) (20)

PPT
RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...
PPTX
Meeting the NSF DMP Requirement June 13, 2012
PPTX
Meeting the NSF DMP Requirement: March 7, 2012
PPTX
NSF Data Policies webcast February 29, 2012
PDF
Introduction to Data Management Planning
PPT
Data management plans
PDF
Dc sheridan dlf_2011_final
PDF
Quality analysis of NSF DMP plans - Wayne State University
PDF
Defining the Libraries' Role in Research: A Needs Assessment  Case Study
PPTX
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
PDF
DMPTool: a community tool
PPTX
Data Management Planning at the DCC
PPTX
Iassist 2012 dms public version
PDF
Praetzellis "Data Management Planning and Tools"
PPT
Introduction to Data Management Planning
PDF
Researh data management
PDF
Figshare for institutions - Jisc Digifest 2016
PPT
Data management plans
PPTX
Data Management Planning for Engineers
PPTX
International approached to DMPs - Sarah Jones
RDAP14: An analysis and characterization of DMPs in NSF proposals from the Un...
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement: March 7, 2012
NSF Data Policies webcast February 29, 2012
Introduction to Data Management Planning
Data management plans
Dc sheridan dlf_2011_final
Quality analysis of NSF DMP plans - Wayne State University
Defining the Libraries' Role in Research: A Needs Assessment  Case Study
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
DMPTool: a community tool
Data Management Planning at the DCC
Iassist 2012 dms public version
Praetzellis "Data Management Planning and Tools"
Introduction to Data Management Planning
Researh data management
Figshare for institutions - Jisc Digifest 2016
Data management plans
Data Management Planning for Engineers
International approached to DMPs - Sarah Jones

Recently uploaded (20)

PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPT
Quality review (1)_presentation of this 21
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Foundation of Data Science unit number two notes
PPTX
Introduction to machine learning and Linear Models
PDF
Fluorescence-microscope_Botany_detailed content
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Mega Projects Data Mega Projects Data
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Introduction to Knowledge Engineering Part 1
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Quality review (1)_presentation of this 21
Data_Analytics_and_PowerBI_Presentation.pptx
Reliability_Chapter_ presentation 1221.5784
Foundation of Data Science unit number two notes
Introduction to machine learning and Linear Models
Fluorescence-microscope_Botany_detailed content
Galatica Smart Energy Infrastructure Startup Pitch Deck
1_Introduction to advance data techniques.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Clinical guidelines as a resource for EBP(1).pdf
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Miokarditis (Inflamasi pada Otot Jantung)
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Mega Projects Data Mega Projects Data
Introduction-to-Cloud-ComputingFinal.pptx
Introduction to Knowledge Engineering Part 1

An analysis and characterization of DMPs in NSF proposals from the University of Illinois (#RDAP14)

  • 1. An Analysis and Characterization of DMPs in NSF Proposals from the University of Illinois RDAP14 Research Data Access & Preservation Summit March 26, 2014 William H. Mischo, Mary C. Schlembach, & Megan N. O’Donnell University of Illinois at Urbana-Champaign Iowa State University
  • 2. NSF Data Management Plans • Data Management Plans (DMPs): required element in NSF proposals, January 2011 • July 2011: the Library, working with the campus Office of Sponsored Programs and Research Administration (OSPRA) began an analysis of DMPs in submitted NSF grant proposals • Currently, looked at 1,600 grants with 1,260 in the analysis.
  • 3. Reasons for Analysis •What storage venues and mechanisms for sharing and reuse are being used? •Are the PI’s using local templates and local campus resources such as the IDEALS?
  • 4. Follow-on • Develop campus-wide infrastructure (Research Data Service - RDS) • Assist in compliance with federal agencies • Develop important partnerships with campus units (CITES, NCSA, Colleges) and national entities • Develop best practices and standard approaches
  • 5. Analysis • Analysis attempts to characterize and classify DMPs into categories • DMPs assigned multiple categories • 1,260 DMPs from July 2011 to November 2013
  • 6. Categories • PI Server – Servers and workstations that the PIs (and their students/staff) use to store project data. laboratory server/workstations, external hard drives, group computer • PI Website – Websites edited or administered by the PI or a group they belong to. Examples: lab website, project website, wiki, PI’s website
  • 7. Categories • Campus – Services located, operated by, run by or endorsed by Illinois. IDEALS, Netfiles and Box.net, NCSA, and Beckman Institute. • Department – Used when a department was specifically mentioned as providing a storage or hosting resource. Departmental website, departmental server, departmental backup service or a web address traced back to an academic department (also given the “campus” label)
  • 8. Categories • Remote – Services and sites not located on the Illinois campus. NASA, other campuses, collaborative projects, non-Illinois institutes • Disciplinary – Disciplinary repositories. GenBank, arXiv, ICPSR, SEAD, Nanohub, and Dryad • Cloud – Storage services using cloud technology. Google Drive, Google Code, Box.net, Amazon, Microsoft, Dropbox
  • 9. Categories • Publication - Scholarly outputs. Journal articles, workshops, and conference presentations/posters. • Analog - Physical records/data. Lab notebooks, photographs, files • Specimens - Physical specimens. Usually biological or artifacts
  • 10. Categories • Optical Disc - DVD, CD, and Blu-ray discs. • Not specified – the DMP was not specific enough for us to categorize further. • No Data – Indicated the proposal will produce no data products. • Local Template Used – used a library authored template.
  • 11. Category Number Percent PI Server 503 39.9% PI Website 529 41.9% Campus 667 52.9% Department 142 11.2% Remote 353 28% Disciplinary 275 21.8% Publication 556 44.1% Cloud 63 5% Optical Disc 56 4% Analog 131 10.4% Specimens 111 8.8% Not Specified 66 5.2% Collaborative 164 13% No Data 103 8.2% ALL DMPs (n=1,260)
  • 12. Data Venue and Risk Data Location Submitted Proposals Funded Proposals Risk of Loss/Corruption/ Breach n=1260 n=298 PI Server/Website 64% High 61% High Departmental Server/Website 11.2% Medium to High 7% Medium to High Campus-Wide Resource 52.9% Low 45% LowIDEALS (Institutional Repos.) 21.9% 19.8% NCSA 4.3% 16.4% Disciplinary Repository/Cloud 25.8% Medium to Low 21.4% Medium to Low Remote Repository 28% Medium to High 22.8% Medium to High Optical Disk, Specimens, Analog 19.4% Out of Scope 11% Out of Scope
  • 13. Notables • Funded: 298 • Used local template: 254 • Only 87 DMPS contained information about file types • IDEALS: 275 • NCSA/XSEDE: 55 • Dryad: 22 • ICPSR: 17 • GenBank: 55 • ArX: 61
  • 14. Analysis • Any differences in storage venue or technologies between the unfunded proposals and the funded proposals? • Any differences between the proposals from the first year and the more current proposals? • Other differences in proposal categories between funded and unfunded • 734 active NSF awards, $861.8 million
  • 15. Analysis: Funded vs. Not-funded • IDEALS institutional repository: 62 funded, 197 not funded: chi-square: 0.17 • Storing data on PI server or website: 183 funded, 569 not funded: chi-square: 0.7 • Disciplinary or Cloud: 67 funded, 241 not funded: chi-square: 0.85 • Remote storage: 68 funded, 267 not funded: chi-square: 3.01
  • 16. Analysis • Use of IDEALS before August 2012 = 108 after (thru November 2013) = 166 chi-square: 4.59, p < .05 • Use of Disciplinary or Cloud before August 2012 = 121 after = 182 chi-square: 4.33, p < .05
  • 17. Implications and Conclusions 1. No significant differences between funded/unfunded proposals in storage venues - no advantage in IDEALS, Disciplinary. 2. More recent proposals suggest IDEALS and disciplinary repositories included at a significantly higher level • What is the role of the library? The campus? The subject discipline? • Connecting data to the literature important

Editor's Notes

  • #4: Took out (covered in keynote) - Make key research data available and sharable - Allow the use of data for verification of results and reproducibility of research work - Agency can show significant return on investment to justify funding
  • #5: to support Illinois researchers in managing their data
  • #10: Very few DMPs were explicit as to how their “publications” and data were related or separated.
  • #11: No data: Many were theoretical studies (math), travel grants, or workshop planning sessions.