SlideShare a Scribd company logo
Where’s Your Data?
             What's so important about
                 managing data?


Sherry Lake
University of Virginia Library
Curry Research Conference           February 1, 2013
Why are you here?
• You’re managing data (your own or your
  lab’s)
• Or you think maybe your should be
  – Who has told you to?
• Or you think you will have to in the future
• You’re not sure why it matters
• You’re curious and want to know how
  managing data affects you
Discussion
• Stories about poor data management
• In groups, assess & report out (2 minutes
  max)
  – What bad data-related practices are talked
    about?
  – What problems did the bad practice(s) cause?
  – What happened to the researcher because of
    poor practice?
  – How could this have been prevented?
Why Manage Research Data?
• Saves Time
  – Simplifies your research & increases your
    research efficiency
• Makes preserving data for the long term
  easier
  – Takes less time to get data ready to share
• Supports Sharing
  – Can focus on research not user requests
  – Lets others understand your data
  – Increases Research Impact
• Meets grant requirements
                                                 4
Who Cares?




                      From Flickr by Redden-McAllister




From Flickr by AJC1    www.rba.gov.au
Why Share Data?
• Required by funding agencies
• Reinforces open scientific inquiry
• Increases the visibility of your research &
  your own research reputation
• Facilitates new discoveries
• Reduces costs by avoiding duplication
• Makes it easier to re-use and verify data sets

                                             6
Who is Requiring Data Sharing?
• Publishers
  –   Nature Publishing Group
  –   American Naturalist
  –   Evolution
  –   Journal of Evolutionary Biology
  –   ESA journals
• Funding agencies
  –   National Institute of Health (NIH)
  –   NIH Public Access Mandate (for publications)
  –   National Science Foundation (NSF)
  –   Institute of Museum and Library Services (IMLS)
                                                        7
Dissemination & Sharing of Research Results:
 “Investigators are expected to share with other
    researchers, at no more than incremental
    cost and within a reasonable time, the
    primary data, samples, physical collections
    and other supporting materials created or
    gathered in the course of work under NSF
    grants. Grantees are expected to encourage
    and facilitate such sharing.”

     NSF: Award & Administration Guide (AAG)
      Chapter VI.D.4
                                                   8
Plans for Data Management & Sharing of the
            Products of Research
• Proposals must include a supplementary
  document of no more than two pages labeled:
  “Data Management Plan”
• Document should describe how the proposal will
  conform to NSF sharing policy

NSF: Grant Proposal Guide (GPG) Chapter II.C.2.j
What is a Data Management Plan?
• Brief description of how you will comply
  with funder’s data sharing policy
• Reviewed as part of a grant application

    • How do I create one?
    • What is included?


                                             10
DMPTool
• Online Data Management Plan creation tool
• Helps researchers meet requirements of NSF
  and other U.S. funding agencies
• Guides researchers through the process of
  creating a data management plan
• Is available to everyone
• Provides additional help for researchers at
  UVa
https://dmptool.org/
Why managedata
Why managedata
Why managedata
Why managedata
Why managedata
Why managedata
Why managedata
Why managedata
Why managedata
Why managedata
Parts of a NSF Data Management Plan
I.    Products of the Research: The types of data, samples,
      physical collections, software, curriculum materials, and
      other materials to be produced in the course of the project.
II.   Data Formats: The standards to be used for data and
      metadata format and content (where existing standards are
      absent or deemed inadequate, this should be documented
      along with any proposed solutions or remedies).
III. Access to Data and Data Sharing Practices and Policies:
     Policies for access and sharing including provisions for
     appropriate protection of privacy, confidentiality, security,
     intellectual property, or other rights or requirements.
IV. Policies for Re-Use, Re-Distribution, and Production of
    Derivatives.
V.    Archiving of Data: Plans for archiving data, samples, and
      other research products, and for preservation of access to
      them.
             Grant Proposal Guide (GPG) Chapter II.C.2.j             23
What is a Data Management Plan?
A comprehensive plan of how you will
manage your research data throughout the
lifecycle of your research project

1. Project description       5. Data administration issues:
2. Survey of existing data      a. Funding and legislative
3. Data to be created              requirements
   a. Data organization         b. Data owners and
      methods (optional)           stakeholders
4. Data sharing and             c. Access and security
   archiving                    d. Backups
                             6. Responsibilities
                             7. Data documentation and
                                metadata                  24

                             8. Budget
Where to Get More Help
http://www.lib.virginia.edu/brown/data/DMP_Support.html
http://pages.shanti.virginia.edu/SciDaC_Grad_Training/
https://dmptool.org

           Scientific Data Consulting Group
             University of Virginia Library
                 scidac@virginia.edu
Links to Discussion Stories
• http://arstechnica.com/science/2012/03/99-percent-of-nasas-
  portable-devices-are-unencrypted/
• http://www.nature.com/news/psychology-must-learn-a-lesson-
  from-fraud-case-1.9513
• http://www.bizjournals.com/atlanta/news/2012/04/18/data-
  loss-at-emory-healthcare-exposes.html?s=print
• http://gawker.com/5625139/grad-students-thesis-dreams-on-
  stolen-laptop
• http://blog.dansimons.com/2012/09/the-fog-of-data-secrecy-
  and-science.html
• http://www.eurekalert.org/pub_releases/2012-09/uol-
  idm091812.php
• http://retractionwatch.wordpress.com/2012/08/16/hypertensio
  n-retracts-paper-over-data-glitch/

More Related Content

PPTX
Virginia Data Management Bootcamp: Building the Research Data Community of Pr...
PPTX
Documentation and Metdata - VA DM Bootcamp
PPTX
Creating dmp
PDF
Using a Case Study to Teach Data Management to Librarians
PPTX
Best practices data collection
PPTX
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
PPTX
Managing the research life cycle
PDF
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Virginia Data Management Bootcamp: Building the Research Data Community of Pr...
Documentation and Metdata - VA DM Bootcamp
Creating dmp
Using a Case Study to Teach Data Management to Librarians
Best practices data collection
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Managing the research life cycle
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...

What's hot (20)

PDF
Strasser "Effective data management and its role in open research"
PDF
Praetzellis "Data Management Planning and Tools"
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PPTX
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
PPTX
Author identifiers & research impact: A role for libraries
PPTX
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
PPTX
Publishing perspectives on data management & future directions
PPTX
Preparing Your Research Material for the Future - 2014-06-09 - Humanities Div...
PPT
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
PDF
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PPTX
Preparing Your Research Material for the Future - 2016-02-22 - Humanities Div...
PPT
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
PPTX
Preparing Your Research Material for the Future - 2015-02-23 - Humanities Div...
PPTX
Research Data Management for SOE
PPTX
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
PDF
RDAP14: Learning to Curate Panel
PPTX
Preparing Your Research Material for the Future 2016-05-16 - Humanities Divis...
PPTX
Data management for TA's
PDF
Valen Metadata and the [Data] Repository
Strasser "Effective data management and its role in open research"
Praetzellis "Data Management Planning and Tools"
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Author identifiers & research impact: A role for libraries
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Publishing perspectives on data management & future directions
Preparing Your Research Material for the Future - 2014-06-09 - Humanities Div...
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Preparing Your Research Material for the Future - 2016-02-22 - Humanities Div...
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
Preparing Your Research Material for the Future - 2015-02-23 - Humanities Div...
Research Data Management for SOE
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
RDAP14: Learning to Curate Panel
Preparing Your Research Material for the Future 2016-05-16 - Humanities Divis...
Data management for TA's
Valen Metadata and the [Data] Repository
Ad

Similar to Why managedata (20)

PPTX
Data Management Planning for Engineers
PPTX
Library resources and services for grant development
PPTX
Winter school in research data science research data management - final
PPTX
Creating a Data Management Plan
PPTX
Managing your research data
PPTX
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
PPTX
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
PDF
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
PDF
Practical Data Management Plans
PPTX
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
PPTX
Data Management for Postgraduate students by Lynn Woolfrey
PPTX
Funder requirements for Data Management Plans
PPTX
Data Literacy: Creating and Managing Reserach Data
PPTX
Fsci 2018 thursday2_august_am6
PPTX
Intro to Data Management Plans
PPT
Getting to grips with Research Data Management
PDF
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
PPTX
Data Management - Lynn Woolfrey
PPTX
Data management woolfrey
PPTX
DMP health sciences
Data Management Planning for Engineers
Library resources and services for grant development
Winter school in research data science research data management - final
Creating a Data Management Plan
Managing your research data
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
Practical Data Management Plans
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Data Management for Postgraduate students by Lynn Woolfrey
Funder requirements for Data Management Plans
Data Literacy: Creating and Managing Reserach Data
Fsci 2018 thursday2_august_am6
Intro to Data Management Plans
Getting to grips with Research Data Management
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
Data Management - Lynn Woolfrey
Data management woolfrey
DMP health sciences
Ad

More from Sherry Lake (16)

PPTX
Planning for Libra Data
PPTX
Best practices data management
PDF
DMTool-ASERL-Webinar
PDF
DMPTool Workshop University of Georgia
PDF
Federal funder mandates
PDF
DMPTool2 demo for DMPTool-DMPonline Workshop IDCC 2014
PPTX
DMPTool Webinar Environmental Scan
PPTX
Lake dmp tool_i_conference
PPTX
Lake us-canada policesupdate
PPTX
Re tooling for data management-support
PPTX
Web links
PPTX
Dmp tool presentation
PPTX
Library support for life cycle
PPTX
Environmental scan - Keeping Updated
PPTX
Re tooling for data management-support
PPTX
Supporting research life cycle librarians
Planning for Libra Data
Best practices data management
DMTool-ASERL-Webinar
DMPTool Workshop University of Georgia
Federal funder mandates
DMPTool2 demo for DMPTool-DMPonline Workshop IDCC 2014
DMPTool Webinar Environmental Scan
Lake dmp tool_i_conference
Lake us-canada policesupdate
Re tooling for data management-support
Web links
Dmp tool presentation
Library support for life cycle
Environmental scan - Keeping Updated
Re tooling for data management-support
Supporting research life cycle librarians

Why managedata

  • 1. Where’s Your Data? What's so important about managing data? Sherry Lake University of Virginia Library Curry Research Conference February 1, 2013
  • 2. Why are you here? • You’re managing data (your own or your lab’s) • Or you think maybe your should be – Who has told you to? • Or you think you will have to in the future • You’re not sure why it matters • You’re curious and want to know how managing data affects you
  • 3. Discussion • Stories about poor data management • In groups, assess & report out (2 minutes max) – What bad data-related practices are talked about? – What problems did the bad practice(s) cause? – What happened to the researcher because of poor practice? – How could this have been prevented?
  • 4. Why Manage Research Data? • Saves Time – Simplifies your research & increases your research efficiency • Makes preserving data for the long term easier – Takes less time to get data ready to share • Supports Sharing – Can focus on research not user requests – Lets others understand your data – Increases Research Impact • Meets grant requirements 4
  • 5. Who Cares? From Flickr by Redden-McAllister From Flickr by AJC1 www.rba.gov.au
  • 6. Why Share Data? • Required by funding agencies • Reinforces open scientific inquiry • Increases the visibility of your research & your own research reputation • Facilitates new discoveries • Reduces costs by avoiding duplication • Makes it easier to re-use and verify data sets 6
  • 7. Who is Requiring Data Sharing? • Publishers – Nature Publishing Group – American Naturalist – Evolution – Journal of Evolutionary Biology – ESA journals • Funding agencies – National Institute of Health (NIH) – NIH Public Access Mandate (for publications) – National Science Foundation (NSF) – Institute of Museum and Library Services (IMLS) 7
  • 8. Dissemination & Sharing of Research Results: “Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants. Grantees are expected to encourage and facilitate such sharing.” NSF: Award & Administration Guide (AAG) Chapter VI.D.4 8
  • 9. Plans for Data Management & Sharing of the Products of Research • Proposals must include a supplementary document of no more than two pages labeled: “Data Management Plan” • Document should describe how the proposal will conform to NSF sharing policy NSF: Grant Proposal Guide (GPG) Chapter II.C.2.j
  • 10. What is a Data Management Plan? • Brief description of how you will comply with funder’s data sharing policy • Reviewed as part of a grant application • How do I create one? • What is included? 10
  • 11. DMPTool • Online Data Management Plan creation tool • Helps researchers meet requirements of NSF and other U.S. funding agencies • Guides researchers through the process of creating a data management plan • Is available to everyone • Provides additional help for researchers at UVa
  • 23. Parts of a NSF Data Management Plan I. Products of the Research: The types of data, samples, physical collections, software, curriculum materials, and other materials to be produced in the course of the project. II. Data Formats: The standards to be used for data and metadata format and content (where existing standards are absent or deemed inadequate, this should be documented along with any proposed solutions or remedies). III. Access to Data and Data Sharing Practices and Policies: Policies for access and sharing including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements. IV. Policies for Re-Use, Re-Distribution, and Production of Derivatives. V. Archiving of Data: Plans for archiving data, samples, and other research products, and for preservation of access to them. Grant Proposal Guide (GPG) Chapter II.C.2.j 23
  • 24. What is a Data Management Plan? A comprehensive plan of how you will manage your research data throughout the lifecycle of your research project 1. Project description 5. Data administration issues: 2. Survey of existing data a. Funding and legislative 3. Data to be created requirements a. Data organization b. Data owners and methods (optional) stakeholders 4. Data sharing and c. Access and security archiving d. Backups 6. Responsibilities 7. Data documentation and metadata 24 8. Budget
  • 25. Where to Get More Help
  • 27. Links to Discussion Stories • http://arstechnica.com/science/2012/03/99-percent-of-nasas- portable-devices-are-unencrypted/ • http://www.nature.com/news/psychology-must-learn-a-lesson- from-fraud-case-1.9513 • http://www.bizjournals.com/atlanta/news/2012/04/18/data- loss-at-emory-healthcare-exposes.html?s=print • http://gawker.com/5625139/grad-students-thesis-dreams-on- stolen-laptop • http://blog.dansimons.com/2012/09/the-fog-of-data-secrecy- and-science.html • http://www.eurekalert.org/pub_releases/2012-09/uol- idm091812.php • http://retractionwatch.wordpress.com/2012/08/16/hypertensio n-retracts-paper-over-data-glitch/

Editor's Notes

  • #5: Now let’s focus on why everyone is here. Learn about Data Management…..Why should you care about data management? Why should you manage your data?Saving Time: How many have had trouble “finding” your own data files, understanding it?, can’t tell which file is the most recent?If you have all your data organized and documented as you go through your research, it is easier at the end to “share” your data.When you share (i.e., in an archive) others can access your data without having to bother you and if it is well documented. Others can understand it.It has been shown that research with easy to find data and easy for reuse, is associated with an increase in citation rate. Piwowar, Heather A, Roger S Day, and Douglas B Fridsma. “Sharing Detailed Research Data Is Associated With Increased Citation Rate.” PLoS ONE 2.3 : 5.As you will see a key importance with having a Data Management Plan is help with sharing of data.– Main goal for grants.
  • #6: If have journal article, have record of what you did stored in journals,..But the data underlying the results are really important,funders careColleagues – potential collaboratorsInstitutions (not shown here)Tenure committees more in the future.You: need to care you might need to go back to it in a few years… need good description.Future scientists – potentially use your data to discover important things. Need to be thinking about the future. (providing data for them)
  • #7: In recent years several national scientific organizations have issued statements and policies underscoring the need for prompt archiving of data and some funding agencies have stared to require that the data they fund be deposited in a public archive.http://www.carlboettiger.info/archives/905Making you data available to other researchers through repositories can increase your prominence and show continued use of the data and the relevance of your research .Enabling others to use your data reinforces open scientific inquiry and can lead to new discoveries (maybe in a different discipline) new collaborations, prevents duplication of effortNYTimes – Alzheimer’s: parked our egos and intellectual-property noses outside the door and agreed that all of our data would be public immediately.
  • #8: If you look at this list you will see journals in the life sciences, but it is a trend.The Office of Management and Budget (OMB) Circular A-110 provides the federal administrative requirements for grants and agreements with institutions of higher education, hospitals and other non-profit organizations. In1999, revised to provide public access under some circumstances to research data through the Freedom of Information Act (FOIA). Funding agencies have implemented the OMB requirement in various waysPubMed for NIH papersNIH 2003 Data Sharing Policy:In NIH's view, all data should be considered for data sharing. Data should be made as widely and freely available as possible while safeguarding the privacy of participants, and protecting confidential and proprietary data. To facilitate data sharing, investigators submitting a research application requesting $500,000 or more of direct costs in any single year to NIH on or after October 1, 2003 are expected to include a plan for sharing final research data for research purposes, or state why data sharing is not possible.The NIH Public Access Policy ensures that the public has access to the published results of NIH funded research. It requires scientists to submit final peer-reviewed journal manuscripts that arise from NIH funds to the digital archive PubMed Centralupon acceptance for publication.  To help advance science and improve human health, the Policy requires that these papers are accessible to the public on PubMed Central no later than 12 months after publication.Remember Managing your Data will make it easier to share!!
  • #9: The rest of this talk will focus on the Data Management Plan requirements for the NSFHas been in the Grant Policy Manual since 2002.Even though this “sharing” requirement was in the Admin Guide, there had been little if any enforcement. There was only a “check box” in the Fast Lane system.
  • #10: DMP should describe how the proposal will conform to NSF policy on the dissemination and sharing of research results (see AAG Chapter VI.D.4), and may include:A valid Data Management Plan may include only the statement that no detailed plan is needed, as long as the statement is accompanied by a clear justification.These are the parts from the Generic guidelines.
  • #11: As we will see, the NSF is really concerned with managing data in order to share it. Currently, it is not interested researchers providing a more comprehensive Data Management Plan though out the research life cycle. We recommend initiating a more comprehensive Data Management Plan to see the full benefits of managing your data. But to comply with NSF mandate, all you need is a 2-page description what data you have and how you will share it.
  • #13: Home PageSlide is Clickable to go to DMPTool page
  • #16: Many ways to get started & logged in
  • #17: UVa is set up to use Netbadge as authentication, do not need to create a separate account (passwords)
  • #24: DMP should describe how the proposal will conform to NSF policy on the dissemination and sharing of research results (see AAG Chapter VI.D.4), and may include:A valid Data Management Plan may include only the statement that no detailed plan is needed, as long as the statement is accompanied by a clear justification.These are the parts from the Generic guidelines.
  • #25: Data Types Choosing file formatsOrganizing your Files File naming conventions Version controlAccess control & security Physical, Network, Computer systems and filesBackup & storage MediaData Quality Control Collection, preparationData Processing & AnalysisFile format conversions accessible in the future, not software/hardware dependentDocument all data detailsAs we will see, the NSF is really concerned with managing data in order to share it. Currently, it is not interested researchers providing a more comprehensive Data Management Plan though out the research life cycle. We recommend initiating a more comprehensive Data Management Plan to see the full benefits of managing your data. But to comply with NSF mandate, all you need is a 2-page description what data you have and how you will share it.
  • #26: DIY, just-time-designTwo navigation pathsGrad student lifecycleData needsDifferent types of content delivery (videos, instructions, how-to, descriptions, case studies)Opportunity to engage and own via Social media bars and commentsLearn about good data management practices.