SlideShare a Scribd company logo
Research Data Access & Preservation Summit 2013, April 4-5, 2013
                                                                             Baltimore, MD


              Can Quantitative
            Social Scientists Get
          Data Reuse Satisfaction?
Ixchel M. Faniel, Ph.D.                     Elizabeth Yakel, Ph.D.   Adam Kriesberg
Postdoctoral Researcher                     Professor                Morgan Daniels
OCLC Research                               University of Michigan
                                                                     Ph.D. Students
fanieli@oclc.org                            yakel@umich.edu          University of Michigan

                                                                     akriesbe@umich.edu

                                                                     mgdaniel@umich.edu



        The world’s libraries. Connected.
Agenda

• Introduction to the DIPIR Project
• Survey of ICPSR Data Reusers
   • Theoretical Frame
   • Our Model
   • Findings
   • Discussion
• Next Steps


    The world’s libraries. Connected.
• Institute for Museum and Library Services (IMLS) funded project led by Drs.
  Ixchel Faniel (PI) & Elizabeth Yakel (co-PI)


• Studying the intersection between data reuse and digital preservation in
  three academic disciplines to identify how contextual information about the
  data that supports reuse can best be created and preserved.


• Focuses on research data produced and used by quantitative social
  scientists, archaeologists, and zoologists.


• The intended audiences of this project are researchers who use secondary
  data and the digital curators, digital repository managers, data center
  staff, and others who collect, manage, and store digital information.
           For more information, please visit http://www.dipir.org

       The world’s libraries. Connected.
The Research Team

                                                Nancy
                                               McGovern
                                              ICPSR/MIT



                       Elizabeth                                 Ixchel Faniel
                         Yakel
                                                                    OCLC
                      University of
                                                                   Research
                       Michigan
                        (Co-PI)                                      (PI)
                                              DIPIR
                                              Project


                               William Fink               Eric Kansa
                               UM Museum                     Open
                                of Zoology                 Context




  The world’s libraries. Connected.
RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satisfact…
RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satisfact…
RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satisfact…
Research Motivations & Questions



   1. What are the significant
      properties of quantitative
      social
      science, archaeological,
      and zoological data that
      facilitate reuse?


   2. How can these significant
      properties be expressed
      as representation
      information to ensure the
      preservation of meaning                Faniel & Yakel 2011


      and enable data reuse?
         The world’s libraries. Connected.
Methods Overview


                                     ICPSR           Open Context       UMMZ
                                      Phase 1: Project Start up
 Interviews                         10                    4               10
 Staff                          Winter 2011          Winter 2011    Spring 2011
                      Phase 2: Collecting and analyzing user data
 Interviews                         44                    22             27
 data consumers                 Winter 2012          Winter 2012     Fall 2012
 Survey                        Over 1,600
 data consumers               Summer 2012
 Web analytics                                         Server logs
 data consumers                                         Ongoing
 Observations                                                            10
 data consumers                                                        Ongoing
    Phase 3: Mapping significant properties as representation information

         The world’s libraries. Connected.
Measuring Data Repository Success


A Survey of ICPSR
Data Reusers


   The world’s libraries. Connected.
Theoretical Framework

  DeLone and McLean Information Systems (IS) Success Model

   Information
     Quality
                                            Intention Use
                                              to use
     System
     Quality                                                                   Net
                                                                             Benefits
                                               User
                                            Satisfaction
    Service
    Quality


                                                   (DeLone & McLean, 2003)

        The world’s libraries. Connected.
Survey of ICPSR Data Reusers - Part 1


 Measuring Repository Success

What data quality
indicators contribute
to quantitative social
scientists’ data reuse
satisfaction?


       The world’s libraries. Connected.
ICPSR Survey of Data Reusers – Part 1


Data Quality Indicators

• Completeness – sufficiency, breadth, depth, and scope of
  the data for the task
• Relevancy – applicability and helpfulness of data for the
  task
• Accessibility – ease and speed data were retrieved
• Ease of Operation – ease data were managed and
  manipulated
• Credibility – correctness, reliability, impartiality of data
                                          (Wang and Strong, 1996; Lee et al., 2002)

      The world’s libraries. Connected.
ICPSR Survey of Data Reusers – Part 1


Additional Quality Indicators

• Data Producer Reputation – regard for a data producer’s
  work
• Documentation Quality – sufficiency and ability to
  facilitate use of the data




      The world’s libraries. Connected.
ICPSR Survey of Data Reusers – Part 1 (The Conceptual Model)


Data Producer Reputation

  Data Ease of Operation
                                                             +
      Data Credibility                               +               +
                                                 +
     Data Accessibility                                      Data Reuse Satisfaction
                                             +
    Data Completeness                                    +       +

      Data Relevancy

  Documentation Quality


         The world’s libraries. Connected.
Survey Methodology


 Data Collection
     1,632 first authors of published
     journal articles 2008-2012
     surveyed


 The Survey
     Part 1:inquire about data reuse
            experience
     Part 2:inquire about experience
            using ICPSR repository
            and intention to continue
            use

        The world’s libraries. Connected.
Findings: Descriptive Statistics


        Variable Name                         Mean     Std.      Cronbach’s
                                                     Deviation     Alpha
  Data Completeness                           5.68     1.07         0.76
  Data Relevancy                              6.50     0.58         0.75
  Data Accessibility                          5.95     1.15         0.87
  Data Ease of Operation                      5.93     1.14         0.86
  Data Credibility                            6.23     0.66         0.79
  Data producer reputation                    6.27     0.91         0.84
  Documentation quality                       6.04     0.77         0.84
  Data reuse satisfaction                     6.30     0.89         0.80
                                                                     n = 254

          The world’s libraries. Connected.
Findings: Multiple Regression Analysis


 Data Producer Reputation

   Data Ease of Operation

                                                         .110*        .098
       Data Credibility                        .034

      Data Accessibility                      .303***     Data Reuse Satisfaction

     Data Completeness                         .278***   .113         .118*
       Data Relevancy

   Documentation Quality
                                                         *p < .05, ***p < .001


          The world’s libraries. Connected.
ICPSR Survey of Data Reusers - Part 1


Discussion

• Tested measures of repository success
• Extended ideas about data quality beyond credibility and
  relevance of data
    • Data reuse satisfaction requires data that are
      complete, accessible, and easy to operate
• Data producer reputation was not significant
• Documentation quality played a role if data reuse
  satisfaction



      The world’s libraries. Connected.
ICPSR Survey of Data Reusers – Part 1


Next Steps – Continued Analysis

• How do other variables impact our model?
    • Journal impact factor
    • Prior data reuse experience
    • Nature of reuse
    • Prior ICPSR contributions
    • Data scarcity
    • Reuse dependence




      The world’s libraries. Connected.
Acknowledgements



• Institute of Museum and Library Services


• Partners: Nancy McGovern, Ph.D. (MIT), Eric Kansa,
  Ph.D. (Open Context), William Fink, Ph.D. (University of
  Michigan Museum of Zoology)


• Students: Adam Kriesberg, Morgan Daniels, Rebecca
  Frank, Julianna Barrera-Gomez, Jessica Schaengold,
  Gavin Strassel, Michele DeLia, Kathleen Fear, Mallory
  Hood, Molly Haig, Annelise Doll, Monique Lowe


        The world’s libraries. Connected.
Ixchel Faniel
fanieli@oclc.org



Questions?



    The world’s libraries. Connected.

More Related Content

PDF
A Multi-institutional Project to Develop Discipline-Specific Data Literacy In...
PPTX
Scio12 sem web_final
PDF
Welcome to International Journal of Engineering Research and Development (IJERD)
PDF
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
PPT
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
PPTX
A Novel Leaf-fragment Dataset and ResNet for Small-scale Image Analysis
PDF
Q01051134140
A Multi-institutional Project to Develop Discipline-Specific Data Literacy In...
Scio12 sem web_final
Welcome to International Journal of Engineering Research and Development (IJERD)
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
A Novel Leaf-fragment Dataset and ResNet for Small-scale Image Analysis
Q01051134140

Viewers also liked (6)

PDF
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
PDF
Altmetrics: how librarians can support researchers in improving their impact
PPTX
RDAP13 Kathleen Fear: The impact of data reuse: a pilot study of 5 measures
PPT
Proactive Guide In Securing Essential Need Facilities
PPTX
Presentacion barcelona 3.rafael.antonia.v2
PPTX
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
Altmetrics: how librarians can support researchers in improving their impact
RDAP13 Kathleen Fear: The impact of data reuse: a pilot study of 5 measures
Proactive Guide In Securing Essential Need Facilities
Presentacion barcelona 3.rafael.antonia.v2
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
Ad

Similar to RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satisfact… (20)

PPTX
Dissemination Information Packages (DIPS) for Information Reuse
PPTX
Supporting research life cycle librarians
PPTX
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
PPTX
Improving Support for Researchers: How Data Reuse Can Inform Data Curation
PDF
Dileo Presentation (in English)
PPTX
Open Science
PPTX
Boundless Opportunity
PPT
Crushing, Blending, and Stretching Data
PPTX
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
PPTX
Scientific data management from the lab to the web
PPTX
CARARE: Can I use this data? FAIR into practice
PDF
Knowledge Exchange, Nov 2011, Bonn
PPTX
DataCite: the Perfect Complement to CrossRef
PPTX
Michener Plenary PPSR2012
PPT
Where is the opportunity for libraries in the collaborative data infrastructure?
PDF
Sünje Dallmeier-Tiessen: Research data "publishing": models, roles and respon...
PPTX
Building a Data Discovery Network for Sustainability Science
PDF
From metadata to data curation: the role of libraries in data exchange
PPT
And the survey says
PPTX
Dissemination Information Packages (DIPS) for Information Reuse
Supporting research life cycle librarians
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
Improving Support for Researchers: How Data Reuse Can Inform Data Curation
Dileo Presentation (in English)
Open Science
Boundless Opportunity
Crushing, Blending, and Stretching Data
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
Scientific data management from the lab to the web
CARARE: Can I use this data? FAIR into practice
Knowledge Exchange, Nov 2011, Bonn
DataCite: the Perfect Complement to CrossRef
Michener Plenary PPSR2012
Where is the opportunity for libraries in the collaborative data infrastructure?
Sünje Dallmeier-Tiessen: Research data "publishing": models, roles and respon...
Building a Data Discovery Network for Sustainability Science
From metadata to data curation: the role of libraries in data exchange
And the survey says
Ad

More from ASIS&T (20)

PPTX
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
DOCX
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
PPTX
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
PPTX
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
PPTX
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
PDF
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
PDF
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
PDF
RDAP 16 Poster: Interpreting Local Data Policies in Practice
PDF
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
PPTX
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
PPTX
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
PPTX
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
PPT
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
PDF
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
PPTX
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
PPT
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
PPTX
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
PPTX
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
PPTX
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
PPTX
RDAP 16: How do we know where to grow? Assessing Research Data Services at th...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: How do we know where to grow? Assessing Research Data Services at th...

RDAP13 Ixchel Faniel: Can Quantitative Social Scientists Get Data Reuse Satisfact…

  • 1. Research Data Access & Preservation Summit 2013, April 4-5, 2013 Baltimore, MD Can Quantitative Social Scientists Get Data Reuse Satisfaction? Ixchel M. Faniel, Ph.D. Elizabeth Yakel, Ph.D. Adam Kriesberg Postdoctoral Researcher Professor Morgan Daniels OCLC Research University of Michigan Ph.D. Students fanieli@oclc.org yakel@umich.edu University of Michigan akriesbe@umich.edu mgdaniel@umich.edu The world’s libraries. Connected.
  • 2. Agenda • Introduction to the DIPIR Project • Survey of ICPSR Data Reusers • Theoretical Frame • Our Model • Findings • Discussion • Next Steps The world’s libraries. Connected.
  • 3. • Institute for Museum and Library Services (IMLS) funded project led by Drs. Ixchel Faniel (PI) & Elizabeth Yakel (co-PI) • Studying the intersection between data reuse and digital preservation in three academic disciplines to identify how contextual information about the data that supports reuse can best be created and preserved. • Focuses on research data produced and used by quantitative social scientists, archaeologists, and zoologists. • The intended audiences of this project are researchers who use secondary data and the digital curators, digital repository managers, data center staff, and others who collect, manage, and store digital information. For more information, please visit http://www.dipir.org The world’s libraries. Connected.
  • 4. The Research Team Nancy McGovern ICPSR/MIT Elizabeth Ixchel Faniel Yakel OCLC University of Research Michigan (Co-PI) (PI) DIPIR Project William Fink Eric Kansa UM Museum Open of Zoology Context The world’s libraries. Connected.
  • 8. Research Motivations & Questions 1. What are the significant properties of quantitative social science, archaeological, and zoological data that facilitate reuse? 2. How can these significant properties be expressed as representation information to ensure the preservation of meaning Faniel & Yakel 2011 and enable data reuse? The world’s libraries. Connected.
  • 9. Methods Overview ICPSR Open Context UMMZ Phase 1: Project Start up Interviews 10 4 10 Staff  Winter 2011  Winter 2011  Spring 2011 Phase 2: Collecting and analyzing user data Interviews 44 22 27 data consumers  Winter 2012  Winter 2012  Fall 2012 Survey Over 1,600 data consumers  Summer 2012 Web analytics Server logs data consumers Ongoing Observations 10 data consumers Ongoing Phase 3: Mapping significant properties as representation information The world’s libraries. Connected.
  • 10. Measuring Data Repository Success A Survey of ICPSR Data Reusers The world’s libraries. Connected.
  • 11. Theoretical Framework DeLone and McLean Information Systems (IS) Success Model Information Quality Intention Use to use System Quality Net Benefits User Satisfaction Service Quality (DeLone & McLean, 2003) The world’s libraries. Connected.
  • 12. Survey of ICPSR Data Reusers - Part 1 Measuring Repository Success What data quality indicators contribute to quantitative social scientists’ data reuse satisfaction? The world’s libraries. Connected.
  • 13. ICPSR Survey of Data Reusers – Part 1 Data Quality Indicators • Completeness – sufficiency, breadth, depth, and scope of the data for the task • Relevancy – applicability and helpfulness of data for the task • Accessibility – ease and speed data were retrieved • Ease of Operation – ease data were managed and manipulated • Credibility – correctness, reliability, impartiality of data (Wang and Strong, 1996; Lee et al., 2002) The world’s libraries. Connected.
  • 14. ICPSR Survey of Data Reusers – Part 1 Additional Quality Indicators • Data Producer Reputation – regard for a data producer’s work • Documentation Quality – sufficiency and ability to facilitate use of the data The world’s libraries. Connected.
  • 15. ICPSR Survey of Data Reusers – Part 1 (The Conceptual Model) Data Producer Reputation Data Ease of Operation + Data Credibility + + + Data Accessibility Data Reuse Satisfaction + Data Completeness + + Data Relevancy Documentation Quality The world’s libraries. Connected.
  • 16. Survey Methodology Data Collection 1,632 first authors of published journal articles 2008-2012 surveyed The Survey Part 1:inquire about data reuse experience Part 2:inquire about experience using ICPSR repository and intention to continue use The world’s libraries. Connected.
  • 17. Findings: Descriptive Statistics Variable Name Mean Std. Cronbach’s Deviation Alpha Data Completeness 5.68 1.07 0.76 Data Relevancy 6.50 0.58 0.75 Data Accessibility 5.95 1.15 0.87 Data Ease of Operation 5.93 1.14 0.86 Data Credibility 6.23 0.66 0.79 Data producer reputation 6.27 0.91 0.84 Documentation quality 6.04 0.77 0.84 Data reuse satisfaction 6.30 0.89 0.80 n = 254 The world’s libraries. Connected.
  • 18. Findings: Multiple Regression Analysis Data Producer Reputation Data Ease of Operation .110* .098 Data Credibility .034 Data Accessibility .303*** Data Reuse Satisfaction Data Completeness .278*** .113 .118* Data Relevancy Documentation Quality *p < .05, ***p < .001 The world’s libraries. Connected.
  • 19. ICPSR Survey of Data Reusers - Part 1 Discussion • Tested measures of repository success • Extended ideas about data quality beyond credibility and relevance of data • Data reuse satisfaction requires data that are complete, accessible, and easy to operate • Data producer reputation was not significant • Documentation quality played a role if data reuse satisfaction The world’s libraries. Connected.
  • 20. ICPSR Survey of Data Reusers – Part 1 Next Steps – Continued Analysis • How do other variables impact our model? • Journal impact factor • Prior data reuse experience • Nature of reuse • Prior ICPSR contributions • Data scarcity • Reuse dependence The world’s libraries. Connected.
  • 21. Acknowledgements • Institute of Museum and Library Services • Partners: Nancy McGovern, Ph.D. (MIT), Eric Kansa, Ph.D. (Open Context), William Fink, Ph.D. (University of Michigan Museum of Zoology) • Students: Adam Kriesberg, Morgan Daniels, Rebecca Frank, Julianna Barrera-Gomez, Jessica Schaengold, Gavin Strassel, Michele DeLia, Kathleen Fear, Mallory Hood, Molly Haig, Annelise Doll, Monique Lowe The world’s libraries. Connected.
  • 22. Ixchel Faniel fanieli@oclc.org Questions? The world’s libraries. Connected.