SlideShare a Scribd company logo
Online Data Analysis
for Librarians
Maine Academic Libraries Day 2015
Celia Emmelhainz
Colby College
Christine Murray
Bates College
Goals of this Training
At the end of this session, you should:
• know your online data analysis
options
• create a simple table from data
• be able to use SDA and the General
Social Survey to answer a reference
question
DATA?
What are
o Numbers
o Quotes
o Text
o GPS Points
o Materials
Social Science Data
“…the digital resources out of which social and
economic statistics are produced. The data do
not spontaneously spring into existence but are
produced from an intentional research
methodology.”
Geraci, D., Humphrey, C., Jacobs J. (2012) Data Basics.
http://3stages.org/class/2012/pdf/data_basics_2012.pdf
Microdata Aggregate data
Values for individual
observations
Summarized by
geography, groups, etc.
Source: US Census Bureau, 2005 PUMS data sample; PUMS estimates
Microdata
Advantages
• Allows statistical
analysis
• Greatest level of
detail
Disadvantages
• Must be processed
to be useful
• May need
expensive
statistical software
• Large,
cumbersome files
Advantages of Online Data
Analysis
• Pedagogical tool
• No assembly required
• No need for statistical software
• Do need to understand what data ‘mean’
What is SDA?
• Survey Documentation and Analysis, at
http://sda.berkeley.edu/archive.htm
• Web interface for analyzing data, creating
tables, and even some statistical analysis
• Berkeley archive contains General Social
Survey, National Election Survey, and
others
• Also used by other data archives (e.g.
ICPSR)
Published statistics vs. do it
yourself
Many data sources
will publish ready-
made tables of
statistics that you
can find online.
But what if it doesn’t
have the information
that you need?
http://www.norc.org/PDFs/GSS%20Reports/GSS_Trends%20in%20Gun%20Ownership_US_1972-2014.pdf
General Social Survey
a walkthrough
What is the GSS?
• Long-running opinion survey (since 1972)
• U.S. national sample
• Wide variety of attitudes on social issues,
plus some demographic info
• Useful for trends in public opinion
• Free to download or analyze online!
Making a table
Q: Are younger people more or less likely
to be concerned about racial issues?
1. Select variables
Here, AGE copied to Row:
2. View variable details in SDA
• Question
wording
• Frequency
table
3. Pull up a table in SDA
BUT… each age
is separated!
4: Recode (group) responses
Solution: recode the
variable and re-run!
5a: Interpreting the results
5b: Interpreting the results
Interesting, but…
why so many
missing cases?
5c: Check the year!
Look up
variable in
‘Codebook
by Year’
Result: this question was
only asked in one year.
Trends
Q: Have Americans become more
accepting of working mothers?
Multi-Year: Trends Over Time
Same steps as in
the Tables exercise,
but using the YEAR
variable
SDA: Charts
Chart Options allows you to create a simple line
graph.
Comparing Means
Q: Are people with higher income less
likely to support redistributive government
policies?
SDA: Comparing Means
“Means” tab
lets you
compare mean
of e.g. income
to how a
question is
answered.
Other
suggested exercises
Try Out the GSS yourself!
Play with any variables, or try to answer these:
• Are those who served longer in the military
(VETYEARS) more likely to support the DRAFT?
• Does your ZODIAC sign relate to your MARITAL
status?
• Does your REGION impact your thoughts on gun
control (GUNLAW)?
• Does household size (HOMPOP) relate to how
rushed you feel in life (RUSHED)?
• Does job satisfaction (SATJOB) vary according to
college DEGREE, SEX or RACE?
Questions?
Comments?
Contact Us
Christine Murray
cmurray2@bates.edu
Celia Emmelhainz
celia.emmelhainz@colby.edu

More Related Content

PPTX
Social Science Students: Making Census Data Work for You
PPTX
Ps rwebinar january2019final
PDF
Slides | Targeting the librarian’s role in research services
PDF
SLIDES | 12 time-saving tips for research support
PPTX
Data in The Classroom: It's Not Just for Nerds Anymore!
PPTX
ICPSR Data Services
PPTX
TeachingWithData.org -- Faculty Presentation
PPTX
Data matters-bournemouth-2015
Social Science Students: Making Census Data Work for You
Ps rwebinar january2019final
Slides | Targeting the librarian’s role in research services
SLIDES | 12 time-saving tips for research support
Data in The Classroom: It's Not Just for Nerds Anymore!
ICPSR Data Services
TeachingWithData.org -- Faculty Presentation
Data matters-bournemouth-2015

What's hot (20)

PDF
2-6-14 ESI Supplemental Webinar: The Data Information Literacy Project
PPT
SES grad student presentation
PPTX
Research Data Management in Academic Libraries: Meeting the Challenge
PDF
Data Services at the American University of Beirut: Challenges and Opportunit...
PPTX
Research Data Management in the Humanities and Social Sciences
PPTX
Expert engagement: practical researcher digital literacy provision at City - ...
PPTX
Open Data Bay Area: Interesting Problems in Academic Data
PPTX
Reputation Management for Early Career Researchers
PPTX
Xiao Hu "Overview of the Space of Learning Analytics and Educational Data Min...
PPT
Mike Thelwall: Introduction to Webometrics
PPTX
Developing a multiple-document-processing performance assessment for epistem...
PDF
Holmes "Institutional Infrastructure for Data Sharing"
PPTX
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014
PPTX
Herzog Building New Faculty Services: Altmetric Adoption
PPTX
Lee "Supporting Research Data is a Group Effort"
PPTX
Critical infrastructure to promote data synthesis
PPT
Data Library Instruction
PPTX
The Future of Data Science @ UVA
PPTX
The Power of Stories: Engaging your American Government Students
2-6-14 ESI Supplemental Webinar: The Data Information Literacy Project
SES grad student presentation
Research Data Management in Academic Libraries: Meeting the Challenge
Data Services at the American University of Beirut: Challenges and Opportunit...
Research Data Management in the Humanities and Social Sciences
Expert engagement: practical researcher digital literacy provision at City - ...
Open Data Bay Area: Interesting Problems in Academic Data
Reputation Management for Early Career Researchers
Xiao Hu "Overview of the Space of Learning Analytics and Educational Data Min...
Mike Thelwall: Introduction to Webometrics
Developing a multiple-document-processing performance assessment for epistem...
Holmes "Institutional Infrastructure for Data Sharing"
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014
Herzog Building New Faculty Services: Altmetric Adoption
Lee "Supporting Research Data is a Group Effort"
Critical infrastructure to promote data synthesis
Data Library Instruction
The Future of Data Science @ UVA
The Power of Stories: Engaging your American Government Students
Ad

Similar to Online Data Analysis for Librarians using SDA and the General Social Survey (20)

PPTX
Asa integrating data 2 19-2014 with cites
PDF
Using Data for Informed Decision Making
PPT
Presentation For Gene S Revision 3
PDF
Analysing students' digital experience: personas and key drivers
PPTX
Improving and Demonstrating Impact for Youth Using Qualitative Data
PDF
How data informs decision making 2
PPTX
Student Activity Hub community Meeting 10-25-2017
PPTX
G-51-Collecting-Effective-Data-in-Counseling.pptx
PPTX
Data Driven Dialogue
PPTX
Glfes summer institute2013_raleigh_final
PDF
CCSS 2014 Annual Conference
PPTX
Learning Analytics Primer: Getting Started with Learning and Performance Anal...
PPT
Epub compass 2012 ace_conference
PPT
MI-LIFE School Improvement Conference Preso
PPTX
Data informed decision-making
PPT
Dai karen
PDF
0602DATASUMMITTSTEWART(1).PDF
PPTX
Community needs assessment.pla_2014.handout
PPTX
From novice to expert: A critical evaluation of direct instruction
PPTX
Getting started with your 2020/21 digital experience insights surveys
Asa integrating data 2 19-2014 with cites
Using Data for Informed Decision Making
Presentation For Gene S Revision 3
Analysing students' digital experience: personas and key drivers
Improving and Demonstrating Impact for Youth Using Qualitative Data
How data informs decision making 2
Student Activity Hub community Meeting 10-25-2017
G-51-Collecting-Effective-Data-in-Counseling.pptx
Data Driven Dialogue
Glfes summer institute2013_raleigh_final
CCSS 2014 Annual Conference
Learning Analytics Primer: Getting Started with Learning and Performance Anal...
Epub compass 2012 ace_conference
MI-LIFE School Improvement Conference Preso
Data informed decision-making
Dai karen
0602DATASUMMITTSTEWART(1).PDF
Community needs assessment.pla_2014.handout
From novice to expert: A critical evaluation of direct instruction
Getting started with your 2020/21 digital experience insights surveys
Ad

More from Celia Emmelhainz (20)

PPTX
The Limits of Open Access
PPTX
Creating libraries where neurodiverse workers can thrive
PPTX
Holding Context: Anthropological Archives for the 21st Century
PPTX
Tips on Transcribing Qualitative Interviews
PPTX
Cleaning Quantitative Data and Coding Qualitative Data
PPTX
Finding Site Reports in Archaeology
PPTX
Organizing and Securing Ethnographic Field Materials.pptx
PPTX
Listening to Community Users in Academic Libraries.pptx
PPTX
Есімдер туралы жобасы
PPTX
Using Workforce Analytics to Improve Our Libraries
PPTX
Building out a cooperative digital humanities for Central Asia
PPTX
Examining the Scholarly Information Economy in America and Kazakhstan
PPTX
Video as Research Data: challenges and solutions in video data preservation
PPTX
Doing Ethnographic Research in Libraries (UCSD)
PPTX
From Conclusions to Community Impact
PPTX
Coding Your Results
PPTX
Using Ethnographic Methods in the Library
PPTX
Asking Anthropological Questions
PPTX
Questions to Ask Across the Ethnographic Lifecycle
PPTX
Thinking Critically about the Information Economy in America and Kazakhstan
The Limits of Open Access
Creating libraries where neurodiverse workers can thrive
Holding Context: Anthropological Archives for the 21st Century
Tips on Transcribing Qualitative Interviews
Cleaning Quantitative Data and Coding Qualitative Data
Finding Site Reports in Archaeology
Organizing and Securing Ethnographic Field Materials.pptx
Listening to Community Users in Academic Libraries.pptx
Есімдер туралы жобасы
Using Workforce Analytics to Improve Our Libraries
Building out a cooperative digital humanities for Central Asia
Examining the Scholarly Information Economy in America and Kazakhstan
Video as Research Data: challenges and solutions in video data preservation
Doing Ethnographic Research in Libraries (UCSD)
From Conclusions to Community Impact
Coding Your Results
Using Ethnographic Methods in the Library
Asking Anthropological Questions
Questions to Ask Across the Ethnographic Lifecycle
Thinking Critically about the Information Economy in America and Kazakhstan

Recently uploaded (20)

PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
Foundation of Data Science unit number two notes
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Business Analytics and business intelligence.pdf
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Mega Projects Data Mega Projects Data
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
IB Computer Science - Internal Assessment.pptx
ISS -ESG Data flows What is ESG and HowHow
Supervised vs unsupervised machine learning algorithms
oil_refinery_comprehensive_20250804084928 (1).pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Foundation of Data Science unit number two notes
climate analysis of Dhaka ,Banglades.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Business Analytics and business intelligence.pdf
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
STUDY DESIGN details- Lt Col Maksud (21).pptx
Introduction-to-Cloud-ComputingFinal.pptx
.pdf is not working space design for the following data for the following dat...
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Reliability_Chapter_ presentation 1221.5784
Business Acumen Training GuidePresentation.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Miokarditis (Inflamasi pada Otot Jantung)
Mega Projects Data Mega Projects Data
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
IB Computer Science - Internal Assessment.pptx

Online Data Analysis for Librarians using SDA and the General Social Survey

  • 1. Online Data Analysis for Librarians Maine Academic Libraries Day 2015 Celia Emmelhainz Colby College Christine Murray Bates College
  • 2. Goals of this Training At the end of this session, you should: • know your online data analysis options • create a simple table from data • be able to use SDA and the General Social Survey to answer a reference question
  • 4. o Numbers o Quotes o Text o GPS Points o Materials
  • 5. Social Science Data “…the digital resources out of which social and economic statistics are produced. The data do not spontaneously spring into existence but are produced from an intentional research methodology.” Geraci, D., Humphrey, C., Jacobs J. (2012) Data Basics. http://3stages.org/class/2012/pdf/data_basics_2012.pdf
  • 6. Microdata Aggregate data Values for individual observations Summarized by geography, groups, etc. Source: US Census Bureau, 2005 PUMS data sample; PUMS estimates
  • 7. Microdata Advantages • Allows statistical analysis • Greatest level of detail Disadvantages • Must be processed to be useful • May need expensive statistical software • Large, cumbersome files
  • 8. Advantages of Online Data Analysis • Pedagogical tool • No assembly required • No need for statistical software • Do need to understand what data ‘mean’
  • 9. What is SDA? • Survey Documentation and Analysis, at http://sda.berkeley.edu/archive.htm • Web interface for analyzing data, creating tables, and even some statistical analysis • Berkeley archive contains General Social Survey, National Election Survey, and others • Also used by other data archives (e.g. ICPSR)
  • 10. Published statistics vs. do it yourself Many data sources will publish ready- made tables of statistics that you can find online. But what if it doesn’t have the information that you need? http://www.norc.org/PDFs/GSS%20Reports/GSS_Trends%20in%20Gun%20Ownership_US_1972-2014.pdf
  • 11. General Social Survey a walkthrough
  • 12. What is the GSS? • Long-running opinion survey (since 1972) • U.S. national sample • Wide variety of attitudes on social issues, plus some demographic info • Useful for trends in public opinion • Free to download or analyze online!
  • 13. Making a table Q: Are younger people more or less likely to be concerned about racial issues?
  • 14. 1. Select variables Here, AGE copied to Row:
  • 15. 2. View variable details in SDA • Question wording • Frequency table
  • 16. 3. Pull up a table in SDA BUT… each age is separated!
  • 17. 4: Recode (group) responses Solution: recode the variable and re-run!
  • 19. 5b: Interpreting the results Interesting, but… why so many missing cases?
  • 20. 5c: Check the year! Look up variable in ‘Codebook by Year’ Result: this question was only asked in one year.
  • 21. Trends Q: Have Americans become more accepting of working mothers?
  • 22. Multi-Year: Trends Over Time Same steps as in the Tables exercise, but using the YEAR variable
  • 23. SDA: Charts Chart Options allows you to create a simple line graph.
  • 24. Comparing Means Q: Are people with higher income less likely to support redistributive government policies?
  • 25. SDA: Comparing Means “Means” tab lets you compare mean of e.g. income to how a question is answered.
  • 27. Try Out the GSS yourself! Play with any variables, or try to answer these: • Are those who served longer in the military (VETYEARS) more likely to support the DRAFT? • Does your ZODIAC sign relate to your MARITAL status? • Does your REGION impact your thoughts on gun control (GUNLAW)? • Does household size (HOMPOP) relate to how rushed you feel in life (RUSHED)? • Does job satisfaction (SATJOB) vary according to college DEGREE, SEX or RACE?
  • 29. Contact Us Christine Murray cmurray2@bates.edu Celia Emmelhainz celia.emmelhainz@colby.edu

Editor's Notes

  • #4: Ask people in the audience to start defining data Source: http://upload.wikimedia.org/wikipedia/commons/7/7b/Price-Earnings_Ratios_as_a_Predictor_of_Ten-Year_Returns_(Shiller_Data).png source of graphic
  • #5: The raw materials – numbers, spoken or written quotes, text in context, data points on a map–from which you can make arguments about the world Why do we need it? Empirical research depends on good data for our insights
  • #7: Microdata is a term for data available at the individual level. For example, if you had Census microdata, each line of the dataset would represent an individual person. Aggregate data, on the other hand, is summarized—each line of the data represents a sum, average, etc. of some characteristic, for a geographic area, or age range, or some other way of grouping individual observations together.
  • #12: Depends on depth + how skillled
  • #16: “View” will let you see the question as asked in the survey. This is important for understanding what the data mean. The frequency table tells you how many time a given answer to the question was found in the survey. In the case, there are very few people that said “Yes.”
  • #23: May add example using selection filter and control.
  • #24: Also can cut and past into Excel.
  • #25: Comparing means is appropriate if you have one variable that is quantitative—for example, income—and you want to see if it varies by another variable that is categorical.
  • #27: Depends on depth + how skillled
  • #28: CM: Income variable is hard to use b/c groupings created in 1970s and not adjusted for inflation. CE: Changed CONINC (income) to household size HOMPOP(r: 1-2; 3-4; 5-6; 7-8; 10-16) for comparison with RUSHED.
  • #29: Depends on depth + how skillled
  • #30: Count data is found in decennial census (every 10 years) and related business censuses Every ten years – surveys mailed plus 6-8 hh visits. Used Ffr distributing government money, knowing who’s where in an emergency, adjusting political districts to match changing population Characteristic data about the whole population is collected through ACS – American community survey. This is a “continuous” survey – 3.54 million residents a year (about 1% a year) of social and economic characteristics. Team up for fams divers council http://teamupforfamilies.com/wp-content/uploads/2013/02/iStock_000019224493Medium3.jpg Blackfam https://nomorerace.files.wordpress.com/2011/12/untitled9.png
  • #31: Where to get all census data – interactive tool for viewing reports and downloading data
  • #32: www2.census.gov Could go in and download the PUMS sample for 2005-2009 and 2008-2012 Esp. if need multiple variables – so if I want to calculate date of migration vs. age vs. language spoken at home, I could run it on statistical software and calculate it myself However for simple one-variable study like this, can also use aggregated tables
  • #33: Other sources – Statistical Abstracts Census.gov OECD iLibrary also has world data (zack osborne)
  • #37: Canada statistics – ODESI for social science and polling data UK DataService – includes census records and Qualidata USA – US Census Bureau Find data/ or let data find your topic. Searching/browsing (Stat Abs vs. Statistical Abstracts – good place to look
  • #38: Can you tell me what the census is used for? Census bureau collects data about “count” and “characteristics” of the population
  • #39: Can you tell me what the census is used for? Census bureau collects data about “count” and “characteristics” of the population