SlideShare a Scribd company logo
16/05/2016 Quantitative Methods II (#SOC2031) 1
Quantitative Methods II
Seminar #11: Secondary analysis. Big
data and open data
David Rozas
(drozas@surrey.ac.uk || @drozas || 26AD03)
16/05/2016 Quantitative Methods II (#SOC2031) 2
Outline
● Big data
● A sociological perspective
● Open data
● Some examples
● Q&A
16/05/2016 Quantitative Methods II (#SOC2031) 3
What is big data?
● Large and
complex datasets.
Difficult to
process with
traditional
database systems
● Just the
beginning! Source: http://image.slidesharecdn.com/03-130821133227-phpapp02/95/cis13-big-data-analytics-vendor-perspective-insights-from-the-bleeding-edge-5-638.jpg?cb=1377092026
16/05/2016 Quantitative Methods II (#SOC2031) 4
What is big data? 3Vs
● Challenges (Douglas,
2001) come from
expansion of:
– Volume: amount of
data
– Variety: number of
types of data
– Velocity: data
processing, produced
in real time
Source: http://itknowledgeexchange.techtarget.com/writing-for-business/files/2013/02/BigData.001.jpg
16/05/2016 Quantitative Methods II (#SOC2031) 5
Only technical?
● Used by researchers, companies,
governments, ...
● Incredible possibilities (OKFN, 2016)
– Understanding global challenges (e.g. climate
change)
– Democratic accountability and governance (e.g.
scrutinise governments)
– Science: free access sum of human knowledge,
improve our understanding of the World
16/05/2016 Quantitative Methods II (#SOC2031) 6
But...
● Privacy
● Surveillance
● Commodification of
human behaviour: data
selling
Sociological perspectives
are necessary
Source: http://bilerico.lgbtqnation.com/images/freefood.jpg
16/05/2016 Quantitative Methods II (#SOC2031) 7
Open data
● Some data should be
freely available to
everyone to use and
republish as they wish
● In the context of Free
Software, Open
Hardware, Open
Access, Free Culture,
etc. Source: https://nomadicutopianism.files.wordpress.com/2011/04/picture-71.png
16/05/2016 Quantitative Methods II (#SOC2031) 8
Open definition
● Open definition (2016):
– Availability and access: the data must be available as
a whole and at no more than a reasonable reproduction
cost, preferably by downloading over the internet.
– Reuse and redistribution: the data must be provided
under terms that permit reuse and redistribution
including the intermixing with other datasets.
– Universal participation: everyone must be able to use,
reuse and redistribute — there should be no
discrimination against fields of endeavour or against
persons or groups.
Source: https://en.wikipedia.org/wiki/Open_data#/media/File:Open_Data_stickers.jpg
16/05/2016 Quantitative Methods II (#SOC2031) 9
Some examples
● Mapping,
Consumer Data
Research Centre
● Data from
data.gov.uk
● http://maps.cdrc.a
c.uk/#/geodemograp
hics/imdcrimee10to
15/default/
16/05/2016 Quantitative Methods II (#SOC2031) 10
Some examples
● Sentiment analysis
(computational
linguistics to identify
subjective information
in source materials),
NC State University
● Data from
dev.twitter.com
● https://www.csc.ncsu.
edu/faculty/healey/tw
eet_viz/tweet_app/
16/05/2016 Quantitative Methods II (#SOC2031) 11
References
● Douglas, L. (2001). 3d data management:
Controlling data volume, velocity and variety.
Gartner. Retrieved, 6.
● OKFN (2016). Open Knowledge. Retrieved 13th
May 2016, from https://okfn.org/
● Open Definition (2016). The Open Definition.
Retrieved 13th May 2016, from
http://opendefinition.org/od/2.1/en/
16/05/2016 Quantitative Methods II (#SOC2031) 12
That's all! Questions?
Thanks!
Danke!
Grazie!
¡Gracias!
Obrigado!
This work is licensed under a Creative Commons
Attribution-ShareALike 4.0 Unported License
excerpt if otherwise noted.
To view a copy of this license, please visit:
http://creativecommons.org/licenses/by-sa/4.0/
contact:
● drozas@surrey.ac.uk || www.davidrozas.com
● @drozas

More Related Content

PDF
Biesenbender - The research core dataset as a standard for research information
PPTX
Nieuwerburgh - Open science e-infrastructure for research analysis and impact...
PDF
Communicating Use and Reuse in the Digital Collection Interface by L. Kelly F...
PDF
Are we data responsible?
PDF
Holistic Benchmarking of Big Linked Data: HOBBIT
PDF
Your research as open science
PDF
EconBiz Research Dataset (SWIB16 Lightning Talk)
PPT
New ways to communicate in science: perspectives from biodiversity research
Biesenbender - The research core dataset as a standard for research information
Nieuwerburgh - Open science e-infrastructure for research analysis and impact...
Communicating Use and Reuse in the Digital Collection Interface by L. Kelly F...
Are we data responsible?
Holistic Benchmarking of Big Linked Data: HOBBIT
Your research as open science
EconBiz Research Dataset (SWIB16 Lightning Talk)
New ways to communicate in science: perspectives from biodiversity research

What's hot (19)

PDF
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
PDF
Semantic Tagging on Historical Maps
PDF
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
PPTX
The Realities of Research Data Management
PDF
20190527_Helena Cousijn _ FREYA
PPTX
The Future is All Mine
PDF
How Jisc supports reporting, communicating and measuring research in the UK
PPTX
20200130_Mannocci_OpenAIRE_ResearchGraph
PDF
OpenMinted: It's Uses and Benefits for the Social Sciences
PDF
Understanding the users of the Parliamentary Web Archive: a user research pro...
PDF
Elab 16 5-13-re3data-scholze-final
PPTX
Open Data and Open Science in the European Commission
PPT
UKSG webinar: The Law on TDM in Europe: an introduction with Nancy Pontika, O...
PPTX
The 2018 European Commission Data Package
PDF
Open Science and Open Education
PPTX
Semantically Mapping Science (SMS)
PDF
OpenMinTeD: Making Sense of Large Volumes of Data
PDF
re3data.org – a Registry of Research Data Repositories
PDF
Managing international comparative data
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Semantic Tagging on Historical Maps
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
The Realities of Research Data Management
20190527_Helena Cousijn _ FREYA
The Future is All Mine
How Jisc supports reporting, communicating and measuring research in the UK
20200130_Mannocci_OpenAIRE_ResearchGraph
OpenMinted: It's Uses and Benefits for the Social Sciences
Understanding the users of the Parliamentary Web Archive: a user research pro...
Elab 16 5-13-re3data-scholze-final
Open Data and Open Science in the European Commission
UKSG webinar: The Law on TDM in Europe: an introduction with Nancy Pontika, O...
The 2018 European Commission Data Package
Open Science and Open Education
Semantically Mapping Science (SMS)
OpenMinTeD: Making Sense of Large Volumes of Data
re3data.org – a Registry of Research Data Repositories
Managing international comparative data
Ad

Similar to Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data (20)

PPTX
H2020 Open Research Data pilot
PDF
Opportunities and methodological challenges of Big Data for official statist...
DOCX
Workshop II on a Roadmap to Future Government
PDF
Managing, Sharing and Curating Your Research Data in a Digital Environment
PPTX
Educating Data Scientists: the SoBigData master experience
PDF
Challenges in Analytics for BIG Data
PDF
A Survey And Taxonomy Of Distributed Data Mining Research Studies A Systemat...
PPTX
Open Access Week 2017: Introduction to Open Data Policies in H2020
PPTX
General introduction to Open Data Policies H2020, influence of OD policies on...
PPTX
Observlets
PDF
An Open Spatial Systems Framework for Place-Based Decision-Making
PDF
Imaging Data Commons (IDC) - Introduction and intital approach
PDF
Webinar@ASIRA: A Practitioners Approach to Open Data for Agricultural Research
PDF
Social Space for Geospatial Information
PDF
Ist africa paper_ref_115_doc_3988
PDF
Social Space for Geospatial Information
PPTX
SRTD PPT FIRST fjjjjjjjjuhkgfkjkhykPPT.pptx
PDF
UCT eResearch Emerging Researcher Series: RDM
PDF
A Survey on Big Data Analytics: Challenges
PDF
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
H2020 Open Research Data pilot
Opportunities and methodological challenges of Big Data for official statist...
Workshop II on a Roadmap to Future Government
Managing, Sharing and Curating Your Research Data in a Digital Environment
Educating Data Scientists: the SoBigData master experience
Challenges in Analytics for BIG Data
A Survey And Taxonomy Of Distributed Data Mining Research Studies A Systemat...
Open Access Week 2017: Introduction to Open Data Policies in H2020
General introduction to Open Data Policies H2020, influence of OD policies on...
Observlets
An Open Spatial Systems Framework for Place-Based Decision-Making
Imaging Data Commons (IDC) - Introduction and intital approach
Webinar@ASIRA: A Practitioners Approach to Open Data for Agricultural Research
Social Space for Geospatial Information
Ist africa paper_ref_115_doc_3988
Social Space for Geospatial Information
SRTD PPT FIRST fjjjjjjjjuhkgfkjkhykPPT.pptx
UCT eResearch Emerging Researcher Series: RDM
A Survey on Big Data Analytics: Challenges
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Ad

More from David Rozas (20)

PDF
When Ostrom Meets Blockchain: Exploring the Potentials of Blockchain for Com...
PDF
When Ostrom Meets Blockchain: Exploring the Potentials of Blockchain for Comm...
PDF
Cuando Ostrom se encuentra con blockchain: explorando las potencialidades de ...
PDF
When Ostrom Meets Blockchain: Exploring the Potentials of Blockchain for Comm...
PDF
Ostrom’s crypto-principles? Towards a commons-based approach for the use of B...
PDF
Self-organisation in Commons-Based Peer Production. Drupal: “the drop is alwa...
PDF
Self-organisation in Commons-Based Peer Production. Drupal: “the drop is alwa...
PDF
Drupal: the drop is always moving. Autogestión y gobernanza de infraestructur...
PDF
Drupal: "come for the software, stay for the community". Conociendo la comunidad
PDF
Talk is silver, code is gold? Contribution beyond source code in Free/Libre O...
PDF
Talk is silver, code is gold? Contribution beyond source code in Free/Libre O...
PDF
Talk is silver, code is gold? Contribution beyond source code in Free/Libre O...
PDF
Lightning talk - Affective labour and the notion of contribution in FLOSS com...
PDF
Directori català de l’economia col·laborativa
PDF
Survey Research (SOC2029). Seminar 10: non-response and missing data
PDF
Survey Research (SOC2029). Seminar 9: designing a self-completion questionnaire
PDF
Survey Research (SOC2029). Seminar 8: formulating a research question
PDF
Survey Research (SOC2029). Seminar 7: ethics in survey research
PDF
Survey Research (SOC2029). Seminar 6: questionnaire design (II)
PDF
Survey Research (SOC2029). Seminar 5: questionnaire design (I)
When Ostrom Meets Blockchain: Exploring the Potentials of Blockchain for Com...
When Ostrom Meets Blockchain: Exploring the Potentials of Blockchain for Comm...
Cuando Ostrom se encuentra con blockchain: explorando las potencialidades de ...
When Ostrom Meets Blockchain: Exploring the Potentials of Blockchain for Comm...
Ostrom’s crypto-principles? Towards a commons-based approach for the use of B...
Self-organisation in Commons-Based Peer Production. Drupal: “the drop is alwa...
Self-organisation in Commons-Based Peer Production. Drupal: “the drop is alwa...
Drupal: the drop is always moving. Autogestión y gobernanza de infraestructur...
Drupal: "come for the software, stay for the community". Conociendo la comunidad
Talk is silver, code is gold? Contribution beyond source code in Free/Libre O...
Talk is silver, code is gold? Contribution beyond source code in Free/Libre O...
Talk is silver, code is gold? Contribution beyond source code in Free/Libre O...
Lightning talk - Affective labour and the notion of contribution in FLOSS com...
Directori català de l’economia col·laborativa
Survey Research (SOC2029). Seminar 10: non-response and missing data
Survey Research (SOC2029). Seminar 9: designing a self-completion questionnaire
Survey Research (SOC2029). Seminar 8: formulating a research question
Survey Research (SOC2029). Seminar 7: ethics in survey research
Survey Research (SOC2029). Seminar 6: questionnaire design (II)
Survey Research (SOC2029). Seminar 5: questionnaire design (I)

Recently uploaded (20)

PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
Microbiology with diagram medical studies .pptx
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
BIOMOLECULES PPT........................
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PDF
The scientific heritage No 166 (166) (2025)
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PPTX
famous lake in india and its disturibution and importance
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
lecture 2026 of Sjogren's syndrome l .pdf
Cell Membrane: Structure, Composition & Functions
The KM-GBF monitoring framework – status & key messages.pptx
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Microbiology with diagram medical studies .pptx
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
BIOMOLECULES PPT........................
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
Biophysics 2.pdffffffffffffffffffffffffff
POSITIONING IN OPERATION THEATRE ROOM.ppt
The scientific heritage No 166 (166) (2025)
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
7. General Toxicologyfor clinical phrmacy.pptx
famous lake in india and its disturibution and importance

Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

  • 1. 16/05/2016 Quantitative Methods II (#SOC2031) 1 Quantitative Methods II Seminar #11: Secondary analysis. Big data and open data David Rozas (drozas@surrey.ac.uk || @drozas || 26AD03)
  • 2. 16/05/2016 Quantitative Methods II (#SOC2031) 2 Outline ● Big data ● A sociological perspective ● Open data ● Some examples ● Q&A
  • 3. 16/05/2016 Quantitative Methods II (#SOC2031) 3 What is big data? ● Large and complex datasets. Difficult to process with traditional database systems ● Just the beginning! Source: http://image.slidesharecdn.com/03-130821133227-phpapp02/95/cis13-big-data-analytics-vendor-perspective-insights-from-the-bleeding-edge-5-638.jpg?cb=1377092026
  • 4. 16/05/2016 Quantitative Methods II (#SOC2031) 4 What is big data? 3Vs ● Challenges (Douglas, 2001) come from expansion of: – Volume: amount of data – Variety: number of types of data – Velocity: data processing, produced in real time Source: http://itknowledgeexchange.techtarget.com/writing-for-business/files/2013/02/BigData.001.jpg
  • 5. 16/05/2016 Quantitative Methods II (#SOC2031) 5 Only technical? ● Used by researchers, companies, governments, ... ● Incredible possibilities (OKFN, 2016) – Understanding global challenges (e.g. climate change) – Democratic accountability and governance (e.g. scrutinise governments) – Science: free access sum of human knowledge, improve our understanding of the World
  • 6. 16/05/2016 Quantitative Methods II (#SOC2031) 6 But... ● Privacy ● Surveillance ● Commodification of human behaviour: data selling Sociological perspectives are necessary Source: http://bilerico.lgbtqnation.com/images/freefood.jpg
  • 7. 16/05/2016 Quantitative Methods II (#SOC2031) 7 Open data ● Some data should be freely available to everyone to use and republish as they wish ● In the context of Free Software, Open Hardware, Open Access, Free Culture, etc. Source: https://nomadicutopianism.files.wordpress.com/2011/04/picture-71.png
  • 8. 16/05/2016 Quantitative Methods II (#SOC2031) 8 Open definition ● Open definition (2016): – Availability and access: the data must be available as a whole and at no more than a reasonable reproduction cost, preferably by downloading over the internet. – Reuse and redistribution: the data must be provided under terms that permit reuse and redistribution including the intermixing with other datasets. – Universal participation: everyone must be able to use, reuse and redistribute — there should be no discrimination against fields of endeavour or against persons or groups. Source: https://en.wikipedia.org/wiki/Open_data#/media/File:Open_Data_stickers.jpg
  • 9. 16/05/2016 Quantitative Methods II (#SOC2031) 9 Some examples ● Mapping, Consumer Data Research Centre ● Data from data.gov.uk ● http://maps.cdrc.a c.uk/#/geodemograp hics/imdcrimee10to 15/default/
  • 10. 16/05/2016 Quantitative Methods II (#SOC2031) 10 Some examples ● Sentiment analysis (computational linguistics to identify subjective information in source materials), NC State University ● Data from dev.twitter.com ● https://www.csc.ncsu. edu/faculty/healey/tw eet_viz/tweet_app/
  • 11. 16/05/2016 Quantitative Methods II (#SOC2031) 11 References ● Douglas, L. (2001). 3d data management: Controlling data volume, velocity and variety. Gartner. Retrieved, 6. ● OKFN (2016). Open Knowledge. Retrieved 13th May 2016, from https://okfn.org/ ● Open Definition (2016). The Open Definition. Retrieved 13th May 2016, from http://opendefinition.org/od/2.1/en/
  • 12. 16/05/2016 Quantitative Methods II (#SOC2031) 12 That's all! Questions? Thanks! Danke! Grazie! ¡Gracias! Obrigado! This work is licensed under a Creative Commons Attribution-ShareALike 4.0 Unported License excerpt if otherwise noted. To view a copy of this license, please visit: http://creativecommons.org/licenses/by-sa/4.0/ contact: ● drozas@surrey.ac.uk || www.davidrozas.com ● @drozas