SlideShare a Scribd company logo
Big Data
Claire Choong
Learning & Research Librarian
(Scholarly Communications)
Etymology
Definitions: 3 Vs?
“huge in volume – consisting of terabytes or
petabytes of data
high in velocity – being created in or near
real-time
diverse in variety in type – being structured
and unstructured in nature, and often
temporally and spatially referenced”
(Kitchin, 2014)
Other key characteristics
exhaustive in scope ( n=all)
fine-grained in resolution
indexical in identification (able to be uniquely labelled and identified)
relational in nature (different datasets can be conjoined)
flexible – can add new fields easily
scalable - can expand in size rapidly
Small and Big Data
Small data Big Data
Volume Limited to large Very large
Velocity Slow, freeze-framed,
bundled
Fast, continuous
Variety Limited to wide Wide
Exhaustivity Samples Entire populations
Resolutions and
indexicality
Course and weak to
tight and strong
Tight and strong
Relationality Weak to strong Strong
Extensionability and
scalability
Low to middling High
Vagaries
The mythology of Big Data
“the widespread belief that large
data sets offer a higher form of
intelligence and knowledge that
can generate insights that were
previously impossible, with the
aura of truth, objectivity and
accuracy.”
boyd & Crawford
Ethics
Inequalities
Practicalities
Implications for the training of future academics – that’s you!
Institutional and cross-institutional infrastructures to support data
storage and processing capacity
Agreements and incentives for sharing data need to be drawn up
(e.g. Concordat on Open Research Data)
Ethical guidelines and protocols are needed
What do Big Data actually tell us?
what people
actually do (not
what they say
they do)
patterns of
behaviour
boyd with a small b
Big Data changes the definition of knowledge
Claims to objectivity and accuracy are misleading
Bigger data are not always better data
Taken out of context, Big Data loses its meaning
Just because it is accessible does not make it ethical
Limited access to Big Data creates new digital divides
These points should be carefully considered before utilising Big Data in research.
Conclusions
“Data should be cooked with care”
(Bowker (2005) in boyd and Crawford, 2012)
Big Data in practice
Fast food
Beer
Casinos
Supermarkets
Healthcare
Zooniverse
Researcher Development (Vitae) Framework
Sources
• boyd, d. and Crawford, K. (2012) ‘Critical questions for Big Data’, Information,
Communication & Society, 15(5), pp. 662-679.
• Davidag. (2011) ‘Drive Thru’. Available at: http://flic.kr/p/9X8hpQ. Accessed 9th
August 2017.
• Dinnen, P. (2010) ‘Sketch of Twitter Data Visualization’. Available at:
http://flic.kr/p/7MH2rf. Accessed 8th August 2017.
• Eynon, R. (2013) ‘The rise of Big Data: what does it mean for education, technology,
and media research?’, Learning, Media and Technology, 30(3), pp. 237-240.
• G4ll4is. (2013) ‘Privacy’. Available at: http://flic.kr/p/dZ2y6b. Accessed 8th August
2017.
• Kitchin, R. (2014) The Data Revolution, London: SAGE.
• Kitchin, R. and McArdle, G. (2016) ‘What makes Big Data, Big Data? Exploring the
ontological characteristics of 26 datasets’, Big Data & Society, January-June 2016, pp.
1-10.
Sources (2)
• Lebied, M. (2017) ‘5 big data examples in your real life at bars, restaurants and casinos’,
Datapine. Available at: http://www.datapine.com/blog/big-data-examples-in-real-life/.
Accessed 9th August 2017.
• Marr, B. (2016) ‘The most practical big data use cases of 2016’, Forbes. Available at:
https://www.forbes.com/sites/bernardmarr/2016/08/25/the-most-practical-big-data-use-
cases-of-2016. Accessed 9th August 2017.
• System of Ideas. (2012) ‘V’. Available at: http://flic.kr/p/bi2CPn. Accessed 8th August 2017.
• Yassan Yukky. (2011) ‘Cooking’. Available at: http://flic.kr/p/9tU7BB. Accessed 9th August
2017.

More Related Content

PDF
Understanding big data and data analytics big data
PPTX
Big data ppt
PPT
Big data
PPTX
Data science
PPTX
Big Data and Classification
PPTX
Big data
PPTX
Introduction to Big Data Analytics
PPTX
Big data
Understanding big data and data analytics big data
Big data ppt
Big data
Data science
Big Data and Classification
Big data
Introduction to Big Data Analytics
Big data

What's hot (19)

PPTX
PPTX
What is big data ? | Big Data Applications
PPTX
Data mining with big data implementation
PDF
Data minig with Big data analysis
PPTX
Big data
PPTX
Big data
 
PPTX
Big Data, NoSQL, NewSQL & The Future of Data Management
PDF
Big Data
PPTX
Big data Presentation
PPSX
Applications of Big Data Analytics in Businesses
PPTX
Big Data for Beginners
PPTX
Big data mining
PPTX
Big data
PPTX
Big data
PPTX
Introduction to big data
PPTX
An Introduction to Big Data
PDF
Big Data analytics best practices
PPTX
Big Data Applications | Big Data Application Examples | Big Data Use Cases | ...
What is big data ? | Big Data Applications
Data mining with big data implementation
Data minig with Big data analysis
Big data
Big data
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data
Big data Presentation
Applications of Big Data Analytics in Businesses
Big Data for Beginners
Big data mining
Big data
Big data
Introduction to big data
An Introduction to Big Data
Big Data analytics best practices
Big Data Applications | Big Data Application Examples | Big Data Use Cases | ...
Ad

Similar to Big data (20)

PPTX
Big data divided (24 march2014)
PPTX
Open data: Enhancing preservation, reproducibility, and innovation
 
PDF
AnalĂ­ticas del aprendizaje: una perspectiva crĂ­tica
 
PDF
Data Management and Broader Impacts: a holistic approach
PPTX
One View of Data Science
PPT
Data Sharing & Data Citation
PPTX
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
PPTX
“Big data” in human services organisations: Practical problems and ethical di...
 
PDF
Big data for qualitative research by kathy a. mills (z lib.org)
PPTX
Data as a research output and a research asset: the case for Open Science/Sim...
PPTX
Data Science and AI in Biomedicine: The World has Changed
PDF
CODATA International Training Workshop in Big Data for Science for Researcher...
PPT
Linking Data to Publications through Citation and Virtual Archives
PDF
Managing Metadata for Science and Technology Studies: the RISIS case
PPTX
Data Services at a Liberal Arts College Library
PPTX
Data Communities - reusable data in and outside your organization.
PPTX
Accessing and Using Big Data to Advance Social Science Knowledge
PPTX
State of the Art Informatics for Research Reproducibility, Reliability, and...
PPTX
Exploring human behaviour in interdisciplinary learning environments - Ali Fi...
PPTX
Next generation data services at the Marriott Library
Big data divided (24 march2014)
Open data: Enhancing preservation, reproducibility, and innovation
 
AnalĂ­ticas del aprendizaje: una perspectiva crĂ­tica
 
Data Management and Broader Impacts: a holistic approach
One View of Data Science
Data Sharing & Data Citation
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
“Big data” in human services organisations: Practical problems and ethical di...
 
Big data for qualitative research by kathy a. mills (z lib.org)
Data as a research output and a research asset: the case for Open Science/Sim...
Data Science and AI in Biomedicine: The World has Changed
CODATA International Training Workshop in Big Data for Science for Researcher...
Linking Data to Publications through Citation and Virtual Archives
Managing Metadata for Science and Technology Studies: the RISIS case
Data Services at a Liberal Arts College Library
Data Communities - reusable data in and outside your organization.
Accessing and Using Big Data to Advance Social Science Knowledge
State of the Art Informatics for Research Reproducibility, Reliability, and...
Exploring human behaviour in interdisciplinary learning environments - Ali Fi...
Next generation data services at the Marriott Library
Ad

Recently uploaded (20)

PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PPTX
TNA_Presentation-1-Final(SAVE)) (1).pptx
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PDF
My India Quiz Book_20210205121199924.pdf
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
IGGE1 Understanding the Self1234567891011
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
 
PDF
What if we spent less time fighting change, and more time building what’s rig...
PDF
Trump Administration's workforce development strategy
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
CHAPTER IV. MAN AND BIOSPHERE AND ITS TOTALITY.pptx
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
Unit 4 Computer Architecture Multicore Processor.pptx
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
TNA_Presentation-1-Final(SAVE)) (1).pptx
Share_Module_2_Power_conflict_and_negotiation.pptx
My India Quiz Book_20210205121199924.pdf
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
Paper A Mock Exam 9_ Attempt review.pdf.
Practical Manual AGRO-233 Principles and Practices of Natural Farming
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
IGGE1 Understanding the Self1234567891011
202450812 BayCHI UCSC-SV 20250812 v17.pptx
 
What if we spent less time fighting change, and more time building what’s rig...
Trump Administration's workforce development strategy
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
LDMMIA Reiki Yoga Finals Review Spring Summer

Big data

  • 1. Big Data Claire Choong Learning & Research Librarian (Scholarly Communications)
  • 3. Definitions: 3 Vs? “huge in volume – consisting of terabytes or petabytes of data high in velocity – being created in or near real-time diverse in variety in type – being structured and unstructured in nature, and often temporally and spatially referenced” (Kitchin, 2014)
  • 4. Other key characteristics exhaustive in scope ( n=all) fine-grained in resolution indexical in identification (able to be uniquely labelled and identified) relational in nature (different datasets can be conjoined) flexible – can add new fields easily scalable - can expand in size rapidly
  • 5. Small and Big Data Small data Big Data Volume Limited to large Very large Velocity Slow, freeze-framed, bundled Fast, continuous Variety Limited to wide Wide Exhaustivity Samples Entire populations Resolutions and indexicality Course and weak to tight and strong Tight and strong Relationality Weak to strong Strong Extensionability and scalability Low to middling High
  • 7. The mythology of Big Data “the widespread belief that large data sets offer a higher form of intelligence and knowledge that can generate insights that were previously impossible, with the aura of truth, objectivity and accuracy.” boyd & Crawford
  • 10. Practicalities Implications for the training of future academics – that’s you! Institutional and cross-institutional infrastructures to support data storage and processing capacity Agreements and incentives for sharing data need to be drawn up (e.g. Concordat on Open Research Data) Ethical guidelines and protocols are needed
  • 11. What do Big Data actually tell us? what people actually do (not what they say they do) patterns of behaviour
  • 12. boyd with a small b Big Data changes the definition of knowledge Claims to objectivity and accuracy are misleading Bigger data are not always better data Taken out of context, Big Data loses its meaning Just because it is accessible does not make it ethical Limited access to Big Data creates new digital divides These points should be carefully considered before utilising Big Data in research.
  • 13. Conclusions “Data should be cooked with care” (Bowker (2005) in boyd and Crawford, 2012)
  • 14. Big Data in practice Fast food Beer Casinos Supermarkets Healthcare Zooniverse
  • 16. Sources • boyd, d. and Crawford, K. (2012) ‘Critical questions for Big Data’, Information, Communication & Society, 15(5), pp. 662-679. • Davidag. (2011) ‘Drive Thru’. Available at: http://flic.kr/p/9X8hpQ. Accessed 9th August 2017. • Dinnen, P. (2010) ‘Sketch of Twitter Data Visualization’. Available at: http://flic.kr/p/7MH2rf. Accessed 8th August 2017. • Eynon, R. (2013) ‘The rise of Big Data: what does it mean for education, technology, and media research?’, Learning, Media and Technology, 30(3), pp. 237-240. • G4ll4is. (2013) ‘Privacy’. Available at: http://flic.kr/p/dZ2y6b. Accessed 8th August 2017. • Kitchin, R. (2014) The Data Revolution, London: SAGE. • Kitchin, R. and McArdle, G. (2016) ‘What makes Big Data, Big Data? Exploring the ontological characteristics of 26 datasets’, Big Data & Society, January-June 2016, pp. 1-10.
  • 17. Sources (2) • Lebied, M. (2017) ‘5 big data examples in your real life at bars, restaurants and casinos’, Datapine. Available at: http://www.datapine.com/blog/big-data-examples-in-real-life/. Accessed 9th August 2017. • Marr, B. (2016) ‘The most practical big data use cases of 2016’, Forbes. Available at: https://www.forbes.com/sites/bernardmarr/2016/08/25/the-most-practical-big-data-use- cases-of-2016. Accessed 9th August 2017. • System of Ideas. (2012) ‘V’. Available at: http://flic.kr/p/bi2CPn. Accessed 8th August 2017. • Yassan Yukky. (2011) ‘Cooking’. Available at: http://flic.kr/p/9tU7BB. Accessed 9th August 2017.