SlideShare a Scribd company logo
DATA JOURNALISM TRAINING
Day 1
WHAT IS DATA
Asking a question
Name Gender Age Height Feeling
Mandy F 21 150cm Swamped
Shani F 23 167cm Nervous
Zizo F 25 167cm Curious
Ashleigh F 22 163cm Relaxed
Danyal M 22 156cm Optimistic
Jason M 36 200cm Flustered
Hannah F 35 167cm Very excited
Phumlani M 24 180cm Grumpy
Milena F 29 160cm Excited
Data types
● QUALITATIVE DATA: is everything that refers to the
quality of something: A description of colours, texture
and feel of an object , a description of experiences, and
interview are all qualitative data.
● QUANTITATIVE DATA: is data that refers to a number.
Data types
● DISCRETE DATA: is numerical data with values which
are distinct and separate, i.e. they can be counted.
Examples might include the number of kittens in a litter;
the number of patients in a doctors surgery;
● CONTINUOUS DATA: is numerical data with a
continuous range. You can count, order and measure
continuous data. For example height, weight,
temperature, the amount of sugar in an orange, etc.
● CATEGORICAL DATA: puts the item you are
describing into a category; Examples can include
gender, colour, size, etc.
● ORDINAL DATA: data which can be ranked (put in
order) or have a rating scale attached. You can count
and order, but not measure, ordinal data; Example: a
scale from 1 to 5
Data types
Data types quiz
Role: Drummer
❏ Continuous Data
❏ Categorical Data
❏ Quantitative Data
Year Born: 1963
❏ Qualitative Data
❏ Discrete Data
❏ Continuous Data
❏ Categorical Data
Name: Rick Allen
❏ Quantitative Data
❏ Qualitative Data
❏ Discrete Data
Size: M
❏ Ordered Data
❏ Categorical Data
❏ Continuous Data
Height: 187cm
❏ Discrete Data
❏ Categorical Data
❏ Continuous Data
❏ Qualitative Data
Date: 5th of March 2014
❏ Discrete Data
❏ Categorical Data
❏ Continuous Data
Jargon busting
Introduction to Data Journalism
Data pipeline
DATA ETHICS &
VERIFICATION
[Jason]
Good practices and basic ethics
● Save original copy of data and do not touch it.
● Paper trail - Keep a log with every step that you take in the
analysis.
● Do not change original columns. Duplicate them and make
the changes here.
● Have several drafts and look at how your analysis
developed.
● Spend to understand your data. Read the methodology.
Good practices and basic ethics
● Do not assume what the data is. Run integrity check on each
column.
● Clean the data before interviewing it
● Count the records. Cross-reference with the methodology.
Report any inconsistency and request the missing data or a
recount. Keep the total records in mind while analysing the data.
● If a result looks to good to be true, it probably is.
● Make a summary of the end results, as if you were writing a
press release. Look for mistakes
Good practices and basic ethics
● Have somebody else verify your work, preferably
somebody who knows nothing about your project.
● Check your biases and look at your data from new
angles
● Look for context that would explain your results to
yourself and to your audience
● e.g. Egypt worst country for women’s rights
● Bounce your results against experts
FINDING DATA
& DATA
SOURCES
Advanced search
● Google Advanced Search
● Wayback Machine – for the dead web (1996 onwards)
http://archive.org/web/
Search operators
● * (asterix) – substitutes a word and will allow your search to
cover similar phrases
● Cache: - allows you to find web pages hidden in Google’s
cache
● filetype: - will get look for the specified file type
● Link: - helps you find all the sites that link to a particular
page
Search operators
● ‘ ‘ or “ “ (Quotation marks) – help you find the exact phrase
● + or AND – narrows down your search by returning the exact
word phrases
● OR – expands search by including either of two search
phrases
● - or NOT – it would tell an engine to exclude a term
● e.g. Monsanto-’agent orange’
WHAT MAKES A
GOOD
VISUALISATION
?
What makes a good visualisation
For each of these visualisations think of:
● What is the target audience
● What is the key message
● How successful are they in communicating the
message
● What makes them stand out?
● How well are they explained?
● How simple/ complex they are?
Source: The Economist
Source: BBC News
Source: The Guardian
Source: New York Times, Amanda Cox;
Source: The Functional Art, Alberto Cairo
Source: Lower Saxony State Elections
Source: Population pyramid
Source: Hans Rosling, 200 Countries, 200 Years, 4 Minutes
Source: The Wall Street Journal
Source: Where does my money go, UK
Introduction to Data Journalism
Source: Where does my money go, UK
Source: Spending stories
Source: Driven by Data, Gregor Aisch
Source: The Guardian
Source: Transparency International
Source: The Guardian
Source: Migrations Map

More Related Content

PPTX
Evaluating a website
PPT
Introduction to statistics 1
PDF
Starr Hoffman - Data Collection & Research Design
PDF
1.-Quantitative-Research-Introduction.pdf
PDF
Interpreting Data Like a Pro - Dawn of the Data Age Lecture Series
PPTX
MA-STAT-200-DESCRIPTIVE-AND-INFERENTIAL-STATISTICS.-MARY-ROSE-M.-HERNANDEZppt...
PPTX
Basics stat ppt-types of data
PDF
Q4-DATA ANALYSIS METHODS-WK4.pdf
Evaluating a website
Introduction to statistics 1
Starr Hoffman - Data Collection & Research Design
1.-Quantitative-Research-Introduction.pdf
Interpreting Data Like a Pro - Dawn of the Data Age Lecture Series
MA-STAT-200-DESCRIPTIVE-AND-INFERENTIAL-STATISTICS.-MARY-ROSE-M.-HERNANDEZppt...
Basics stat ppt-types of data
Q4-DATA ANALYSIS METHODS-WK4.pdf

Similar to Introduction to Data Journalism (20)

PDF
Statistics Assignment Help
PDF
Doing a systematic review: top tips for progressing your review
PDF
Pelatihan Data Analitik
PPTX
Introduction to Health statistics and biostats
PPTX
Unit-X-Data management, types of data and analysis
PDF
Data Driven College Counseling by SchooLinks
PPTX
Machine Learning - Startup weekend UCSB 2018
PPT
Research design
PPTX
Practical applications and analysis in Research Methodology
PDF
Survey Design Webinar
PPTX
This is about the Machine Language programming
PPTX
Topic2- Gathering Data Qualitatively and Quantitatively.pptx
PPTX
introduction to biostat, standard deviation and variance
PDF
Workshop on SPSS: Basic to Intermediate Level
PPTX
Data analysis (Seminar for MR) (1).pptx
PPTX
PPT Chapter 1_Stat 1.pptx Statiscs Statisticc
PPTX
spssworksho9035530-lva1-app6891 (1).pptx
PPTX
Lane-SlidesMania.pptx
DOCX
Pick 2 topics and discusstalk about the topics. No plagiarism wi.docx
PPTX
BUSINESS INFORMATION temlplatesssds.pptx
Statistics Assignment Help
Doing a systematic review: top tips for progressing your review
Pelatihan Data Analitik
Introduction to Health statistics and biostats
Unit-X-Data management, types of data and analysis
Data Driven College Counseling by SchooLinks
Machine Learning - Startup weekend UCSB 2018
Research design
Practical applications and analysis in Research Methodology
Survey Design Webinar
This is about the Machine Language programming
Topic2- Gathering Data Qualitatively and Quantitatively.pptx
introduction to biostat, standard deviation and variance
Workshop on SPSS: Basic to Intermediate Level
Data analysis (Seminar for MR) (1).pptx
PPT Chapter 1_Stat 1.pptx Statiscs Statisticc
spssworksho9035530-lva1-app6891 (1).pptx
Lane-SlidesMania.pptx
Pick 2 topics and discusstalk about the topics. No plagiarism wi.docx
BUSINESS INFORMATION temlplatesssds.pptx
Ad

More from School of Data (20)

PDF
School of Data - What is it?
PDF
Skillshare - Creating Excel Dashboards
PDF
Skillshare - Understanding extractives data
PDF
Skillshare - Regression Analysis for Data Journalism
PDF
Skillshare - Building a data literacy community in Nigeria
PDF
Skillshare - Using Kobo Toolbox for mobile data collection
PPTX
Skillshare - Introduction to Timemapper
PDF
Skillshare - Let's talk about R in Data Journalism
PDF
Skillshare - Introduction to Data Scraping
PDF
Intro to open refine
PPTX
From data to diagrams: an introduction to basic graphs and charts
PDF
Skillshare getting feedback from training events
PDF
Photography tips
PDF
Activism through the lens [english].pptx
PDF
Gamification skillshare by Yuandra Ismiraldi
PPTX
Facilitation skill share by Happy Feraren
PPTX
UX presentation
PPTX
Mapping Skillshare with School of Data
PDF
Data Visualization & Design with School of Data
PPTX
Network mapping with School of Data
School of Data - What is it?
Skillshare - Creating Excel Dashboards
Skillshare - Understanding extractives data
Skillshare - Regression Analysis for Data Journalism
Skillshare - Building a data literacy community in Nigeria
Skillshare - Using Kobo Toolbox for mobile data collection
Skillshare - Introduction to Timemapper
Skillshare - Let's talk about R in Data Journalism
Skillshare - Introduction to Data Scraping
Intro to open refine
From data to diagrams: an introduction to basic graphs and charts
Skillshare getting feedback from training events
Photography tips
Activism through the lens [english].pptx
Gamification skillshare by Yuandra Ismiraldi
Facilitation skill share by Happy Feraren
UX presentation
Mapping Skillshare with School of Data
Data Visualization & Design with School of Data
Network mapping with School of Data
Ad

Recently uploaded (20)

PDF
Introduction to Data Science and Data Analysis
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PPTX
Managing Community Partner Relationships
PDF
How to run a consulting project- client discovery
PPTX
New ISO 27001_2022 standard and the changes
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
Introduction to Data Science and Data Analysis
retention in jsjsksksksnbsndjddjdnFPD.pptx
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
SAP 2 completion done . PRESENTATION.pptx
Qualitative Qantitative and Mixed Methods.pptx
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
Database Infoormation System (DBIS).pptx
Pilar Kemerdekaan dan Identi Bangsa.pptx
Managing Community Partner Relationships
How to run a consulting project- client discovery
New ISO 27001_2022 standard and the changes
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
annual-report-2024-2025 original latest.
IBA_Chapter_11_Slides_Final_Accessible.pptx
ISS -ESG Data flows What is ESG and HowHow

Introduction to Data Journalism

  • 4. Name Gender Age Height Feeling Mandy F 21 150cm Swamped Shani F 23 167cm Nervous Zizo F 25 167cm Curious Ashleigh F 22 163cm Relaxed Danyal M 22 156cm Optimistic Jason M 36 200cm Flustered Hannah F 35 167cm Very excited Phumlani M 24 180cm Grumpy Milena F 29 160cm Excited
  • 5. Data types ● QUALITATIVE DATA: is everything that refers to the quality of something: A description of colours, texture and feel of an object , a description of experiences, and interview are all qualitative data. ● QUANTITATIVE DATA: is data that refers to a number.
  • 6. Data types ● DISCRETE DATA: is numerical data with values which are distinct and separate, i.e. they can be counted. Examples might include the number of kittens in a litter; the number of patients in a doctors surgery; ● CONTINUOUS DATA: is numerical data with a continuous range. You can count, order and measure continuous data. For example height, weight, temperature, the amount of sugar in an orange, etc.
  • 7. ● CATEGORICAL DATA: puts the item you are describing into a category; Examples can include gender, colour, size, etc. ● ORDINAL DATA: data which can be ranked (put in order) or have a rating scale attached. You can count and order, but not measure, ordinal data; Example: a scale from 1 to 5 Data types
  • 8. Data types quiz Role: Drummer ❏ Continuous Data ❏ Categorical Data ❏ Quantitative Data Year Born: 1963 ❏ Qualitative Data ❏ Discrete Data ❏ Continuous Data ❏ Categorical Data Name: Rick Allen ❏ Quantitative Data ❏ Qualitative Data ❏ Discrete Data Size: M ❏ Ordered Data ❏ Categorical Data ❏ Continuous Data Height: 187cm ❏ Discrete Data ❏ Categorical Data ❏ Continuous Data ❏ Qualitative Data Date: 5th of March 2014 ❏ Discrete Data ❏ Categorical Data ❏ Continuous Data
  • 13. Good practices and basic ethics ● Save original copy of data and do not touch it. ● Paper trail - Keep a log with every step that you take in the analysis. ● Do not change original columns. Duplicate them and make the changes here. ● Have several drafts and look at how your analysis developed. ● Spend to understand your data. Read the methodology.
  • 14. Good practices and basic ethics ● Do not assume what the data is. Run integrity check on each column. ● Clean the data before interviewing it ● Count the records. Cross-reference with the methodology. Report any inconsistency and request the missing data or a recount. Keep the total records in mind while analysing the data. ● If a result looks to good to be true, it probably is. ● Make a summary of the end results, as if you were writing a press release. Look for mistakes
  • 15. Good practices and basic ethics ● Have somebody else verify your work, preferably somebody who knows nothing about your project. ● Check your biases and look at your data from new angles ● Look for context that would explain your results to yourself and to your audience ● e.g. Egypt worst country for women’s rights ● Bounce your results against experts
  • 17. Advanced search ● Google Advanced Search ● Wayback Machine – for the dead web (1996 onwards) http://archive.org/web/
  • 18. Search operators ● * (asterix) – substitutes a word and will allow your search to cover similar phrases ● Cache: - allows you to find web pages hidden in Google’s cache ● filetype: - will get look for the specified file type ● Link: - helps you find all the sites that link to a particular page
  • 19. Search operators ● ‘ ‘ or “ “ (Quotation marks) – help you find the exact phrase ● + or AND – narrows down your search by returning the exact word phrases ● OR – expands search by including either of two search phrases ● - or NOT – it would tell an engine to exclude a term ● e.g. Monsanto-’agent orange’
  • 21. What makes a good visualisation For each of these visualisations think of: ● What is the target audience ● What is the key message ● How successful are they in communicating the message ● What makes them stand out? ● How well are they explained? ● How simple/ complex they are?
  • 25. Source: New York Times, Amanda Cox;
  • 26. Source: The Functional Art, Alberto Cairo
  • 27. Source: Lower Saxony State Elections
  • 29. Source: Hans Rosling, 200 Countries, 200 Years, 4 Minutes
  • 30. Source: The Wall Street Journal
  • 31. Source: Where does my money go, UK
  • 33. Source: Where does my money go, UK
  • 35. Source: Driven by Data, Gregor Aisch

Editor's Notes

  • #4: What we mean by data when we do data journalism? Whether you began with a question or not, you should always keep your eyes open for unexpected patterns, unusual results, or anything that surprises you. Often, the most interesting stories aren’t the ones you were looking for.
  • #7: Discrete data is counted, Continuous data is measured Discrete Data Discrete Data can only take certain values. Example: the number of students in a class (you can't have half a student). Continuous Data Continuous Data can take any value (within a range) Examples: A person's height: could be any value (within the range of human heights), not just certain fixed heights, Time in a race: you could even measure it to fractions of a second, A dog's weight, The length of a leaf
  • #10: Machine readable - if it is in a format that can be easily processed by a computer. Digital ≠machine readabale. Example: a PDF document containing tables of data (is digital but are not machine-readable because a computer would struggle to access the tabular information even though they are very human readable!). The equivalent tables in a format such as a spreadsheet would be machine readable. In general, HTML and PDF are *not* machine-readable.
  • #23: COMPARE
  • #24: COMPARE AND PUT IN CONTEXT: put in context the loss of men and women in the Afgan was as compared to Vietnam and the second World War
  • #25: Show trends
  • #26: SHOW TREND OVER TIME
  • #27: Trend over time Compares different presidencies
  • #30: Show trend over time Tell a story Engage, captivate Compares countries
  • #31: Patterns
  • #32: Personal angle
  • #33: Show hierarchy
  • #34: Personal angle - people get where they fit in the bigger picture Compares, puts things into perspective
  • #36: Show relations