Data literacy
Chapter 2
Data: raw facts
Information: meaning of the data
Knowledge: insights gained from information
Wisdom: apply knowledge to action
Data :Red, traffic light
Information: South facing traffic light on AABC
street has turned red
Knowledge: Traffic light in my direction has turned
red
Wisdom: I need to stop the car
Data Pyramid
• Data literacy is the ability to understand ,
interpret and communicate with data
• Data security: protecting data from
unauthorized access, corruption or theft.
• Data privacy: determines who gets to see your
personal information.
• Data privacy and data security are often used
interchangeably but they are different from
each other.(t/f)
• ______ is the practice of protecting digital
information from unauthorized access,
corruption or theft throughout its entire
lifecycle.
• A) data security b) data literacy
• C) data privacy d) data acquisition
Q. Classify the following into Qualitative and Quantitative data.
which is a good park nearby?
Cricket score
Restaurant bill
Temperature
Gender
Shoe size
Favorite color
Weight of a person
Q. Classify the following into discrete and continuous data.
Number of students in a class
Height
Weight
temperature
only
Decimal points are allowed
Decimal points are not allowed
Song name year Song length artist
Blinding
lights
2019 3.2 harry
happy 2020 2.49 stephen
7 rings 2018 3 david
Is song length discrete or continuous?
Is song name qualitative or quantitative?
Is year qualitative or quantitative?
Is song length qualitative or quantitative?
• Data also is divided based on the domains of
AI.
• For Computer vision applications, data
required is images or videos.
• For Data Science applications, data required is
numbers.
• For Natural Language Processing applications,
data required is text or voice.
Data acquisition
• Data acquisition :collecting data
• Steps
1. Data discovery
2. Data augmentation
3. Data generation
• Data discovery: searching for data.
– Collecting data
• Data augmentation: adding more data to the
existing data.
• Data generation: generating data if data is not
available or recording data using sensors.
• creating new data.
• _____ refers to the data collection process
that involves gathering data from multiple
databases and data sources, cataloging said
data, and classifying the data for evaluation
and analysis.
• _____ is the process of artificially generating
new data from existing data, primarily to train
new machine learning (ML) models.
• _____ refers to creating or producing new
data.
Data is collected directly
from the source
• Classify into good data or bad data
• Information is well structured
• Information is scattered
• Accurate
• Incorrect
• clearly presented
• Contains relevant information
• Poorly presented
• Not relevant to the requirement
Data preprocessing
1. Structure
– Good structure
– Poor structure
2. Data cleaning
3. Accuracy: closeness to the actual value.
Good structure, Poor structure
• Text
• Spreadsheet
• Table
• video
Data cleaning
• Clean data from duplicates, missing value and
errors
accuracy
• How well data matches real- world values.
Features of data
• Characteristics or properties of the data.
• Student record
– Student name
– Age
– Class
These are the features of student record.
Data features
• Independent features
• Dependent features
Features of data
– Independent: input to the model or information
we provided to make prediction.
– Dependent: output of the model or prediction
• Independent features -> model->dependent
features
• Size of house
• No. of rooms house price
• location
• Previous mark
• Study time mark prediction
• Have extra tuition
• Sleep time
• Employed
• Monthly salary how much loan money?
• Have extra income?
• Have own land?
• gold
data_literacy_ch2_scientificst.pptx (data literacy)
Data processing and data interpretation
• Niki has 7 candies and ruchi has 4 candies.
How many candies do niki and ruchi have in
total?
• Data processing means operating on data to
produce meaningful information.
• Niki has 7 candies and ruchi has 4 candies.
• Who should get more candies so that both
Niki and Ruchi have an equal number of
candies?
• How many candies should they get?
• Data interpretation means analyzing data to
arrive at meaningful decisions.
Q. ____ relates to the manipulation of data to
produce meaningful insights.
a. Data processing
b. Data interpretation
c. Data analysis
d. Data presentation
Data Interpretation
• Quantitative Data Interpretation
• Qualitative Data Interpretation
Qualitative Data Interpretation
• Qualitative Data Interpretation analyses non-
numeric data.
• It analyses the emotions and feelings of
people.
data_literacy_ch2_scientificst.pptx (data literacy)
Examples of Qualitative Data Interprepation
• Trending sports
• Trending movies
• Trending athletes
Quantitative Data Interpretation
• Quantitative Data Interpretation is made on
numerical data.
• It helps us answer questions like “when”, “how
many” and “how often”?
• Examples
– No. of website visit.
– Cumulative grade point
– Height of students in a class
data_literacy_ch2_scientificst.pptx (data literacy)
Q. Classify the following into Quantitative data
interpretation and Qualitative data interpretation.
• Group activities is the best to learn things.
• 75% of students scored above 80% in maths. Higher
percentage indicates that teching method is effective.
• Library environment needs to be modernized.
• Interactive activities are better learning methods than
traditional lectures.
• The school might need to evaluate the homework policy,
as exceeding homework could be contributing to student
stress.
• Sales of sandwiches increased by 20% and that of sugary
drinks decreased by 15%. Students are opting for
healthier food options.
Q. Quantitative data is numerical in nature.(T/F)
Data Interpretation
• Textual DI
• Tabular DI
• Graphical DI
Textual DI
• Data is mentioned in the text form.
Tabular DI
• Data is represented systematically in the form
of rows and columns.
Graphical DI
data_literacy_ch2_scientificst.pptx (data literacy)
Q. Which among these is not a type of data
interpretation?
a. Textual
b. Tabular
c. Graphical
d. Raw data
Q. A bar graph is an example of ?
e. Textual
f. Tabular
g. Graphical

More Related Content

PPTX
Introducition to Data scinece compiled by hu
PPTX
Data science unit1
PDF
CS3352-Foundations of Data Science Notes.pdf
PPTX
Data Science Introduction to Data Science
PPTX
classIX_DS_Teacher_Presentation.pptx
PPTX
Classification & Clustering.pptx
PPTX
Advance Data Mining - Machine Learning -
PPTX
Data science.chapter-1,2,3
Introducition to Data scinece compiled by hu
Data science unit1
CS3352-Foundations of Data Science Notes.pdf
Data Science Introduction to Data Science
classIX_DS_Teacher_Presentation.pptx
Classification & Clustering.pptx
Advance Data Mining - Machine Learning -
Data science.chapter-1,2,3

Similar to data_literacy_ch2_scientificst.pptx (data literacy) (20)

PDF
Getting started in Data Science (April 2017, Los Angeles)
PPTX
Introduction to Data Analytics.pptx
PDF
Machinr Learning and artificial_Lect1.pdf
PDF
Getting Started in Data Science
PPTX
Fundamentals of Data science Introduction Unit 1
PPTX
On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)
PPTX
DS_Teacher_Presentation DS and Education.pptx
PPT
Dataanalysis
PPTX
Module 3 - Improving Current Business with External Data- Online
PDF
Career in Data Science (July 2017, DTLA)
PDF
How data informs decision making 2
PPTX
DataScienceandVisualization_Mod_1_ppt.pptx
PPTX
01-02-Data Use Training (Data Quality).pptx
PPTX
PPTX
Research Data Management
PPTX
Ub d chapter 13
PPTX
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
PDF
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
PPTX
Introduction to data science
PPTX
Data Education project briefing for Royal Society
Getting started in Data Science (April 2017, Los Angeles)
Introduction to Data Analytics.pptx
Machinr Learning and artificial_Lect1.pdf
Getting Started in Data Science
Fundamentals of Data science Introduction Unit 1
On Information Quality: Can Your Data Do The Job? (SCECR 2015 Keynote)
DS_Teacher_Presentation DS and Education.pptx
Dataanalysis
Module 3 - Improving Current Business with External Data- Online
Career in Data Science (July 2017, DTLA)
How data informs decision making 2
DataScienceandVisualization_Mod_1_ppt.pptx
01-02-Data Use Training (Data Quality).pptx
Research Data Management
Ub d chapter 13
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Introduction to data science
Data Education project briefing for Royal Society
Ad

Recently uploaded (20)

PDF
Journal of Dental Science - UDMY (2021).pdf
PDF
International_Financial_Reporting_Standa.pdf
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PDF
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
LEARNERS WITH ADDITIONAL NEEDS ProfEd Topic
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
PDF
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 2).pdf
PPTX
DRUGS USED FOR HORMONAL DISORDER, SUPPLIMENTATION, CONTRACEPTION, & MEDICAL T...
PPTX
Introduction to pro and eukaryotes and differences.pptx
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
CRP102_SAGALASSOS_Final_Projects_2025.pdf
PDF
Journal of Dental Science - UDMY (2022).pdf
PDF
Complications of Minimal Access-Surgery.pdf
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
Journal of Dental Science - UDMY (2021).pdf
International_Financial_Reporting_Standa.pdf
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
Share_Module_2_Power_conflict_and_negotiation.pptx
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
LEARNERS WITH ADDITIONAL NEEDS ProfEd Topic
Environmental Education MCQ BD2EE - Share Source.pdf
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 2).pdf
DRUGS USED FOR HORMONAL DISORDER, SUPPLIMENTATION, CONTRACEPTION, & MEDICAL T...
Introduction to pro and eukaryotes and differences.pptx
Unit 4 Computer Architecture Multicore Processor.pptx
CRP102_SAGALASSOS_Final_Projects_2025.pdf
Journal of Dental Science - UDMY (2022).pdf
Complications of Minimal Access-Surgery.pdf
FORM 1 BIOLOGY MIND MAPS and their schemes
Ad

data_literacy_ch2_scientificst.pptx (data literacy)

  • 2. Data: raw facts Information: meaning of the data Knowledge: insights gained from information Wisdom: apply knowledge to action Data :Red, traffic light Information: South facing traffic light on AABC street has turned red Knowledge: Traffic light in my direction has turned red Wisdom: I need to stop the car
  • 4. • Data literacy is the ability to understand , interpret and communicate with data
  • 5. • Data security: protecting data from unauthorized access, corruption or theft. • Data privacy: determines who gets to see your personal information.
  • 6. • Data privacy and data security are often used interchangeably but they are different from each other.(t/f) • ______ is the practice of protecting digital information from unauthorized access, corruption or theft throughout its entire lifecycle. • A) data security b) data literacy • C) data privacy d) data acquisition
  • 7. Q. Classify the following into Qualitative and Quantitative data. which is a good park nearby? Cricket score Restaurant bill Temperature Gender Shoe size Favorite color Weight of a person
  • 8. Q. Classify the following into discrete and continuous data. Number of students in a class Height Weight temperature only Decimal points are allowed Decimal points are not allowed
  • 9. Song name year Song length artist Blinding lights 2019 3.2 harry happy 2020 2.49 stephen 7 rings 2018 3 david Is song length discrete or continuous? Is song name qualitative or quantitative? Is year qualitative or quantitative? Is song length qualitative or quantitative?
  • 10. • Data also is divided based on the domains of AI. • For Computer vision applications, data required is images or videos. • For Data Science applications, data required is numbers. • For Natural Language Processing applications, data required is text or voice.
  • 11. Data acquisition • Data acquisition :collecting data • Steps 1. Data discovery 2. Data augmentation 3. Data generation
  • 12. • Data discovery: searching for data. – Collecting data • Data augmentation: adding more data to the existing data.
  • 13. • Data generation: generating data if data is not available or recording data using sensors. • creating new data.
  • 14. • _____ refers to the data collection process that involves gathering data from multiple databases and data sources, cataloging said data, and classifying the data for evaluation and analysis. • _____ is the process of artificially generating new data from existing data, primarily to train new machine learning (ML) models. • _____ refers to creating or producing new data.
  • 15. Data is collected directly from the source
  • 16. • Classify into good data or bad data • Information is well structured • Information is scattered • Accurate • Incorrect • clearly presented • Contains relevant information • Poorly presented • Not relevant to the requirement
  • 17. Data preprocessing 1. Structure – Good structure – Poor structure 2. Data cleaning 3. Accuracy: closeness to the actual value.
  • 18. Good structure, Poor structure • Text • Spreadsheet • Table • video
  • 19. Data cleaning • Clean data from duplicates, missing value and errors
  • 20. accuracy • How well data matches real- world values.
  • 21. Features of data • Characteristics or properties of the data. • Student record – Student name – Age – Class These are the features of student record.
  • 22. Data features • Independent features • Dependent features
  • 23. Features of data – Independent: input to the model or information we provided to make prediction. – Dependent: output of the model or prediction
  • 24. • Independent features -> model->dependent features • Size of house • No. of rooms house price • location
  • 25. • Previous mark • Study time mark prediction • Have extra tuition • Sleep time
  • 26. • Employed • Monthly salary how much loan money? • Have extra income? • Have own land? • gold
  • 28. Data processing and data interpretation • Niki has 7 candies and ruchi has 4 candies. How many candies do niki and ruchi have in total? • Data processing means operating on data to produce meaningful information.
  • 29. • Niki has 7 candies and ruchi has 4 candies. • Who should get more candies so that both Niki and Ruchi have an equal number of candies? • How many candies should they get? • Data interpretation means analyzing data to arrive at meaningful decisions.
  • 30. Q. ____ relates to the manipulation of data to produce meaningful insights. a. Data processing b. Data interpretation c. Data analysis d. Data presentation
  • 31. Data Interpretation • Quantitative Data Interpretation • Qualitative Data Interpretation
  • 32. Qualitative Data Interpretation • Qualitative Data Interpretation analyses non- numeric data. • It analyses the emotions and feelings of people.
  • 34. Examples of Qualitative Data Interprepation • Trending sports • Trending movies • Trending athletes
  • 35. Quantitative Data Interpretation • Quantitative Data Interpretation is made on numerical data. • It helps us answer questions like “when”, “how many” and “how often”? • Examples – No. of website visit. – Cumulative grade point – Height of students in a class
  • 37. Q. Classify the following into Quantitative data interpretation and Qualitative data interpretation. • Group activities is the best to learn things. • 75% of students scored above 80% in maths. Higher percentage indicates that teching method is effective. • Library environment needs to be modernized. • Interactive activities are better learning methods than traditional lectures. • The school might need to evaluate the homework policy, as exceeding homework could be contributing to student stress. • Sales of sandwiches increased by 20% and that of sugary drinks decreased by 15%. Students are opting for healthier food options.
  • 38. Q. Quantitative data is numerical in nature.(T/F)
  • 39. Data Interpretation • Textual DI • Tabular DI • Graphical DI
  • 40. Textual DI • Data is mentioned in the text form.
  • 41. Tabular DI • Data is represented systematically in the form of rows and columns.
  • 44. Q. Which among these is not a type of data interpretation? a. Textual b. Tabular c. Graphical d. Raw data Q. A bar graph is an example of ? e. Textual f. Tabular g. Graphical