SlideShare a Scribd company logo
data & content design
Frieda Brioschi - frieda.brioschi@gmail.com
Emma Tracanella - emma.tracanella@gmail.com
HOW TO COLLECT AND ORGANIZE DATA
LESSON 2 - 2019/20
A QUICK INTRO
LET’S START
data & content design
LESSON 2
3
PRESENT YOUR DATA
data & content design
DATA IS ALL AROUND US
LESSON 2
4
METHODS
DATA COLLECTION
data & content design
LESSON 2
WHAT ARE DATA
Data are individual units of information.
A datum describes a single quality or quantity of some object or phenomenon.
Data are measured, collected and reported, and analyzed, whereupon they can
be visualized using graphs, images or other analysis tools.
6
data & content design
LESSON 2
PRIMARY VS SECONDARY DATA
▸ Primary data is data that is observed or collected from first-hand sources
▸ Secondary data is data gathered from studies, surveys, or experiments that
have been run by other people
7
data & content design
LESSON 2
QUALITATIVE VS QUANTITATIVE
▸ Quantitative data comes in the form of numbers, quantities and values. 

Pro: it’s concrete and easily measurable.
▸ Qualitative data is descriptive, based on attributes. 

It helps to explain the “why” behind the information quantitative data
reveals.
8
data & content design
LESSON 2
PRIMARY DATA COLLECTION
▸ Observation
▸ Surveys & Questionnaire
▸ Interviews
▸ Focus Group
9
data & content design
LESSON 2
HOW
10
data & content design
LESSON 2
PRIMARY DATA COLLECTION
▸ In-Person Interviews

Pros: In-depth and a high degree of confidence on the data

Cons: Time consuming, expensive and can be dismissed as anedoctal
▸ Mail Surveys

Pros: Can reach anyone and everyone – no barrier

Cons: Expensive, data collection errors, lag time
▸ Phone Surveys

Pros: High degree of confidence on the data collected, reach almost anyone

Cons: Expensive, cannot self-administer, need to hire an agency
▸ Web/Online Surveys

Pros: Cheap, can self-administer, very low probability of data errors

Cons: Not all your customers might have an email address/be on the internet, customers may be wary of
divulging information online.
11
data & content design
LESSON 2
BIAS
Bias in data collection is a distortion which results in the information not being truly representative
of the situation you are trying to investigate. Bias occurs for example when systematic error is
introduced into sampling or testing by selecting or encouraging one outcome or answer over others.
It can results from:
▸ survey questions that are constructed with a particular slant
▸ choosing a known group with a particular background to respond to surveys
▸ reporting data in misleading categorical groupings
▸ non-random selections when sampling
▸ systematic measurement errors
12
data & content design
LESSON 2
CASE STUDY: TAY.AI
Tay was an artificial intelligence chatter bot that was originally released by
Microsoft via Twitter on March 23, 2016.
It caused subsequent controversy when the bot began to post inflammatory and
offensive tweets through its Twitter account, causing Microsoft to shut down the
service only 16 hours after its launch.
13
data & content design
LESSON 2
SECONDARY DATA SOURCES
▸ Our data:
▸ Personal information, likes, activities and interests (Facebook, instagram,
Youtube, …)
▸ Personal data (from mobile phone)
14
data & content design
LESSON 2
APPLE DATA HEALTH
▸ Heart rate, sleeping habits, workouts,
steps and walking routines
▸ Introduced in September 2014 with iOS
8, the Apple Health app is pre-installed
on all iPhones.
▸ Low-energy sensors, constantly
collecting information about the user’s
physical activities. With optional extra
hardware (e.g. Apple Watch), Apple
Health can collect significantly more
information. 
15
data & content design
LESSON 2
SECONDARY DATA SOURCES
▸ Other data:
▸ Public data sets
▸ Historical data
16
data & content design
LESSON 2
FLIGHTRADAR24
▸ Flightradar24 is a global flight tracking
service that provides you with real-time
information about thousands of aircraft
around the world.
▸ Flightradar24 tracks 180,000+ flights, from
1,200+ airlines, flying to or from 4,000+
airports around the world in real time.
▸ https://www.flightradar24.com
17
data & content design
LESSON 2
HISTORICAL CLIMATE DATA
▸ Many of the historical sources available to
climate historians mention weather in some
way, but these references are buried in a huge
volume of information.
▸ In recent years initiatives have transcribed,
quantified, and digitalized: 

a) historical observations, 

b) historical activities that must have been
strongly influenced by weather.
▸ https://www.historicalclimatology.com/
databases.html
18
data & content design
LESSON 2
ATLAS OF URBAN EXPANSION
▸ As of 2010, the world contained 4,231 cities with
100,000 or more people.
▸ The Atlas of Urban Expansion collects and analyzes
data on the quantity and quality of urban
expansion in a stratified global sample of 200
cities.
▸ The Atlas presents the output of the first two
phases of the Monitoring Global Urban Expansion
Program, an initiative that gathers data and
evidence on cities worldwide.
▸ http://atlasofurbanexpansion.org/cities/view/Milan
19
data & content design
LESSON 2
THE MOST POPULOUS CITY THROUGH TIME
▸ https://www.youtube.com/watch?v=pMs5xapBewM
20
data & content design
DATA COLLECTION MAY BE AFFECTED BY
THEIR USE!
We
LESSON 2
21
PROCESSING
DATA
data & content design
LESSON 2
STRUCTURED DATA
Structured data is usually contained in rows and columns and its elements can be mapped into fixed pre-
defined model. Examples of sources:
▸ SQL Databases
▸ Spreadsheets such as Excel
▸ OLTP Systems
▸ Online forms
▸ Sensors such as GPS or RFID tags
▸ Network and Web server logs
▸ Medical devices
23
data & content design
LESSON 2
UNSTRUCTURED DATA
Unstructured data is data that cannot be contained in a row-column format and doesn’t have a data
model. Examples of sources:
▸ Web pages
▸ Images (JPEG, GIF, PNG, etc.)
▸ Videos
▸ Memos
▸ Reports
▸ Word documents and PowerPoint persentations
▸ Surveys
24
data & content design
LESSON 2
SEMI-STRUCTURED DATA
Basically it’s a mix between both of the previous ones. Semi-structured data has some defining or
consistent characteristics but doesn’t conform to a rigid structure. Examples of sources:
▸ E-mails
▸ XML and other markup languages
▸ Binary executables
▸ TCP/IP packets
▸ Zipped files
▸ JSON
▸ Web pages
25
data & content design
LESSON 2
DATA CLEANING - TIME
26
data & content design
LESSON 2
DATA CLEANING
27
data & content design
LESSON 2
DATA CLEANING - COUNTRY
28
data & content design
LESSON 2
DATA CLEANING
▸ Italy - 3
▸ Italy (with space) - 2
▸ Italia
▸ Pisa, Italy
▸ Milan
▸ Milan italy
▸ South Korea - 2
29
▸ South Korea
▸ Egypt
▸ Mexico
▸ Serbia
▸ The Netherlands
▸ Norway
▸ Taiwan
▸ Taiwan
▸ Costa Rica
▸ Macedonia
▸ Turkey
▸ Australia
data & content design
LESSON 2
DATA CLEANING - NAME
▸ Greta Scuso
▸ Vittoria
▸ Soonji Kwun
▸ Rewan
▸ Aurora
▸ Neithan
▸ Nadja
▸ Andrea
▸ Nadia van 't Klooster
▸ Yeso Lee
30
▸ Hanne Heimdal
▸ Hsin Yi Chen
▸ Yuri Michieletti
▸ Alessandro Calzoni
▸ Giulia Filippi
▸ Elena Fantini
▸ Stasha
▸ Eugenio Tonoli
▸ Ahmet Karan Oner
▸ Eileen
▸ Matteo
DATABASES
DON’T BE AFRAID OF
data & content design
LESSON 2
WHAT IS A DB?
According to Wikipedia “a database is an organized collection of data, generally
stored and accessed electronically from a computer system”.
Ideally it is organized in such a way that it can be easily accessed, managed, and
updated.
32
data & content design
LESSON 2
DB JARGON: QUERY
When you want to perform an operation on data stored in a db, you should run a
query. This is typically one of SELECT, INSERT, UPDATE, or DELETE.
SELECT wakeUpTime FROM dCDCourse
33
data & content design
LESSON 2
DB JARGON: TRANSACTION
When you need to perform a sequence of operations as a single unit of work,
that’s a transaction.
If one of you decide to withdraw from this course, then I need to update both the
list of students enrolled to this course and the total count of students. If I didn’t
operate inside a transaction, there’s a moment when one information (list of
students or total count) is wrong.
34
data & content design
LESSON 2
DB JARGON: ACID
Wikipedia: ACID (Atomicity, Consistency, Isolation, Durability) is a set of properties of database
transactions intended to guarantee validity even in the event of errors, power failures, etc.
▸ Atomicity means that you guarantee that either all of the transaction succeeds or none of
it does.
▸ Consistency ensures that you guarantee that all data will be consistent.
▸ Isolation guarantees that all transactions will occur in isolation. No transaction will be
affected by any other transaction.
▸ Durability means that, once a transaction is committed, it will remain permanently in the
system.
35
DEAR DATA
GIORGIA LUPI
How to collect and organize data

More Related Content

PPT
Bill of rights power point rev 1
PPTX
Portfolio analysis and case study of Tata group
PDF
Bravehearts Awards is Here!
PPTX
The role of finance in the strategic planning and decision-making process
PPT
Sampling and sampling distribution
PPTX
Data science unit1
PDF
How to collect and organize data (v. ITA 2020)
PPTX
Data science.chapter-1,2,3
Bill of rights power point rev 1
Portfolio analysis and case study of Tata group
Bravehearts Awards is Here!
The role of finance in the strategic planning and decision-making process
Sampling and sampling distribution
Data science unit1
How to collect and organize data (v. ITA 2020)
Data science.chapter-1,2,3

Similar to How to collect and organize data (20)

PDF
Decision Ready Data: Power Your Analytics with Great Data
PPTX
Managing Data For Efficiency.pptx and in
DOCX
Confirming PagesLess managing. More teaching. Greater
PDF
Data Strategy Best Practices
PPTX
U - 2 Emerging.pptx
PDF
Big Data Processing with Hadoop : A Review
PPTX
Introduction to data science
PPTX
Big Data Driven Solutions to Combat Covid' 19
PPTX
1 UNIT-DSP.pptx
PDF
A Deep Dissertion Of Data Science Related Issues And Its Applications
PPTX
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
PDF
Introduction to data science part one and
PDF
(eBook PDF) Data Analytics for Accounting 2nd Edition By Vernon Richardson
PDF
Big data and the data quality imperative
PPTX
Bigdatacooltools
PPTX
PowerPoint Template
PPTX
Make your data great now
PPTX
UNIT_1-BD.pptx
PDF
Introduction to Data Analytics and data analytics life cycle
PDF
Around Data Science
Decision Ready Data: Power Your Analytics with Great Data
Managing Data For Efficiency.pptx and in
Confirming PagesLess managing. More teaching. Greater
Data Strategy Best Practices
U - 2 Emerging.pptx
Big Data Processing with Hadoop : A Review
Introduction to data science
Big Data Driven Solutions to Combat Covid' 19
1 UNIT-DSP.pptx
A Deep Dissertion Of Data Science Related Issues And Its Applications
Big Data in Distributed Analytics,Cybersecurity And Digital Forensics
Introduction to data science part one and
(eBook PDF) Data Analytics for Accounting 2nd Edition By Vernon Richardson
Big data and the data quality imperative
Bigdatacooltools
PowerPoint Template
Make your data great now
UNIT_1-BD.pptx
Introduction to Data Analytics and data analytics life cycle
Around Data Science
Ad

More from Frieda Brioschi (20)

PDF
Digital communication (v. 2021 ITA)
PDF
Storytelling with data (v. 2021 ITA)
PDF
Visual communication of qualitative and quantitative data (v. 2021 ITA)
PDF
How we perceive information (v. 2021 ITA)
PDF
Around Data Science (v. 2021 ITA)
PDF
Data Lingo (v. ITA 2021)
PDF
Information Classification (v. ITA 2021)
PDF
How to collect and organize data (v. ITA 2021)
PDF
What are data and information, why they matter (v. ITA 2021)
PDF
Artificial Intelligence, Machine Learning & Tools (v. 2020 ITA)
PDF
Digital communication (v. 2020 ITA)
PDF
Storytelling with data (v. 2020 ITA)
PDF
Visual communication of qualitative data (v. 2020 ITA)
PDF
Visual communication of quantitative data (v. 2020 ITA)
PDF
How we perceive information (v. 2020 ITA)
PDF
Data mining and data aggregation basics
PDF
Around Data Science (v. 2020 ITA)
PDF
Data Lingo (v. ITA 2020)
PDF
Information Classification (v. ITA 2020)
PDF
What are data and information, why they matter (v. ITA 2020)
Digital communication (v. 2021 ITA)
Storytelling with data (v. 2021 ITA)
Visual communication of qualitative and quantitative data (v. 2021 ITA)
How we perceive information (v. 2021 ITA)
Around Data Science (v. 2021 ITA)
Data Lingo (v. ITA 2021)
Information Classification (v. ITA 2021)
How to collect and organize data (v. ITA 2021)
What are data and information, why they matter (v. ITA 2021)
Artificial Intelligence, Machine Learning & Tools (v. 2020 ITA)
Digital communication (v. 2020 ITA)
Storytelling with data (v. 2020 ITA)
Visual communication of qualitative data (v. 2020 ITA)
Visual communication of quantitative data (v. 2020 ITA)
How we perceive information (v. 2020 ITA)
Data mining and data aggregation basics
Around Data Science (v. 2020 ITA)
Data Lingo (v. ITA 2020)
Information Classification (v. ITA 2020)
What are data and information, why they matter (v. ITA 2020)
Ad

Recently uploaded (20)

PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
20th Century Theater, Methods, History.pptx
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PDF
HVAC Specification 2024 according to central public works department
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
1_English_Language_Set_2.pdf probationary
PPTX
Computer Architecture Input Output Memory.pptx
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Chinmaya Tiranga quiz Grand Finale.pdf
FORM 1 BIOLOGY MIND MAPS and their schemes
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
Practical Manual AGRO-233 Principles and Practices of Natural Farming
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
20th Century Theater, Methods, History.pptx
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
Weekly quiz Compilation Jan -July 25.pdf
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
HVAC Specification 2024 according to central public works department
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
1_English_Language_Set_2.pdf probationary
Computer Architecture Input Output Memory.pptx

How to collect and organize data

  • 1. data & content design Frieda Brioschi - frieda.brioschi@gmail.com Emma Tracanella - emma.tracanella@gmail.com HOW TO COLLECT AND ORGANIZE DATA LESSON 2 - 2019/20
  • 3. data & content design LESSON 2 3 PRESENT YOUR DATA
  • 4. data & content design DATA IS ALL AROUND US LESSON 2 4
  • 6. data & content design LESSON 2 WHAT ARE DATA Data are individual units of information. A datum describes a single quality or quantity of some object or phenomenon. Data are measured, collected and reported, and analyzed, whereupon they can be visualized using graphs, images or other analysis tools. 6
  • 7. data & content design LESSON 2 PRIMARY VS SECONDARY DATA ▸ Primary data is data that is observed or collected from first-hand sources ▸ Secondary data is data gathered from studies, surveys, or experiments that have been run by other people 7
  • 8. data & content design LESSON 2 QUALITATIVE VS QUANTITATIVE ▸ Quantitative data comes in the form of numbers, quantities and values. 
 Pro: it’s concrete and easily measurable. ▸ Qualitative data is descriptive, based on attributes. 
 It helps to explain the “why” behind the information quantitative data reveals. 8
  • 9. data & content design LESSON 2 PRIMARY DATA COLLECTION ▸ Observation ▸ Surveys & Questionnaire ▸ Interviews ▸ Focus Group 9
  • 10. data & content design LESSON 2 HOW 10
  • 11. data & content design LESSON 2 PRIMARY DATA COLLECTION ▸ In-Person Interviews
 Pros: In-depth and a high degree of confidence on the data
 Cons: Time consuming, expensive and can be dismissed as anedoctal ▸ Mail Surveys
 Pros: Can reach anyone and everyone – no barrier
 Cons: Expensive, data collection errors, lag time ▸ Phone Surveys
 Pros: High degree of confidence on the data collected, reach almost anyone
 Cons: Expensive, cannot self-administer, need to hire an agency ▸ Web/Online Surveys
 Pros: Cheap, can self-administer, very low probability of data errors
 Cons: Not all your customers might have an email address/be on the internet, customers may be wary of divulging information online. 11
  • 12. data & content design LESSON 2 BIAS Bias in data collection is a distortion which results in the information not being truly representative of the situation you are trying to investigate. Bias occurs for example when systematic error is introduced into sampling or testing by selecting or encouraging one outcome or answer over others. It can results from: ▸ survey questions that are constructed with a particular slant ▸ choosing a known group with a particular background to respond to surveys ▸ reporting data in misleading categorical groupings ▸ non-random selections when sampling ▸ systematic measurement errors 12
  • 13. data & content design LESSON 2 CASE STUDY: TAY.AI Tay was an artificial intelligence chatter bot that was originally released by Microsoft via Twitter on March 23, 2016. It caused subsequent controversy when the bot began to post inflammatory and offensive tweets through its Twitter account, causing Microsoft to shut down the service only 16 hours after its launch. 13
  • 14. data & content design LESSON 2 SECONDARY DATA SOURCES ▸ Our data: ▸ Personal information, likes, activities and interests (Facebook, instagram, Youtube, …) ▸ Personal data (from mobile phone) 14
  • 15. data & content design LESSON 2 APPLE DATA HEALTH ▸ Heart rate, sleeping habits, workouts, steps and walking routines ▸ Introduced in September 2014 with iOS 8, the Apple Health app is pre-installed on all iPhones. ▸ Low-energy sensors, constantly collecting information about the user’s physical activities. With optional extra hardware (e.g. Apple Watch), Apple Health can collect significantly more information.  15
  • 16. data & content design LESSON 2 SECONDARY DATA SOURCES ▸ Other data: ▸ Public data sets ▸ Historical data 16
  • 17. data & content design LESSON 2 FLIGHTRADAR24 ▸ Flightradar24 is a global flight tracking service that provides you with real-time information about thousands of aircraft around the world. ▸ Flightradar24 tracks 180,000+ flights, from 1,200+ airlines, flying to or from 4,000+ airports around the world in real time. ▸ https://www.flightradar24.com 17
  • 18. data & content design LESSON 2 HISTORICAL CLIMATE DATA ▸ Many of the historical sources available to climate historians mention weather in some way, but these references are buried in a huge volume of information. ▸ In recent years initiatives have transcribed, quantified, and digitalized: 
 a) historical observations, 
 b) historical activities that must have been strongly influenced by weather. ▸ https://www.historicalclimatology.com/ databases.html 18
  • 19. data & content design LESSON 2 ATLAS OF URBAN EXPANSION ▸ As of 2010, the world contained 4,231 cities with 100,000 or more people. ▸ The Atlas of Urban Expansion collects and analyzes data on the quantity and quality of urban expansion in a stratified global sample of 200 cities. ▸ The Atlas presents the output of the first two phases of the Monitoring Global Urban Expansion Program, an initiative that gathers data and evidence on cities worldwide. ▸ http://atlasofurbanexpansion.org/cities/view/Milan 19
  • 20. data & content design LESSON 2 THE MOST POPULOUS CITY THROUGH TIME ▸ https://www.youtube.com/watch?v=pMs5xapBewM 20
  • 21. data & content design DATA COLLECTION MAY BE AFFECTED BY THEIR USE! We LESSON 2 21
  • 23. data & content design LESSON 2 STRUCTURED DATA Structured data is usually contained in rows and columns and its elements can be mapped into fixed pre- defined model. Examples of sources: ▸ SQL Databases ▸ Spreadsheets such as Excel ▸ OLTP Systems ▸ Online forms ▸ Sensors such as GPS or RFID tags ▸ Network and Web server logs ▸ Medical devices 23
  • 24. data & content design LESSON 2 UNSTRUCTURED DATA Unstructured data is data that cannot be contained in a row-column format and doesn’t have a data model. Examples of sources: ▸ Web pages ▸ Images (JPEG, GIF, PNG, etc.) ▸ Videos ▸ Memos ▸ Reports ▸ Word documents and PowerPoint persentations ▸ Surveys 24
  • 25. data & content design LESSON 2 SEMI-STRUCTURED DATA Basically it’s a mix between both of the previous ones. Semi-structured data has some defining or consistent characteristics but doesn’t conform to a rigid structure. Examples of sources: ▸ E-mails ▸ XML and other markup languages ▸ Binary executables ▸ TCP/IP packets ▸ Zipped files ▸ JSON ▸ Web pages 25
  • 26. data & content design LESSON 2 DATA CLEANING - TIME 26
  • 27. data & content design LESSON 2 DATA CLEANING 27
  • 28. data & content design LESSON 2 DATA CLEANING - COUNTRY 28
  • 29. data & content design LESSON 2 DATA CLEANING ▸ Italy - 3 ▸ Italy (with space) - 2 ▸ Italia ▸ Pisa, Italy ▸ Milan ▸ Milan italy ▸ South Korea - 2 29 ▸ South Korea ▸ Egypt ▸ Mexico ▸ Serbia ▸ The Netherlands ▸ Norway ▸ Taiwan ▸ Taiwan ▸ Costa Rica ▸ Macedonia ▸ Turkey ▸ Australia
  • 30. data & content design LESSON 2 DATA CLEANING - NAME ▸ Greta Scuso ▸ Vittoria ▸ Soonji Kwun ▸ Rewan ▸ Aurora ▸ Neithan ▸ Nadja ▸ Andrea ▸ Nadia van 't Klooster ▸ Yeso Lee 30 ▸ Hanne Heimdal ▸ Hsin Yi Chen ▸ Yuri Michieletti ▸ Alessandro Calzoni ▸ Giulia Filippi ▸ Elena Fantini ▸ Stasha ▸ Eugenio Tonoli ▸ Ahmet Karan Oner ▸ Eileen ▸ Matteo
  • 32. data & content design LESSON 2 WHAT IS A DB? According to Wikipedia “a database is an organized collection of data, generally stored and accessed electronically from a computer system”. Ideally it is organized in such a way that it can be easily accessed, managed, and updated. 32
  • 33. data & content design LESSON 2 DB JARGON: QUERY When you want to perform an operation on data stored in a db, you should run a query. This is typically one of SELECT, INSERT, UPDATE, or DELETE. SELECT wakeUpTime FROM dCDCourse 33
  • 34. data & content design LESSON 2 DB JARGON: TRANSACTION When you need to perform a sequence of operations as a single unit of work, that’s a transaction. If one of you decide to withdraw from this course, then I need to update both the list of students enrolled to this course and the total count of students. If I didn’t operate inside a transaction, there’s a moment when one information (list of students or total count) is wrong. 34
  • 35. data & content design LESSON 2 DB JARGON: ACID Wikipedia: ACID (Atomicity, Consistency, Isolation, Durability) is a set of properties of database transactions intended to guarantee validity even in the event of errors, power failures, etc. ▸ Atomicity means that you guarantee that either all of the transaction succeeds or none of it does. ▸ Consistency ensures that you guarantee that all data will be consistent. ▸ Isolation guarantees that all transactions will occur in isolation. No transaction will be affected by any other transaction. ▸ Durability means that, once a transaction is committed, it will remain permanently in the system. 35