SlideShare a Scribd company logo
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 1/48
MUKALELE
Rogers
MULINDA
Saddati
TUSABA Pauline
Joan
MUSANA
Evans
Kwesiga Allan
13/U/21067/EVE 13/U/21076/EV
E
13/U/21363/EVE 13/U/21078/EVE 13/U/20996/EV
E
213024992 213008565 213003883 213004582 213010254
Makerere University
Tuesday, 28 July 2015 - 11:30 AM
GROUP ASSIGNMENT 3:
BIS 1204: Data and Information Management I
by Mr. ATUGONZA Martin
NORMALIZATION
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 2/48
Presentation Objectives
• The purpose of normalization.
• How normalization can be used during database design.
• The update anomalies associated with data redundancy.
• The concept of functional dependencies, which describe the
relationship between attributes.
• How to undertake the process of normalization.
• How to identify the most commonly used normal forms: First Normal
Form(1NF), Second Normal Form (2NF), and Third Normal Form
(3NF).
• Introduction to Advanced Normalisation: The Boyce–Codd Normal
Form (BCNF) and higher normal forms.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 3/48
The purpose of normalization.
• When tasked to design a relational database for an enterprise,
we are given the user and data requirements, and our objective
is to generate a set of relations (tables) that allow us to store
information without unnecessary redundancy, yet also allows us
to retrieve information easily.
• The purpose of normalization is to identify a suitable set of
relations:
– that support the data requirements of an enterprise.
– containing a minimal number of only NECESSARY attributes.
– where attributes with a close logical relationship are found in the
same relation;
– having minimal data redundancy ( with the important exception of
foreign keys essential for the joining of related relations.)
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 4/48
Normalization is a technique for producing a set
of relations with desirable properties, given the
data requirements of an enterprise.
Please Note:
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 5/48
How normalization can be used during
database design.
• Approach 1: Normalization can be used as a bottom up
standalone database design technique to create a set of
relations.
• Approach 2:Normalization can be used as a validation
technique to check the structure of relations, which may
have been created using a top-down approach such as ER
modeling.
• No matter which approach is used, the goal is the same:
that of creating a set of well-designed relations that meet
the data requirements of the enterprise.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 6/48
How normalization can be used during
database design (cont)
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 7/48
Data Redundancy
• Data redundancy is a direct result of storing copies of the
same data in more than one place in a database
(duplication of data), which can lead to inconsistencies if
some of the copies are modified and thus loss of data
integrity.
• Benefits of reducing data redundancy include:
– updates to the data stored in the database are achieved with a
minimal number of operations
– Reduction in the opportunities for data inconsistencies;
– Reduction in the file storage space required by the base relations
thus minimizing costs.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 8/48
Data Redundancy: Case Study
StudentDetail
We illustrate the problems associated with unwanted
data redundancy by comparing the StudentDetail relation
shown below, with the Student, District and Hostel relations on
the next slide.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 9/48
Data Redundancy: Case Study (cont)
Student
District Hostel
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 10/48
Data Redundancy
• StudentDetail relation has redundant data: details of a
hotel are repeated for every student.
• In contrast, hostel information appears only once for
each hostel in Hostel relation and only HostelID is
repeated in Staff relation, to represent where each
student resides.
• The same case applies for districts.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 11/48
Update Anomalies
• Relations that contain redundant information may
potentially suffer from update anomalies.
Types of update anomalies include:
– Modification
– Insertion
– Deletion
(a) Modification anomalies
• If we want to change the value of one of the attributes of a
particular hostel in the StudentDetail relation, for example
the address for hostel number HOST07, we must update
the tuples of all students residing at that hostel.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 12/48
Update Anomalies (cont)
(b) Insertion Anomalies:
– Every time we insert the details of new students into the
StudentDetail relation, we must include the details of the hostel at
which the students are to reside.
– To insert details of a new hostel that currently has no students into
the StudentDetail relation, it is necessary to enter nulls into the
attributes such as staffNo, which violates entity integrity.
(c) Deletion Anomalies
– If we delete a tuple from the StudentDetail relation that
represents the last student residing at a hostel, the details about
that hostel are also lost from the database.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 13/48
Properties associated with decomposition
of relations
• There are two important properties associated with
decomposition of a larger relation into smaller relations:
- Lossless-join property enables us to find any instance of
original relation from corresponding instances in the
smaller relations.
- Dependency preservation property enables us to enforce
a constraint on original relation by enforcing some
constraint on each of the smaller relations.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 14/48
The concept of
Functional Dependencies
• Functional Dependency
– Describes relationship between attributes in a relation.
– If A and B are attributes of relation R, B is functionally
dependent on A (denoted A  B), if each value of A is
associated with exactly one value of B.
• Determinant of a functional dependency refers to
attribute or group of attributes on left-hand side of
the arrow.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 15/48
Functional dependency is One of the main
concepts associated with normalization, which
describes the relationship between attributes in a
relation. For example, if A and B are attributes of
relation R, B is functionally dependent on A
(denoted A → B), if each value of A is associated
with exactly one value of B. (A and B may each
consist of one or more attributes.)
Please Note:
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 16/48
Types of Functional Dependencies
• Full functional dependency: If A and B are attributes of a
relation, B is fully functionally dependent on A if B is
functionally dependent on A, but not on any proper subset
of A.
• Partial Functional Dependency: A functional dependency
A→B is a partial dependency if there is some attribute that
can be removed from A and yet the dependency still holds.
• Transitive Dependency: A condition where A, B, and C are
attributes of a relation such that if A → B and B → C, then C
is transitively dependent on A via B (provided that A is not
functionally dependent on B or C).
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 17/48
Characteristics of functional dependencies
used in normalisation
1. There is a one-to-one relationship between the attribute(s) on the
left-hand side (determinant) and those on the right-hand side of a
functional dependency. (Note that the relationship in the opposite
direction, that is from the right- to the left-hand side attributes, can
be a one-to-one relationship or one-to-many relationship.)
2. They hold for all time. Eg. The Dependancy SName  RegNo does
not hold for all time but RegNo  Sname holds for all time.
3. The determinant has the minimal number of attributes necessary to
maintain the dependency with the attribute(s) on the right-hand
side. In other words, there must be a full functional dependency
between the attribute(s) on the left- and right-hand sides of the
dependency.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 18/48
The process of normalization
• Formal process for analyzing a relation based on its primary
key and functional dependencies between its attributes.
• Often executed as a series of steps. Each step corresponds
to a specific normal form, which has known properties.
• As normalization proceeds, relations become progressively
more restricted (stronger) in format and also less
vulnerable to update anomalies.
• Four most commonly used normal forms are first (1NF),
second (2NF) and third (3NF) normal forms, and Boyce–
Codd normal form (BCNF). Other forms are 4NF, 5NF and
other Higher Normal forms.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 19/48
The
process
of
normalization
cont’d
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 20/48
Unnormalized Form (UNF)
• A table that contains one or more repeating
groups.
– Note: A repeating group is an attribute or group of
attributes within a table that occurs with multiple values
for a single occurrence of the nominated key attributes
for that table. For example a book with multiple authors, A
client with many properties, etc.
• To create an unnormalized table:
– transform data from information source (e.g. form)
into table format with columns and rows.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 21/48
Unnormalized Form (UNF) cont’d
Stage 1: Paper
Forms (Data Source)
Stage 2: Table in UNF
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 22/48
First Normal Form (1NF)
• First Normal Form (1NF) is a relation in which
the intersection of each row and column
contains one and only one value.
• A table is in First Normal Form (1NF) if all its
attributes are atomic, i.e. are considered to be
indivisible units.
• It should have no composite attributes, no
multivalued attributes and no repeating groups.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 23/48
First Normal Form (1NF) is a relation in which
the intersection of each row and column contains
one and only one value.
Please Note:
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 24/48
UNF to 1NF (two approaches)
In case a table is not in 1NF, we do two things:
• First Nominate an attribute or group of attributes
to act as the primary key for the unnormalized
table, then use any of the approaches below:
Approach 1:
• Identify repeating group(s) in unnormalized table
which repeats for the key attribute(s).
• Flatten the table: Place each value of a repeating
group on a tuple with duplicate values of the non-
repeating data.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 25/48
UNF to 1NF (Approach 1) cont’d
Stage 2: Table
in UNF
Stage 3: Table
in 1NF
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 26/48
UNF to 1NF (two approaches)
Approach 2:
• Make a new table to cater for multivalued
attributes.
• Place the repeating group along with copy of the
original primary key attribute(s) into a separate
relation
• The new primary key is usually a combination of the
(multivalued) attribute and the primary key of the
parent table.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 27/48
UNF to 1NF (two approaches)
Approach 2:
Stage 2: Table
in UNF
Stage 3: Two Tables in 1NF
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 28/48
Second Normal Form (2NF)
• Second Normal Form (2NF) is based on the concept
of full functional dependency. Second Normal Form
applies to relations with composite keys, that is,
relations with a primary key composed of two or
more attributes.
• The normalization of 1NF relations to 2NF involves
the removal of partial dependencies.
• If a partial dependency exists, we remove the
partially dependent attribute(s) from the relation by
placing them in a new relation along with a copy of
their determinant.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 29/48
Second Normal Form (2NF) is a relation that is
in First Normal Form and every non-primary-key
attribute is fully functionally dependent on the
primary key.
Full functional dependency indicates that if A
and B are attributes of a relation, B is fully
functionally dependent on A if B is functionally
dependent on A but not on any proper subset of A.
Please Note:
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 30/48
Second Normal Form (2NF) cont’d
• To change the 1NF ClientRental relation to 2NF, we
begin by identifying the presence of any partial
dependencies on the primary key.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 31/48
Second Normal Form (2NF) cont’d
The ClientRental relation has the following functional dependancies:
• fd1 clientNo, propertyNo → rentStart, rentFinish (Primary key)
• fd2 clientNo → cName (Partial dependency)
NB: cName is partially dependent on the primary key, because it
depends on only the clientNo attribute and not the whole primary key
(clientNo & propertyNo combination).
• fd3 propertyNo → pAddress, rent, ownerNo, oName (Partial
dependency)
• fd4 ownerNo → oName (Transitive dependency)
• fd5 clientNo, rentStart → propertyNo, pAddress,
• rentFinish, rent, ownerNo, oName (Candidate key)
• fd6 propertyNo, rentStart → clientNo, cName, rentFinish (Candidate
key)
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 32/48
Second Normal Form (2NF) cont’d
• The identification of partial dependencies within the ClientRental
relation indicates that the relation is not in 2NF.
• To transform the ClientRental relation into 2NF requires the creation
of new relations so that the non-primary-key attributes are removed
along with a copy of the part of the primary key on which they are
fully functionally dependent.
• This results in the creation of three new relations called Client,
Rental, and PropertyOwner having the following form:
• Client (clientNo, cName)
• Rental (clientNo, propertyNo, rentStart, rentFinish)
• PropertyOwner (propertyNo, pAddress, rent, ownerNo, oName)
• These three relations are now in Second Normal Form as every
nonprimary- key attribute is fully functionally dependent on the
primary key of the relation. See the next slide for illustration.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 33/48
1NF to 2NF
illustration
Stage 3: Two Tables in 1NF
Stage 4: Three
Tables in 2NF
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 34/48
Third Normal Form (3NF)
• 2NF relations may still suffer from update anomalies.
For Example, if we want to update the name of an owner,
such as Tony Shaw (ownerNo CO93), we have to update
two tuples in the PropertyOwner relation.
• This update anomaly is caused by a transitive dependency.
• The normalization of 2NF relations to 3NF involves the
removal of transitive dependencies.
• If a transitive dependency exists, we remove the
transitively dependent attribute(s) from the relation by
placing the attribute(s) in a new relation along with a copy
of the determinant.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 35/48
Third Normal Form (3NF) is a relation that is in
First and Second Normal Form, and in which no
non-primary key attribute is transitively dependent
on the primary key.
Transitive dependency is a condition where A,
B, and C are attributes of a relation such that if
A → B and B → C, then C is transitively dependent
on A via B.
Please Note:
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 36/48
Third Normal Form (3NF) cont’d
• In our previous example, the Client and Rental relations have no
transitive dependencies and are therefore already in 3NF.
• However, in the PropertyOwner relation, all the non-primary-key
attributes within the PropertyOwner relation are functionally
dependent on the primary key, with the exception of oName, which is
transitively dependent on ownerNo (represented as fd4).
• To transform the PropertyOwner relation into 3NF we must first
remove this transitive dependency by creating two new relations
called PropertyForRent and Owner in the form:
• PropertyForRent (propertyNo, pAddress, rent, ownerNo)
• Owner (ownerNo, oName)
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 37/48
2NF to 3NF
illustration
Stage 5: Four
Tables in 3NF
Stage 4: Three
Tables in 2NF
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 38/48
Lossless-join decomposition
• ClientRental 1NF relation has been decomposed
into four 3NF relations.
• The normalization process has decomposed the original ClientRental
relation using a series of relational algebra projections. This results in
a lossless-join decomposition, which is reversible using the natural
join operation.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 39/48
General Definitions of 2NF and 3NF
• Earlier 2NF and 3NF definitions do not take into account
other candidate keys of a relation, if any exist.
• Here below, we present more general definitions that take
into account candidate keys of a relation.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 40/48
Higher Normal Forms
• We have described the three most commonly used
normal forms: 1NF, 2NF, and 3NF.
• However, R. Boyce and E.F. Codd identified a
weakness with 3NF and introduced a stronger
definition of 3NF called Boyce–Codd Normal Form
(BCNF).
• Higher normal forms that go beyond BCNF were
introduced later, such as Fourth (4NF) and Fifth (5NF)
Normal Forms (Fagin, 1977, 1979).
• However, these later normal forms deal with situations
that are very rare and so we briefly look at only BCNF.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 41/48
Boyce–Codd Normal Form (BCNF)
• We have seen that 2NF and 3NF disallow partial and
transitive dependencies on the primary key of a relation,
respectively.
• However, the 2NF and 3NF do not consider whether such
dependencies remain on other candidate keys of a relation,
if any exist.
• Additional redundancy caused by dependencies that violate
one or more candidate keys is a weakness in 3NF relations.
This weakness resulted in the presentation of a stronger
normal form called Boyce–Codd Normal Form (Codd,
1974).
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 42/48
Boyce–Codd Normal Form (BCNF) is a relation
in which every determinant is a candidate key.
Please Note:
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 43/48
PRACTICE EXERCISES
1. Given the following data source, use normalization as a
bottom-up technique to create a suitable set of relations
in 3NF.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 44/48
Solution - Exercise One
• 0NF
– ORDER(order#, customer#, name, street, city, country, orderdate, (product#,
description, quantity, unitprice))
• 1NF
– ORDER(order#, customer#, name, street,city,country, orderdate)
– PRODUCT_ORDER(order#, product#, description, quantity, unitprice)
• 2NF
– ORDER(order#, customer#, name, street, city, country, orderdate)
– PRODUCT_ORDER(order#, product#, quantity)
– PRODUCT(product#, description, unitprice)
• 3NF
– ORDER(order#, customer#, orderdate)
– CUSTOMER(customer#, name, street, city, country)
– PRODUCT_ORDER(order#, product#, quantity)
– PRODUCT(product#, description, unitprice)
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 45/48
PRACTICE EXERCISES cont’d
2. Using the given functional dependencies, normalize the
EMPLOYEE_CONTRACT relation fully.
• EMPLOYEE_CONTRACT (staffNo, contractNo, hours,
staffName, branchNo, branchName)
• fd1: staffNo, contractNo -> hours, staffName,
branchNo, branchName
• fd2: staffNo -> staffName
• fd3: contractNo -> branchNo, branchName
• Fd4: branchNo -> branchName
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 46/48
PRACTICE EXERCISES cont’d
3 (a) The following table is susceptible to update anomalies. Provide
examples of insertion, deletion, and modification anomalies.
(b) Using the primary key and functional dependency concepts,
normalize the following tables below to the 3NF
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 47/48
Review Questions
1. Describe the purpose of normalizing data.
2. Discuss the alternative ways that normalization can be used to support database design.
3. Describe the types of update anomaly that may occur on a relation that has redundant data.
4. Describe the concept of functional dependency.
5. What are the main characteristics of functional dependencies that are used for normalization?
6. Describe how a database designer typically identifies the set of functional dependencies
associated with a relation.
7. Describe the characteristics of a table in Unnormalized Form (UNF) and describe how such a
table is converted to a First Normal Form (1NF) relation.
8. What is the minimal normal form that a relation must satisfy? Provide a definition for this
normal form.
9. Describe the two approaches to converting an Unnormalized Form (UNF) table to First
Normal Form (1NF) relation(s).
10. Describe the concept of full functional dependency and describe how this concept relates to
2NF. Provide an example to illustrate your answer.
11. Describe the concept of transitive dependency and describe how this concept relates to 3NF.
12. Discuss how the definitions of 2NF and 3NF based on primary keys differ from the general
definitions of 2NF and 3NF.
BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 48/48

More Related Content

PPT
Introduction to data structure
PDF
Avl tree algorithm
PPTX
Slide 3 data abstraction & 3 schema
PPTX
pl/sql Procedure
PPTX
DBMS Integrity rule
PPTX
Integrity Constraints
PPT
Fundamentals of Database ppt ch03
PPTX
Presentation on TCP/IP Model
Introduction to data structure
Avl tree algorithm
Slide 3 data abstraction & 3 schema
pl/sql Procedure
DBMS Integrity rule
Integrity Constraints
Fundamentals of Database ppt ch03
Presentation on TCP/IP Model

What's hot (20)

PPT
Database Normalization 1NF, 2NF, 3NF, BCNF, 4NF, 5NF
PPT
Transaction management and concurrency control
PPTX
Object Oriented Programming with C++
PPT
Database Chapter 3
PPTX
Internetworking
PPT
Fundamentals of Database system
PDF
Dbms viva questions
PPTX
5. stored procedure and functions
PPTX
Transaction processing ppt
PPTX
Logical database design and the relational model(database)
PPTX
Introduction to database
PPTX
Instruction pipeline: Computer Architecture
PPTX
PPTX
Normalization
PPTX
process and thread.pptx
PPTX
Tcp/ip server sockets
PPTX
Student Tracking System
PDF
Database Normalization
DOC
Chapter-13-solutions
Database Normalization 1NF, 2NF, 3NF, BCNF, 4NF, 5NF
Transaction management and concurrency control
Object Oriented Programming with C++
Database Chapter 3
Internetworking
Fundamentals of Database system
Dbms viva questions
5. stored procedure and functions
Transaction processing ppt
Logical database design and the relational model(database)
Introduction to database
Instruction pipeline: Computer Architecture
Normalization
process and thread.pptx
Tcp/ip server sockets
Student Tracking System
Database Normalization
Chapter-13-solutions
Ad

Viewers also liked (20)

PDF
Database design & Normalization (1NF, 2NF, 3NF)
PPT
Web designing and publishing computer studies theory lesson
PPT
D B M S Animate
PPTX
PPT
Over view of internet computer studies lesson
DOCX
ICT FINAL EXAM
PPTX
Normalization
PPTX
Patterns of organization of speech, and how to lead discussions and seminars
PDF
Answer guide to uneb sample question paper 2013 print by WAFUBA SJ BUKOYO EC ...
PPT
102 Evolution of computers
PPT
Input computer hardware notes for UCEand UACE ICT
PPT
DBMS - Normalization
PDF
A Monitor System in Data Redundancy in Information System
PPTX
Kittys network power point
PPT
Databaseconcepts
PPT
Mysql Statments
 
PPT
DOC
Assignment 4,5,6 technological matrixing and types of computers
PDF
BitNation Client-Server Manual
PPTX
Chapter 6 database normalisation
Database design & Normalization (1NF, 2NF, 3NF)
Web designing and publishing computer studies theory lesson
D B M S Animate
Over view of internet computer studies lesson
ICT FINAL EXAM
Normalization
Patterns of organization of speech, and how to lead discussions and seminars
Answer guide to uneb sample question paper 2013 print by WAFUBA SJ BUKOYO EC ...
102 Evolution of computers
Input computer hardware notes for UCEand UACE ICT
DBMS - Normalization
A Monitor System in Data Redundancy in Information System
Kittys network power point
Databaseconcepts
Mysql Statments
 
Assignment 4,5,6 technological matrixing and types of computers
BitNation Client-Server Manual
Chapter 6 database normalisation
Ad

Similar to NORMALIZATION - BIS 1204: Data and Information Management I (20)

PPT
Presentation for the Database Management
PPT
Chapter 1 - Introduction to Database Systems.ppt
PPSX
Corporate data handling
PDF
chapter-13-database-concepts.pdf
PPTX
04 CHAPTER FOUR - INTEGRITY CONSTRAINTS AND NORMALIZATION.pptx
PPTX
Normalization
DOCX
ICS Part 2 Computer Science Short Notes
PDF
Cal Essay
PDF
ITB - UNIT 3.pdf
PPTX
the normalization of database from APSI Course
PPTX
Chapter 3 ( PART 2 ).pptx
PDF
Ibps it officer exam capsule by affairs cloud
PDF
Ibps it officer exam capsule by affairs cloud
DOCX
Advanced Database Systems CS352Unit 4 Individual Project.docx
DOCX
Database Normalization.docx
PPTX
01 CHAPTER ONE PART I - OVERVIEW OF THE DATABASE SYSTEM.pptx
PPT
DBMS e evevevevevevevbebrbbrbrbrbrbrbrb 4.ppt
PDF
Unit 1 basic concepts of DBMS
PPTX
DBMS DATA MANAGEMENT SYSTEM ppt Cs403 rtc
PDF
Asif nosql
Presentation for the Database Management
Chapter 1 - Introduction to Database Systems.ppt
Corporate data handling
chapter-13-database-concepts.pdf
04 CHAPTER FOUR - INTEGRITY CONSTRAINTS AND NORMALIZATION.pptx
Normalization
ICS Part 2 Computer Science Short Notes
Cal Essay
ITB - UNIT 3.pdf
the normalization of database from APSI Course
Chapter 3 ( PART 2 ).pptx
Ibps it officer exam capsule by affairs cloud
Ibps it officer exam capsule by affairs cloud
Advanced Database Systems CS352Unit 4 Individual Project.docx
Database Normalization.docx
01 CHAPTER ONE PART I - OVERVIEW OF THE DATABASE SYSTEM.pptx
DBMS e evevevevevevevbebrbbrbrbrbrbrbrb 4.ppt
Unit 1 basic concepts of DBMS
DBMS DATA MANAGEMENT SYSTEM ppt Cs403 rtc
Asif nosql

More from Mukalele Rogers (7)

DOC
Assignments on adopting information technology in traditional organisations
PPTX
IT Security and Risk Mitigation
DOCX
How Internet Search Engines Work
PPTX
Technology based distribution channels / networks in financial industry
PPT
Piascy group discussion about abstainance
PPT
P I A S C Y G R O U P D I S C U S S I O N A B O U T A B S T A I N A N C E...
PPT
Factors which affect the speed of internet computer studies lesson
Assignments on adopting information technology in traditional organisations
IT Security and Risk Mitigation
How Internet Search Engines Work
Technology based distribution channels / networks in financial industry
Piascy group discussion about abstainance
P I A S C Y G R O U P D I S C U S S I O N A B O U T A B S T A I N A N C E...
Factors which affect the speed of internet computer studies lesson

Recently uploaded (20)

PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
01-Introduction-to-Information-Management.pdf
PDF
RMMM.pdf make it easy to upload and study
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
Pharma ospi slides which help in ospi learning
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
01-Introduction-to-Information-Management.pdf
RMMM.pdf make it easy to upload and study
Microbial disease of the cardiovascular and lymphatic systems
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Pharma ospi slides which help in ospi learning
O5-L3 Freight Transport Ops (International) V1.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Renaissance Architecture: A Journey from Faith to Humanism
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
TR - Agricultural Crops Production NC III.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
O7-L3 Supply Chain Operations - ICLT Program
FourierSeries-QuestionsWithAnswers(Part-A).pdf
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
GDM (1) (1).pptx small presentation for students
Anesthesia in Laparoscopic Surgery in India
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx

NORMALIZATION - BIS 1204: Data and Information Management I

  • 1. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 1/48 MUKALELE Rogers MULINDA Saddati TUSABA Pauline Joan MUSANA Evans Kwesiga Allan 13/U/21067/EVE 13/U/21076/EV E 13/U/21363/EVE 13/U/21078/EVE 13/U/20996/EV E 213024992 213008565 213003883 213004582 213010254 Makerere University Tuesday, 28 July 2015 - 11:30 AM GROUP ASSIGNMENT 3: BIS 1204: Data and Information Management I by Mr. ATUGONZA Martin NORMALIZATION
  • 2. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 2/48 Presentation Objectives • The purpose of normalization. • How normalization can be used during database design. • The update anomalies associated with data redundancy. • The concept of functional dependencies, which describe the relationship between attributes. • How to undertake the process of normalization. • How to identify the most commonly used normal forms: First Normal Form(1NF), Second Normal Form (2NF), and Third Normal Form (3NF). • Introduction to Advanced Normalisation: The Boyce–Codd Normal Form (BCNF) and higher normal forms.
  • 3. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 3/48 The purpose of normalization. • When tasked to design a relational database for an enterprise, we are given the user and data requirements, and our objective is to generate a set of relations (tables) that allow us to store information without unnecessary redundancy, yet also allows us to retrieve information easily. • The purpose of normalization is to identify a suitable set of relations: – that support the data requirements of an enterprise. – containing a minimal number of only NECESSARY attributes. – where attributes with a close logical relationship are found in the same relation; – having minimal data redundancy ( with the important exception of foreign keys essential for the joining of related relations.)
  • 4. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 4/48 Normalization is a technique for producing a set of relations with desirable properties, given the data requirements of an enterprise. Please Note:
  • 5. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 5/48 How normalization can be used during database design. • Approach 1: Normalization can be used as a bottom up standalone database design technique to create a set of relations. • Approach 2:Normalization can be used as a validation technique to check the structure of relations, which may have been created using a top-down approach such as ER modeling. • No matter which approach is used, the goal is the same: that of creating a set of well-designed relations that meet the data requirements of the enterprise.
  • 6. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 6/48 How normalization can be used during database design (cont)
  • 7. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 7/48 Data Redundancy • Data redundancy is a direct result of storing copies of the same data in more than one place in a database (duplication of data), which can lead to inconsistencies if some of the copies are modified and thus loss of data integrity. • Benefits of reducing data redundancy include: – updates to the data stored in the database are achieved with a minimal number of operations – Reduction in the opportunities for data inconsistencies; – Reduction in the file storage space required by the base relations thus minimizing costs.
  • 8. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 8/48 Data Redundancy: Case Study StudentDetail We illustrate the problems associated with unwanted data redundancy by comparing the StudentDetail relation shown below, with the Student, District and Hostel relations on the next slide.
  • 9. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 9/48 Data Redundancy: Case Study (cont) Student District Hostel
  • 10. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 10/48 Data Redundancy • StudentDetail relation has redundant data: details of a hotel are repeated for every student. • In contrast, hostel information appears only once for each hostel in Hostel relation and only HostelID is repeated in Staff relation, to represent where each student resides. • The same case applies for districts.
  • 11. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 11/48 Update Anomalies • Relations that contain redundant information may potentially suffer from update anomalies. Types of update anomalies include: – Modification – Insertion – Deletion (a) Modification anomalies • If we want to change the value of one of the attributes of a particular hostel in the StudentDetail relation, for example the address for hostel number HOST07, we must update the tuples of all students residing at that hostel.
  • 12. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 12/48 Update Anomalies (cont) (b) Insertion Anomalies: – Every time we insert the details of new students into the StudentDetail relation, we must include the details of the hostel at which the students are to reside. – To insert details of a new hostel that currently has no students into the StudentDetail relation, it is necessary to enter nulls into the attributes such as staffNo, which violates entity integrity. (c) Deletion Anomalies – If we delete a tuple from the StudentDetail relation that represents the last student residing at a hostel, the details about that hostel are also lost from the database.
  • 13. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 13/48 Properties associated with decomposition of relations • There are two important properties associated with decomposition of a larger relation into smaller relations: - Lossless-join property enables us to find any instance of original relation from corresponding instances in the smaller relations. - Dependency preservation property enables us to enforce a constraint on original relation by enforcing some constraint on each of the smaller relations.
  • 14. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 14/48 The concept of Functional Dependencies • Functional Dependency – Describes relationship between attributes in a relation. – If A and B are attributes of relation R, B is functionally dependent on A (denoted A  B), if each value of A is associated with exactly one value of B. • Determinant of a functional dependency refers to attribute or group of attributes on left-hand side of the arrow.
  • 15. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 15/48 Functional dependency is One of the main concepts associated with normalization, which describes the relationship between attributes in a relation. For example, if A and B are attributes of relation R, B is functionally dependent on A (denoted A → B), if each value of A is associated with exactly one value of B. (A and B may each consist of one or more attributes.) Please Note:
  • 16. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 16/48 Types of Functional Dependencies • Full functional dependency: If A and B are attributes of a relation, B is fully functionally dependent on A if B is functionally dependent on A, but not on any proper subset of A. • Partial Functional Dependency: A functional dependency A→B is a partial dependency if there is some attribute that can be removed from A and yet the dependency still holds. • Transitive Dependency: A condition where A, B, and C are attributes of a relation such that if A → B and B → C, then C is transitively dependent on A via B (provided that A is not functionally dependent on B or C).
  • 17. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 17/48 Characteristics of functional dependencies used in normalisation 1. There is a one-to-one relationship between the attribute(s) on the left-hand side (determinant) and those on the right-hand side of a functional dependency. (Note that the relationship in the opposite direction, that is from the right- to the left-hand side attributes, can be a one-to-one relationship or one-to-many relationship.) 2. They hold for all time. Eg. The Dependancy SName  RegNo does not hold for all time but RegNo  Sname holds for all time. 3. The determinant has the minimal number of attributes necessary to maintain the dependency with the attribute(s) on the right-hand side. In other words, there must be a full functional dependency between the attribute(s) on the left- and right-hand sides of the dependency.
  • 18. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 18/48 The process of normalization • Formal process for analyzing a relation based on its primary key and functional dependencies between its attributes. • Often executed as a series of steps. Each step corresponds to a specific normal form, which has known properties. • As normalization proceeds, relations become progressively more restricted (stronger) in format and also less vulnerable to update anomalies. • Four most commonly used normal forms are first (1NF), second (2NF) and third (3NF) normal forms, and Boyce– Codd normal form (BCNF). Other forms are 4NF, 5NF and other Higher Normal forms.
  • 19. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 19/48 The process of normalization cont’d
  • 20. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 20/48 Unnormalized Form (UNF) • A table that contains one or more repeating groups. – Note: A repeating group is an attribute or group of attributes within a table that occurs with multiple values for a single occurrence of the nominated key attributes for that table. For example a book with multiple authors, A client with many properties, etc. • To create an unnormalized table: – transform data from information source (e.g. form) into table format with columns and rows.
  • 21. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 21/48 Unnormalized Form (UNF) cont’d Stage 1: Paper Forms (Data Source) Stage 2: Table in UNF
  • 22. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 22/48 First Normal Form (1NF) • First Normal Form (1NF) is a relation in which the intersection of each row and column contains one and only one value. • A table is in First Normal Form (1NF) if all its attributes are atomic, i.e. are considered to be indivisible units. • It should have no composite attributes, no multivalued attributes and no repeating groups.
  • 23. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 23/48 First Normal Form (1NF) is a relation in which the intersection of each row and column contains one and only one value. Please Note:
  • 24. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 24/48 UNF to 1NF (two approaches) In case a table is not in 1NF, we do two things: • First Nominate an attribute or group of attributes to act as the primary key for the unnormalized table, then use any of the approaches below: Approach 1: • Identify repeating group(s) in unnormalized table which repeats for the key attribute(s). • Flatten the table: Place each value of a repeating group on a tuple with duplicate values of the non- repeating data.
  • 25. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 25/48 UNF to 1NF (Approach 1) cont’d Stage 2: Table in UNF Stage 3: Table in 1NF
  • 26. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 26/48 UNF to 1NF (two approaches) Approach 2: • Make a new table to cater for multivalued attributes. • Place the repeating group along with copy of the original primary key attribute(s) into a separate relation • The new primary key is usually a combination of the (multivalued) attribute and the primary key of the parent table.
  • 27. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 27/48 UNF to 1NF (two approaches) Approach 2: Stage 2: Table in UNF Stage 3: Two Tables in 1NF
  • 28. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 28/48 Second Normal Form (2NF) • Second Normal Form (2NF) is based on the concept of full functional dependency. Second Normal Form applies to relations with composite keys, that is, relations with a primary key composed of two or more attributes. • The normalization of 1NF relations to 2NF involves the removal of partial dependencies. • If a partial dependency exists, we remove the partially dependent attribute(s) from the relation by placing them in a new relation along with a copy of their determinant.
  • 29. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 29/48 Second Normal Form (2NF) is a relation that is in First Normal Form and every non-primary-key attribute is fully functionally dependent on the primary key. Full functional dependency indicates that if A and B are attributes of a relation, B is fully functionally dependent on A if B is functionally dependent on A but not on any proper subset of A. Please Note:
  • 30. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 30/48 Second Normal Form (2NF) cont’d • To change the 1NF ClientRental relation to 2NF, we begin by identifying the presence of any partial dependencies on the primary key.
  • 31. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 31/48 Second Normal Form (2NF) cont’d The ClientRental relation has the following functional dependancies: • fd1 clientNo, propertyNo → rentStart, rentFinish (Primary key) • fd2 clientNo → cName (Partial dependency) NB: cName is partially dependent on the primary key, because it depends on only the clientNo attribute and not the whole primary key (clientNo & propertyNo combination). • fd3 propertyNo → pAddress, rent, ownerNo, oName (Partial dependency) • fd4 ownerNo → oName (Transitive dependency) • fd5 clientNo, rentStart → propertyNo, pAddress, • rentFinish, rent, ownerNo, oName (Candidate key) • fd6 propertyNo, rentStart → clientNo, cName, rentFinish (Candidate key)
  • 32. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 32/48 Second Normal Form (2NF) cont’d • The identification of partial dependencies within the ClientRental relation indicates that the relation is not in 2NF. • To transform the ClientRental relation into 2NF requires the creation of new relations so that the non-primary-key attributes are removed along with a copy of the part of the primary key on which they are fully functionally dependent. • This results in the creation of three new relations called Client, Rental, and PropertyOwner having the following form: • Client (clientNo, cName) • Rental (clientNo, propertyNo, rentStart, rentFinish) • PropertyOwner (propertyNo, pAddress, rent, ownerNo, oName) • These three relations are now in Second Normal Form as every nonprimary- key attribute is fully functionally dependent on the primary key of the relation. See the next slide for illustration.
  • 33. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 33/48 1NF to 2NF illustration Stage 3: Two Tables in 1NF Stage 4: Three Tables in 2NF
  • 34. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 34/48 Third Normal Form (3NF) • 2NF relations may still suffer from update anomalies. For Example, if we want to update the name of an owner, such as Tony Shaw (ownerNo CO93), we have to update two tuples in the PropertyOwner relation. • This update anomaly is caused by a transitive dependency. • The normalization of 2NF relations to 3NF involves the removal of transitive dependencies. • If a transitive dependency exists, we remove the transitively dependent attribute(s) from the relation by placing the attribute(s) in a new relation along with a copy of the determinant.
  • 35. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 35/48 Third Normal Form (3NF) is a relation that is in First and Second Normal Form, and in which no non-primary key attribute is transitively dependent on the primary key. Transitive dependency is a condition where A, B, and C are attributes of a relation such that if A → B and B → C, then C is transitively dependent on A via B. Please Note:
  • 36. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 36/48 Third Normal Form (3NF) cont’d • In our previous example, the Client and Rental relations have no transitive dependencies and are therefore already in 3NF. • However, in the PropertyOwner relation, all the non-primary-key attributes within the PropertyOwner relation are functionally dependent on the primary key, with the exception of oName, which is transitively dependent on ownerNo (represented as fd4). • To transform the PropertyOwner relation into 3NF we must first remove this transitive dependency by creating two new relations called PropertyForRent and Owner in the form: • PropertyForRent (propertyNo, pAddress, rent, ownerNo) • Owner (ownerNo, oName)
  • 37. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 37/48 2NF to 3NF illustration Stage 5: Four Tables in 3NF Stage 4: Three Tables in 2NF
  • 38. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 38/48 Lossless-join decomposition • ClientRental 1NF relation has been decomposed into four 3NF relations. • The normalization process has decomposed the original ClientRental relation using a series of relational algebra projections. This results in a lossless-join decomposition, which is reversible using the natural join operation.
  • 39. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 39/48 General Definitions of 2NF and 3NF • Earlier 2NF and 3NF definitions do not take into account other candidate keys of a relation, if any exist. • Here below, we present more general definitions that take into account candidate keys of a relation.
  • 40. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 40/48 Higher Normal Forms • We have described the three most commonly used normal forms: 1NF, 2NF, and 3NF. • However, R. Boyce and E.F. Codd identified a weakness with 3NF and introduced a stronger definition of 3NF called Boyce–Codd Normal Form (BCNF). • Higher normal forms that go beyond BCNF were introduced later, such as Fourth (4NF) and Fifth (5NF) Normal Forms (Fagin, 1977, 1979). • However, these later normal forms deal with situations that are very rare and so we briefly look at only BCNF.
  • 41. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 41/48 Boyce–Codd Normal Form (BCNF) • We have seen that 2NF and 3NF disallow partial and transitive dependencies on the primary key of a relation, respectively. • However, the 2NF and 3NF do not consider whether such dependencies remain on other candidate keys of a relation, if any exist. • Additional redundancy caused by dependencies that violate one or more candidate keys is a weakness in 3NF relations. This weakness resulted in the presentation of a stronger normal form called Boyce–Codd Normal Form (Codd, 1974).
  • 42. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 42/48 Boyce–Codd Normal Form (BCNF) is a relation in which every determinant is a candidate key. Please Note:
  • 43. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 43/48 PRACTICE EXERCISES 1. Given the following data source, use normalization as a bottom-up technique to create a suitable set of relations in 3NF.
  • 44. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 44/48 Solution - Exercise One • 0NF – ORDER(order#, customer#, name, street, city, country, orderdate, (product#, description, quantity, unitprice)) • 1NF – ORDER(order#, customer#, name, street,city,country, orderdate) – PRODUCT_ORDER(order#, product#, description, quantity, unitprice) • 2NF – ORDER(order#, customer#, name, street, city, country, orderdate) – PRODUCT_ORDER(order#, product#, quantity) – PRODUCT(product#, description, unitprice) • 3NF – ORDER(order#, customer#, orderdate) – CUSTOMER(customer#, name, street, city, country) – PRODUCT_ORDER(order#, product#, quantity) – PRODUCT(product#, description, unitprice)
  • 45. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 45/48 PRACTICE EXERCISES cont’d 2. Using the given functional dependencies, normalize the EMPLOYEE_CONTRACT relation fully. • EMPLOYEE_CONTRACT (staffNo, contractNo, hours, staffName, branchNo, branchName) • fd1: staffNo, contractNo -> hours, staffName, branchNo, branchName • fd2: staffNo -> staffName • fd3: contractNo -> branchNo, branchName • Fd4: branchNo -> branchName
  • 46. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 46/48 PRACTICE EXERCISES cont’d 3 (a) The following table is susceptible to update anomalies. Provide examples of insertion, deletion, and modification anomalies. (b) Using the primary key and functional dependency concepts, normalize the following tables below to the 3NF
  • 47. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 47/48 Review Questions 1. Describe the purpose of normalizing data. 2. Discuss the alternative ways that normalization can be used to support database design. 3. Describe the types of update anomaly that may occur on a relation that has redundant data. 4. Describe the concept of functional dependency. 5. What are the main characteristics of functional dependencies that are used for normalization? 6. Describe how a database designer typically identifies the set of functional dependencies associated with a relation. 7. Describe the characteristics of a table in Unnormalized Form (UNF) and describe how such a table is converted to a First Normal Form (1NF) relation. 8. What is the minimal normal form that a relation must satisfy? Provide a definition for this normal form. 9. Describe the two approaches to converting an Unnormalized Form (UNF) table to First Normal Form (1NF) relation(s). 10. Describe the concept of full functional dependency and describe how this concept relates to 2NF. Provide an example to illustrate your answer. 11. Describe the concept of transitive dependency and describe how this concept relates to 3NF. 12. Discuss how the definitions of 2NF and 3NF based on primary keys differ from the general definitions of 2NF and 3NF.
  • 48. BIS 1204: Data and Information Management IEvening ‘Group 2’ Presentation: Slide 48/48