Systems Analysis & Design
Eleventh Edition
Chapter # 13
Designing Databases
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Learning Objectives (1 of 2)
13.1 Appreciate what a database is.
13.2 Understand basic data concepts.
13.3 Learn not to efficiently store data using normalization.
13.4 Know the guidelines for designing main files and
database relations.
13.5 Grasp the notion of normalization and why it is useful.
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Learning Objectives (2 of 2)
13.6 Understand the concept of data warehouses.
13.7 Comprehend the relationship of business intelligence
to data warehouses.
13.8 Understand data analytics in helping systems and
people make decisions.
13.9 Learn about data lakes and how they differ from data
warehouses.
13.10 Gain basic knowledge about blockchains and their
usefulness.
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Major Topics
• Databases
• Normalization
• Key design
• Using the database
• Data warehouses
• Data lakes
• Data mining
• Business intelligence
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data Storage (1 of 2)
• The data must be available when the user wants to use
them
• The data must be accurate and consistent
• Efficient storage of data as well as efficient updating
and retrieval
• It is necessary that information retrieval be
purposeful
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data Storage (2 of 2)
• There are two approaches to the storage of data in a
computer-based system:
– Store the data in individual files, each unique to a
particular application
– Store data in a database
▪ A database is a formally defined and centrally
controlled store of data intended for use in many
different applications
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Databases (1 of 2)
• Effectiveness objectives of the database:
– Ensuring that data can be shared among users for a
variety of applications
– Maintaining data that are both accurate and consistent
– Ensuring data required for current and future applications
will be readily available
– Allowing the database to evolve as the needs of the users
grow
– Allowing users to construct their personal view of the data
without concern for the way the data are physically stored
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Databases (2 of 2)
• Databases are highly useful for organizations to store,
access, and retrieve their data
• Security issues will arise
• Risk assessment is an essential part of database
planning
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Database Security (1 of 2)
• Cyber attacks, including malware and denial of service
attacks, are on the increase
• Getting users involved in business continuity planning
and disaster recovery approaches will help make
databases and the organization resilient if security is
breached
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Database Security (2 of 2)
• Companies must consider these questions:
– What is the likelihood of an attack?
– What is the value of the data that are being
protected?
– What are the implications of a security breach for the
company’s customers and, in the aftermath, for the
company’s image?
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Reality, Data, and Metadata
• Reality
– The real world
• Data
– Collected about people, places, or events in reality
and eventually stored in a file or database
• Metadata
– Information that describes data
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.1 Reality, Data, and Metadata
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Entities
• Any object or event about which someone chooses to
collect data
• May be a person, place, or thing
• May be an event or unit of time
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Entity Subtype
• An entity subtype is a special one-to-one relationship
used to represent additional attributes, which may not
be present on every record of the first entity
• This eliminates null fields stored on database tables
• For example, students who have internships: the
STUDENT MASTER should not have to contain
information about internships for each student
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Relationships
• Relationships
– One-to-one
– One-to-many
– Many-to-many
• A single vertical line represents one
• A crow’s foot represents many
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.2 Entity-Relationship Diagrams
Associations
Entity-relationship (E-R) diagrams
can show one-to-one, one-to-
many, or many-to-many
associations.
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.3 Entity-Relationship Symbols
and Their Meanings
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.4 The Entity-Relationship
Diagram for Patient Treatment
Attributes can be
listed alongside
the entities. The
key is underlined.
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Attributes, Records, and Keys
• Attributes represent some characteristic of an entity
• Records are a collection of data items that have
something in common with the entity described
• Keys are data items in a record used to identify the
record
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.6 Attributes and Keys
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Key Types
• Key types are:
– Primary key—unique attribute for the record
– Candidate key—an attribute or collection of attributes,
that can serve as a primary key
– Secondary key—a key which may not be unique,
used to select a group of records
– Composite key—a combination of two or more
attributes representing the key
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Metadata
• Data about the data in the file or database
• Describe the name given and the length assigned
each data item
• Also describe the length and composition of each of
the records
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.7 Metadata
Metadata
includes a
description of
what the value of
each data item
looks like.
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Files
• A file contains groups of records used to provide
information for operations, planning, management, and
decision making
• Files can be used for storing data for an indefinite period
of time, or they can be used to store data temporarily for
a specific purpose
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
File Types
• Master file
• Table file
• Transaction file
• Report file
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Master and Table Files
• Master files:
– Contain records for a group of entities
– Contain all information about a data entity
• Table files:
– Contain data used to calculate more data or
performance measures
– Usually read-only by a program
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Transaction and Report Files
• Transaction records:
– Used to enter changes that update the master file and
produce reports
• Report files:
– Used when it is necessary to print a report when no
printer is available
– Useful because users can take files to other computer
systems and output to specialty devices
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Relational Databases
• A database is intended to be shared by many users
• There are three structures for storing database files:
– Relational database structures
– Hierarchical database structures
– Network database structures
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.8 Database Design
Database design includes
synthesizing user reports,
user views, and logical
and physical designs
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.9 Relational Data Structure
In a relational data structure, data are stored in many tables.
Item Price Item-# Title Price
B235 Guys and Dolls 8.99
B521 My Fair Lady 6.99
B894 42nd Street 10.99
B992 A Chorus Line 10.99
Order
Order-# Last
Name
I Street Address City St Charge
Acct
10784 MacRae G 2314 Curly
Circle
Lincoln NE 45-4654-76
10796 Jones S 34 Dream Lane Oklahoma
City
OK 44-9876-74
11821 Preston R 1008 Madison
Ave.
River City IA 34-7642-64
11845 Channing C 454 Harmonia
St.
New York NY 34-0876-87
11872 Kiley R 765 Dulcinea
Drive
La
Mancha
CA 65-8798-87
Item Status
Item-# Order-# Status
B235 10784 Shipped 5/12
B235 19796 Shipped 5/14
B235 11872 In Process
B521 11821 In Process
B894 11845 Backordered
B894 11872 Shipped 5/12
B992 10784 Shipped 5/12
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Normalization
• Normalization is the transformation of complex user
views and data stores to a set of smaller, stable, and
easily maintainable data structures
• The main objective of the normalization process is to
simplify all the complex data items that are often found in
user views
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.10 Normalization of a Relation
is Accomplished in Three Major Steps
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data Model Diagrams
• Shows data associations of data elements
• Each entity is enclosed in an ellipse
• Arrows are used to show the relationships
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.13 Drawing Data Model
Drawing data model diagrams
for data associations
sometimes helps analysts
appreciate the complexity of
data storage.
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
First Normal Form (1NF)
• Remove repeating groups
• The primary key with repeating group attributes are
moved into a new table
• When a relation contains no repeating groups, it is in first
normal form
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.16 The Original Unnormalized
Relation
The original
unnormalized relation
SALES-REPORT is
separated into two
relations, SALESPERSON
(3NF) and
SALESPERSON-
CUSTOMER (1NF).
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Second Normal Form (2NF)
• Remove any partially dependent attributes and place
them in another relation
• A partial dependency is when the data are dependent on
a part of a primary key
• A relation is created for the data that are only dependent
on part of the key and another for data that are
dependent on both parts
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.18 Second Normal Form
The relation SALESPERSON -
CUSTOMER is separated into
a relation called CUSTOMER
- WAREHOUSE (2NF) and a
relation called SALES (1NF).
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Third Normal Form (3NF)
• Must be in 2NF
• Remove any transitive dependencies
• A transitive dependency is when nonkey attributes are
dependent not only on the primary key, but also on a
nonkey attribute
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.20 Third Normal Form
The relation CUSTOMER-WAREHOUSE is separated into two relations called
CUSTOMER (1NF) and WAREHOUSE (1NF).
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.22 AlS. Well Hydraulic
Company E-R Diagram
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Using the Entity-Relationship Diagram to
Determine Record Keys
• When the relationship is one-to-many, the primary key
of the file at the one end of the relationship should be
contained as a foreign key on the file at the many end
of the relationship
• A many-to-many relationship should be divided into
two one-to-many relationships with an associative
entity in the middle
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Guidelines for Master File/Database
Relation Design
• Each separate data entity should create a master
database table
• A specific data field should exist on one master table
• Each master table or database relation should have
programs to create, read, update, and delete the
records
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Integrity Constraints
• Entity integrity
• Referential integrity
• Domain integrity
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Entity Integrity
• The primary key cannot have a null value
• If the primary key is a composite key, none of the fields in
the key can contain a null value
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Referential Integrity (1 of 3)
• Referential integrity governs the nature of records in a
one-to-many relationship
• Referential integrity means that all foreign keys in the
many table (the child table) must have a matching record
in the parent table
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Referential Integrity (2 of 3)
• Referential integrity implications:
– You cannot add a record in the child (many) table
without a matching record in the parent table
– You cannot change a primary key that has matching
child table records
– You cannot delete a record that has child records
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Referential Integrity (3 of 3)
• Implemented in two ways:
– A restricted database updates or deletes a key only if
there are no matching child records
– A cascaded database will delete or update all child
records when a parent record is deleted or changed
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Domain Integrity
• Domain integrity rules are used to validate the data
• Domain integrity has two forms:
– Check constraints, which are defined at the table level
– Rules, which are defined as separate objects and can
be used within a number of fields
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Anomalies
• Data redundancy
• Insert anomaly
• Deletion anomaly
• Update anomaly
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data Redundancy
• When the same data is stored in more than one place in
the database
• Solved by creating tables that are in third normal form
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Insert Anomaly
• Occurs when the entire primary key is not known and the
database cannot insert a new record, which would violate
entity integrity
• Can be avoided by using a sequence number for the
primary key
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Deletion Anomaly
• Happens when a record is deleted that results in the loss
of other related data
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Update Anomaly
• When a change to one attribute value causes the
database to either contain inconsistent data or causes
multiple records to need changing
• May be prevented by making sure tables are in third
normal form
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Retrieving and Presenting Database Data
• Choose a relation from the database
• Join two relations together
• Project columns from the relation
• Select rows from the relation
• Derive new attributes
• Index or sort rows
• Calculate totals and performance measures
• Present data
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Denormalization
• Denormalization is the process of taking the logical data
model and transforming it into an efficient physical model
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data Warehouses and Database
Differences (1 of 2)
• Data warehouses are used to organize information for
quick and effective queries
• In the data warehouse, data are organized around major
subjects
• Data in the warehouse are stored as summarized rather
than detailed raw data
• Data in the data warehouse cover a much longer time
frame than in a traditional transaction-oriented database
• Data warehouses are organized for fast queries
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data Warehouses and Database
Differences (2 of 2)
• Data warehouses are usually optimized for answering
complex queries, known as OLAP
• Data warehouses allow for easy access via data-
mining software
• Data warehouses include multiple databases that
have been processed so that data are uniformly
defined
• Data warehouses usually include data from outside
sources
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Online Analytic Processing
• Online analytic processing (OLAP) is meant to answer
decision makers’ complex questions by defining a
multidimensional database
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data-Mining Decision Aids
• Software
– Statistical analysis
– Decision trees
– Neural networks
– Intelligent agents
– Fuzzy logic
– Data visualization
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data-Mining Patterns
• Associations—patterns that occur together
• Sequences—patterns of actions that take place over a
period of time
• Clustering—patterns that develop among groups of
people
• Trends—the patterns that are noticed over a period of
time
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.27 Data Mining
Data mining collects
personal
information about
customers in an
effort to be more
specific in
interpreting and
anticipating their
preferences
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data-Mining Problems
• Costs may be too high to justify
• Has to be coordinated
• Ethical aspects
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Business Intelligence (BI)
• Business intelligence is a decision support system (DSS)
for organizational decision makers
• It is composed of features that gather and store data
• It uses knowledge management approaches combined
with analysis
• This becomes input to decision makers’ decision-making
processes
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Business Intelligence
• Business intelligence is built around processing large
volumes of data
• Big data is when data sets become too large or too
complex to be handled with traditional tools or within
traditional databases or data warehouses
• Big data is a strategy that allows organizations to cope
with ever-increasing numbers of data from a myriad of
sources
– Human generated
– Generated via sensors of some type
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Analyzing Business Intelligence
• Five prominent methods are used for analyzing business
intelligence
– Slice-and-dice drilldown
– Ad hoc queries
– Real-time analysis
– Forecasting
– Scenarios
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data Analytics
• Data analytics use algorithms to analyze vast amounts
of data in databases or to analyze huge quantities of
unstructured data
• Analytics can refer to:
– Solving business problems and the decision-
making process
– The analytical approaches that help organizations
create value
– A broader term is business intelligence (BI)
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Eight Aspects of Business Analysis (1 of 3)
1. Ensure the quality of the data
2. Facilitate communication between users and data
analytics professionals
3. Educate users and decision makers about how to use
analytical tools and how to interpret reports based on
use of those tools
4. Create reports that can be user generated
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Eight Aspects of Business Analysis (2 of 3)
5. Work with users who prefer to have a data analytics
professional create reports
6. Develop a collaborative business intelligence platform
by collaborating on the creation of reports and making it
possible for decision makers to share insights
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Eight Aspects of Business Analysis (3 of 3)
7. Ensure that the process of data analytics has the
desired impact on a business
8. Provide deep insights and understanding that support
decision makers and data analysts in making sense of
their individual and organizational experience
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Data Lakes
• Repositories for raw data and normalized and
unnormalized data
• Data deposited needs not be structured or predefined
• Can be queried
• Are intended to make use of the large amount of data an
organization collects
• Can provide insights into customer behavior
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Figure 13.28 Differences Between a Data
Warehouse and a Data Lake
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Blockchains (1 of 5)
• A blockchain network is a data structure built on an
Internet platform that permits sharing and creation of a
digital ledger of data
• Can be shared with others on a network that is either
public or private
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Blockchains (2 of 5)
• Useful whenever a business or an individual wants to
possess a verifiable electronic record to track any kind of
business asset
• Blockchains came into the public consciousness with the
introduction of bitcoins
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Blockchains (3 of 5)
• Blockchain is especially useful in food supply chains for
tracking perishable products
• A blockchain is an open, immutable record of
transactions
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Blockchains (4 of 5)
• Are meant to:
– Improve security
– Cut down on risk
– Improve reliability and efficiency when conducting
transactions of all sorts
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Blockchains (5 of 5)
• Blockchains can be thought of as distributed databases
that feature controls on information that also store
information and share it
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Permissioned Blockchains
• Businesses use special permissioned blockchains which
offer
– Better privacy
– Upgraded auditability
– Increased efficiency for operations
• A permissioned blockchain means that transactions can
be trusted
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Web 3.0
• Just being built
• About decentralization of data and data ownership
• Users own their own data
• Blockchain based
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Summary (1 of 3)
• Storing data
– Individual files
– Database
• Reality, data, metadata
• Conventional files
– Type
– Organization
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Summary (2 of 3)
• Database
– Relational
– Hierarchical
– Network
• E-R diagrams
• Normalization
– First normal form
– Second normal form
– Third normal form
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Summary (3 of 3)
• Denormalization
• Data warehouse
• Data mining
• Business Intelligence (BI)
• Data lakes
• Blockchains
• Web 3.0
Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
Copyright
This work is protected by United States copyright laws and is
provided solely for the use of instructors in teaching their
courses and assessing student learning. Dissemination or sale of
any part of this work (including on the World Wide Web) will
destroy the integrity of the work and is not permitted. The work
and materials from it should never be made available to students
except by instructors using the accompanying text in their
classes. All recipients of this work are expected to abide by these
restrictions and to honor the intended pedagogical purposes and
the needs of other instructors who rely on these materials.

More Related Content

PPT
PDF
Dbms intro
PPTX
7 - Enterprise IT in Action
PPTX
Databases and its representation
PPT
Ch # 09 database management system
PPTX
RDBMS to NoSQL. An overview.
PPTX
Data concepts
PPTX
Data base management system(DBMS), sourav mathur
Dbms intro
7 - Enterprise IT in Action
Databases and its representation
Ch # 09 database management system
RDBMS to NoSQL. An overview.
Data concepts
Data base management system(DBMS), sourav mathur

Similar to Database design- system analysis and design (20)

PDF
Module03
PPT
D B M S Animate
PPTX
PPT
Mc leod9e ch06 database management systems
DOCX
Database DESIGN CONCEPTSDr. Dexter Francis2Data Design
PPTX
Module 3 Database systems for DIPLOMA.pptx
PPT
PPTX
Business Intelligence.pptx
PDF
INT 1010 07-3.pdf
PDF
dbms Unit 1.pdf arey bhai teri maa chodunga
PPT
Unit 1
PPTX
DBMS for beginners with examples.pptx
PDF
Ch 6.pdf
PDF
CBSE XII Database Concepts And MySQL Presentation
PPTX
SESSION 1 DATABASE MANAGEMENT SYSTEM.pptx
PPT
Dbms Lec Uog 02
PPTX
Chapter 09
DOCX
Database
PPTX
Information Technology 104
Module03
D B M S Animate
Mc leod9e ch06 database management systems
Database DESIGN CONCEPTSDr. Dexter Francis2Data Design
Module 3 Database systems for DIPLOMA.pptx
Business Intelligence.pptx
INT 1010 07-3.pdf
dbms Unit 1.pdf arey bhai teri maa chodunga
Unit 1
DBMS for beginners with examples.pptx
Ch 6.pdf
CBSE XII Database Concepts And MySQL Presentation
SESSION 1 DATABASE MANAGEMENT SYSTEM.pptx
Dbms Lec Uog 02
Chapter 09
Database
Information Technology 104
Ad

Recently uploaded (20)

PDF
Getting Started with Data Integration: FME Form 101
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Five Habits of High-Impact Board Members
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
Unlock new opportunities with location data.pdf
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
August Patch Tuesday
PPTX
Benefits of Physical activity for teenagers.pptx
Getting Started with Data Integration: FME Form 101
Zenith AI: Advanced Artificial Intelligence
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Getting started with AI Agents and Multi-Agent Systems
A contest of sentiment analysis: k-nearest neighbor versus neural network
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
O2C Customer Invoices to Receipt V15A.pptx
A comparative study of natural language inference in Swahili using monolingua...
Five Habits of High-Impact Board Members
1 - Historical Antecedents, Social Consideration.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
sustainability-14-14877-v2.pddhzftheheeeee
Developing a website for English-speaking practice to English as a foreign la...
Final SEM Unit 1 for mit wpu at pune .pptx
Unlock new opportunities with location data.pdf
observCloud-Native Containerability and monitoring.pptx
Enhancing emotion recognition model for a student engagement use case through...
DP Operators-handbook-extract for the Mautical Institute
August Patch Tuesday
Benefits of Physical activity for teenagers.pptx
Ad

Database design- system analysis and design

  • 1. Systems Analysis & Design Eleventh Edition Chapter # 13 Designing Databases Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved
  • 2. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Learning Objectives (1 of 2) 13.1 Appreciate what a database is. 13.2 Understand basic data concepts. 13.3 Learn not to efficiently store data using normalization. 13.4 Know the guidelines for designing main files and database relations. 13.5 Grasp the notion of normalization and why it is useful.
  • 3. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Learning Objectives (2 of 2) 13.6 Understand the concept of data warehouses. 13.7 Comprehend the relationship of business intelligence to data warehouses. 13.8 Understand data analytics in helping systems and people make decisions. 13.9 Learn about data lakes and how they differ from data warehouses. 13.10 Gain basic knowledge about blockchains and their usefulness.
  • 4. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Major Topics • Databases • Normalization • Key design • Using the database • Data warehouses • Data lakes • Data mining • Business intelligence
  • 5. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data Storage (1 of 2) • The data must be available when the user wants to use them • The data must be accurate and consistent • Efficient storage of data as well as efficient updating and retrieval • It is necessary that information retrieval be purposeful
  • 6. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data Storage (2 of 2) • There are two approaches to the storage of data in a computer-based system: – Store the data in individual files, each unique to a particular application – Store data in a database ▪ A database is a formally defined and centrally controlled store of data intended for use in many different applications
  • 7. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Databases (1 of 2) • Effectiveness objectives of the database: – Ensuring that data can be shared among users for a variety of applications – Maintaining data that are both accurate and consistent – Ensuring data required for current and future applications will be readily available – Allowing the database to evolve as the needs of the users grow – Allowing users to construct their personal view of the data without concern for the way the data are physically stored
  • 8. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Databases (2 of 2) • Databases are highly useful for organizations to store, access, and retrieve their data • Security issues will arise • Risk assessment is an essential part of database planning
  • 9. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Database Security (1 of 2) • Cyber attacks, including malware and denial of service attacks, are on the increase • Getting users involved in business continuity planning and disaster recovery approaches will help make databases and the organization resilient if security is breached
  • 10. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Database Security (2 of 2) • Companies must consider these questions: – What is the likelihood of an attack? – What is the value of the data that are being protected? – What are the implications of a security breach for the company’s customers and, in the aftermath, for the company’s image?
  • 11. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Reality, Data, and Metadata • Reality – The real world • Data – Collected about people, places, or events in reality and eventually stored in a file or database • Metadata – Information that describes data
  • 12. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.1 Reality, Data, and Metadata
  • 13. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Entities • Any object or event about which someone chooses to collect data • May be a person, place, or thing • May be an event or unit of time
  • 14. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Entity Subtype • An entity subtype is a special one-to-one relationship used to represent additional attributes, which may not be present on every record of the first entity • This eliminates null fields stored on database tables • For example, students who have internships: the STUDENT MASTER should not have to contain information about internships for each student
  • 15. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Relationships • Relationships – One-to-one – One-to-many – Many-to-many • A single vertical line represents one • A crow’s foot represents many
  • 16. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.2 Entity-Relationship Diagrams Associations Entity-relationship (E-R) diagrams can show one-to-one, one-to- many, or many-to-many associations.
  • 17. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.3 Entity-Relationship Symbols and Their Meanings
  • 18. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.4 The Entity-Relationship Diagram for Patient Treatment Attributes can be listed alongside the entities. The key is underlined.
  • 19. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Attributes, Records, and Keys • Attributes represent some characteristic of an entity • Records are a collection of data items that have something in common with the entity described • Keys are data items in a record used to identify the record
  • 20. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.6 Attributes and Keys
  • 21. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Key Types • Key types are: – Primary key—unique attribute for the record – Candidate key—an attribute or collection of attributes, that can serve as a primary key – Secondary key—a key which may not be unique, used to select a group of records – Composite key—a combination of two or more attributes representing the key
  • 22. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Metadata • Data about the data in the file or database • Describe the name given and the length assigned each data item • Also describe the length and composition of each of the records
  • 23. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.7 Metadata Metadata includes a description of what the value of each data item looks like.
  • 24. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Files • A file contains groups of records used to provide information for operations, planning, management, and decision making • Files can be used for storing data for an indefinite period of time, or they can be used to store data temporarily for a specific purpose
  • 25. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved File Types • Master file • Table file • Transaction file • Report file
  • 26. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Master and Table Files • Master files: – Contain records for a group of entities – Contain all information about a data entity • Table files: – Contain data used to calculate more data or performance measures – Usually read-only by a program
  • 27. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Transaction and Report Files • Transaction records: – Used to enter changes that update the master file and produce reports • Report files: – Used when it is necessary to print a report when no printer is available – Useful because users can take files to other computer systems and output to specialty devices
  • 28. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Relational Databases • A database is intended to be shared by many users • There are three structures for storing database files: – Relational database structures – Hierarchical database structures – Network database structures
  • 29. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.8 Database Design Database design includes synthesizing user reports, user views, and logical and physical designs
  • 30. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.9 Relational Data Structure In a relational data structure, data are stored in many tables. Item Price Item-# Title Price B235 Guys and Dolls 8.99 B521 My Fair Lady 6.99 B894 42nd Street 10.99 B992 A Chorus Line 10.99 Order Order-# Last Name I Street Address City St Charge Acct 10784 MacRae G 2314 Curly Circle Lincoln NE 45-4654-76 10796 Jones S 34 Dream Lane Oklahoma City OK 44-9876-74 11821 Preston R 1008 Madison Ave. River City IA 34-7642-64 11845 Channing C 454 Harmonia St. New York NY 34-0876-87 11872 Kiley R 765 Dulcinea Drive La Mancha CA 65-8798-87 Item Status Item-# Order-# Status B235 10784 Shipped 5/12 B235 19796 Shipped 5/14 B235 11872 In Process B521 11821 In Process B894 11845 Backordered B894 11872 Shipped 5/12 B992 10784 Shipped 5/12
  • 31. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Normalization • Normalization is the transformation of complex user views and data stores to a set of smaller, stable, and easily maintainable data structures • The main objective of the normalization process is to simplify all the complex data items that are often found in user views
  • 32. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.10 Normalization of a Relation is Accomplished in Three Major Steps
  • 33. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data Model Diagrams • Shows data associations of data elements • Each entity is enclosed in an ellipse • Arrows are used to show the relationships
  • 34. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.13 Drawing Data Model Drawing data model diagrams for data associations sometimes helps analysts appreciate the complexity of data storage.
  • 35. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved First Normal Form (1NF) • Remove repeating groups • The primary key with repeating group attributes are moved into a new table • When a relation contains no repeating groups, it is in first normal form
  • 36. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.16 The Original Unnormalized Relation The original unnormalized relation SALES-REPORT is separated into two relations, SALESPERSON (3NF) and SALESPERSON- CUSTOMER (1NF).
  • 37. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Second Normal Form (2NF) • Remove any partially dependent attributes and place them in another relation • A partial dependency is when the data are dependent on a part of a primary key • A relation is created for the data that are only dependent on part of the key and another for data that are dependent on both parts
  • 38. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.18 Second Normal Form The relation SALESPERSON - CUSTOMER is separated into a relation called CUSTOMER - WAREHOUSE (2NF) and a relation called SALES (1NF).
  • 39. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Third Normal Form (3NF) • Must be in 2NF • Remove any transitive dependencies • A transitive dependency is when nonkey attributes are dependent not only on the primary key, but also on a nonkey attribute
  • 40. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.20 Third Normal Form The relation CUSTOMER-WAREHOUSE is separated into two relations called CUSTOMER (1NF) and WAREHOUSE (1NF).
  • 41. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.22 AlS. Well Hydraulic Company E-R Diagram
  • 42. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Using the Entity-Relationship Diagram to Determine Record Keys • When the relationship is one-to-many, the primary key of the file at the one end of the relationship should be contained as a foreign key on the file at the many end of the relationship • A many-to-many relationship should be divided into two one-to-many relationships with an associative entity in the middle
  • 43. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Guidelines for Master File/Database Relation Design • Each separate data entity should create a master database table • A specific data field should exist on one master table • Each master table or database relation should have programs to create, read, update, and delete the records
  • 44. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Integrity Constraints • Entity integrity • Referential integrity • Domain integrity
  • 45. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Entity Integrity • The primary key cannot have a null value • If the primary key is a composite key, none of the fields in the key can contain a null value
  • 46. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Referential Integrity (1 of 3) • Referential integrity governs the nature of records in a one-to-many relationship • Referential integrity means that all foreign keys in the many table (the child table) must have a matching record in the parent table
  • 47. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Referential Integrity (2 of 3) • Referential integrity implications: – You cannot add a record in the child (many) table without a matching record in the parent table – You cannot change a primary key that has matching child table records – You cannot delete a record that has child records
  • 48. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Referential Integrity (3 of 3) • Implemented in two ways: – A restricted database updates or deletes a key only if there are no matching child records – A cascaded database will delete or update all child records when a parent record is deleted or changed
  • 49. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Domain Integrity • Domain integrity rules are used to validate the data • Domain integrity has two forms: – Check constraints, which are defined at the table level – Rules, which are defined as separate objects and can be used within a number of fields
  • 50. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Anomalies • Data redundancy • Insert anomaly • Deletion anomaly • Update anomaly
  • 51. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data Redundancy • When the same data is stored in more than one place in the database • Solved by creating tables that are in third normal form
  • 52. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Insert Anomaly • Occurs when the entire primary key is not known and the database cannot insert a new record, which would violate entity integrity • Can be avoided by using a sequence number for the primary key
  • 53. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Deletion Anomaly • Happens when a record is deleted that results in the loss of other related data
  • 54. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Update Anomaly • When a change to one attribute value causes the database to either contain inconsistent data or causes multiple records to need changing • May be prevented by making sure tables are in third normal form
  • 55. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Retrieving and Presenting Database Data • Choose a relation from the database • Join two relations together • Project columns from the relation • Select rows from the relation • Derive new attributes • Index or sort rows • Calculate totals and performance measures • Present data
  • 56. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Denormalization • Denormalization is the process of taking the logical data model and transforming it into an efficient physical model
  • 57. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data Warehouses and Database Differences (1 of 2) • Data warehouses are used to organize information for quick and effective queries • In the data warehouse, data are organized around major subjects • Data in the warehouse are stored as summarized rather than detailed raw data • Data in the data warehouse cover a much longer time frame than in a traditional transaction-oriented database • Data warehouses are organized for fast queries
  • 58. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data Warehouses and Database Differences (2 of 2) • Data warehouses are usually optimized for answering complex queries, known as OLAP • Data warehouses allow for easy access via data- mining software • Data warehouses include multiple databases that have been processed so that data are uniformly defined • Data warehouses usually include data from outside sources
  • 59. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Online Analytic Processing • Online analytic processing (OLAP) is meant to answer decision makers’ complex questions by defining a multidimensional database
  • 60. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data-Mining Decision Aids • Software – Statistical analysis – Decision trees – Neural networks – Intelligent agents – Fuzzy logic – Data visualization
  • 61. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data-Mining Patterns • Associations—patterns that occur together • Sequences—patterns of actions that take place over a period of time • Clustering—patterns that develop among groups of people • Trends—the patterns that are noticed over a period of time
  • 62. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.27 Data Mining Data mining collects personal information about customers in an effort to be more specific in interpreting and anticipating their preferences
  • 63. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data-Mining Problems • Costs may be too high to justify • Has to be coordinated • Ethical aspects
  • 64. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Business Intelligence (BI) • Business intelligence is a decision support system (DSS) for organizational decision makers • It is composed of features that gather and store data • It uses knowledge management approaches combined with analysis • This becomes input to decision makers’ decision-making processes
  • 65. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Business Intelligence • Business intelligence is built around processing large volumes of data • Big data is when data sets become too large or too complex to be handled with traditional tools or within traditional databases or data warehouses • Big data is a strategy that allows organizations to cope with ever-increasing numbers of data from a myriad of sources – Human generated – Generated via sensors of some type
  • 66. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Analyzing Business Intelligence • Five prominent methods are used for analyzing business intelligence – Slice-and-dice drilldown – Ad hoc queries – Real-time analysis – Forecasting – Scenarios
  • 67. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data Analytics • Data analytics use algorithms to analyze vast amounts of data in databases or to analyze huge quantities of unstructured data • Analytics can refer to: – Solving business problems and the decision- making process – The analytical approaches that help organizations create value – A broader term is business intelligence (BI)
  • 68. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Eight Aspects of Business Analysis (1 of 3) 1. Ensure the quality of the data 2. Facilitate communication between users and data analytics professionals 3. Educate users and decision makers about how to use analytical tools and how to interpret reports based on use of those tools 4. Create reports that can be user generated
  • 69. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Eight Aspects of Business Analysis (2 of 3) 5. Work with users who prefer to have a data analytics professional create reports 6. Develop a collaborative business intelligence platform by collaborating on the creation of reports and making it possible for decision makers to share insights
  • 70. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Eight Aspects of Business Analysis (3 of 3) 7. Ensure that the process of data analytics has the desired impact on a business 8. Provide deep insights and understanding that support decision makers and data analysts in making sense of their individual and organizational experience
  • 71. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Data Lakes • Repositories for raw data and normalized and unnormalized data • Data deposited needs not be structured or predefined • Can be queried • Are intended to make use of the large amount of data an organization collects • Can provide insights into customer behavior
  • 72. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Figure 13.28 Differences Between a Data Warehouse and a Data Lake
  • 73. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Blockchains (1 of 5) • A blockchain network is a data structure built on an Internet platform that permits sharing and creation of a digital ledger of data • Can be shared with others on a network that is either public or private
  • 74. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Blockchains (2 of 5) • Useful whenever a business or an individual wants to possess a verifiable electronic record to track any kind of business asset • Blockchains came into the public consciousness with the introduction of bitcoins
  • 75. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Blockchains (3 of 5) • Blockchain is especially useful in food supply chains for tracking perishable products • A blockchain is an open, immutable record of transactions
  • 76. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Blockchains (4 of 5) • Are meant to: – Improve security – Cut down on risk – Improve reliability and efficiency when conducting transactions of all sorts
  • 77. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Blockchains (5 of 5) • Blockchains can be thought of as distributed databases that feature controls on information that also store information and share it
  • 78. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Permissioned Blockchains • Businesses use special permissioned blockchains which offer – Better privacy – Upgraded auditability – Increased efficiency for operations • A permissioned blockchain means that transactions can be trusted
  • 79. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Web 3.0 • Just being built • About decentralization of data and data ownership • Users own their own data • Blockchain based
  • 80. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Summary (1 of 3) • Storing data – Individual files – Database • Reality, data, metadata • Conventional files – Type – Organization
  • 81. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Summary (2 of 3) • Database – Relational – Hierarchical – Network • E-R diagrams • Normalization – First normal form – Second normal form – Third normal form
  • 82. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Summary (3 of 3) • Denormalization • Data warehouse • Data mining • Business Intelligence (BI) • Data lakes • Blockchains • Web 3.0
  • 83. Copyright © 2024, 2019, 2014 Pearson Education, Inc. All Rights Reserved Copyright This work is protected by United States copyright laws and is provided solely for the use of instructors in teaching their courses and assessing student learning. Dissemination or sale of any part of this work (including on the World Wide Web) will destroy the integrity of the work and is not permitted. The work and materials from it should never be made available to students except by instructors using the accompanying text in their classes. All recipients of this work are expected to abide by these restrictions and to honor the intended pedagogical purposes and the needs of other instructors who rely on these materials.

Editor's Notes

  • #1: If this PowerPoint presentation contains mathematical equations, you may need to check that your computer has the following installed: 1) MathType Plugin 2) Math Player (free versions available) 3) NVDA Reader (free versions available)
  • #5: Data storage is considered to be the heart of an information system. The information obtained from the stored data must be in a form useful for managing, planning, controlling, or decision making.
  • #6: File system advantages: can be designed and built rapidly any concerns about data availability and security can be minimized File system disadvantages: often designed only with immediate needs in mind expensive programming time for file and program development and maintenance stored data will be redundant updating files is more time consuming data integrity is an issue
  • #7: A database is a central source of data meant to be shared by many users for a variety of applications.
  • #12: Within the realm of reality are entities and attributes; within the realm of actual data are record occurrences and data item occurrences; within the real of metadata are record definitions and data item definitions. Long Description: The image shows reality, data and metadata connected to each other. Within the realm of reality, it shows entities and attributes, both are connected to each other. Within the realm of Data, it shows record occurrences and data item occurrences are connected to each other. Within the realm of metadata, it shows record definitions and data item definitions connected to each other.
  • #13: Examples: a salesperson a city a product machine breakdown a sale a month or year
  • #15: Relationships are associations between entities. Self-join relationship—an entity with a relationship connecting to itself.
  • #16: Long Description: An E-R diagram shows relationship and examples of E-R diagrams. The information is depicted as follows. 1, One to One. One product has one product package and is depicted by a solid line with double stroke at each entity end. One Employee is assigned to one office and is depicted by a solid line with double stroke at each entity end. 2, One to Many, or Many to One. One physician treats many patients and is depicted by a solid line with double stroke at physician entity to patient entity with an upwards arrow to bar. Many Employees belong to One Department, as depicted by a solid line with an upward arrow to bar from Employee entity to Department entity with double strokes. 3, Many to Many. Many students take multiple courses, as depicted by a solid line with an upward arrow to bar from both the entities. Many salespersons serve multiple cities, as depicted by a solid line with an upward arrow to bar from both the entities.
  • #17: Associative entity—used to join two entities. Attributive entity—used for repeating groups. Long Description: The image shows three columns, Symbol, Official Explanation and What it really means. The information depicted as Symbol, Official Explanation, What it really means. A rectangle: Entity: A class of persons, places, or things. A diamond shape inside a rectangle touching all the four sides: Associative entity: Used to join two entities A circle inside a rectangle touching all the four sides: Attributive entity: Used for repeating groups. A horizontal solid line with double strokes at the right end: To 1 relationship: Exactly one. A horizontal solid line with leftwards arrow to bar at right end: To many relationship: One or more. A horizontal solid line with a circle and bar at the right end: To 0 or 1 relationship: Only zero or one. A horizontal solid line with leftwards arrow to circle: To 0 or more relationship: can be zero, one, or more. A horizontal solid line with leftwards arrow: To more than 1 relationship: Greater than one.
  • #18: The attributes are listed next to each of the entities, and the key is underlined. Long Description: An E R diagram depicts the relationship between various entities and each entity have attributes with a key. The information is depicted as Attributes with key: relationship. 1, Physician. Physician name (key), Physician address, Physician phone, Specialty: One physician treats many patients. 2, Patient. Patient name (key), Patient address, Patient phone, Date-First-Visit: Many patients are taken care by one physician, many patients subscribes to one Insurance carrier and one patient experiences many treatments. 3, Insurance Carrier. Carrier name (key), Carrier address, Plan description: One Insurance carrier insures many patients. 4, Treatment. Product name (key), Patient name (key), Description, Date, Symptom: Many treatments are given to one patient and many treatments include one prescription. 5, Prescription Product name (key), Dosage, Manufacturer, Amount: One prescription belongs to many treatments.
  • #19: Data item is used interchangeably with attribute.
  • #20: Long Description: The image shows seven column heads, order number, last name, initial, street address, city, state and credit card depicted as a record. Order number is depicted as key and the other heads are Attributes. All the heads together are record.
  • #21: A primary key should be minimal and contain no extra attributes than are necessary to identify a record.
  • #23: Long Description: Fields are depicted as follows. N: Numeric. A: Alphanumeric. D: Date M M slash D D slash Y Y Y Y. Dollar sign: Currency. M: Memo. The image shows a table with two columns, Data Item and Value. The third column shows length of data item. The information depicted is as follows. (Data Item: Value with length). Salesperson Number: N with length as 5. Salesperson Name: A with length as 20. Company Name: A with length 26. Address: A with length 36. Sales: N with length 9.2. Width: N with length as 2. Height: N with length as 2. Length: N with length as 2. Weight: N with length as 2. Mailing Address: A with length as 36. Return Address: A with length as 36. Product(s): A with length as 4. Description(s): A with length as 30. Quantity Ordered: N with length as 2. Last Name of Customer: A with length as 24. First Initial: A with length as 1. Street Address: A with length as 28. City: A with length as 12. State: A with length as 2. Zip Code: N with length as 9. Credit Card Number: N with length as 10. Date Order Was Placed: D with length as 8 M M slash D D slash Y Y Y Y (special format specified for date field). Amount: dollar sign with length as 7.2 (7.2 means the field takes up 7 digits, two of which are right of the decimal). Status: A with length as 22.
  • #26: Each record of a Master file generally contains a primary key and several secondary keys. Examples: patient records customer records a personnel file a parts inventory.
  • #28: An analyst today would typically design a relational database.
  • #29: Long Description: The image shows that User reports (tabular outputs, Graphs, etc.) leads to User Views or User Schema (User's Description of data needed), which is transformed into Conceptual Schema (Logical design model of the database) and further into Internal schema (Physical design model of the database).
  • #31: For relational tables to be useful and manageable, the relational tables must first be normalized.
  • #32: Step 1: remove all repeating groups and identify the primary key—the relation needs to be broken up into two or more relations (1NF) Step 2: remove partial dependencies—ensures that all nonkey attributes are fully dependent on the primary key. All partial dependencies are removed and placed in another relation. Step 3: remove any transitive dependencies—one in which nonkey attributes are dependent on other nonkey attributes. Long Description: The image shows transformation from un-normalized relationship to third normal form. User Views shows un-normalized relationship. Step 1: Remove repeating groups. It leads to Normalized relations (1 N F). Step 2: Remove partial dependencies. It leads to Second Normal form relations (2 N F). Step 3: Remove transitive dependencies. It leads to Third Normal form (3 N F).
  • #33: Although it is possible to draw these relationships with an E-R diagram, it is sometimes easier to use the simpler bubble diagram to model the data.
  • #34: Long Description: The image shows a data model diagram depicting data as: One Salesperson name is associated with one Sales person number and one Sales area could be covered by many Salesperson numbers. One Salesperson number can be associated with many customer numbers. One Customer number is associated with one Customer name. One Warehouse number is associated with one warehouse location. One Warehouse number and one Warehouse location can service many customer numbers. Sales amount can be determined only by determining the total of Salesperson numbers and customer numbers.
  • #36: Long Description: The image shows that Sales report relation is split into Salesperson and Salesperson customer relation. Sales report originally consisted of eight columns, Salesperson number, Salesperson name, Sales area, Customer number, Customer name, Warehouse number, Warehouse location and Sales amount. A new relation, Salesperson is formed with three columns, Salesperson number, Salesperson name and Sales area. Another new relation, Salesperson customer is formed with six columns, Salesperson number, Customer number, Customer name, Warehouse number, Warehouse location and Sales amount.
  • #38: Long Description: The image shows the Salesperson customer relation split into Customer warehouse and Sales relation. Salesperson customer originally consisted of six columns, Salesperson number, Customer number, Customer name, Warehouse number, Warehouse location and Sales amount. A new relation, Customer warehouse is formed with four columns, Customer number, Customer name, Warehouse number and Warehouse location. Another new relation, Sales is formed with three columns, Salesperson number, Customer number and Sales amount.
  • #40: Long Description: The image shows Customer warehouse relation split into Customer and Warehouse relation. Customer warehouse originally consists of four columns, Customer number, Customer name, Warehouse number and Warehouse location. A new relation between Customer number and Customer is formed with three columns, Customer number, Customer name and Warehouse number. Another new relation, Warehouse is formed with two columns, Warehouse number and Warehouse location.
  • #41: Long Description: The E-R diagram shows the relationship between Salesperson, Sales, Customer and Warehouse relations. Salesperson have attributes Salesperson number (key), Salesperson name and Sales area. Sales have attributes Salesperson number (key), Customer number (key) and Sales amount. Customer have attributes as Customer number (key), Customer name and Warehouse number. Warehouse have attributes Warehouse number (key) and Warehouse location. Salesperson and Customer are associated with each other through Sales relation. One salesperson serves many customers. Many customers receive their items from one warehouse.
  • #55: The first and last steps are mandatory, but the six steps in between are optional, depending on how data are to be used.
  • #62: Long Description: The image shows data is maintained internally which is collected from various sources, as follows. External data like customer demographics from municipality. External data like Customer purchase history from credit card. External data like prospects or mailing lists from other companies. Warranty card customer sent in. Information from survey that customer filled out. Customer profile obtained when customer made purchase from the web. An arrow from data maintained internally pointing to a gift pack is labeled, special promotional offer for customer.
  • #72: Long Description: The information is as follows. Data lake consists of raw data, large size, and unstructured. A box for Data Lake shows texts, data placed randomly. Data lake supports machine learning, data analytics, and business intelligence. Data warehouse consists of refined data, medium size, and relational. Data warehouse supports data mining and O L A P.