MIS 301- Database

MIS 301
Information Systems in Organizations
Dave Salisbury
[email protected] (email)
http://www.davesalisbury.com/ (web site)
Databases & Data Modeling




Data as a resource
Samples of how databases are set up in
organizations
Basic logical modeling of data
Basic physical modeling of data


Data Integrity
Normalization
Logical Data Elements
Personnel
Database
Employee
Record 1
Payroll
File
Benefits
File
Employee
Record 2
Employee
Record 3
Employee
Record 4
Name SS Salary Name SS Salary Name SS Salary Name SS Salary
Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
Physical Data Access

(Primary) key fields


Sequential Access (Tape)



Unique identifier for a record
Stored in order by key field
Can only be retrieved/stored in that order
Direct Access (Disk)

Hashed File Organization


Key transformed into physical address using algorithm
Indexed Sequential Access Method (ISAM)


Direct access using indexes
Sequential when doing batch runs
Disadvantages of File Processing

Program-Data Dependence


Data Redundancy (Duplication of data)


Different systems/programs have separate copies
of the same data
Limited Data Sharing


All programs maintain metadata for each file they
use
No centralized control of data
Excessive Program Maintenance

80% of of information systems budget
Duplicate (Redundant) Data
Problems with Data Redundancy



Waste of space to have duplicate data
Causes more maintenance headaches
The biggest Problem:




When data changes in one file, could cause
inconsistencies
Compromises data integrity
Lack of coordination and central control
Non-standard file formats
Database




Central repository of shared data
Data is managed by a controlling agent
Stored in a standardized, convenient
form
Requires a database management
system (DBMS)
Advantages of Database Approach

Program-Data Independence


Minimal Data Redundancy


Different users get different views of the data
Enforcement of Standards


Leads to increased data integrity/consistency
Improved Data Sharing


Metadata stored in DBMS, so applications don’t need to
worry about data formats
All data access is done in the same way
Improved Data Quality

Constraints, data validation rules
Modeling Reality


A database must mirror the real world if it is
to answer questions about the real world
Data Modeling is a design technique for
capturing reality
STUDENT
Social_Security_No
Name
Major
Entity-Relationship Modeling


One type of data modeling
Represents data in terms of:


Entities
Relationships
Entity-Relationship Modeling
Entity-something (real or abstract) that
can be identified in a user’s work
environment that can be distinguished
from other things.


Entity Class-(i.e. CUSTOMER)
Entity Instance-(e.g. FRED FLINSTONE)
Entity-Relationship Modeling
Attributes (a.k.a. properties). Describe
the characteristics of entities.


EMPLOYEE_NAME
EMPLOYMENT_DATE
Entity-Relationship Modeling
Primary keys - identify instances of
entities, a.k.a. key attributes

SOCIAL_SECURITY_NUMBER
Entity-Relationship Modeling


Relationships imply constraints on how
many entities may occur on one side (or
the other) of a given relationship.
Types of Relationships



one-to-one 1:1
one-to-many 1:N
many-to-many N:M
Notation in an ER diagram
ENTITY
Something about which
you want to keep data.
RELATIONSHIP
How things you want to
keep data about are tied
together.
•Attribute – Something you want to store about an entity.
•Primary Key – Uniquely identifies an entity (e.g. SID).
Entity-Relationship Modeling
Places/
Placed by
Customer
Order
Contains/ is
Contained in
Product
Relational Databases

Relations (a.k.a. tables)




Each row is unique (entity instance)
Order is unimportant
Each column represents one thing
(attribute)
Entries are from the same domain (e.g.
student)
Relational Databases

Keys



Primary key (uniquely identifies a record)
Composite key, a.k.a. Concatenated key
(two elements combined are unique)
Foreign key (links tables/relations)
Data Integrity


Rules to make sure your data is valid
Entity integrity constraint


Referential integrity




Primary key cannot be null
Ensures valid relationships between data
Cannot add a row with no parent
Cannot delete a parent without deleting child
(cascading)
Domain Integrity
Data Interrelationships


Dependencies and determinants
Anomalies



Insertion
Deletion
Update
Normalization

First normal form


Second normal form


No repeating groups
No partial dependencies
Third normal form

No transitive dependencies
M:N Relationships in Relational Model
Places/
Placed by
Contains/ is
Contained in
Order
Order Line
Customer
Product
Contains/ is
Contained in
How this looks in Access
1
OrderID
OrderDate
CustomerID
1
ORDER LINE
OrderID
ProductID
Quantity
PRODUCT
ProductID
Description
Price
Weight
Supplier
8
ORDER
8
CustomerID
FirstName
LastName
Address
City
State
Zip
Telephone
1
8
CUSTOMER