MIS 301 Information Systems in Organizations Dave Salisbury [email protected] (email) http://www.davesalisbury.com/ (web site) Databases & Data Modeling Data as a resource Samples of how databases are set up in organizations Basic logical modeling of data Basic physical modeling of data Data Integrity Normalization Logical Data Elements Personnel Database Employee Record 1 Payroll File Benefits File Employee Record 2 Employee Record 3 Employee Record 4 Name SS Salary Name SS Salary Name SS Salary Name SS Salary Data Data Data Data Data Data Data Data Data Data Data Data Physical Data Access (Primary) key fields Sequential Access (Tape) Unique identifier for a record Stored in order by key field Can only be retrieved/stored in that order Direct Access (Disk) Hashed File Organization Key transformed into physical address using algorithm Indexed Sequential Access Method (ISAM) Direct access using indexes Sequential when doing batch runs Disadvantages of File Processing Program-Data Dependence Data Redundancy (Duplication of data) Different systems/programs have separate copies of the same data Limited Data Sharing All programs maintain metadata for each file they use No centralized control of data Excessive Program Maintenance 80% of of information systems budget Duplicate (Redundant) Data Problems with Data Redundancy Waste of space to have duplicate data Causes more maintenance headaches The biggest Problem: When data changes in one file, could cause inconsistencies Compromises data integrity Lack of coordination and central control Non-standard file formats Database Central repository of shared data Data is managed by a controlling agent Stored in a standardized, convenient form Requires a database management system (DBMS) Advantages of Database Approach Program-Data Independence Minimal Data Redundancy Different users get different views of the data Enforcement of Standards Leads to increased data integrity/consistency Improved Data Sharing Metadata stored in DBMS, so applications don’t need to worry about data formats All data access is done in the same way Improved Data Quality Constraints, data validation rules Modeling Reality A database must mirror the real world if it is to answer questions about the real world Data Modeling is a design technique for capturing reality STUDENT Social_Security_No Name Major Entity-Relationship Modeling One type of data modeling Represents data in terms of: Entities Relationships Entity-Relationship Modeling Entity-something (real or abstract) that can be identified in a user’s work environment that can be distinguished from other things. Entity Class-(i.e. CUSTOMER) Entity Instance-(e.g. FRED FLINSTONE) Entity-Relationship Modeling Attributes (a.k.a. properties). Describe the characteristics of entities. EMPLOYEE_NAME EMPLOYMENT_DATE Entity-Relationship Modeling Primary keys - identify instances of entities, a.k.a. key attributes SOCIAL_SECURITY_NUMBER Entity-Relationship Modeling Relationships imply constraints on how many entities may occur on one side (or the other) of a given relationship. Types of Relationships one-to-one 1:1 one-to-many 1:N many-to-many N:M Notation in an ER diagram ENTITY Something about which you want to keep data. RELATIONSHIP How things you want to keep data about are tied together. •Attribute – Something you want to store about an entity. •Primary Key – Uniquely identifies an entity (e.g. SID). Entity-Relationship Modeling Places/ Placed by Customer Order Contains/ is Contained in Product Relational Databases Relations (a.k.a. tables) Each row is unique (entity instance) Order is unimportant Each column represents one thing (attribute) Entries are from the same domain (e.g. student) Relational Databases Keys Primary key (uniquely identifies a record) Composite key, a.k.a. Concatenated key (two elements combined are unique) Foreign key (links tables/relations) Data Integrity Rules to make sure your data is valid Entity integrity constraint Referential integrity Primary key cannot be null Ensures valid relationships between data Cannot add a row with no parent Cannot delete a parent without deleting child (cascading) Domain Integrity Data Interrelationships Dependencies and determinants Anomalies Insertion Deletion Update Normalization First normal form Second normal form No repeating groups No partial dependencies Third normal form No transitive dependencies M:N Relationships in Relational Model Places/ Placed by Contains/ is Contained in Order Order Line Customer Product Contains/ is Contained in How this looks in Access 1 OrderID OrderDate CustomerID 1 ORDER LINE OrderID ProductID Quantity PRODUCT ProductID Description Price Weight Supplier 8 ORDER 8 CustomerID FirstName LastName Address City State Zip Telephone 1 8 CUSTOMER
© Copyright 2026 Paperzz