exam question 2 show Normalization in this database

Lakukan tugas rumah & ujian kamu dengan baik sekarang menggunakan Quizwiz!

Entity (table)

- Can be thought of as a simple 2D grid of data consisting of: --Attributes/fields/columns --Instances/records/tuples/rows - Entities can handle multi-dimensional data --It is easiest to think of them as flat 2D objects consisting of columns and rows

Functional dependence Keys

2 Crucial Concept

Fourth Normal Form (4NF)

A table is in fourth normal form if it is in third normal form and there are no multivalued dependencies

Third Normal Form (3NF)

A table is in third normal form if it is in second normal form and the only determinants it contains are candidate keys

Nonkey Column

An attribute (column) that is not part of the primary key

Transitive Dependency

An attribute functionally depends on another nonkey attribute

Transitive dependency

An attribute functionally depends on another nonkey attribute.

Atomic Attribute

An attribute that cannot be further subdivided to produce meaningful components. EX: A person's last name attribute cannot be meaningfully subdivided.

atomic attribute

An attribute that cannot be further subdivided to produce meaningful components. For example, a person's last name attribute cannot be meaningfully subdivided

nonprime attribute

An attribute that is not part of a key

Non-key Attribute (SEE: NON-PRIME ATTRIBUTE)

An attribute that is not part of a key.

34) Denormalization is the process of joining previously normalized tables back together.

Answer: TRUE Diff: 1 Page Ref: 166-167

This process progressively breaks table into new set of relations based on identified dependencies

Normalization

Atomicity

Not being able to be divided into smaller units

Requirements for good normalized set of tables

Evaluate PK assignments and naming conversations. Identify new attributes and relationships, refine primary keys as required for data granularity, maintain historical accuracy and evaluate using derived attributes.

Boyce Codd Normal Form

Every determinant in table is a candidate key. - Considered special case of 3NF

Boyce-Codd Normal Form (BCNF)

Every determinant in the table is a candidate key

Boyce-Codd normal form

Every determinant in the table should be a candidate key. Equivalent to 3NF for one candidate key. Violated when table contains more than one candidate key.

Boyce-Codd normal form (BCNF)

Every determinant is a candidate key

Boyce-cod normal form

Every determinant is a candidate key

Normalization can rely on itself to make good designs

False

Not all relational tables satisfy 1NF requirements

False

Repeating Group

Group of multiple entries can exist for any single key.

Repeating Group

Group of multiple entries of same type can exist for any single key attribute occurence

Repeating group

Group of multiple entries of same type can exist for any single key attribute occurrence. Proves presence of data redundancies. Enable reducing data redundancies.

Partial Dependency

Functional dependence in which the determinant is only part of the primary key

Partial dependency

Functional dependence in which the determinant is only part of the primary key Assumption: one candidate key.

Levels of normalization

If a data model has passed the first level, we say that it is in first normal form If it has passed the second level, second normal form Etc.

yes, instructor-teacher

Is an associative entity needed, if so between what entities must it be introduced?

no, salary

Is the ERD in 3NF if not, What attribute is the problem?

Which of the following would be the best name for a column representing the charges per hour in a table named JOB?

JOB_CHG_HOUR

Assume you have the following table definition: 1NF (PROJ_NUM, EMP_NUM, PROJ_NAME, EMP_NAME, JOB_CLASS, CHG_HOURS, HOURS). Identify a transitive dependency.

JOB_CLASS --> CHG_HOURS

Join (bowtie) ex. Characters ⋈ (name) Appearances At most |R| * |S| tuples in result

Step 1 of conversion to 2NF:

Make new tables to eliminate partial dependencies

Conversion to second normal form

Make new tables to eliminate partial dependencies. Reassign corresponding dependent attributes. The table is in 1NF, includes no partial dependencies.

Step 1 of conversion to 3NF:

Make new tables to eliminate transitive dependencies (for every transitive dependency, write its determinant as PK for new table)

Conversion to third normal form

Makes new tables to eliminate transitive dependencies. Reassign corresponding dependent attributes. Table is in 2NF, contains no transitive dependencies.

Context ERD

Meaning that the entities (rectangles) and relationships are showing

Repeating Group

More than one entry at a single location in a table

Normalization

a process that enables a database designer to eliminate design problems by decomposing (or reorganizing) the existing table structures.

DBMS (Database Management System)

a program or collection of programs whose function is to manage a database on behalf of the people who use it.

Attribute

a property of an entity (For the entity "Person," attributes could include eye color and height).

In a real-world environment, we must strike a balance between design integrity and ________.

flexibility

Note

The primary key is often called the key in other studies on database management and the relational model. This text will continue to use the term primary key to distinguish between the different definitions of a key that you will encounter throughout this text.

database

a set of interrelated centrally coordinated files

In a normalized database, what is represented by each table?

a single subject

Database

a structure that can store information about multiple types of entities, the attributes of these entities, and the relationships among the entities.

____ keys are usually numeric, they are often automatically generated by the DBMS, they are free of semantic content (they have no special meaning), and they are usually hidden from the end users.

surrogate

Potential problems in the design of a relational database are known as ____. a. update anomalies b. select anomalies c. modification anomalies d. relational anomalies

a. update anomalies

1NF

table contains no repeating groups.

Modality

the Minimum number of times than an instance of an entity can be associated with instances of another entity

primary key

the database attribute or combination of attributes that uniquely identifies a specific row in each table

database system

the database, the dbms and the application programs that access the database through the dbms

All the attributes in an entity should be completely determined by...

the entity instance's primary key

data model

abstract representation of database contents

To put an ERD into 3NF ... (2)

add another entity and a 1:M relationship, if dependent on another secondary attribute (which is not a FK) Or move attribute to other entity, if dependent on a FK

2nd Normal Form (2NF)

for every instance of an entity have a PK with 2 or more components, each attribute of the instance is completely determined by the totality of the instance's PK, not just by some of its components. - When Concatenated PK, each instance attribute is completely dependent on all PK's

second normal form

for every instance of an entity having a pk with 2 or more components, each attribute of the instance is completely determined by the totality of the instance's pk, not just some of its component

If all attributes of a relation are prime....

in 2NF and 3NF

Denormalization

increasing data redundancies to yield less I/O in order to increase processing speed.

external level schema

individual user views of portions of the database

Note, a secondary attribute is an attribute that ...

is not in the PK

Cardinality

is the maximum number of times that an instance of an entity (table) can be associated with instances of another entity (table)

Modality

is the minimum number of times that an instance of an entity can be associated with instances of another entity

Normalization Second Normal Form (2NF) A model is in 2NF if

it is in 1NF, and for every instance of an entity having a PK with 2 or more components, each attribute of the instance is completely determined by the totality of the instance's PK, not just by some of its components

Normalization Third Normal Form (3NF) A model is in 3NF if

it is in 2NF, and for every instance of an entity, each secondary attribute is dependent only on the PK, not on any other secondary attribute

A table is in fourth normal form (4NF) if ______.

it is in 3NF and has no independent multivalued dependencies

Another name for a prime attribute is a(n) ________ attribute.

key

Unnormalized relation or table

one that is not in 1NF

Dependencies based on only a part of a composite primary key are called ________ dependencies.

partial

A column is a nonkey column if it is not a part of the ____________________.

primary key

Database Modeling

The process of creating an ERD is called Database Modeling

Normalization Process

The process of removing repeating groups to produce a first normal form table. Sometimes refers to the process of creating a third normal form table

In an ideal (database design) world, the level of desired granularity is determined at the ________ design or at the requirements gathering phase.

conceptual

data warehouse

contains both detailed and summarized data for a number of years and is used for analysis rather than in real time

data dictionary

contains information about the structure of a database

data query language (DQL)

contains powerful commands that enable users to retrieve, sort, order, and display data

record layout

example customer number, customer name, address, credit limit, balance (alphanumeric or numeric)

Normalization is a ... step procedure

Three

The database relational model does not offer direct support ...

to many-to-many relationships, even though such relationships happen frequently in the realm that is being modeled

A 2NF table can exhibit ________ dependency.

transitive

Assume you have the following table: |JOB_CODE | JOB_DESCRIPTION | JOB_CHG_HOUR | | 511 | Programmer | $35.75 | | 512 | Programmer | $35.75 | How might you eliminate the possibility of duplicate entries for JOB_DESCRIPTION?

use a unique index for JOB_DESCRIPTION

business intelligence

using a data warehouse for strategic decision making

online analytical processing (OLAP)

using queries to guide investigation of hypothesized relationships in data

data mining

using sophisticated statistical analysis, including artificial intelligence techniques

Improving ________ leads to more flexible queries.

atomicity

Candidate key

attribute or composite attribute in a table that can uniquely identify a table.

The normalization process involves assigning ________ to tables based on the concept of determination.

attributes

A(n) ____ is a column or collection of columns on which all columns in the table are functionally dependent. a. index key b. candidate key c. major key d. special key

b. candidate key

Data redundancy produces ________.

data integrity problems

update anomaly

data value updates are not correctly recorded

Lower normalization forms occur (and are even required) in specialized databases known as _______.

data warehouses

data heirarchy

database files record fields

delete anomaly

deleting a row has unintended consequences

________ yields increase performance in a database system.

denormalization

A ________ is any attribute whose value determines other values within a row.

determinant

In BCNF, every ________ in a table is a candidate key.

determinant

Dependencies can be depicted with the help of a dependency ________.

diagram

tuple

each row in a table & contains data about a specific occurence of the type of entity represented by that table

Repeating group steps

Eliminate repeating groups, identify primary key, identify all dependencies.

Step 1 of conversion to 1NF:

Eliminate the repeating groups (nulls)

Types of Relationships

One-to-one, One-to-many, Many-to-many

Unnormalized relation

- a relation (table) that contains a repeating group (or multiple entries for a single record)

A table that has all key attributes defined, no repeating groups, and an identified primary key is said to be in ________.

1NF

All relational tables satisfy the ______ requirements.

1NF

From a structural point of view, 2NF is better than ________.

1NF

Second Normal Form

1NF + Has no partial dependencies

2NF

1NF and has no partial dependencies

Update Anomalies

- identify the existence of potential problems

Alternate keys

- the candidate keys that are not chosen as the primary key.

Goal of Normalization

- to take a table or collection of tables and produce a new collection of tables that represents the same information but that is free of update anomalies.

problems to storing all information in one table

-update anomally -insert anomaly -deletion anomaly

26) The standard sales order line item pattern is a classic example of unneeded normalization.

/Answer: FALSE Diff: 2 Page Ref: 164

Disadvantages of Normalization:

1. More tables (typically) 2. Non-intuitive table design (possibly) 3. Query complexity (multi-table joins) 4. Slower queries (occasionally)

A table that is in 1NF and includes no partial dependencies is said to be in ________.

2NF

A table whose primary key consists of only a single attribute is automatically in ________ when it is in 1NF.

2NF

transitive dependency

A condition in which an attribute is dependent on another attribute that is not part of the primary key

Transitive Dependency

A condition in which an attribute is dependent on another attribute that is not part of the primary key.

Partial Dependency

A dependency of a column on only a portion of the primary key

Dependency Diagram

A diagram that indicates the dependencies among the columns in a table

prime attribute

A key attribute, that is, an attribute that is part of a key or is the whole key. See also key attribute.

Primary key

A non-null (non-blank) attribute or set of non-null attributes that uniquely identifies each instance/row of an entity - Can be simple or composite 1. Best practices: every entity should have a primary key --Can never be null 2. Common practice: go simple when possible --Use a simple PK unless a composite PK offers an additional modeling advantage that can't easily be implemented in another way

Boyce-Codd Normal Form (BCNF)

A relation is in Boyce-Codd normal form if it is is second normal form and the only determinants it contains are canidate keys; also called third normal form in the text

Second Normal Form (2NF)

A relation is in second normal form if it is in first normal form and no nonkey attribute is dependent on only a portion of the primary key

dependency diagram

A representation of all data dependencies (primary key, partial, or transitive) within a table

Dependency Diagram

A representation of all data dependencies (primary key, partial, or transitive) within a table.

dependency diagram

A representation of all data dependencies (primary key, partial, or transitive) within a table.

Fourth normal form (4NF)

A table is in 4NF if it is in 3NF and contains no multiple independent sets of multivalued dependencies.

Boyce-Codd normal form (BCNF)

A special form of third normal form (3NF) in which every determinant is a candidate key. A table that is in BCNF must be in 3NF. See also determinant.

Boyce-Codd normal form (BCNF)

A special type of third normal form (3NF) in which every determinant is a candidate key. A table in BCNF must be in 3NF. See also determinant.

BCNF

A special type of third normal form (3NF) in which every determinant is a candidate key. A table in _____ must be in 3NF

Boyce-Codd Normal Form (BCNF) (SEE: DETERMINANT)

A special type of third normal form (3NF) in which every determinant is candidate key. A table in BCNF must be 3NF.

Database

A usually large collection of data organized especially for rapid search and retrieval

Density of M:N relationship

Actual/possible ratio - This traditional associative entity technique for representing M:N relationships works well for low ratios --Alternative techniques are available for high density scenarios

Associative entity

Added to the ERD to resolve a many to many relationship between two entities.

Conversion to first normal form 1NF

All key attributes are defined, there are no repeating groups in the table, all attributes are dependent on the primary key. Some tables contain partial dependencies.

Data modeling

All names should be limited in length. Should be short and meaningful nouns. Abbreviate names of entities linked through the composite key. Attribute names can be prefixes or suffixes.

Second normal form

All non-key attributes are dependent on the entire key

Normalization process ensures:

All tables are 3NF, higher forms are not likely to be encountered in business environment, works one relation at a time.

ERD - Entity Relationship Diagram

An ERD shows the relationships between the data

Data model

An abstract representation (often graphical) of the data (entities and attributes) and the associations (relationships) between them - Many styles/formats of graphical data models have been proposed over the years --The entity-relationship (ER) model is among the most popular

9) When examining data values as a part of assessing table structure, you should try to determine functional dependencies.

Answer: TRUE Diff: 1 Page Ref: 161-162 Fig 4-1

57) When you are given a set of tables and asked to create a database to store their data, the first step is to ________. A) assess the existing tables' structure and content B) design the database structure C) create one or more new tables D) move the data into the new database E) design the applications that will use the database

Answer: A Diff: 1 Page Ref: 161-162 Fig 4-1

49) A null value in a column may indicate that there is no appropriate value for that attribute.

Answer: TRUE Diff: 1 Page Ref: 171

92) When accessing table structure, how do you determine how many rows are in a table?

Answer: To determine how many rows are in a table, use a SELECT query that contains the SQL COUNT(*) function. This function will return the number of rows in the table. Diff: 1 Page Ref: 161

93) When accessing table structure, how do you determine the number and types of columns in a table?

Answer: To determine the number and types of how many rows are in a table, use a SELECT query that contains the SQL * expression. This expression will return all the columns in the table, and the data types can be determined by examining the values in each column. Diff: 1 Page Ref: 161

determinant

Any attribute in a specific row whose value directly determines other values in that row

Determinant

Any attribute in a specific row whose value directly determines other values in that row.

Clicker Questions (SEE SLIDE 32) Is the ERD in 3NF if not what attribute is the problem? A. Yes B. No Salary C. No, Completiondate D. No, rank

C. No, Completiondate

Clicker Question (SEE SLIDE 13) Is an associative entity needed, if so between what entities must it be introduced A.Yes, course-section B. Yes, section-instructor C. Yes, Instructor-teaching assistant D. No, section-student

C. Yes, Instructor-teaching assistant

What is the trade-off of denormalization?

Can increase performance but results in greater data redundancy

Atomic Attribute

Cannot be further subdivided

Atomic attribute

Cannot be further subdivided

What does cardinality quantify?

Cardinality quantifies an Entity Relationship

Atomicity

Characteristic of an atomic attribute

Key

Column A (or a collection of columns) is the primary key for a relation (table) R, if: Property 1. All columns in R are functionally dependent on A. Property 2. No subcollection of the columns in A (assuming A is a collection of columns and not just a single column) also has Property 1.

Functionally Determines

Column A functionally determines column B if B is functionally dependent on A

Functional Dependence

Column B is functionally dependent on column A (or on a collection of columns) if a value for A determines a single value for B at any one time.

Functionally Dependent

Column B is functionally dependent on column A (or on a collection of columns) if a value for A determines a single value for B at any one time.

|X|

Count of tuples in relation X

N-ary relationships

Create a new associate entity (super bridge) with N FKs to point to the N "sides" of the relationship

Join two tables without using the join operator

Cross Product + Selection

X

Cross product (Cartesian Product) |R| * |S| tuples in R x S

Clicker Question (SEE SLIDE 15) Which of these would be the primary key for the registration table? A. SectionID B. StudentID C. CouseID D. SectionID and StudentID

D. SectionID and StudentID

Clicker Question (SEE SLIDE 24) Which attribute can have multiple values? A. fname B. Iname C. dept D. course ID

D. course ID

3. Many-to-many (M:N)

Each record in X can be associated with many records in Y, and vice versa - Typical implementation: --Create a new associative entity (aka, composite entity, bridge table) consisting of two FKs to the associated record in X and Y

A table that is in first normal form is better than a table in second normal form.

False

Converting to third normal form always avoids the problems related to dependencies.

False

Functional dependencies can be determined by looking at sample data

False

If B is functionally dependent on A, you can also say that B functionally determines A.

False

If the primary key of a table contains only a single column, the table is automatically in first normal form

False

In a dependency diagram, the arrows below the boxes indicate the normal dependencies.

False

Second normal form represents an improvement over ____________________ normal form

First

3 Stages of Normalization

First normal form (1NF) Second normal form (2NF) Third normal form (3NF)

Question: On which columns is QuotedPrice functionally dependent?

For any combination of an order number and a part number, there can be only one row in the OrderLine table. Thus, any combination of an order number and part number in the OrderLine table is associated with exactly one quoted price. Consequently, QuotedPrice is functionally dependent on the combination (formally called the concatenation) of OrderNum and PartNum.

Normalization

Helps eliminate redundancy by optimizing database design. Breaks things down in a systematic way to eliminate data anomalies. One table at a time. 1:1 relationship

3NF

If a table is already in 2NF, it is in 3NF if there are no determinant relationships between non-key attributes except for candidate keys.

4NF

If a table is already in 3NF, it is in 4NF if there are no potential multi-valued attributes.

partial dependency

In normalization, a condition in which an attribute is dependent on only a portion (subset) of the primary key.

Third normal form

In second normal form and has no transitional dependencies

Granularity

Level of detail represented by the values stored in a table's row

Granularity

Level of detail represented by the values stored in a table's row.

Normalization process

Objective is to ensure table conforms to the concept of well-formed relations. Necessary data is stored in one table. All nonprime attributes in a table are dependent on the primary key. Table void of insertion, update, and deletion anomalies.

Multivalued dependencies

Occurs when one key determines multiple values of two other attributes and those attributes are independent of each other.

In 1NF some tables contain:

Partial dependencies which should be used with caution

Normalization

Process for evaluating and correcting table structures to minimize data redundancies

Normalization

Process for evaluating and correcting table structures to minimize data redundancies, reducing data anomalies

Denormalization

Produces a lower normal form

Denormalization

Produces a lower normal form. Results in increased performance and greater data redundancy.

Data-modeling checklist: Business rules

Properly document and verify business rules with end users. Ensure rules are written precisely; identify attributes, relationships, and constraints. Identify source of business rules; justified, dated, and signed off by an authority.

record

Row of an entity- also called an instance of the entity

nonkey attribute

See nonprime attribute.

Normalization and database design

Should be part of the design process. Entities must meet required the normal form before creating structures.

ERD- Entity Relationship Diagram

Shows the relationships between the data

Fourth normal form

Table is in 3NF, has no multivalued dependencies. All attributes must be dependent on primary key, but independent of each other. No row may contain two or more multivalued facts about an entity.

key attribute

The attribute(s) that form(s) a primary key. See also prime attribute.

Degree of a relationship

The number of entities involved in the relationship 1. Unary - one entity 2. Binary - two entities 3. N-ary - N entities

Cardinality of a relationship

The number of records (min and max) in entity Y that can be associated with each record in entity X, and vice versa

What is the objective of normalization?

To ensure that all tables are in at least 3NF

A table that is in first normal form may contain problems that will require you to restructure it.

True

Data-modeling: ER model

Validated against expected processes: inserts, updates, and deletions. Evaluate where, when, and how to maintain a history. No redundant relationships. All that is needed is here, and all that is there is needed.

white hat hacker

Who is a hacker that works to help society?

Foreign Key

a field in a table that links to a primary key field in another table to form a relationship

An Associative entity is also called ...

an Intersection Entity or a Linking Table

Relationship

an association between entities

foreign key

an attribute that is a primary key in another table and is used to LINK tables

An alternate key is a ____. a. foreign key b. primary key c. column that could be a primary key but was not chosen d. row that could be a primary key but was not chosen

c. column that could be a primary key but was not chosen

1st Normal Form (1NF)

look at instance value for the Primary Key and it should drive every instance for the secondary instances. - for every instance of an entity, each attribute of the instance is completely determined by the instances PK - each instance attribute is completely dependent upon the PK

An attribute that is part of a key is known as a(n) ________ attribute.

prime

normalization

process that enables a database design to eliminate design problems by decomposing (or reorganizing) the existing table structure

A selection removes...

rows

minimal data redundancy and data inconsistencies

usually only stored once, so this is minimized

Unnormalized tables yield no simple strategies for creating virtual tables, which are also known as ________.

views

what you know

what authentication method does tippie college of business use for students

2nd Normal Form

•All attributes (non-key columns) are dependent on the key •Make junction tables (with primary keys from different tables) •No partial key dependencies * No non-key column is dependent on only a portion of the primary key

3rd Normal Form

•All fields (columns) can be determined only by the key in the table and no other column (attribute)

Normalization process

- Sequential application of increasingly higher standards / conditions / rules - A database is said to be in Kth normal form if all tables satisfy the first K rules: 1. *1NF: first normal form* - PK for every table - One value per field (no compound fields or lists) - One field per value (no repeating values) 2. *2NF: second normal form* - One subject per table 3. *3NF: third normal form* - No "computed" values (math, logical, or lookup) 4. *Higher normal forms* - Database theoreticians have proposed additional rules of normalization beyond 3NF to deal with specialized circumstances - In practice, few database practitioners normalize past 3NF

De-normalization

- Sometimes, a database practitioner intentionally violates a rule in an attempt to improve performance (query speed) --Pre-computed values --Intentional redundancy - This should only be done if the improved performance is critical

First normal form

- Tables without repeating groups.

Functional Dependence

- a column (attribute) B is functionally dependent on another column A (or possibly a collection of columns) when each value for A in the database is associated with exactly one value of B.

Candidate key

- a column or a collection of columns on which all columns in the table are functionally dependent. - the definition for primary key also defines a candidate key.

Functional Dependence

- a formal name for what is basically a simple idea.

Entity

- a person, place, thing, or event

Normalization

- a progression in which a table that is in first normal form is better (freer from problems) than a table that is not in first normal form, a table that is in second normal form is better than one that is in first normal form, and so on.

To correct update anomalies

- convert tables to various types of normal forms.

Normalization Process

- enables you to identify the existence of potential problems, called update anomalies, in the design of a relational database.

Relational Integrity

- if a foreign key in one table matches the primary key of another table, then values entered in the foreign key must match the value of the primary key or be null.

A table is in 1NF when (3 things):

-All key attributes are defined -No repeating groups in table -All attributes are dependent on PK

A table is in 2NF when (2 things):

-It is in 1NF and -It includes no partial dependencies

A table is in 3NF when (two things):

-It is in 2NF and -It contains no transitive dependencies

A table is in the Fourth Normal Form (4NF) when (2 things):

-It is in 3NF and -No multiple sets of multivalued dependencies 37

When can a table be in 3NF and fail to meet BCNF (2 reasons):

-No partial dependencies, nor does it contain transitive dependencies -A nonkey attribute is the determinant of a key attribute

Entity

-Something about which data needs to be kept -Can be a thing or event -Entities end up as a table in a database

Advantages of a database system

-data integration -data sharing -minimal data redundancy and data inconsistencies -data independence -cross functional analysis

66) You have been given two tables, CUSTOMER and SALE. You want to check the referential integrity constraint: SALE.CustomerNumber must exist in CUSTOMER.CustomerNumber You run the following SQL query: SELECT CustomerNumber FROM SALE WHERE CustomerNumber NOT IN (SELECT CustomerNumber FROM SALE, CUSTOMER WHERE SALE.CustomerNumber = CUSTOMER.CustomerNumber) What is shown in the results of this query? A) All values of CustomerNumber that match the constraint. B) All values of CustomerNumber that violate the constraint. C) All values of CustomerNumber where SALE.CustomerNumber = CUSTOMER.CustomerNumber. D) A and C E) B and C

/Answer: B

Things to avoid:

1. *Multi-value fields* - Divide multi-item fields/attributes into separate fields --EX. Name becomes FirstName, LastName - Simplifies user queries --EX. Find customers who live in Austin, Find customers who live in apt. - Simplifies some data validation rules --EX. Street is required but Apt. is optional 2. *Repeating fields* - Pull fields/attributes out of tables that are shared by multiple records --Reduces physical size of database and helps control data inconsistencies 3. Computed fields 4. Multi-level dependencies

First Normal Form

1. All key attributes are defined 2. There are no repeating groups in the table 3. All attributes are dependent on primary key

Dependency Diagram

1. Depicts all dependencies found within a given table structure

Advantages of Normalization:

1. Duplicate data is minimized - Reducing storage requirements - Simplifying maintenance 2. Pre-calculated values are eliminated - Reducing data requirements - Improving quality/accuracy 3. Fewer secondary indices per table (since each table is for a single "subject" - Making insert, update, and delete operations faster

What is the normalization process (4 things)?

1. Each table represents a single subject 2. No data item will be unnecessarily store d in more than one table 3. All nonprime attributes in a table are dependent on the PK 4. Each table is void of insertion, update, deletion anomalies

Normalization process

1. Each table represents a single subject 2. No data item will be unnecessarily stored in more than one table 3. All attributes in a table are dependent on the primary key 4. Each table void of insertion, update, deletion anomalies

8 issues to address, in order, to produce a good normalized set of tables:

1. Evaluate PK assignments 2. Evaluate naming conventions 3. Refine attribute atomicity 4. Identify new attributes 5. Identify new relationships 6. Refine PKs as required for data granularity 7. Maintain historical accuracy 8. Evaluate using derived attributes

4 characteristics of the Boyce-Codd Normal Form (BCNF)

1. Every determinant in table is a candidate key (which has the same characteristics of a PK but for some reason was not chosen as such) 2. When table contains only one candidate key, the 3NF and the BCNF are equivalent 3. BCNF can be violated only when table contains more than one candidate key 4. Most designers consider BCNF as a special case of 3NF

Conversion to 2NF (2 Steps)

1. Identify all key components 2. Identify the dependent attributes

Conversion to 3NF (3 Steps)

1. Identify each new determinant 2. Identify the dependent attributes 3. Remove the dependent attributes from transitive dependencies

Techniques in business intelligence

1. OLAP- online analytical processing 2. data mining

Three basic types of relationships

1. One-to-many (1:N) 2. One-to-one (1:1) 3. Many-to-many (M:N)

Advantages of databases:

1. Organized (structured) collection of data 2. Enables more effective decision making 3. Supports multiple users 4. Improved data quality 5. Well maintained: full backups and security 6. Has GUI and/or programming language for ease of use

Steps to Design a Relational Database

1. Store all data in one uniform table 2. Vary the number of columns 3. SOLUTION- a set of tables aka: relational database

3 things about improving table design:

1. Table structures should be cleaned up to eliminate initial partial and transitive dependencies 2. Normalization cannot, by itself, be relied on to make good designs 3. Valuable because it helps eliminate data redundancies

three levels of schemas

1. conceptual schema 2. external level schema 3. internal level

Basic Requirements of a Relational Database

1. every column in a row must be single valued: one value per cell 2. Primary keys cannot be null - ensures that every row represents something and can be identified aka: entity integrity rule 3. Foreign keys if not null, must have values that correspond to the value of a primary key in another table- aka: referential integrity rule 4. all nonkey attributes must describe a characteristic of the object identified by the primary key-

two approaches to database design

1. normalization 2. semantic data modeling

Summary: 1NF

1NF - each instance attribute is completely dependent upon PK

Second normal form (2NF

1NF and No partial dependencies

Assume the following table definition and dependencies. How many tables would be produced by converting to BCNF? TABLE(STU_ID, STAFF_ID, CLASS_CODE, ENROLL_GRADE) STU_ID + STAFF_ID --> CLASS_CODE, ENROLL_GRADE CLASS_CODE --> STAFF_ID

2

Third Normal Form

2NF + Has no transitive dependencies

Summary: 2NF

2NF - when concatenated PK, each instance attribute is completely dependent upon all PKs

3NF

2NF and contains no transitive dependencies

Third Normal Form

2NF and no transitive dependencies

Given the following table definition, how many tables would be produced by the conversion to 2NF? 1NF (PROJ_NUM, EMP_NUM, PROJ_NAME, EMP_NAME, JOB_CLASS, CHG_HOURS, ASSIGN_HOURS) with PROJ_NUM & EMP_NUM AS PRIMARY KEY IDENTIFIERS

3

How many steps does the normalization procedure have?

3

A table in ________ contains no transitive dependencies.

3NF

A table that is in 2NF and contains no transitive dependencies is said to be in ________.

3NF

A table where all attributes are dependent on the primary key and are independent of each other, and no row contains two or more multivalued facts about an entity, is said to be in ________.

3NF

Assume the following table definition and dependencies. In which normal form is this table? TABLE(A, B, C, D) A + B --> C, D C --> B

3NF

Best form of normalization

3NF

For most business database design purposes, ________ is as high as you need to go in the normalization process.

3NF

Most designers consider Boyce-Codd normal form (BCNF) as a special case of ________.

3NF

Objective of normalization is to ensure all tables in at least...

3NF

Tables in ________ will perform suitably in business transactional databases.

3NF

Which level of normalization is needed for business database design purposes?

3NF

Which normalization form is best-suited for business transactional databases?

3NF

Fourth Normal Form

3NF + Has no multivalued dependencies

Summary: 3NF

3NF - each instance attribute is dependent only on PK and not a secondary attribute

4NF

3NF and contains no multivalued dependencies.

Candidate Key

A Candidate Key is a field (attribute) that could have been selected as the primary key but was not.

Primary Key

A Primary Key is the field (attribute) or combination of fields (attributes) that uniquely identifies a row in a relational table.

Alternate Key

A candidate key that was not chosen to be the primary key

Determinant

A column in a table that determines at least one other column

transitive dependency

A condition in which an attribute is dependent on another attribute that is not part of the primary key.

partial dependency

A condition in which an attribute is dependent on only a portion (subset) of the primary key

Partial Dependency

A condition in which an attribute is dependent on only a portion (subset) of the primary key.

Interrelation Constraint

A constraint that involves more than one relation

Prime Attribute (SEE: KEY ATTRIBUTES)

A key attribute; that is, an attribute that is apart of a key or is the whole key.

prime attribute

A key attribute; that is, an attribute that is part of a key or is the whole key

Prime Attribute

A key attribute; that is, an attribute that is part of a key or is the whole key. See also key attributes.

Primary Key

A minimal collection of columns (attributes) in a table on which all columns are functionally dependent and that is chosen as the main direct access vehicle to individual rows. Also see candidate key

Candidate Key

A minimal collection of columns (attributes) in a table on which all columns are functionally dependent but that has not necessarily been chosen as the primary key

Normalization First Normal Form (1NF)

A model is in 1NF if, for every instance of an entity, each attribute of the instance is completely determined by the instance's PK In other words, 1NF holds when there are no instance attributes that have multiple values

Partial Dependency

A non-key characterisitc is dependent on only part of the key, not the whole thing. To eliminate, pull out the elements that are dependent and put them in their own table.

Transitive dependency

A nonkey attribute functionally depends on another nonkey attribute

denormalization

A process by which a table is changed from a higher level normal form to a lower level normal form. Usually done to increase processing speed. Potentially yields data anomalies.

Denormalization

A process by which a table is changed from a higher-level normal form to a lower-level normal form, usually to increase processing speed. Denormalization potentially yields data anomalies.

denormalization

A process by which a table is changed from a higher-level normal form to a lower-level normal form, usually to increase processing speed. This process potentially yields data anomalies

Denormalization

A process by which a table is charged from a higher-level normal form to a lower-level normal form, usually to increase processing speed. Denormalization potentially yields data anomalies.

normalization

A process that assigns attributes to entities in such a way that data redundancies are reduced or eliminated.

normalization

A process that assigns attributes to entities so that data redundancies are reduced or eliminated

Normalization

A process that assigns attributes to entities so that data redundancies are reduced or eliminated.

atomicity

A property of transactions that states that all parts of a transaction must be treated as a single logical unit of work in which all operations must be completed (committed) to produce a consistent database.

ER diagrams (ERDs)

A standardized set of notation for representing entities (attributes, PKs, and FKs) and the relationships between them

surrogate key

A system-assigned primary key, generally numeric and auto-incremented.

Note:

A table (relation) is in first normal form (1NF) when it does not contain repeating groups.

table

A table in a relational database stores information about only one entity.

3NF

A table is in 3NF when it is in 2NF and no nonkey attribute is functionally dependent on another nonkey attribute; that is, it cannot include transitive dependencies

Third Normal Form (3NF)

A table is in 3NF when it is in 2NF and no nonkey attribute is functionally dependent on another nonkey attribute; that is, it cannot include transitive dependencies.

third normal form (3NF)

A table is in 3NF when it is in 2NF and no nonkey attribute is functionally dependent on another nonkey attribute; that is, it cannot include transitive dependencies.

4NF

A table is in 4NF if it is in 3NF and contains no multiple independent sets of multivalued dependencies

Fourth Normal Form (4NF)

A table is in 4NF if it is in 3NF and contains no multiple independent sets of multivalued dependencies.

First Normal Form (1NF)

A table is in first normal form if it does not contain a repeating group

1NF

A table is in first normal form if it does not contain any potential multi-valued attributes. Business rules determine if an attribute is potentially multi-valued e.g. if a table stores three addresses for a single customer, the attribute is potentially multi-valued

Associative Entities

An associative entity is a relationship turned into an entity

Nonkey Attribute

An attribute (column) that is not part of the primary key

Foreign key

An attribute or set of attributes for a "child" instance/row of an entity that identifies the unique "parent" instance/row 1. The parent can either be in the same of different entity than the child 2. FKs can be null under certain conditions (acceptable orphans) 3. Typically the foreign key of the child identifies the primary key of the parent --But exceptions are possible

atomic attribute

An attribute that cannot be further subdivided to produce meaningful components. For example, a person's last name attribute cannot be meaningfully subdivided into other name components; therefore, the last name attribute is atomic.

Atomic Attribute

An attribute that cannot be further subdivided to produce meaningful components. For example, a person's last name attribute cannot be meaningfully subdivided.

Non-prime Attribute

An attribute that is not part of a key.

Nonprime attribute

An attribute that is not part of a key.

nonprime attribute

An attribute that is not part of a key.

78) A table designed to store PhoneNumber01, PhoneNumber02 and PhoneNumber03 contains ________. A) the multivalued, multicolumn problem B) the inconsistent values problem C) the missing values problem D) the general-purpose remarks column problem E) None of the above is correct.

Answer: A Diff: 1 Page Ref: 168-170

72) A classic example of unneeded normalization is when we are dealing with ________. A) ZIP codes B) sales orders and line items C) association patterns D) multivalued dependencies E) general purpose remarks columns

Answer: A Diff: 2 Page Ref: 164-165

79) A form of multivalued dependency is found in ________. A) the multivalued, multicolumn problem B) the inconsistent values problem C) the missing values problem D) the general-purpose remarks column problem E) None of the above is correct.

Answer: A Diff: 2 Page Ref: 168-170

67) The advantages of normalization include ________. A) the elimination of modification anomalies B) the elimination of duplicated data C) more complex SQL for multitable subqueries and joins D) A and B E) A, B, and C

Answer: A Diff: 3 Page Ref: 162 Fig 4-2

99) What are null values, and why are they a problem?

Answer: A null value is a missing value, a value that has never been input into a cell in a table. Null values are a problem because they are ambiguous. A null value can represent three different conditions: (1) the value is inappropriate, (2) the value is appropriate but unknown, or (3) the value is appropriate and known but not entered into the database. Diff: 1 Page Ref: 171-172

63) The SQL function COUNT ________. A) counts the number of columns in a table B) counts the number of rows in a table C) counts the number of tables in a database D) A and C E) B and C

Answer: B Diff: 1 Page Ref: 161

65) The SQL keyword TOP ________. A) limits the number of columns retrieved from a table B) limits the number of rows retrieved from a table C) limits the number of tables retrieved from a database D) A and C E) B and C

Answer: B Diff: 1 Page Ref: 161

75) For a number of reasons, ________ is not often an advantage for a read-only database. A) updating B) normalization C) denormalization D) A and B E) B and C

Answer: B Diff: 1 Page Ref: 165-166

81) You are creating a PRODUCT table using existing data from multiple sources. Examining the data, you find that you have "large red hat", "large hat, red", "red hat large" and "hat, large, red." This is an example of ________. A) the multivalue, multicolumn problem B) the inconsistent values problem C) the missing values problem D) the general-purpose remarks column problem E) None of the above is correct.

Answer: B Diff: 1 Page Ref: 168-170

82) The problem of misspelled data entries is really the same as ________. A) the multivalue, multicolumn problem B) the inconsistent values problem C) the missing values problem D) the general-purpose remarks column problem E) None of the above is correct.

Answer: B Diff: 1 Page Ref: 170-172

86) A missing value is called a(n) ________. A) empty value B) null value C) missing value D) Any of A, B, or C can be used. E) None of the above is correct.

Answer: B Diff: 1 Page Ref: 171

64) To limit the number of rows retrieved from a table, use the SQL construct ________. A) SELECT * B) SELECT TOP n * C) SELECT COUNT(TOP n) D) SELECT COUNT(*) E) SELECT COUNT *

Answer: B Diff: 2 Page Ref: 161

80) When a table is created using existing data from multiple sources, you are likely to find that the different sources code data in slightly different ways. This is an example of ________. A) the multivalued, multicolumn problem B) the inconsistent values problem C) the missing values problem D) the general-purpose remarks column problem E) None of the above is correct.

Answer: B Diff: 2 Page Ref: 168-170

What is shown in the results of this query? A) All values of CustomerNumber that match the constraint. B) All values of CustomerNumber that violate the constraint. C) All values of CustomerNumber where SALE.CustomerNumber = CUSTOMER.CustomerNumber. D) A and C E) B and C

Answer: B Diff: 3 Page Ref: 161

69) The presence of one or more foreign keys in a relation prevents ________. A) the elimination of modification anomalies B) the elimination of duplicated data C) more complex SQL for multitable subqueries and joins D) A and B E) A, B, and C

Answer: B Diff: 3 Page Ref: 162

68) The disadvantages of normalization include ________. A) the elimination of modification anomalies B) the elimination of duplicated data C) more complex SQL for multitable subqueries and joins D) A and B E) A, B, and C

Answer: C Diff: 1 Page Ref: 162 Fig 4-2

59) The second step in assessing table structure includes ________. A) counting rows B) examining columns C) examining data values D) A and B E) B and C

Answer: C Diff: 2 Page Ref: 161-162 Fig 4-1

76) ________ is the process of joining two or more tables and storing the result as a single table. A) Querying B) Normalization C) Denormalization D) A and B E) B and C

Answer: C Diff: 2 Page Ref: 166

83) When a table is created using existing data from multiple sources, you are likely to find that some data values have never been provided. This is an example of ________. A) the multivalued, multicolumn problem B) the inconsistent values problem C) the missing values problem D) the general-purpose remarks column problem E) None of the above is correct.

Answer: C Diff: 2 Page Ref: 171-172

84) When a table is created using existing data from multiple sources, you are likely to find that some data values have never been provided. This is an example of ________. A) the multivalued, multicolumn problem B) the inconsistent values problem C) the missing values problem D) the general-purpose remarks column problem E) None of the above is correct.

Answer: C Diff: 2 Page Ref: 171-172

85) Reviewing the work done on a table that was created using existing data from multiple sources, you are likely to find that some data values that were provided were never entered into the table. This is an example of ________. A) the multivalued, multicolumn problem B) the inconsistent values problem C) the missing values problem D) the general-purpose remarks column problem E) None of the above is correct.

Answer: C Diff: 2 Page Ref: 171-172

88) To check for null values in a column in a table, use the SQL phrase ________. A) IS B) IS NOT C) IS NULL D) COUNT(IS NOT) E) COUNT(IS NULL)

Answer: C Diff: 2 Page Ref: 171-172

89) The SQL keyword IS NULL can be used to ________. A) count the number of columns in a table B) count the number of rows in a table C) count the number of null values in a column D) A and C E) B and C

Answer: C Diff: 2 Page Ref: 171-172

58) The first step in assessing table structure includes ________. A) counting rows B) examining columns C) examining data values D) A and B E) B and C

Answer: D Diff: 2 Page Ref: 161-162 Fig 4-1

70) Anomalies caused by functional dependencies can be eliminated by putting tables into ________. A) 1NF B) 2NF C) 3NF D) BCNF E) 4NF

Answer: D Diff: 2 Page Ref: 163-164

71) The defining characteristic of BCNF is that a table is in BCNF if ________. A) all rows are unique B) all columns are consistent C) the primary key is a candidate key D) all determinants are candidate keys E) all candidate keys are determinants

Answer: D Diff: 2 Page Ref: 163-164

73) Unlike the anomalies from functional dependencies, the anomalies from ________ are so serious that they should always be eliminated. A) ZIP codes B) sales orders and line items C) association patterns D) multivalued dependencies E) general purpose remarks columns

Answer: D Diff: 2 Page Ref: 165

87) A null value can indicate which of the following conditions? A) The value is inappropriate. B) The value is appropriate but unknown. C) The value is appropriate and known, but not entered into the database. D) A, B, and C E) None of the above is correct.

Answer: D Diff: 2 Page Ref: 171

90) Reviewing the work done on a table that was created using existing data from multiple sources, you find that a column name Remarks has been included, and it is populated with inconsistent and verbose verbal data. This is an example of ________. A) the multivalued, multicolumn problem B) the inconsistent values problem C) the missing values problem D) the general-purpose remarks column problem E) None of the above is correct.

Answer: D Diff: 2 Page Ref: 172-173

62) To count the number of rows in a table, use the SQL construct ________. A) SELECT * B) SELECT TOP n * C) SELECT COUNT(TOP n) D) SELECT COUNT(*) E) SELECT COUNT *

Answer: D Diff: 3 Page Ref: 161

96) Explain denormalization, and why it may be appropriate to denormalize a set of tables.

Answer: Denormalization is the process of taking a normalized set of relations and converting them into a smaller set of relations that is in a lower normal form and thus vulnerable to a greater number of anomalies. While the fully normalized set of relations is more desirable from a theoretical perspective, it does have some disadvantages. The greater the number of relations in the database, the more work that is required by the database whenever data must be retrieved from multiple relations to reconstruct the users' view of the data. In some circumstances, the anomalies introduced by denormalizing a data model may be considered an acceptable cost for the performance improvement from reducing the overhead work performed by the database. Diff: 2 Page Ref: 164-165

74) Read-only databases are used for ________. A) updating B) querying C) reporting D) A and B E) B and C

Answer: E Diff: 1 Page Ref: 165

60) During the second step of assessing table structure, you are trying to determine ________. A) multivalued dependencies B) functional dependencies C) foreign keys D) A and B E) A, B, and C

Answer: E Diff: 2 Page Ref: 161-162 Fig 4-1

61) During the second step of assessing table structure, you are trying to determine ________. A) primary keys B) candidate keys C) foreign keys D) A and B E) A, B, and C

Answer: E Diff: 2 Page Ref: 161-162 Fig 4-1

77) An advantage of denormalization is ________. A) faster updating B) faster querying C) less complex SQL in application code D) A and B E) B and C

Answer: E Diff: 3 Page Ref: 166-167

4) To determine the number and type of columns in a table, use the SQL construct COUNT(*).

Answer: FALSE Diff: 1 Page Ref: 161

12) When examining data values as a part of assessing table structure, there is no need to try to determine candidate keys other than the table's primary key.

Answer: FALSE Diff: 1 Page Ref: 161-162 Fig 4-1

13) When examining data values as a part of assessing table structure, there is no need to try to determine foreign keys.

Answer: FALSE Diff: 1 Page Ref: 161-162 Fig 4-1

32) Design guidelines and priorities are the same whether you're working with an updatable database or a read-only database.

Answer: FALSE Diff: 1 Page Ref: 165-166

43) When you are creating a database from existing data, you will have only minor problems with inconsistent values.

Answer: FALSE Diff: 1 Page Ref: 170-171

54) General-purpose remarks columns rarely contain important data.

Answer: FALSE Diff: 1 Page Ref: 172-173

3) To count the number of rows in a table, use the SQL construct COUNT(ROWS).

Answer: FALSE Diff: 2 Page Ref: 161

24) The SQL DELETE TABLE statement can be used to remove unneeded tables after the normalized tables are created and populated.

Answer: FALSE Diff: 2 Page Ref: 164

31) Read-only databases are often updated.

Answer: FALSE Diff: 2 Page Ref: 165-166

33) Normalization is an advantage for a read-only database.

Answer: FALSE Diff: 2 Page Ref: 165-166

37) Read-only databases seldom use more than one copy of a set of same data.

Answer: FALSE Diff: 2 Page Ref: 167

45) The problem of misspelled data entries is an entirely different problem than the inconsistent values problem.

Answer: FALSE Diff: 2 Page Ref: 170-171

52) Most DBMS products will let you define a primary key on a column that contains null values.

Answer: FALSE Diff: 2 Page Ref: 171-172

17) Normalization eliminates modification anomalies and data duplication.

Answer: FALSE Diff: 3 Page Ref: 162 Fig 4-2

29) To eliminate multivalued dependencies, normalize your tables so that they are all in BCNF.

Answer: FALSE Diff: 3 Page Ref: 165

91) Describe the steps in assessing table structure when you are given a set of tables and asked to construct a new database to store the existing data.

Answer: First, you should count the number of rows in the tables and determine the type of data stored in each column. Second, you should analyze the existing data values and work with users to determine (1) multivalued dependencies, (2) functional dependencies, (3) candidate keys, (4) primary keys and (5) foreign keys. Finally, you should check the validity of the possible referential integrity constraints. Diff: 1 Page Ref: 161-162 Fig 4-1

100) What are general-purpose remarks columns, and why are they a problem?

Answer: General-purpose remarks columns are columns in a table with labels such as "Remarks", "Comments" or "Notes." They are a problem because these columns often contain important data that is stored in an inconsistent, verbal and verbose way. Further, they may contain multiple data items. Diff: 1 Page Ref: 172-173

98) What are inconsistent values, and why are they a problem?

Answer: Inconsistent values are different forms of the same data value. For example, in a database of dog owners and their dogs, we might find a border collie listed as "b. collie", "collie, border", and "border collie". This will create query problems. For example, a query for "border collie" will not return the dogs listed in the other formats. Or, alternatively, we would have to write a query that includes all possible (or at least known) variations of "border collie," and that would be a real pain. Diff: 1 Page Ref: 170-171

1) When you are given a set of tables and asked to create a database to store their data, the first step is to assess the tables' structure and content.

Answer: TRUE Diff: 1 Page Ref: 161

5) To limit the number of rows retrieved from a table, use the SQL TOP keyword.

Answer: TRUE Diff: 1 Page Ref: 161

10) When examining data values as a part of assessing table structure, you should try to determine multivalued dependencies.

Answer: TRUE Diff: 1 Page Ref: 161-162 Fig 4-1

11) When examining data values as a part of assessing table structure, you should try to determine the table's primary key.

Answer: TRUE Diff: 1 Page Ref: 161-162 Fig 4-1

27) Multivalued dependencies create anomalies so serious that multivalued dependencies must always be eliminated.

Answer: TRUE Diff: 1 Page Ref: 165

30) Creating a read-only database is a job often given to beginning database professionals.

Answer: TRUE Diff: 1 Page Ref: 165-166

36) Denormalization is simple—join the data together and store it in a table.

Answer: TRUE Diff: 1 Page Ref: 166-167

39) Multivalued dependencies show up under a different name as the multivalued, multicolumn problem.

Answer: TRUE Diff: 1 Page Ref: 168-170

40) The multivalued, multicolumn problem occurs when a set of columns are used to store data that should actually be in one column.

Answer: TRUE Diff: 1 Page Ref: 168-170

41) One situation created by the multivalued, multicolumn problem is that the maximum number of data values for an attribute is limited.

Answer: TRUE Diff: 1 Page Ref: 168-170

42) If you have a table with a set of columns named "Child01", "Child02" and "Child03", the table has the multivalued, multicolumn problem.

Answer: TRUE Diff: 1 Page Ref: 168-170

46) You are creating a BOAT table using existing data from multiple sources, and you find that you have "power boat blue", "boat, power, blue" and "blue power boat" as data values for the same column. This is an example of the inconsistent values problem.

Answer: TRUE Diff: 1 Page Ref: 170-171

47) A missing value is called a null value.

Answer: TRUE Diff: 1 Page Ref: 171

48) Null values are a problem because they are ambiguous.

Answer: TRUE Diff: 1 Page Ref: 171

50) A null value in a column may indicate that there is an appropriate value for that attribute, but it is unknown.

Answer: TRUE Diff: 1 Page Ref: 171

53) The SQL IS NULL keyword can be used to count the number of nulls in a column.

Answer: TRUE Diff: 1 Page Ref: 171-172

2) The first step in assessing table structure is to count rows and examine columns.

Answer: TRUE Diff: 2 Page Ref: 161 Fig 4-1

14) The third step in assessing table structure is to check the validity of presumed referential integrity constraints.

Answer: TRUE Diff: 2 Page Ref: 161-162 Fig 4-1

6) When examining data values as a part of assessing table structure, you should try to determine two types of dependencies: functional dependencies and functional dependencies.

Answer: TRUE Diff: 2 Page Ref: 161-162 Fig 4-1

7) When examining data values as a part of assessing table structure, you should try to determine three types of keys: the primary key, any candidate keys and any foreign keys.

Answer: TRUE Diff: 2 Page Ref: 161-162 Fig 4-1

8) The second step in assessing table structure is to examine data values and determine dependencies and keys.

Answer: TRUE Diff: 2 Page Ref: 161-162 Fig 4-1

16) Database design varies depending on whether you're building an updatable database or a read-only database.

Answer: TRUE Diff: 2 Page Ref: 162

18) The presence of one or more foreign keys in a relation means that we cannot eliminate duplicated data in that table.

Answer: TRUE Diff: 2 Page Ref: 162

19) Normalization requires applications to use more complex SQL since they will need to write subqueries and joins to recombine data stored in separate relations.

Answer: TRUE Diff: 2 Page Ref: 162 Fig 4-2

22) A defining requirement for BCNF is that every determinant must be a candidate key.

Answer: TRUE Diff: 2 Page Ref: 163

23) The SQL INSERT statement can be used to populate normalized tables.

Answer: TRUE Diff: 2 Page Ref: 163

25) SQL statements that can be used to create referential integrity statements for normalized tables are created during the normalization process.

Answer: TRUE Diff: 2 Page Ref: 164

28) Writing SQL subqueries and joins against normalized tables is simple compared to the code that must be written to handle anomalies from multivalued dependencies.

Answer: TRUE Diff: 2 Page Ref: 165

35) Denormalization reduces the complexity of the SQL statements needed in an application to read required data.

Answer: TRUE Diff: 2 Page Ref: 166-167

38) Read-only databases often use several copies of a set of the same data, where each copy is modified for a specific use.

Answer: TRUE Diff: 2 Page Ref: 167

44) An inconsistent values problem is created when different users have coded the same data entries differently.

Answer: TRUE Diff: 2 Page Ref: 170-171

51) A null value in a column may indicate that there is an appropriate value for that attribute, and although the value is known no one has entered the value into the database.

Answer: TRUE Diff: 2 Page Ref: 171

55) The problem with a general-purpose remarks column is that the data it contains are likely to be verbal, inconsistent and stored in a verbose manner.

Answer: TRUE Diff: 2 Page Ref: 172-173

56) If you see a column name Notes, it is likely that this is a general-purpose remarks column.

Answer: TRUE Diff: 2 Page Ref: 172-173

15) The elimination of modification anomalies and the reduction of duplicated data are advantages of normalization.

Answer: TRUE Diff: 3 Page Ref: 162 Fig 4-2

20) Relations are sometimes left unnormalized to improve performance.

Answer: TRUE Diff: 3 Page Ref: 164-165

21) Relations in BCNF have no modification anomalies in regard to functional dependencies.

Answer: TRUE Diff: 3 Page Ref: 165

95) What are the advantages and disadvantages of normalization?

Answer: The advantages of normalization are that it (1) eliminates modification anomalies and (2) reduces duplicated data. Reducing duplicated data will eliminate data integrity problems and save file space. The disadvantages of normalization are that it (1) will require application developers to write more complicated SQL statements for multitable subqueries and joins, and (2) may slow down the applications running against the database. Diff: 2 Page Ref: 162 Fig 4-2

97) What is the multivalue, multicolumn problem, and why is it a problem?

Answer: The multivalue, multicolumn problem occurs when a table used multiple columns to store multiple values of an attribute. For example, a database of dog owners might have a table with the structure: DOG_OWNER (OwnerID, LastName, FirstName, DogName01, DogName02, DogName03) This is a problem because (1) the number of DogNames is fixed, and (2) this structure creates query problems. In fact, this structure is another form of the multivalued dependency, and requires a similar solution-a DOG table should be created to store the DogNames and with a foreign key to link the dogs to their owner. Diff: 1 Page Ref: 168-170

94) When accessing table structure, and determining the number and types of columns in a table, how can you control how many rows are returned?

Answer: When determining the number and types of how many rows are in a table, use a SELECT query that contains the SQL SELECT TOP ## expression, where ## is the number of rows that you want returned. This function will return only the number of rows specified by the TOP expression, which can make a table with a large number of rows manageable. Diff: 1 Page Ref: 161

determinant

Any attribute in a specific row whose value directly determines other values in that row. See also Boyce-Codd normal form (BCNF).

Determinant

Any attribute that you can use to determine the values assigned to other attributes.

Determinant

Any attribute whose value determines other values within a row

First normal form

Any table that is a relation Single values, all elements in a column must be of the same type,each column has a unique name, no identical rows

Step 2 of conversion to 2NF:

Assign corresponding dependent attributes

Generalized functional dependence

Attribute A determines attribute B if all of the rows in the table that agree in value for A also agree for B.

Functional dependence

Attribute B is fully functionally dependent on the attribute A if each value of A determines one and only one value of B.

Atomic Attribute

Attribute cannot be further subdivided

A table where every determinant is a candidate key is said to be in ________.

BCNF

BCNF

BCNF is used to control anomalies.

Boyce-Codd normal form (BCNF)

Boyce-Codd正常形式(BCNF)

Which of the following normal forms is mostly of theoretical interest?

DKNF

DBA

Database Administrator or Database Administration

Dependency diagam

Depicts all dependencies found within a given table structure and is helpful to get a bird's-eye view of all the relationships among table attributes. Makes it less likely that you will overlook an important dependency.

Dependency diagram

Depicts all dependencies found within given table structure. Gets an overview of all relationships among table's attributes. Makes it less likely that an important dependency will be overlooked.

Repeating group

Derives its name from the fact that a group of multiple entries of the same type can exist for any single key attribute occurrence.

Principles and normalization procedures

ERD is created through an iterative process. Normalization focuses on the characteristics of specific entities.

1. One-to-many (1:N)

Each record in X can be associated with many records in Y, but each record in Y can only be associated with one record in X - X is the "parent" and Y is the "child" - Typical implementation: --Put a FK in Y to identify the associated record in X -->Add a FK to the child-side of the relationship to point to the parent-side

2. One-to-one (1:1)

Each record in X can only be associated with one record in Y, and vice versa - Typical implementation: 1. Put a FK in Y to identify the associated record in X - Add a FK to one side of the relationship to point to the other side 2. And define a unique index on the FK to eliminate duplicates - The unique index transforms a standard 1:N relationship into 1:1 relationship

Normalization

Evaluate and correct table structures to minimize data redundancies. Reduce data anomalies. Assigns attributes to tables based on determination.

Tables that are in second normal form do not contain problems

False

3rd normal Form (3NF)

For every instance on an entity, each secondary attribute is dependent online on the PK, not on any other secondary attribute. - Each instance attribute is dependent only on PK and not a secondary attribute.

Partial dependency

Functional dependence in which the determinant is only part of the primary key. Assumption-one candidate key, straight forward, easy to identify.

Step 3 of conversion to 1NF:

Identify all dependencies (depicted with a diagram)

Data-modeling checklist: relationships

Identify relationship participants. Define participation, connectivity, and document cardinality.

Step 2 of conversion to 1NF:

Identify the primary key (must unique ID attribute value)

Normalization starts by:

Identifying the dependencies of a relation/table. Progressively breaking the relation into new set of relations.

2NF

If a table is in 1NF and has a single attribute as its primary key then it is automatically in 2NF - or - If a table is already in 1NF and its primary key consists of more than one attribute, it is in 2NF if all non-key attributes are dependent on the entire primary key

Fully functional dependence: composite key

If attribute B is functionally dependent on a composite key A but not on any subset of that composite key, the attribute B is fully functionally dependent on A.

Prime

If part of any key

Repeating Group

In a relation, a characteristic describing a group of multiple entries of the same type for a single key attribute occurrence. EX: a car can have multiple colors for its top, interior, bottom, trim and so on.

repeating group

In a relation, a characteristic describing a group of multiple entries of the same type for a single key attribute occurrence. For example, a car can have multiple colors for its top, interior, bottom, trim, and so on

Repeating Group

In a relation, a characteristic describing a group of multiple entries of the same type for a single key attribute occurrence. For example, a car can have multiple colors for its top, interior, bottom, trim, and so on.

repeating group

In a relation, a characteristic describing a group of multiple entries of the same type that exist for a single key attribute occurrence. For example, a car can have multiple colors (top, interior, bottom, trim, and so on).

Multidetermine

In a table with columns A, B, and C, A multidetermines B if each value for A is associated with a specific collection of values for B and, further, this collection is independent of any values for C

Multidependent

In a table with columns A, B, and C, B is multi-dependent on A if each value for A is associated with a specific collection of values for B and, further, this collection is independent of any values for C

Multivalued Dependence

In a table with columns A, B, and C, there is a multivalued dependence of column B on column A (also read as "B is multidependent on A" or "A multidetermines B"), if each value for A is associated with a specific collection of values for B and, furthermore, this collection is independent of any values for C

Fourth normal form

In bcnf and has no multi-valued dependencies

Note

In general, when converting a table that is not in first normal form to first normal form, the primary key will usually include the original primary key concatenated with the key to the repeating group, which is the column that distinguishes one occurrence of the repeating group from another on a given row in the table

First Normal Form (1NF) Second Normal Form (2NF) Third Normal Form (3NF) Fourth Normal Form (4NF)

Most common normal forms

Does an ERD show how data is converted or processed to information in an Information System?

No

Entity Integrity

No field that is part of the primary key can accept null values.

First normal form (1NF)

No repeating roups and PK identified.

Question: Is OrderNum the primary key for the OrderLine table?

No, because it does not uniquely determine NumOrdered or QuotedPrice. The order number 21617, for example, appears on a row in the OrderLine table in which the number ordered is 2 and the quoted price is $794.95 and on a row in which the number ordered is 4 and the quoted price is $150.00.

Question: Is Class the primary key for the Part table?

No, because the other columns are not functionally dependent on the class. The item class HW, for example, appears on a row in the Part table in which the part number is AT94, a row in which the part number is DL71, and a row in which the part number is FD21. The item class HW is associated with three part numbers, so the part number is not functionally dependent on the class.

Question: Is QuotedPrice functionally dependent on PartNum?

No. A given part number, such as DR93, can occur on more than one row in the OrderLine table. The quoted price can be different on each row. With current data, the two rows on which the part number is DR93 actually have the same price, $495.00. W

Question: Is the combination of PartNum and Description the primary key for the Part table?

No. It is true that this combination functionally determines all columns in the Part table. PartNum alone, however, also has this property.

Question: In the OrderLine table, is QuotedPrice functionally dependent on OrderNum?

No. Order number 21617, for example, occurs on a row in which the quoted price is $794.95 and on another row in which the quoted price is $150.00. Thus, an order number can be associated with more than one quoted price.

Question: In the Customer table, is CustomerName functionally dependent on RepNum?

No. Rep number 20, for example, occurs on a row in which the customer name is Al's Appliance and Sport, on a row in which the customer name is Kline's, and on a row in which the customer name is All Season. Thus, a rep number can be associated with more than one customer name.

atomicity

Not being able to be divided into smaller units

Atomicity

Not being able to be divided into smaller units.

Non-prime

Not part of any key

Step 2 of conversion to 3NF:

Reassign corresponding dependent attributes (identify attributes dependent on each determinant ID'd in Step 1 and name the table to reflect its content and function)

Instance

Record, or row, in a table is called an Instance of the Entity

Normalizing table structure will:

Reduce data redundancies

granularity

Refers to the level of detail represented by the values stored in a table's row. Data stored at their lowest level of granularity are said to be atomic data.

ρ

Renaming Rename relations or columns ex. ρ vessels(vid, handle, shade) (Boats) Boats(bid, name, color) --> Vessels(vid, handle, shade) ex. ρ Barges (Boats) Boats --> Barges

Relational table must not contain:

Repeating groups

Normal Form

See first normal form, second normal form, third normal form, and fourth normal form.

σ

Selection ex. σ (viewers > 3M) Episodes gives subset σc(R) can have at most as many rows as R

Entity

Something you keep data on (a table)

Normalization

Technique used to design tables in which data redundancies are minimized

key attributes

The attributes that form a primary key

Key Attribute (SEE: PRIME ATTRIBUTE)

The attributes that form a primary key.

Key Attributes

The attributes that form a primary key. See also prime attribute.

Concatenation

The combination of two or more rows in an operation, such as join, or the combination of two or more columns for a primary key field to uniquely identify a given row in the table

QBE

The database acronym QBE stands for Query by Example.

SQL

The database acronym SQL stands for Structured Query Language.

1NF

The first stage in the normalization process. It describes a relation depicted in tabular format, with no repeating groups and a primary key identified. All nonkey attributes in the relation are dependent on the primary key

First Normal Form (1NF)

The first stage in the normalization process. It describes a relation depicted in tabular format, with no repeating groups and a primary key identified. All nonkey attributes in the relation are dependent on the primary key.

first normal form (1NF)

The first stage in the normalization process. It describes a relation depicted in tabular format, with no repeating groups and with a primary key identified. All nonkey attributes in the relation are dependent on the primary key.

Granularity

The level of detail represented by the values stored in a table's row. Data stored at its lowest level of granularity is said to be "atomic data".

granularity

The level of detail represented by the values stored in a table's row. Data stored at its lowest level of granularity is said to be atomic data

Granularity

The level of detail represented by the values stored in a table's row. Data stored at its lowest level of granularity is said to be atomic data.

Normalization

The process of eliminating redundant and interdependent data from a database in order to improve consistency, maintainability, storage efficiency and scalability - Does not necessarily mean database simplification or intuitiveness

second normal form (2NF)

The second stage in the normalization process in which a relation is in 1NF and there are no partial dependencies (dependencies in only part of the primary key).

2NF

The second stage in the normalization process, in which a relation is in 1NF and there are no partial dependencies (dependencies in only part of the primary key)

Second Normal Form (2NF)

The second stage in the normalization process, in which a relation is in 1NF and there are no partial dependencies (dependencies in only part of the primary key).

Anomalies

Things that could happen/ occur

By splitting relations to achieve third normal form tables, you create the need to express interrelation constraints.

True

Highest level of normalization is not always most desirable

True

In general, when converting a non-first normal form table to first normal form, the primary key will usually include the original primary key concatenated with the key to the repeating group.

True

Normalization works one relation at a time

True

Relational table must not contain repeating groups

True

Removal of repeating groups is the starting point in the quest to create tables that are as free of problems as possible.

True

Some tables contain partial dependencies

True

The most recent definition of third normal form is referred to as the Boyce-Codd normal form.

True

The normalization process used to convert a relation or collection of relations to an equivalent collection of third normal form tables is a crucial part of the database design process.

True

The primary key in a table will be a determinant.

True

To correct update anomalies in a database, tables must be converted into various types of normal forms.

True

Need for normalization

Used while designing a new database structure. Analyze the relationship among the attributes within each entity. Determine if structure can be improved. Improves existing data structure creates appropriate database design.

Deletion Anomaly (Question)

What if Judy resigns from her job? What happens to the Sludge Floatation and N2O generation projects?

Insertion Anomaly (Question)

What if a new project is negotiated when there is no employee assigned to it yet? What if a new employee joins who is not yet assigned to any project?

Deletion Anomaly

What if someone needs to be removed from the system? etc.

Update Anomaly (Question)

What if the name of the project PPV5678 is changed to Sludge disposal?

malware

What is the general term for software intended to penetrate

zombie

What is the term used to describe a computer that has been taken over?

phishing

What is the term used to describe email sent to get secure information?

four

What many entities would there be once the data is in a 3NF database

Non-PK attribute instance value determines the instance value of another non-PK attribute in the same entity

What you Violate at 3NF ERD?

When to use a surrogate key?

When PK is considered to be unsuitable

online forms, clickstreams, cookies

When is private data collected during web activities even when no private information is entered?

attempting to join fields from the wrong direction

When joins can't be set under relationship, what is the problem?

section id and student id

Which of these would be the primary key for the registration table

Question: Is the combination of OrderNum and PartNum the primary key for the OrderLine table?

Yes, because all columns are functionally dependent on this combination. Any combination of an order number and a part number occurs on only one row in the OrderLine table and is associated with only one value for NumOrdered and only one value for QuotedPrice. Further, neither OrderNum nor PartNum alone has this property. For example, order number 21617 appears on more than one row, as does part DR93.

Question: Is CustomerNum the primary key for the Customer table?

Yes, because customer numbers are unique. A given customer number cannot appear on more than one row. Thus, each customer number is associated with a single name, a single street, a single city, a single state, a single zip code, a single balance, a single credit limit, and a single rep number. In other words, all columns in the Customer table are functionally dependent on CustomerNum.

internal level schema

a low level view of the database; describes how the data are stored and accessed including record layouts, definitions, addressed, indexes

The benefits of Normalization are ... (4)

a model that better represents the actual business situation a model with less redundancy in the data a model which allows easy update of data a model that allows for accurate query results

Normalization is ...

a step by step process that enables a database designer to eliminate design problems by decomposing (or reorganizing) the existing table structures

If B (an attribute) is functionally dependent on A, we can also say that ____. a. A functionally determines B b. A functionally determines another attribute in the table c. B functionally determines A d. B does not determine any other attribute

a. A functionally determines B

The fact that column B is functionally dependent on column A can be written as ____. a. A ® B b. B ® A c. A ® ® B d. B ® ® A

a. A ® B

Which of the following contains a repeating group? a. Orders (OrderNum, OrderDate, (ItemNum, NumOrdered) b. Orders (OrderNum, OrderDate, ItemNum, NumOrdered ) c. Orders (OrderNum, OrderDate) d. Orders (OrderNum, ItemNum, NumOrdered)

a. Orders (OrderNum, OrderDate, (ItemNum, NumOrdered) )

Another name for a nonkey column is a ____. a. nonkey attribute b. key attribute c. nonkey row d. key table

a. nonkey attribute

Second normal form can be defined as a table that is in first normal form but that contains no ____. a. partial dependencies b. alternate keys c. nonkey columns d. interrelation constraints

a. partial dependencies

A table is in first normal form if it does not contain ____. a. repeating groups b. a foreign key c. a primary key d. alternate keys

a. repeating groups

A table that displays data redundancies yields ________.

anomalies

To put an ERD into 1NF, add ...

another entity and a 1:M relationship to it

To put an ERD into 2NF, add ...

another entity and a 1:M relationship to it

Non-Key Attribute

any attribute that is not part of the primary key

normalization

assumes that everything is initially stored in one large table Rules are followed to decompose that initial table into third normal form (3NF)

If all Singleton keys...

at least in 2NF, but can still violate 3NF

If there is more than one possible choice for the primary key, and one of the possibilities is chosen to be the primary key, the others are referred to as ____. a. canceled keys b. alternate keys c. nonkey attributes d. contributory keys

b. alternate keys

From all the ____ keys, one is chosen to be the primary key. a. alternate b. candidate c. functional d. normal

b. candidate

To convert a table to fourth normal form, split the third normal form table into separate tables, each containing the column that ____ the others. a. determines b. multidetermines c. defines d. identifies

b. multidetermines

A table that contains a repeating group is called a(n) ____. a. normalized relation b. unnormalized relation c. nominal relation d. non-nominal relation

b. unnormalized relation

data definition language (DDL)

builds the data dictionary, creates a database, described logical views for each user and specifies records or field security constraints

A table is in fourth normal form when it is in third normal form and there are no ____. a. alternate keys b. foreign keys c. multivalued dependencies d. primary dependencies

c. multivalued dependencies

Partial dependencies are dependencies on only a portion of the ____. a. nonkey column b. first column or attribute c. primary key d. index

c. primary key

The ____ is a column (or collection of columns) A such that all other columns are functionally dependent on A and no subcollection of the columns in A has this property. a. functional key b. composite key c. primary key d. declared key

c. primary key

The definition for ____ also defines a candidate key. a. functional key b. repeating group c. primary key d. nonkey column

c. primary key

32. ____ normal form has an additional condition that the only determinants the table contains are candidate keys. a. First b. Second c. Third d. Fourth

c. third

BCNF can be violated only if the table contains more than one ________ key.

candidate

When a table contains only one ________ key, the 3NF and the BCNF are equivalent.

candidate

data manipulation language (DML)

changes database content, including updating the elements, insertions, and deletions

A projection removes...

columns

A ________ key makes it more difficult to write search routines.

composite primary

The conflicts between design efficiency, information requirements, and performance are often resolved through ________.

compromises that may include denormalization

A column is a nonkey column if it is ____. a. in first normal form b. in second normal form c. a part of the primary key d. not a part of the primary key

d. not a part of the primary key

The conversion of an unnormalized table to first normal form requires the removal of ____. a. determinants b. interrelation constraints c. nonkey columns d. repeating groups

d. repeating groups

data independence

data and the programs are used independently, each acan be changed without changing the other

Storing a(n) ________ attribute in a table makes it easy to write the application software to produce the desired results.

derived

schema

described the logical structure of a database

semantic data modeling

designer uses knowledge of business processses and information needs to create a diagram that shows what to include in the database

first normal form

for every instance of an entity, each attribute of the instance is completely determinted by the instance of the PK

third normal form

for every instance of an entity, each secondary attribute is dependent only on the pk, not on any other secondary attribute

____ refers to the level of detail represented by the values stored in a table's row.

granularity

logical view of data

how people conceptually organize and understand the data

When designing a database you should _______.

make sure entities are in normal form before table structures are created

An associative entity is added to the ERD model to resolve a ...

many-to-many relationship between two entities

data integration

master files are combined into large "pools" of data that many application programs access

data sharing

more easily shared with authorized users

A→→ B signifies that B is ____________________ on A.

multidependent

insert anomaly

no way to store info until something else occurs

A(n) ________ attribute is not part of a key.

nonprime

Identification of the ________ will let you know where you are in the normalization process.

normal form

The ____________________ process enables you to identify the existence of potential problems in the design of a database.

normalization

π

projection ex. π(a1, a2, a3) (R) Has at most as many rows as R (should return all rows)

physical view

refers to how and where the data are physically arranged and stored in the computer system

The ________ model views the data as part of a table or collection of tables in which all key values must be identified.

relational

cross-functional analysis

relationships can be explicitly defined and used in the preparation of management reports

relational data model

represents conceptual and external level schemas as if data are stored in tables

database administrator

responsible for the database

Nonkey attribute

see nonprime attribute

To implement a surrogate key, MS Access uses an AutoNumber data type, MS SQL Server uses an identity column, and Oracle uses a(n) ________ object.

sequence

report writer

simplifies report creation

A ________ key is an artificial PK introduced by the designer with the purpose of simplifying the assignment of primary keys to tables.

surrogate

database management system

the interface between the database and the various application programs

Cardinality

the maximum number of times that an instance of an entity (table) can be associated with instances of another entity (table)

conceptual level schema

the organization wide view of the entire database

Update Anomaly

what happens if the name of a project changes- have to update it.

insertion anomaly

what if a new project is negotiated when there is no employee assigned to it

Insertion anomaly

what if partial info is inserted.

update anomaly

what if that name of the project PPV5678 is changed to sludge disposal

deletion anomaly

what is judy resigns from her job, what happens the the generation projects?

date of birth

what is not an example of biometrics

https

what protocol denotes and certificate use in e-commerce web sites

panopticon effect

what was the term used that typifies the environment in which we live today as far as our privacy

obscurity

which strategy would best be, only allowing logical access through intranet applications to the dbms

Advantages of a Database Approach

• Common database managed by a DBMS • Each entity appears only once in the system, reducing the duplication of data • With all data being in a single database, it is possible to list all information concerning the entities • Getting more information from the same amount of data - when all the data for various systems are stored in a single database, the information becomes available, as well as the process of retrieving the information can be quick and easy • Sharing of data - Several users can have access to the same piece of data • Balancing conflicting requirements - a person or group, often called Database Administrator (DBA) can structure the database in such a way that it benefits the entire organization, not just a single group • Controlling redundancy - not only saves space, but makes the updating process easier • Consistency - is a direct result of redundancy, so by reducing redundancy, there is much less potential for this sort of inconsistency with the database approach • Integrity - an integrity constraint is a rule that must be followed by data in the database. Example: Not allowing a person's age to be lower than zero • Security - the prevention of access to the database by unauthorized users • Increasing productivity - a good DBMS comes with many features that allow users to gain access to data without having to do any programming • Data independence - a property that allows the structure of a database to be changed without the programs that access the database having to change

Disadvantages of Database Approach

• DBMS size - DBMSs are large programs that occupy a large amount of disk space as well as internal memory • DBMS complexity - the complexity and breadth of the functions provided by a DBMS make it a complex product to use • Greater impact of a failure - a failure on the part of any one user that damages the database in some way may affect all the other users on the system • More difficult recovery - if the database is being updated by a large number of users, all updates must be redone since the time of its restoration

Disadvantages of Non-database Approach

• Duplication of data • Extremely difficult to fulfill requirements that involve data from more than one system

none

• Which one of these restricts company's internal use of customers information

1st Normal Form

•Each cell is single-valued •Entries in a column are same type •Rows uniquely identified - Add Unique ID, or Add more columns to make unique (the order of the rows and the order of the columns are irrelevant) •All atomic attributes

Natural Join

⋈ without condition Connects on similar attributes At most |R| * |S| tuples in R ⋈ S

prime attribute

主屬性

dependency diagram

依賴關係圖

transitive dependency

傳遞依賴

atomic attribute

原子屬性

atomicity

原子性

denormalization

反正規化

normalization

正規化

first normal form (1NF)

第一正規式(1NF)

third normal form (3NF)

第三正規式(3NF)

second normal form (2NF)

第二正規式(2NF)

fourth normal form (4NF)

第四正規式(4NF)

granularity

粒度

determinant

行列式

partial dependency

部分依賴

repeating group

重複組

key attribute

鍵屬性

nonprime attribute

非屬性

nonkey attribute

非鍵屬性


Set pelajaran terkait

Linux Ess. Ch 12, Linux Final, Linux, 211 FINAL, DCOm 142 Linux Final Study

View Set

Chapter 10 - Communicating Effectively, Chapter 9 - Thinking Critically, Making Decisions, Solving Problems

View Set

ACT study guide Math: Roots of Polynomials - Factoring to Find Solutions

View Set

Module 9: IPv4 and Network Segmentation

View Set

PATHOS - WEEK 2 - Neoplasm and Cancer

View Set

Characteristics of an Effective and Successful Team

View Set

fluid and electrolyte balance ATI

View Set

NASM Chapter 14 Flexibility Training Concepts

View Set

Chapter 10 accounting (and extra)

View Set