C175 Data Management Foundations
normalization stages
1NF, 2NF, and 3NF are _____.
composite
A _____ attribute is an attribute that can be further subdivided to yield additional attributes.
required
A _____ attribute is an attribute that must have a value.
simple
A _____ attribute is one that cannot be subdivided.
repeating group
A relational table must not contain a(n) _____.
data manipulation language (DML)
A(n) _____ defines the environment in which data can be managed and is used to work with the data in the database.
database language
A(n) _____ enables the user to perform complex queries designed to transform the raw data into useful information.
transitive dependency
A(n) _____ exists when there are functional dependencies such that Y is functionally dependent on X, Z is functionally dependent on Y, and X is the primary key.
contraint
A(n) _______ is a restriction placed on the data.
one
Ideally, an entity identifier is composed of this many attribute(s).
hierarchical
In the _______ model, each parent can have many children, but each child has only one parent.
hierarchical
In the _______ model, the basic logical structure is represented as an upside-down tree.
name node
The HDFS ____ stores all the metadata about a file system.
hierarchical
The _____ data model was developed in the 1960s-1970s.
relational
The _____ data model was developed in the 1970s.
object-oriented
The _____ data model was developed in the 1980s.
network model
The _____ has structural level dependence.
1:M
The _____ relationship is the "relational model ideal."
object-oriented
The _______ model uses the term connectivity to label the relationship types.
context
To reveal meaning, information requires _____.
performance tuning
_____ relates to activities that make a database operate more efficiently in terms of storage and access speed.
unary
_____ relationships are common in manufacturing industries.
business intelligence
_____ tools focus on the strategic and tactical use of information.
relational models
_____ view data as part of a table or collection of tables in which all key values must be identified.
Metadata
______ provide a description of the data characteristics and the set of relationships that link the data found within the database.
constraint
_______ are normally expressed in the form of rules.
partial dependency
__________ specifies that a non-prime attribute is functionally dependent on part of a candidate key
composite
A _____ attribute can be further subdivided to yield additional attributes.
derived
A _____ attribute need not be physically stored within the database.
repeating group
A _____ derives its name from the fact that a collection of multiple entries of the same type can exist for any single key attribute occurrence.
database language
A _____ enables the user to create database and table structures to perform basic data management chores.
weak
A _____ entity has a primary key that is partially or totally derived from the parent entity in the relationship.
weak entity
A _____ has a primary key that is partially or totally derived from the parent entity in the relationship.
segment
A _____ in a hierarchical model is the equivalent of a record in a file system.
unary
A _____ relationship exists when an association is maintained within a single entity.
ternary
A _____ relationship exists when three entities are associated.
strong
A _____ relationship is also known as an identifying relationship.
data model
A ______ is a relatively simple representation of more complex real-world data structures.
graphical
A data model is usually _____.
partial dependency
A dependency based on only a part of a composite primary key is called a _____.
partial dependency
A dependency based on only a part of a composite primary key is known as a _____.
dashed line
A derived attribute is indicated in the Chen notation by a _____ that connects the attribute and an entity.
entity
A noun in a business rule translates to a(n) _______ in the data model.
single valued
A person's Social Security number would be an example of a single valued attribute.
cardinality
A relationship _____ indicates the number of entities or participants associated with a relationship.
entities
A relationship is an association between _____.
Boolean
A specialty field in mathematics, known as _____ algebra, is dedicated to the use of logical operators.
candidate key
A table is in BCNF if every determinant in the table is a _____.
anomalies
A table that displays data redundancies yields _____.
first
A table that has all key attributes defined, has no repeating groups, and all its attributes are dependent on the primary key is said to be in _____ normal form.
second
A table that is in 1NF and includes no partial dependencies is said to be in _____ normal form.
3NF
A table that is in 2NF and contains no transitive dependencies is said to be in _____.
relationship
A verb associating two nouns in a business rule translates to a(n) _______ in the data model.
existence
A weak entity must be _____ dependent.
atomic
A(n) _____ attribute cannot be further subdivided.
class
A(n) _____ is a collection of similar objects with a shared structure and behavior.
composite identifier
A(n) _____ is a primary key composed of more than one attribute.
alias
A(n) _____ is an alternate name given to a column or table in any SQL statement.
index
A(n) _____ is an orderly arrangement used to logically access rows in a table.
transitive dependency
A(n) _____ is one in which an attribute is functionally dependent on another nonkey attribute.
schema
A(n) _____ is the conceptual organization of an entire database as viewed by a database administrator.
domain
A(n) _____ is the set of possible values for a given attribute.
application
A(n) _____ might be written by a programmer or it might be created through a DBMS utility program.
cascading
A(n) _____ order sequence is a multilevel ordered sequence that can be created easily by listing several attributes, separated by commas, after the ORDER BY clause.
iterative
A(n) _____ process is based on repetition of processes and procedures.
enterprise
A(n) ______ database is used by an organization and supports many users across many departments.
model
A(n) ______'s main function is to help one understand the complexities of the real world environment.
data definition language (DDL)
A(n) _______ enables a database administrator to describe schema components.
entity
A(n) _______ is anything about which data are to be collected and stored.
relationship
A(n) _______ is bidirectional.
attribute
A(n) _______ is the equivalent of a field in a file system.
entity
A(n) _______ represents a particular type of object in the real world.
parentheses
According to the rules of precedence, performing operations within ____ should be completed first.
prime attribute
An attribute that is part of a key is known as a(n) _____.
relational
An entity in the entity relationship model corresponds to a table in the _____ environment.
existence
An entity is said to be _____-dependent if it can exist in the database only when it is associated with another related entity occurrence.
data structure
An implementation-ready data model should contain a description of the _____ that will store the end-user data.
attributes
An index key can have multiple _____ (a composite index).
bridge
Another name for a composite entity is a(n) _____ entity.
prime attribute
Any attribute that is at least part of a key is known as a _____, or key attribute.
determinant
Any attribute whose value determines other values within a row is known as a(n) _____.
candidate
BCNF can be violated only if the table contains more than one _____ key.
primary key
Because a partial dependency can exist only when a table's primary key is composed of several attributes, a table whose _____ consists of only a single attribute is automatically in 2NF once it is in 1NF.
architecture
Business intelligence (BI) _____ is composed of data, people, processes, technology, and the management of such components.
decision making
Business intelligence is a term used to describe a comprehensive, cohesive, and integrated set of tools and processes used to capture, collect, integrate, store, and analyze data with the purpose of generating and presenting information used to support business _____.
data warehouse
Business intelligence tools use the _____ data as the raw materials for data analytics to generate business knowledge.
writing
Business rules must be rendered in _____.
information
Complex _____ requirements may dictate data transformations, and they may expand the number of entities and attributes within the design.
business rules
Connectivities and cardinalities are established by concise statements known as _____.
partial
Dependencies based on only a part of a composite primary key are known as _____ dependencies.
dependency diagram
Dependencies can be identified with the help of a(n) _____.
column
Each _____ in a relation represents an attribute.
tuple
Each row in a relation is called a _____.
relational
Each row in the _____ table is known as an entity instance or entity occurrence in the ER model.
entity instance
Each row in the relational table is known as a(n) _____.
third
For most business transactional databases, we should normalize relations into _____ normal form.
business rules
From a database point of view, the collection of data becomes meaningful only when it reflects properly defined _______ .
derived
From a system functionality point of view, _____ attribute values can be calculated when they are needed to write reports or invoices.
relationships
Identifying the attributes of entities helps in the better understanding of _____ among entities.
partial dependencies
If a table has multiple candidate keys and one of those candidate keys is a composite key, the table can have _____ based on this composite candidate key even when the primary key chosen is a single attribute.
recursive
If an employee within an EMPLOYEE entity has a relationship with itself, that relationship is known as a _____ relationship.
strong
If an entity can exist apart from all of its related entities, then it is existence-independent, and it is referred to as a(n) _____ entity.
circle
In Crow's Foot notation, an optional relationship between entities is shown by drawing a _____ on the side of the optional entity.
conditional
In SQL, all _____ or boolean expressions evaluate to true or false.
crow foot notation
In _____, a three pronged symbol represents the "many" side of the relationship.
one
In a 1:M relationship, to avoid the possibility of referential integrity errors, the data of the _____ side must be loaded first.
attribute
In a relational table, each column has a specific range of values known as the _____ domain.
dependency
In a(n) _____ diagram, the arrows above the attributes indicate all desirable dependencies.
attributes
In an SQL-based relational database, rows in different tables are related based on common values in common _____.
method
In object oriented terms, a _____ defines an object's behavior.
high processing speeds
In organizations that generate large number of transactions, _____ are often a top priority in database design.
rectangle
In the Chen and Crow's Foot notations, an entity is represented with a _____ containing the entity's name.
query
In the SQL environment, the word _____ covers both questions and actions.
fourth
In the _____ normal form, no row may contain two or more multivalued facts about an entity.
network
In the _______ model, the user perceives the database as a collection of records in 1:M relationships, where each record can have more than one parent.
partial dependencies
In the context of _____, data redundancies occur because every row entry requires duplication of data.
entity
In the context of data models, a(n) _____ is a person, place, thing, or event about which data will be collected and stored.
row
In the entity relationship model, a table _____ corresponds to an entity instance.
oval
In the original Chen notation, each attribute is represented by an _____ with the attribute name connected to an entity rectangle with a line.
keys
In the relational model, _____ are important because they are used to ensure that each row in a table is uniquely identifiable.
entity
Knowing the minimum and maximum number of _____ occurrences is very helpful at the application software level.
third
Most designers consider the BCNF as a special case of the _____ normal form.
relational data model
MySQL is an example of the _____.
relational
NoSQL databases are not based on the _____ model.
fault tolerance
NoSQL databases provide _____.
three
Normalization works through a series of stages called normal forms. For most purposes in business database design, _____ stages are as high as you need to go in the normalization process.
DKNF
Of the following normal forms, _____ is mostly of theoretical interest.
hierarchical
One of the limitations of the _______ model is that there is a lack of standards.
XML/Hybrid data model
Oracle 11g is an example of the _______.
optional
Participation is _____ if one entity occurrence does not require a corresponding entity occurrence in a particular relationship.
left to right
Question 15 :String comparisons are made from _____.
single entity
Repeating groups must be eliminated by ensuring that each row defines a _____.
logical
SQL allows the use of _____ restrictions on its inquiries such as OR, AND, and NOT.
many-to-many
Students and classes have a _______ relationship.
rectangle
The Chen notation identifies a weak entity by using a double-walled entity _____.
solid
The Crow's Foot notation depicts the strong relationship with a(n) _____ line between the entities.
client node
The HDFS _____ acts as the interface bewtween the user application and the HDFS.
insert
The SQL command that allows a user to insert rows into a table is _____.
select
The SQL command that allows a user to list the contents of a table is _____.
commit
The SQL command that allows a user to permanently save data changes is _____.
SELECT
The ____ command, coupled with appropriate search conditions, is an incredibly powerful tool that enables a user to transform data into information.
between
The ____ operator is used to check whether an attribute value lies within two bounds?
system catalog
The _____ can be described as a detailed system data dictionary that describes all objects within the database, including data about table names, the table's creator and creation date, the number of columns in each table, the data type corresponding to each column, index filenames, index creators, authorized users, and access privileges.
not null
The _____ constraint can be placed on a column to ensure that every row in the table has a value for that column.
system catalog
The _____ is actually a system-created database whose tables store the user/designer-created database characteristics and contents.
database structure
The _____ itself is stored as a collection of files, and the only way to access the data in those files is through the DBMS.
UML
The _____ notation of entity-relationship modelling can be used for both conceptual and implementation modelling.
1:M
The _____ relationship is the relational database norm.
1:1
The _____ relationship should be rare in any relational database design.
is null
The _____ special operator is used to check whether an attribute value is null.
object-oriented
The _______ data model uses the concept of inheritance.
entity relationship
The _______ model uses the term connectivity to label the relationship types.
entity relationship
The _______ model was developed to allow designers to use a graphical tool to examine structures rather than describing them with text.
primary key
The concept of relationship strength is based on how the _____ of a related entity is defined.
collection of files
The database structure in a DBMS is stored as a _____.
derived
The decision to store _____ attributes in database tables depends on the processing requirements and the constraints placed on a particular application.
conceptual
The entity relationship diagram (ERD) represents the _____ database as viewed by the end user.
optional
The existence of a(n) _____ entity indicates that its minimum cardinality is zero.
entity-relationship diagram (ERD)
The first step in building an _____ is creating a detailed narrative of the organization's description of operations.
normalization
The objective of _____ is to ensure that each table conforms to the concept of well-formed relations.
rows and columns
The order of the _____ is important to the DBMS.
expected use
The organization of data within folders in a manual file system is determined by _____.
logical view
The practical significance of taking the _____ is that it serves as a reminder of the simple file concept of data storage.
anomalies
The problem with transitive dependencies is that they still yield data _____, inconsistencies, and irregularities.
data redundancy
The proper use of foreign keys is crucial to controlling _____.
relation
The relational model's foundation is a mathematical concept known as a _____.
query result set
The response of the DBMS to a query is the _____.
domain
The row's range of permissible values is known as its _____.
exists
The special operator used to check whether a subquery returns any rows is _____.
like
The special operator used to check whether an attribute value matches a given string pattern is _____.
database system
The term _____ refers to an organization of components that define and regulate the collection, storage, management and use of data within a database environment.
constraint
These are important because they help to ensure data integrity.
select
To make the output more readable, the SQL standard permits the use of aliases for any column in a _____ statement.
binary
To simplify the conceptual design, most higher-order relationships are decomposed into appropriate equivalent _____ relationships whenever possible.
file system
VMS/VSAM is an example of the _______ data model.
normalized
When designing a database, you should make sure that the table entities are _____ before table structures are created.
minimum
When indicating cardinality, the first value represents the _____ number of associated entities.
symbols
When the specific cardinalities are not included on the diagram in Crow's Foot notation, cardinality is implied by the use of _____.
solid
When using the Crow's Foot notation, the associative entity is indicated by _____ relationship lines between the parents and the associative entity.
unique
When you define a table's primary key, the DBMS automatically creates a(n) _____ index on the primary key column(s) you declared.
data model
Within the database environment, a(n) _____ represents data structures with the purpose of supporting a specific problem domain.
persistent
You can think of a table as a _____ representation of a logical relation.
normalization
____ works through a series of stages called normal forms.
primary keys
____, also known as Identifiers, are underlined in an ER diagram.
business rules
_____ are brief, precise, and unambiguous description of a policy, procedure, or principle within a specific organization.
attributes
_____ are characteristics of entities.
dependency diagrams
_____ are very helpful in getting a bird's eye view of all the relationships among a table's attributes.
business rules
_____ can serve as a communication tool between the users and designers.
Metadata
_____ describe the data characteristics and the set of relationships that links the data found within the database.
RDBMSs
_____ enforce integrity rules automatically.
spreadsheet applications
_____ enhance the user's ability to understand the data.
Data inconsistency
_____ exists when different and conflicting versions of the same data appear in different places.
cardinality
_____ expresses the minimum and maximum number of entity occurrences associated with one occurrence of the related entity.
business intelligence
_____ functionality ranges from simple data gathering and transformation to very complex data analysis and presentation.
business intelligence
_____ is a framework that allows a business to transform data into information, information into knowledge, and knowledge into wisdom.
normalization
_____ is a process of organizing the data in the database to avoid insertion anomaly, data redundancy, update and deletion anomaly
normalization
_____ is a process to help reduce the likelihood of data anomalies.
documentation
_____ not only helps database designers to stay on track during the design process, it also enables them to pick up the design thread when the time comes to modify the design.
data redundancy
_____ produces data anomalies.
granularity
_____ refers to the level of detail represented by the values in a table's row.
granularity
_____ refers to the level of detail represented by the values stored in a table's row.
data redundancy
_____ reflects a situation where the same data is stored unnecessarily at different places.
data anomalies
data redundancy produces _____.
data warehouse
A _____ can store data derived from many sources.
data dictionary
A _____ contains at least all of the attribute names and characteristics for each table in the system.
unique
A _____ index is an index in which the index key can have only one pointer value (row) associated with it.
field
A _____ is a character or group of characters that has a specific meaning.
database management system
A _____ is a collection of programs that manages the database structure and controls access to the data stored in the database.
file
A _____ is a collection of related records.
superkey
A _____ is a key that can uniquely identify any row in the table.
record
A _____ is a logically connected set of one or more fields that describes a person, place, or thing.
query
A _____ is a specific request issued to the DBMS for data manipulation.
null
A _____ is created when you press the Enter key or the Tab key to move to the next entry without making a prior entry of any kind.
data dictionary
A _____ is sometimes described as "the database designer's database" because it records the design decisions about tables and their structures.
foreign key
A _____ is the primary key of one table that has been placed into another table to create a common attribute.
candidate
A _____ key can be described as a minimal superkey, a superkey without any unnecessary attributes.
secondary
A _____ key is defined as a key that is used strictly for data retrieval purposes.
data dictionary
A _____ provides a detailed description of all tables found within the user/designer-created database
ad-hoc
An _____ query is a spur-of-the-moment question.
verifiable
Data is said to be _____ if the data always yields consistent results.
operational databases
Data warehouse contains historical data obtained from the_____.
Julian
Date attributes contain calendar dates stored in a special format known as the _____ date format.
structured
____ data are the result of formatting disorganized data in order to facilitate storage, use and generation of information.
Data
____ is the the result of processing raw facts to reveal its meaning.
primary keys
_____ are required in relational databases to ensure data integrity.
unstructured
_____ data exist in a format that does not lend itself to processing that yields information.
unstructured
_____ data exist in the format in which they were collected.
Analytical
_____ databases focus primarily on storing data used to generate information required to make tactical or strategic decisions.
performance tuning
_____ relates to the activities that make the database execute transactions more efficiently in terms of storage and access speed.
DBMSs
_____ serve as the intermediary between the user and the database.
information
______ is the result of revealing the meaning of raw facts.
field
______ refers to a collection of related records.
single-user
A desktop database is a _____ database.
candidate
A primary key is a _____ key selected to uniquely identify all other attribute values in any given row.
relation
A table is also called a _____ because the relational model's creator, E. F. Codd, used the two terms as synonyms.
multi-user
A workgroup database is a(n) _____ database.
workgroup
A(n) _____ database supports a relatively small number of users (usually fewer than 50) or a specific department within an organization.
data anomaly
A(n) _____ develops when all required changes in the redundant data are not made successfully.
good decision making
Accurate, relevant, and timely information is the key to _____.
semistructured
An XML database supports the storage and management of _____ XML data.
string data type
Character data type also known as _____, can contain any character or symbol not intended for mathematical manipulation, but character sets can include digits or a numeric string.
null
Depending on the sophistication of the application development software, _____ can create problems when functions such as COUNT, AVERAGE, and SUM are used
column
Each table _____ represents an attribute.
1:1
If one department chair "a professor" can chair only one department, and one department can have only one department chair. The entities PROFESSOR and DEPARTMENT exhibit a _____ relationship.
synonym
In a database context, a(n) _____ indicates the use of different names to describe the same attribute.
keys
In a relational model, _____ are also used to establish relationships among tables and to ensure the integrity of the data.
semistructured
Most data that can be encountered are best classified as _____.
1:1
One characteristic of generalization hierarchies is that they are implemented as _____ relationships.
security breaches
One disadvantage of the DBMS is that it increases the risk of data _____.
Data Processing
Organization of data for the purpose of producing desired information; involves recording, classifying, sorting, summarizing, calculating, disseminating and storing data.
data warehouse
Proper _____ design requires carefully defined and controlled data redundancies to function properly.
formatted
Raw data must be properly _____ for storage, processing and presentation.
integrity
Referential _____ dictates that the foreign key must contain values that match the primary key in the related table, or must contain null.
entity
The CUSTOMER table's primary key is CUS_CODE. The CUSTOMER primary key column has no null entries, and all entries are unique. This is an example of _____ integrity.
data dictionary
The DBMS uses the _____ to look up the required data component structures and relationships, thus relieving programmers from having to code such complex relationships in each program.
islands of information
The term _____ refers to scattered locations storing the same basic data.
flags
To avoid nulls, some designers use special codes, known as _____, to indicate the absence of some value.
data inconsistency
_____ exists when different versions of the same data appear in different places
Data independence
_____ exists when it is possible to make changes in the data storage characteristics without affecting an application program's ability to access data.
Structural dependence
_____ exists when it is possible to make changes in the file structure without affecting the application program's ability to access the data.
Data redundancy
_____ exists when the same data are stored unnecessarily at different places.
Extensible Markup Language (XML)
_____ is a special language used to represent and manipulate data elements in a textual format.
Metadata
_____ is data about data through which the end-user data are integrated and managed.
Data anomaly
_____ is defined as the condition in which all of the data in the database are consistent with the real-world events and conditions.
data integrity
_____ is defined as the condition in which all of the data in the database are consistent with the real-world events and conditions.
end-user data
_____ is raw facts of interest to the end user.
knowledge
_____ is the body of information and facts about a specific subject.
predicate
_____ logic, used extensively in mathematics, provides a framework in which an assertion (statement of fact) can be verified as either true or false.
database programming languages
_____ receive all application requests and translate them into the complex operations required to fulfill those requests.
in-memory databases
_____ refer to a type of database that stores most of its data in RAM rather than in hard disks.