330 Final
________ is a scripting language that allows for the creation of customized tags, and is often used to facilitate exchange of data between applications over the Web.
Extensible Markup Language (XML)
PHP is a popular API for MySQL because of all of the following EXCEPT:
Microsoft integration
________ are anomalies that can be caused by editing data in tables.
Modification
Which of the following are properties of relations?
No two rows in a relation are identical
The need for consensus on data definitions is an example of which type of risk in the database environment?
Organizational conflict
Which language would you expect to see in a stored procedure?
PL/SQL
Which of the following is an advantage of stored procedures?
Performance improves for compiled SQL statements.
________ database specification indicates all the details of data storage that are then input to database implementation.
Physical
________ are established between entities in a data model in order to depict how they interact with each other.
Relationships
Which of the following violates the atomic property of relations?
Sam Hinz
Which of the following is NOT a reason to create an instance of a relational schema with sample data?
Sample data can reverse database implementation errors.
SOAP stands for:
Simple Object Access Protocol Method.
Which of the following is an entity that exists independently of other entity types?
Strong
________ is a tool even non-programmers can use to access relevant and desired information from a database.
Structured query language
The traditional methodology used to develop, maintain and replace information systems is called the:
Systems Development Life Cycle.
________ analyze the business situation and identify the need for information and information services to meet the problems or opportunities of the business.
Systems analysts
(T/F) Data structures include data organized in the form of tables with rows and columns.
True
(T/F) Informational systems are designed to support decision making based on historical point-in-time and prediction data.
True
(T/F) Sample data are useful for developing prototype applications and for testing queries.
True
(T/F) The need for data warehousing in an organization is driven by its need for an integrated view of high-quality data.
True
An alternative name for an attribute is called a(n):
alias
Extensible Business Reporting Language (XBRL) is an example of:
an XML-based vocabulary.
The SDLC phase in which every data attribute is defined, every category of data is listed and every business relationship between data entities is defined is called the ________ phase.
analysis
The SDLC phase in which the detailed conceptual data model is created is the ________ phase.
analysis
At a basic level, analytics refers to:
analysis and interpretation of data.
The client/server architectures that have evolved can be distinguished by the distribution of ________ across clients and servers.
application logic components
In a file processing environment, descriptions for both the data and the logic for accessing the data are built into:
application programs
Web services:
are a set of emerging standards for protocols for automatic communication between software over the Web.
An entity that serves as a relationship between one or more entity types and contains attributes specific to the relationships is called a(n):
associative entity
A property or characteristic of an entity type that is of interest to the organization is called a(n):
attribute
The property by which subtype entities possess the values of all attributes of a supertype is called:
attribute inheritance
A person's name, birthday, and social security number are all examples of:
attributes
Unlike a project-level data model, an enterprise-level model will typically not include _________.
attributes
A(n) ________ constraint specifies the number of instances of one entity that can be associated with each instance of another entity.
cardinality
The the main idea behind ________ computing refers to a model for providing ubiquitous, convenient and on-demand access to shared computing resources.
cloud
A(n) ________ constraint is a type of constraint that addresses whether an instance of a supertype must also be an instance of at least one subtype.
completeness
An attribute that can be broken down into smaller parts is called a(n) ________ attribute.
composite
The Hadoop Distributed File System (HDFS) is the foundation of a ________ infrastructure of Hadoop.
data management
All of the following are ways to consolidate data EXCEPT:__
data rollup and integration
A technique using pattern recognition to upgrade the quality of raw data is called:
data scrubbing
Converting data from the format of its source to the format of its destination is called:
data transformation
The attribute on the left-hand side of the arrow in a functional dependency is the
determinant
The ________ rule specifies that an entity can be a member of only one subtype at a time.
disjoint
Of the following, ________ focus the LEAST on determining the requirements for the database component of an information system.
end Users
Database development begins with ________, which establishes the range and general contents of organizational databases
enterprise data modeling
A primary key whose value is unique across all relations is called a(n):
enterprise key
A person, place, object, event, or concept about which the organization wishes model for a database is called a(n):
entity
In an ER diagram, a box represents a(n) ___________.
entity
The ________ states that no primary key attribute may be null.
entity integrity rule
Using a packaged data model, projects take less time and cost because:
essential components and structures are already defined.
A(n) _______ client is a PC that is responsible for processing presentation logic, extensive application and business rules logic and even DBMS functions.
fat
Program-data dependence is caused by:
file descriptions being stored in each database application.
When all multivalued attributes have been removed from a relation, it is said to be in:
first normal form
An attribute in a relation of a database that serves as the primary key of another relation in the same database is called a
foreign key
The normal form which deals with multivalued dependencies is called:
fourth normal form.
A constraint between two attributes is called a(n):
functional dependency
The process of creating a new supertype based on finding common attributes or relationships among several entity types is called _________,
generalization
The process of combining data from various sources into a single table or view is called:
joining
It is true that in an HDFS cluster the DataNodes are the:
large number of slaves
The need to ________ relations commonly occurs when different views need to be integrated.
merge
________ are data that describe the properties of other data.
metadata
________ is/are any of several classes of software that allow an application to interoperate with other software without requiring the user to understand all software involved.
middleware
The entity integrity rule states that:
no primary key attribute can be null.
The number of entity types that participate in a recursive relationship is:
one
Understanding the steps involved in transforming EER diagrams into relations is important because:
one must be able to check the output of a CASE tool.
The ________ rule states that an entity instance can simultaneously be a member of two (or more) subtypes.
overlap
The subtype discriminator is a composite attribute when there is a(n):
overlap rule
A functional dependency in which one or more nonkey attributes are functionally dependent on part, but not all, of the primary key is called a ________ dependency.
partial functional
The ________ rule specifies that an instance of a supertype entity is allowed not to belong to any of its subtypes.
partial specialization
Descriptive, predictive, and ________ are the three main types of analytics.
prescriptive
In an ER diagram, a line represents a(n) ___________.
relationship
A centralized knowledge base of all data definitions, data relationships, screen and report formats, and other system components is called a(n)
repository
With the database approach, data descriptions are stored in a central location known as a:
repository
A(n) _________ attribute of an entity type must have a value for every entity instance of that type.
required
XPath and XQuery are both technologies used to:
retrieve data from XML documents.
Data quality ROI stands for:
risk of incarceration
A PC configured to handle user interface with little or no local storage is called a(n) _______ client.
thin
Cloud computing relies most heavily on:
three-tier architectures.
One characteristic of quality data which pertains to the expectation for the time between when data are expected and when they are available for use is:
timeliness
The ________ rule specifies that each entity instance of the supertype must be a member of some subtype in the relationship.
total specialization
A functional dependency between two or more nonkey attributes is called a:
transitive dependency
A data warehouse derives its data from:
various operational data sources
The three 'v's commonly associated with big data include:
volume, variety, and velocity.
An entity type whose existence depends on another entity type is called a ________ entity.
weak
A relation that contains minimal redundancy and allows easy use is considered to be:
well-structured.
E. F. Codd developed the relational model in the:
1970s
Operational and informational systems are generally separated because of which of the following factors?
A data warehouse centralizes data that are scattered throughout disparate operational systems and makes them readily available for decision support applications.
Which of the following advances in information systems contributed to the emergence of data warehousing?
Advances in middleware products that enabled enterprise database connectivity across heterogeneous platforms.
A(n) ________ is a set of application routines that programs use to direct the performance of procedures by the computer's operating system.
Application Program Interface (API)
________ is the process of assigning pieces of application code to clients or servers.
Application partitioning
The normal form which removes any remaining functional dependencies because there was more than one primary key for the same nonkeys is called:
Boyce-Codd normal form
Which of the following is a component of processing logic?
Business rules
Which of the following factors drive the need for data warehousing?
Businesses need an integrated view of company information.
Which of the following criteria should be considered when selecting an identifier?
Choose an identifier that doesn't have large composite attributes.
Which of the following is NOT a method for storing XML documents?
Convert to text
________ is a component of the relational data model included to specify business rules to maintain the integrity of data when they are manipulated.
Data integrity
________ problems are encountered when removing data with transitive dependencies.
Deletion
All of the following are newer XML schema languages EXCEPT:
Document Type Declarations (DTDs)
Which of the following organizational trends does not encourage the need for data warehousing?
Downsizing
________ is the most popular data modeling notation for people who design relational databases.
ERD
(T/F) A composite key consists of only one attribute.
False
(T/F) A foreign key is a primary key of a relation that also is a primary key in another relation.
False
(T/F) The development of the relational data model did not contribute to the emergence of data warehousing.
False
(T/F) When multiple systems in an organization are synchronized, the need for data warehousing increases.
False
FLWOR is an acronym for:
For, LET, Where OrderBy, Return.
Which of the following is NOT a characteristic of a good business rule?
Implementational
Which of the following is NOT a cost and/or risk of the database approach?
Increased long-term programmer labor costs.
The three-schema approach, as defined by ANSI, includes which of the following schemas?
Internal
If you write a database application in a Java program, you are most likely to use the ________ API.
JDBC
The Hadoop framework consists of the ________ algorithm to solve large scale problems.
MapReduce
An iterative methodology that rapidly repeats the analysis, design, and implementation phases of the SDLC is called:
RAD.
Which of the following is NOT an advantage of database systems?
Redundant data
Which of the following is false about three-tier architectures?
Tends to involve fatter clients than two-tier architectures
Which of the following is NOT an objective that drove the development and evolution of database technology?
The desire to require programmers to write all file handling functionality
Which of the following is an INCORRECT statement about stored procedures?
There can be significant network traffic increases using stored procedures as processing moves from client to server.
(T/F) A primary key is an attribute that uniquely identifies each row in a relation.
True
Advances in computer hardware, particularly the emergence of affordable mass storage and parallel computer architectures, was one of the key advances that led to the emergence of data warehousing.
True
A technical specification for creating a distributed registry of Web services and businesses that are open to communicating through Web services is called:
UDDI
The promise of Web services is the development of a standardized communication system using:
XML
An XML transformation language that allows applications to query both XML data and relational databases is called:
XQuery
________ facilitates the ability of applications to query relational data along with associated structured data.
XQuery
________ is a language used to transform complex XML documents and also to create HTML pages from XML documents.
XSLT
Service-oriented architectures (SOA) are:
a collection of services that communicate with each other in some manner.
Extensible Markup Language (XML) is:
a scripting language that allows the creation of customized tags to enable easier sharing of data across organizations.
An entity cluster is:
a set of one or more entity types and associated relationships grouped into a single abstract entity type.
The process of transforming data from a detailed to a summary level is called:
aggregating
One of the most popular RAD methods, which even has a "manifesto" is called:
agile
A primary key that consists of more than one attribute is called a:
composite key
A rule that CANNOT be violated by database users, and that helps to ensure data quality, is called a:
constraint
In the SQL language, the ________ statement is used to make table definitions.
create table
When a regular entity type contains a multivalued attribute, one must:
create two new relations, one containing the multivalued attribute.
All of the following are tasks of data cleansing EXCEPT:
creating foreign keys
Information is processed __________.
data
Conformance means that:
data are stored, exchanged or presented in a format that is specified by its metadata.
Including data capture controls (i.e., dropdown lists) helps reduce ________ deteriorated data problems.
data entry
When we consider data in the data warehouse to be time-variant, we mean
data in the warehouse contain a time dimension so that they may be used to study trends and changes.
A graphical system used to capture the nature and relationships among data is called a(n):
data model
The number of entity types that participate in a relationship is called its _________.
degree
An attribute that can be calculated from related attribute values is called a ________ attribute.
derived
Allowing users to dive deeper into the view of data with online analytical processing (OLAP) is an important part of
descriptive analytics
When online analytical processing (OLAP) studies last year's sales, this represents:
descriptive analytics
A nonkey attribute is also called a(n):
descriptor
Customers, cars, and parts are examples of:
entities
Data governance can be defined as:
high-level organizational groups and processes that oversee data stewardship.
An attribute that may have more than one meaning is called a(n):
homonym
A(n) ________ is the relationship between a weak entity type and its owner.
identifying relationship
Testing and installing the database and its associated application programs is done in the ________ phase.
implementation
The best place to improve data entry across all applications is:
in the database definitions
All of the following are characteristics of cloud technologies EXCEPT:
infinite bandwidth
The analysis of summarized data to support decision making is called:
informational processing.
A domain definition consists of all of the following components EXCEPT:
integrity constraints
Big Data includes:
large volumes of data with many different data types that are processed at very high speeds.
Informational and operational data differ in all of the following ways EXCEPT:
level of detail
The SDLC phase in which a conceptual model is transformed into a relational database structure is called.
logical design
A form of database specification which maps conceptual requirements is called:
logical specifications.
A database is an organized collection of ________ related data.
logically
A relationship where the minimum and maximum cardinality are both one is a(n) ________ relationship.
mandatory one
All of the following are the main goals of normalization EXCEPT:
maximize storage space use.
An attribute (or attributes) that uniquely identifies each row in a relation is called a:
primary key
A Web server:
processes client requests and returns HTML pages to the client.
All of the following are properties of metadata EXCEPT:
processing logic
Data federation is a technique which:
provides a virtual view of integrated data without actually creating one centralized database.
Event-driven propagation:
pushes data to duplicate sites as an event occurs.
A rule that states that each foreign key value must match a primary key value in the other relation is called the:
referential integrity constraint.
A two-dimensional table of data sometimes is called a:
relation
Relational databases establish the relationships between entities by means of common fields (keys) included in a file (or table) called a(n):
relation
Informational systems are designed for all of the following EXCEPT:
running a business in real time.
A relation that contains no multivalued attributes and has nonkey attributes solely dependent on the primary key but contains transitive dependencies is in which normal form?
second
A workgroup database is stored on a central device called a:
server
It is true that in an HDFS cluster the NameNode is the:
single master server
One simple task of a data quality audit is to:
statistically profile all files
When an XML document is shredded, each element is:
stored in a relational table
A(n) ________ is a module of code written in SQL or some proprietary language to run business rules from within a database server.
stored procedure
The most common types of entities are:
strong entities
The characteristic that indicates that a data warehouse is organized around key high-level entities of the enterprise is:
subject-oriented.
An attribute of a supertype that determines the target subtype(s) is called the __________ __________ (two words).
subtype discriminator
Two or more attributes having different names but the same meaning are called:
synonyms
Which of the following is a basic method for single field transformation?
table lookup
Data is represented in the form of:
tables
A candidate key must satisfy all of the following conditions EXCEPT:
the key must indicate the row's position in the table.
Subtypes should be used when:
there are attributes that apply to some but not all instances of an entity type.
External data sources present problems for data quality because:
there is a lack of control over data quality
Quality data can be defined as being:
unique
Because applications are often developed independently in file processing systems:
unplanned duplicate data files are the rule rather than the exception
A(n) ________ is often developed by identifying a form or report that a user needs on a regular basis.
user view