Chapter 6
You are creating a database to store temperature and wind data from various airports. Which of the following fields is the most likely candidate to use as the basis for a primary key in the Airport table?
Airport code
Microsoft SQL Server is a(n):
DBMS for midrange computers.
Which common database challenge is illustrated by a person receiving multiple copies of an L.L. Bean catalog, each addressed to a slightly different variation of his or her full name?
Data inconsistency
Which of the following occurs when the same attribute in related data files has different values?
Data inconsistency
Which of the following is a specialized language that programmers use to add and change data in the database?
Data manipulation language
________ creates confusion that hampers the creation of information systems that integrate data from different sources.
Data redundancy
In a large organization, which of the following functions would be responsible for policies and procedures for managing internal data resources?
Database administration
Which of the following is software that handles all application operations between browser-based computers and a company's back-end business applications or databases?
Database server software
Which of the following is a grouping of characters into a word, a group of words, or a complete number?
Field
You work for a retail clothing chain whose primary outlets are in shopping malls and are conducting an analysis of your customers and their preferences. You wish to find out if there are any particular activities that your customers engage in, or the types of purchases made in the month before or after purchasing select items from your store. To do this, you will want to use the data mining software you are using to do which of the following?
Identify sequences
Which of the following is not one of the benefits of a blockchain database?
It prevents data from being altered retroactively.
The fact that a traditional file system cannot respond to unanticipated information requirements in a timely fashion is an example of which of the following issues with traditional file systems?
Lack of flexibility
Which of the following is a DBMS for desktop computers?
Microsoft Access
MongoDB and SimpleDB are both examples of:
NoSQL databases
In data mining, which of the following involves events linked over time?
Sequences
Which of the following is not one of the techniques used in web mining?
Server mining
Which of the following is not a method for performing a data quality audit?
Surveying data definition and query files
Which of the following best illustrates the relationship between entities and attributes?
The entity CUSTOMER with the attribute PURCHASE
Which of the following would you use to find patterns in user interaction data recorded by a web server?
Web usage mining
All of the following are technologies used to analyze and manage big data except:
cloud computing.
The join operation:
combines relational tables to provide the user with more information than is otherwise available.
The term big data refers to all of the following except:
datasets with fewer than a billion records.
DBMSs typically include report generating tools in order to:
display data in a more structured and polished format than would be possible just by querying
Data cleansing not only corrects errors but also:
enforces consistency among different sets of data
A schematic of the entire database that describes the relationships in a database is called a(n):
entity-relationship diagram..
Data mining allows users to:
find hidden relationships in data
An organization's rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information is called a(n):
information policy.
The process of streamlining data to minimize redundancy and awkward many-to-many relationships is called:
normalization
In the context of data relationships, the term associations refers to:
occurrences linked to a single event.
A DBMS makes the:
physical database available for different logical views.
The logical view of a database:
presents data as they would be perceived by end users.
A(n) ________ represent data as two-dimensional tables.
relational DBMS
In a table for customers, the information about a single customer resides in a single:
row.
Microsoft Access's data dictionary displays all of the following information about a field except the:
the organization within the organization that is responsible for maintaining the data.
In a relational database, a record is referred to in technical terms as a(n):
tuple.
A one-to-many relationship between two entities is symbolized in a diagram by a line that ends with:
two short marks.
A data lake is composed of:
unstructured and structured data that has not been analyzed
OLAP enables:
users to obtain online answers to ad-hoc questions in a rapid amount of time. programmers to quickly diagram data relationships. programmers to normalize data. users to quickly generate summary reports. *users to view both logical and physical views of data.
Which of the following enables a DBMS to reduce data redundancy and inconsistency?
Ability to minimize isolated files with repeated data
Which of the following enables you to create a script that allows a web server to communicate with a back-end database?
CGI
In data mining, which of the following involves recognizing patterns that describe the group to which an item belongs by examining existing items and inferring a set of rules?
Classifications
You work for a car rental agency and want to determine what characteristics are shared among your most loyal customers. To do this, you will want to use the data mining software you are using to do which of the following?
Classify data
Which of the following is an automated or manual file that stores information about data elements and data characteristics such as usage, physical representation, ownership, authorization, and security?
Data dictionary
In a large organization, which of the following functions would be responsible for physical database design and maintenance?
Database administration
In data mining, which of the following involves using a series of existing values to determine what other future values will be?
Forecasting
Which of the following technologies would you use to analyze the social media data collected by a major online retailer?
Hadoop
Which of the following is the most prominent data manipulation language today?
SQL
A household appliances manufacturer has hired you to help analyze its social media datasets to determine which of its refrigerators are seen as the most reliable. Which of the following tools would you use to analyze this data?
Sentiment analysis software
________ tools are used to analyze large unstructured data sets, such as email, memos, and survey responses to discover patterns and relationships.
Text mining
A characteristic or quality that describes a particular database entity is called a(n)
attribute
A characteristic or quality that describes a particular database entity is called a(n):
attribute.
The select operation:
creates a subset consisting of all records in the file that meet stated criteria.
The project operation:
creates a subset consisting of columns in a table.
Detecting and correcting data in a database or file that are incorrect, incomplete, improperly formatted, or redundant is called:
data scrubbing.
All of the following are issues with the traditional file environment except:
inability to develop specialized applications for functional areas.
A DBMS makes the
is a database that is stored in multiple physical locations.
A field identified in a table as holding the unique identifier of the table's records is called the:
primary key.
A database ________ describes a database entity.
record
Data ________ occurs when the same data is duplicated in multiple files of a database.
redundancy
In a relational database, the three basic operations used to develop useful sets of data are:
select, project, and join.
A distributed database:
uses SQL. *is a database that is distributed across many business firms. uses predictive analysis. uses Hadoop to process information. is a database that is stored in multiple physical locations.