chapter 6
MongoDB and SimpleDB are both examples of:
NoSQL databases.
________ tools are used to analyze large unstructured data sets, such as e-mail, memos, and survey responses to discover patterns and relationships.
Text mining
In a relational database, a record is referred to in technical terms as a(n):
tuple.
OLAP enables:
users to obtain online answers to ad-hoc questions in a rapid amount of time.
Which of the following enables a DBMS to reduce data redundancy and inconsistency?
Ability to minimize isolated files with repeated data
You are creating a database to store temperature and wind data from various airports. Which of the following fields is the most likely candidate to use as the basis for a primary key in the Airport table?
Airport code
Which of the following is software that handles all application operations between browser-based computers and a company's back-end business applications or databases?
Application server software
Which of the following enables you to create a script that allows a web server to communicate with a back-end database?
CGI
In data mining, which of the following involves recognizing patterns that describe the group to which an item belongs by examining existing items and inferring a set of rules?
Classifications
You work for a car rental agency and want to determine what characteristics are shared among your most loyal customers. To do this, you will want to use the data mining software you are using to do which of the following?
Classify data
Microsoft SQL Server is a(n):
DBMS for midrange computers.
In a large organization, which of the following functions would be responsible for physical database design and maintenance?
Data administration
In a large organization, which of the following functions would be responsible for policies and procedures for managing internal data resources?
Data administration
Which of the following is an automated or manual file that stores information about data elements and data characteristics such as usage, physical representation, ownership, authorization, and security?
Data dictionary
)Which common database challenge is illustrated by a person receiving multiple copies of an L.L. Bean catalog, each addressed to a slightly different variation of his or her full name?
Data inconsistency
Which of the following occurs when the same attribute in related data files has different values?
Data inconsistency
Which of the following is a specialized language that programmers use to add and change data in the database?
Data manipulation language
________ creates confusion that hampers the creation of information systems that integrate data from different sources.
Data redundancy
Which of the following is a grouping of characters into a word, a group of words, or a complete number?
Field
In data mining, which of the following involves using a series of existing values to determine what other future
Forecasting
Which of the following technologies would you use to analyze the social media data collected by a major online retailer?
Hadoop
You work for a retail clothing chain whose primary outlets are in shopping malls and are conducting an analysis of your customers and their preferences. You wish to find out if there are any particular activities that your customers engage in, or the types of purchases made in the month before or after purchasing select items from your store. To do this, you will want to use the data mining software you are using to do which of the following?
Identify sequences
The fact that a traditional file system cannot respond to unanticipated information requirements in a timely fashion is an example of which of the following issues with traditional file systems?
Lack of flexibility
Which of the following is a DBMS for desktop computers?
Microsoft Access
Which of the following tools enables users to view the same data in different ways using multiple dimensions?
OLAP
Which of the following is the most prominent data manipulation language today?
SQL
A household appliances manufacturer has hired you to help analyze its social media datasets to determine which of its refrigerators are seen as the most reliable. Which of the following tools would you use to analyze this data?
Sentiment analysis software
In data mining, which of the following involves events linked over time?
Sequences
Which of the following is not one of the techniques used in web mining?
Server mining
Which of the following is not a method for performing a data quality audit?
Surveying data definition and query files
Which of the following best illustrates the relationship between entities and attributes?
The entity CUSTOMER with the attribute PURCHASE
Which of the following would you use to find patterns in user interaction data recorded by a web server?
Web usage mining
A one-to-many relationship between two entities is symbolized in a diagram by a line that ends with:
a crow's foot topped by a short mark.
inability to develop specialized applications for functional areas.
attribute.
All of the following are technologies used to analyze and manage big data except:
cloud computing.
The join operation:
combines relational tables to provide the user with more information than is otherwise available.
The select operation:
creates a subset consisting of all records in the file that meet stated criteria.
The project operation
creates a subset consisting of columns in a table.
Detecting and correcting data in a database or file that are incorrect, incomplete, improperly formatted, or redundant is called:
data scrubbing.
The term big data refers to all of the following except:
datasets with fewer than a billion records.
DBMSs typically include report generating tools in order to:
display data in an easier-to-read format.
Data cleansing not only corrects errors but also:
enforces consistency among different sets of data.
A schematic of the entire database that describes the relationships in a database is called a(n):
entity-relationship diagram.
Data mining allows users to:
find hidden relationships in data.
A data warehouse is composed of:
historic and current internal data.
)A one-to-one relationship between two entities is symbolized in a diagram by a line that ends:
in two short marks.
An organization's rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information is called a(n):
information policy.
A field identified in a record as holding the unique identifier for that record is called the:
key field.
A ________ is an example of pre-digital data storage that is comparable to a database.
library card catalog
The process of streamlining data to minimize redundancy and awkward many-to-many relationships is called
normalization.
In the context of data relationships, the term associations refers to:
occurrences linked to a single event.
A DBMS makes the:
physical database available for different logical views.
The logical view of a database:
presents data as they would be perceived by end users.
A field identified in a table as holding the unique identifier of the table's records is called the:
primary key.
Data ________ occurs when the same data is duplicated in multiple files of a database.
redundancy
A(n) ________ represent data as two-dimensional tables.
relational DBMS
In a table for customers, the information about a single customer resides in a single:
row
In a relational database, the three basic operations used to develop useful sets of data are:
select, project, and join.
Microsoft Access's data dictionary displays all of the following information about a field except the:
the organization within the organization that is responsible for maintaining the data.