Chapter 6
Which of the following best describes a data dictionary?
A data dictionary is an automated or manual file for storing and organizing information about the data maintained in a database.
Which of the following best describes a data manipulation language?
A data manipulation language is used to add, change, delete, and retrieve the data in the database.
Microsoft's DBMS that includes built-in report generation and reporting is called ________.
Access
__________ is a method for documenting databases illustrating the relationship between various entities in the database.
An entity relationship diagram
A person's name or age is called a ________.
Field
For handling unstructured and semi-structured data in vast quantities, as well as structured data, organizations are using _____________.
Hadoop
__________ breaks a big data problem down into sub-problems, distributes them among up to thousands of inexpensive computer processing nodes, and then combines the result into a smaller data set that is easier to analyze.
Hadoop
________ is the most popular NoSQL DBMS.
Hadoop
What makes a Blockchain system possible and attractive to business firms?
Its encryption and authentication processes make it highly secure for storing many kinds of data.
The SQL ________ statement allows you to combine two different tables.
JOIN
An emerging non-relational database management system is called ________.
NoSQL
Which of the following is the process of creating small, stable, yet flexible data structures from complex groups of data when designing a relational database?
Normalization
__________ refers to the coupling of data stored in files and the specific programs required to update and maintain those files such that changes in programs require changes to the data.
Program-data dependence
SQL stands for ________.
Structured Query Language
Which of the following is not a step a firm might take to make sure it has a high level of data quality?
Using in-memory computing; Using data mining
Another name for a database field is a(n) ________.
attribute
To describe datasets with volumes so huge that they are beyond the ability of typical DBMS to capture, store, and analyze, the term __________ is used.
big data
A ________ is the smallest component in the computer system data hierarchy.
bit
Which of the following is a distributed database technology that enables firms and organizations to create and verify transactions on a peer-to-peer network nearly instantaneously without a central authority?
blockchain
A ________ is a component in the computer system data hierarchy that represents an ASCII character like a letter, a number, or another symbol.
byte
A(n) ________ is a collection of data organized to serve many applications efficiently.
database
A ________ is software that enables an organization to centralize data, manage them efficiently, and provide access to the stored data by application programs.
database management system (DBMS)
A ________ is software that enables an organization to centralize data, manage them efficiently, and provide access to the stored data by application programs.
database management system (DBMS)
DBMS have a data________ capability that is used to specify the structure of the content of the database.
definition
Organizations perform data quality audits to __________.
determine the accuracy and level of completeness of data
Data redundancy occurs when ________.
different groups in an organization independently collect the same piece of data and store it independently of each other
Data mining techniques can be used to ________.
find patterns in data to help managers create forecasts
A ________ provides the link from one table to another table's primary key.
foreign key
The term data ________ is used to describe the policies and processes for managing the availability, usability, integrity, and security of the data employed in an enterprise.
governance
Text mining tools _________.
help businesses analyze data by extracting key elements from unstructured big data sets, discover patterns and relationships, and summarize the information
The organization's rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information are specified by __________.
information policies
A(n) ________ is an important resource for an organization to create, illustrating rules on how data is organized and who has access to what data.
information policy
The formal process to remove data problems such as repeating groups is called ________.
normalization
Although "Big Data" doesn't refer to a specific amount of data, it usually does involve ________.
petabytes
Another term for a key field is a ________.
primary key
The field that uniquely identifies a row in a relational database table is called a ________.
primary key
Another term for a database row is a(n) ________.
record
Microsoft SQL Server is a ________ for large mainframes and midrange computers.
relational DBMS
What is data administration?
Data administration is responsible for information policy, as well as for data planning, data dictionary development, and monitoring data usage in the firm.
Which of the following consists of activities for detecting and correcting data in a database that are incorrect, incomplete, improperly formatted, or redundant?
Data cleansing
__________ is a DBMS capability that specifies the structure and content of the database.
Data definition
The sales, manufacturing, and inventory systems at a bicycle manufacturer use different codes to represent bicycle frame sizes. For example, in the sales system, a large bike frame is coded with "L", while the inventory system codes a large bike frame with "LG". This situation illustrates ________.
data inconsistency
Data cleansing, also known as _________, consists of activities for detecting and correcting data in a database that are incorrect, incomplete, improperly formatted, or redundant.
data scrubbing
Data cleansing, working to identify and fix incorrect data, is also called ________.
data scrubbing
A ________ is a database that stores current and historical data of potential interest to decision makers throughout the company.
data warehouse
A(n) ________ database makes it possible to store information across millions of machines in hundreds of data centers around the globe, with special time-keeping tools to synchronize the data and ensure the data are always consistent.
distributed
A ________ is a component in the computer system data hierarchy that includes content such as Student ID 100, Course IS 101, and Grade A.
field
A ________ is a component in the computer system data hierarchy that consists of multiple records of the same type.
file
In the most popular type of DBMS, data is represented as two-dimensional tables called ________.
relations
An information policy __________.
specifies the organization's rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information
Tools that help businesses analyze data by extracting key elements from unstructured big data sets, discover patterns and relationships, and summarize the information are called __________ tools.
text mining
The most popular type of database today for PCs as well as for larger computers and mainframes is ___________ database.
the relational
A(n) ________ is the most popular method to model a database.
entity-relationship (ER) model
A record describes a(n) ________ which is a person, place, thing, or event on which we store and maintain information. Each characteristic or quality describing a particular ________ is called a(n) ________.
entity; entity; attribute