Chapter 3
_____ is the term used to describe enormous and complex data collections that traditional data management software, hardware, and analysis processes are incapable of handling.
big data
The position of a _____ is a non-technical position responsible for defining and implementing consistent principles for setting data standards and data definitions that apply across all the databases in an organization.
data administrator
A ______ is a collection of instructions and commands used to define and describe data and relationships in a specific database.
data definition language
At Coast-to-Coast Moving and Van Lines, Ryan is responsible for managing the flow of the company's data, from initial acquisition of new data to deletion of old, outdated data. Ryan is involved in ___________.
data life cycle management
In a relational database model, the range of allowable values for a data attribute is called its _____.
domain
Starting from monthly sales data, if you wish to get weekly sales data, use the ___ feature.
drill down
_____ is an approach that starts by investigating the general data and information needs of an organization at the strategic level and then examines more specific data and information needs for the functional areas and departments within the organization.
enterprising data model
In the _____ process, all of the data from various sources is converted into a single format suitable for processing.
extract
A database is a collection of integrated and related _____.
files
For the ____ operation, it is required that the the two tables have a common attribute.
join
In a relational database model, _____ involves combining two or more tables.
joining
After entering data into a relational database, users can make all of the following basic data manipulations except:
organizing
A(n) _____ is a field or set of fields in a record that uniquely identifies the record.
primary key
_____ is a method of dealing with a situation in which two or more users or applications need to access the same record at the same time.
Concurrency control
With _____, the database is stored on a service provider's server and accessed by the client over a network, typically the Internet.
Database as service
A database administrator (DBA) is an important nontechnical position responsible for defining and implementing consistent principles for a variety of data issues, including setting data standards and data definitions that apply across all the databases in an organization.
FALSE
An attribute is a general class of people, places, or things for which data is collected, stored, and maintained.
FALSE
An enterprise data model is created only for documentation, but is otherwise inconsequential.
FALSE
DBMSs are used primarily for modeling data.
FALSE
Data modeling usually involves setting a specific business goal and then analyzing the data and information needed to reach that goal.
FALSE
In the context of the relational database model, selecting involves combining two or more tables.
FALSE
Organizations can collect many types of data from a wide variety of sources, but typically they only collect structured data that fits neatly into traditional relational database management systems.
FALSE
Suppose a manager wishes to analyze historical trends in sales. He would use the online transaction processing (OLTP) system.
FALSE
The relational model describes data using a standard tabular format; all data elements are placed in three-dimensional tables called relations, which are the logical equivalent of files.
FALSE
Two sets of fields can have a common primary key.
FALSE
Which of the following statements is true of a database?
It helps companies analyze information to open new market opportunities.
_____ is a special-purpose programming language for accessing and manipulating data stored in a relational database.
Standard Query Language (SQL)
A database administrator (DBA) must have a clear understanding of the fundamental business of an organization, be proficient in the use of selected database management systems, and stay abreast of emerging technologies and new design approaches.
TRUE
A database management system (DBMS) serves as an interface between an application program and a database.
TRUE
A database must reflect the business processes of an organization.
TRUE
A database, a database management system, and the application programs that use the data, make up a database environment.
TRUE
A relational model splits the information up into many two dimensional data structures.
TRUE
Carlos is working with an existing database, but he wants to eliminate some of the columns in the table to create a new, more streamlined database. He will use projection to create his new database.
TRUE
For each data item, a data dictionary usually contains information such as the name of the data item, the range of values that can be used, and the amount of storage needed for the item.
TRUE
If "color" is an attribute, then "green" is a data item.
TRUE
In the context of the relational database model, projecting involves eliminating columns in a table.
TRUE
One of the primary advantages of a relational database is that it allows tables to be linked to reduce data redundancy and allow data to be organized more logically.
TRUE
Sometimes a database can contain "bad data," meaning incomplete, incorrect, inaccurate, or irrelevant records, which can be corrected or deleted through data cleansing.
TRUE
The relational database model is currently the most widely used.
TRUE
Two of the major challenges business users face when using big data are finding the information they need to make decisions and knowing that the data they have is valid.
TRUE
Unstructured data comes from sources such as word-processing documents and scientific research.
TRUE
Suppose a new tech startup wants to define the data model. They would then ____.
create the entity-relationship model
To ensure reliability and integrity, SQL databases conform to four specific properties. Which of the following is NOT one of those four properties?
currency
A _____ is a subset of a data warehouse that is used by small- and medium-sized businesses and departments within large companies to support decision making.
data mart
Kyle works for National Sporting Goods, where he manages critical data elements, such as identifying and acquiring new data sources, creating consistent reference data, and analyzing data for quality. Kyle is a _______.
data steward
Each attribute in a relational database model can be constrained to a range of allowable values called a _____.
domain
A(n) _____ is a field or set of fields that uniquely identify a record.
primary key
The action of including only some columns for output while retrieving data is known as ____.
projecting
A collection of fields all related to a single object, activity, or individual is a(n) _____.
record
The ____ model database makes it easy to develop queries that involve joining two tables.
relational
The _____ model has become the most popular database model.
relational
Suppose you wish to find what tables are in a database, what fields they have, and how they are related you would review the _____.
schema
Haley's employer has asked her to review tens of thousands of social media posts about their company's products and compile this data into a database. In terms of the characteristics of big data, Haley is dealing with ________.
variety
You would use Query by Example, if _____.
you wish to use a visual approach to query building