Chapter 6
You are creating a database to store temperature and wind data from various airports. Which of the following fields is the most likely candidate to use as the basis for a primary key in the Airport table?
Airport Code
Data Cleansing
Also known as data scrubbing, consists of activities for detecting in correcting data in database that are incorrect and complete in properly formatted or redundant
Attribute
Characteristic or quality describing an entity
Program data dependence
Coupling of data stored in files and the specific programs required to update and maintain those files such that changes in programs require changes to the data
Microsoft SQL Server is a(n):
DBMS for midrange computers
Which of the following is a specialized language that programmers use to add and change data in the database?
Data manipulation language
data governance
Deals with the policies and processes for managing the availability usability integrity and security of the day that employed in an enterprise with special emphasis on promoting privacy security data quality and compliance with government regulations
Which of the following is a grouping of characters into a word, a group of words, or a complete number?
Field
In data mining, which of the following involves using a series of existing values to determine what other future values will be?
Forecasting
Which of the following technologies would you use to analyze the social media data collected by a major online retailer?
Hadoop
Hadoop
Handles unstructured and semi structured data in a vast quantity as well structured data
Analytic platforms
High-speed platforms using both relational and non-relational tools optimized for large datasets
entity-relationship diagram
Illustrates the relationship between entities
Data mart
Is a subset of data warehouse in which a summarized or highly focused portion of the organizations data is placed in a separate database for a specific population of users
sentiment analysis
Is able to mine text comments in an email message blog social media conversation or survey form to detect favorable and unfavorable opinions about specific subjects
Database Management System (DBMS)
Is software that permits and organization to centralize data manage them efficiently and provide access to the store data by application programs
The fact that a traditional file system cannot respond to unanticipated information requirements in a timely fashion is an example of which of the following issues with traditional file systems?
Lack of flexibility
Which of the following is a DBMS for desktop computers?
Microsoft Access
Data redundancy
Presence of duplicate data in multiple data files so that the same data are stored in more than one place or location
data mining
Provides insights into corporate data that cannot be obtained with OLAP by finding hidden patterns and relationships and large databases and frame rules from them to predict future behavior
In Memory computing
Relies on computers main memory (RAM) for data storage.
Tuples
Rows, record
Referential Integrity
Rules to ensure that the relationships between couple tables remain consistent
Which of the following is the most prominent data manipulation language today?
SQL
In data mining, which of the following involves events linked over time?
Sequences
Which of the following is not one of the techniques used in web mining?
Server mining
Data definition
Specify the structure of the content of the database
Data dictionary
Stores definitions of data elements and their characteristics
Online analytical processing
Supports multi dimensional data analysis enabling users to view the same data in different ways using multiple dimensions
Information policy
The organizations rules for sharing disseminating, acquiring, standardizing, classifying an inventory information
database administration
The process of managing a database
Primary Key
The unique identifier for all the information and any row of the table. and cannot be duplicated
Nine relational database management systems
Use a more flexible data model and our design for managing large data sets across many distributed machines and for easily scaling up or down
Data manipulation language
Used to add change delete and retrieve the data in the database
Data inconsistency
When the same attribute has different values
byte
a group of bits: represents a single letter, number, or symbol
field
a groupinmg of chracters into a word, a group of words, or complete number
text mining
analyzes unstructured data to find trends and patterns in words and sentences
All of the following are technologies used to analyze and manage big data except:
cloud computing
The join operation:
combines relational tables to provide the user with more information than is otherwise available
The select operation:
creates a subset consisting of all records in the file that meet stated criteria.
The term big data refers to all of the following except:
datasets with fewer than a billion records.
DBMSs typically include report generating tools in order to:
display data in an easier-to-read format.
Multiple data marts are combined and streamlined to create a data warehouse
false
key field
field used to uniquely identify each record
file
group of records of the same type
record
group of related fields
A data warehouse is composed of:
historic and current internal data.
All of the following are issues with the traditional file environment except:
inability to develop specialized applications for functional areas.
entity
is a person place thing or event on which we store and maintain information.
A ________ is an example of pre-digital data storage that is comparable to a database
library card catalog
The process of streamlining data to minimize redundancy and awkward many-to-many relationships is called:
normalization.
The logical view of a database:
presents data as they would be perceived by end users.
A field identified in a table as holding the unique identifier of the table's records is called the:
primary key
Relational DBMS
represents data as two dimensional tables
data administration
responsible for specific policies and procedures through which data can be managed as a resource
In a table for customers, the information about a single customer resides in a single:
row
database server
stores and provides access to a database
data quality audit
structured survey of the accuracy and completeness of data
Microsoft Access's data dictionary displays all of the following information about a field except the:
the organization within the organization that is responsible for maintaining the data.
bit
the smallest unit of data a computer can handle
Database
A collection of data organized to serve many applications efficiently by centralizing the data and controlling redundant data
Data warehouse
A database that stores current and historical data potential interest to decision-makers throughout the company
Structured Query Language (SQL)
A language used to create and manipulate databases.