Chapter 6
-it typically focuses on a single subject area or line of business
A data mart usually can be constructed more rapidly and at lower cost than a data warehouse because: -it typically focuses on a single subject area or line of business -all the information is historical -it uses a Web interface -all of the information belongs to a single company
False
A data warehouse is typically comprised of several smaller data marts (T/F)
True
A data warehouse may include information from legacy systems (T/F)
-key field
A field identified in a record as holding the unique identifier for that record is called the: -primary key -key field -primary field -foreign key
-primary key
A field identified in a table as holding the unique identifier of the table's records is called the: -primary key -key field -primary field -foreign key
True
A firm's information policy lays out who is responsible for updating and maintaining the information in a database system (T/F)
True
A foreign key is a field that links to a separate table (T/F)
False
A grouping of characters into a word, a group of words, or a complete number is called a record (T/F)
-with a crow's foot preceded by a short mark
A one-to-many relationship between two entities is symbolized in a diagram by a line that ends: -in two short marks -in one short mark -with a crow's foot -with a crow's foot preceded by a short mark
-in two short marks
A one-to-one relationship between two entities is symbolized in a diagram by a line that ends: -in two short marks -in one short mark -with a crow's foot -with a crow's foot topped by a short mark
True
A physical view shows data as it is organized on the storage media (T/F)
-entity-relationship diagram
A schematic of the entire database that describes the relationships in a database is called a(n): -data dictionary -intersection relationship diagram -entity-relationship diagram -data definition diagram
-intersection relation
A table that links two tables that have a many-to-many relationship is often called a(n): -derived table -intersection relation -foreign table -entity-relationship table
-database server
A(n) ________ is a dedicated computer in a client/server environment that hosts a DBMS. -web server -application server -database server -client server
-data quality audit
A(n) ________ is a structured survey of the accuracy and level of completeness of the data in an information system. -data quality audit -systems analysis -systems audit -data analysis
-DBMS for desktop PC systems
Access is a: -DBMS for small handheld computing devices -popular open-source DBMS -DBMS for midrange computers -DBMS for desktop PC systems
-SQL queries
All of the following are tools or technologies for extracting information from unstructured data sets except: -sentiment analysis software -SQL queries -Hadoop -Non-relational DBMS
-data dictionary
An automated or manual file that stores information about data elements and data characteristics such as usage, physical representation, ownership, authorization, and security is the: -data dictionary -data definition diagram -entity-relationship diagram -relationship dictionary
-library's card-catalog
An example of a pre-digital database is a: -library's card-catalog -cash register receipt -doctor's office invoice -list of sales totals on a spreadsheet
False
An information policy specifies the procedures and rules for defining the structure and content of corporate databases (T/F)
True
Both Oracle and Microsoft Access use SQL to retrieve information from the database (T/F)
-text mining
Businesses use ________ tools to search and analyze unstructured data sets, such as e-mails and memos. -Hadoop -Web mining -text mining -OLAP
True
Common Gateway Interface is a specification for processing data on a Web server (T/F)
True
DBMS simplify how end users work with databases by separating the logical and physical views of the data (T/F)
-display data in an easier-to-read format
DBMS typically include report-generating tools in order to: -retrieve and display data -display data in an easier-to-read format -display data in graphs -perform predictive analysis
True
Data administration is a special organizational function that manages the policies and procedures through which data can be managed as an organizational resource (T/F)
True
Data cleansing is the same process as data scrubbing (T/F)
-find hidden relationships in data
Data mining is a tool for allowing users to: -quickly compare transaction data gathered over many years -find hidden relationships in data -obtain online answers to ad hoc questions in a rapid amount of time -summarize massive amounts of data into much smaller, traditional reports
-discovery driven
Data mining is more ________ than OLAP. -data focused -multidimensional -query oriented -discovery driven
-business intelligence
Tools for consolidating, analyzing, and providing access to vast amounts of data to help users make better business decisions are known as: -DSS -business intelligence -OLAP -data mining
True
Web mining is the discovery of useful patterns on the Web (T/F)
-Relationships between entities
What are the relationships that the relational database is named for? -Relationships between rows and columns -Relationships between entities -Relationships between fields and records -Relationships between databases
-Identify the data needed to run the business
What is the first step you should take in managing data for a firm? -Identify the data needed to run the business -Cleanse the data before importing it to any database -Normalize the data before importing to a database -Audit your data quality
-Structuring the database
What is the purpose of a DBMS's data definition function? -Storing and managing the data in the database -Storing definitions of data elements -Normalizing the database -Structuring the database
-Critical, as without one, your data may be inaccurate, incomplete, or difficult to retrieve
Which of the following best describes the importance of creating an accurate data model for your business's database? -It is not essential, as the data model itself is modified by the database continuously -Critical, as without one, your data may be inaccurate, incomplete, or difficult to retrieve -Essential, as table relationships cannot be created without an accurate data model -Somewhat important, as the type of data model will define what kinds of data will be stored
-The entity CUSTOMER with the attribute ADDRESS
Which of the following best illustrates the relationship between entities and attributes? -The entity CUSTOMER with the attribute PRODUCT -The entity CUSTOMER with the attribute ADDRESS -The entity PRODUCT with the attribute PURCHASE -The entity PRODUCT with the attribute CUSTOMER
-Non-relational DBMS
Which of the following database types would be best suited for storing multimedia? -SQL DBMS -Open-source DBMS -Non-relational DBMS -Cloud-based database
-Query wizard tool
Which of the following is not a typical feature of DBMS? -Data manipulation language -Report generation tools -Data dictionary -Query wizard tool
-Data warehouse systems provide easy-to-use tools for managers to easily update data
Which of the following statements about data warehouses is not true? -They store supply data to be used across the enterprise for management analysis and decision making -Data warehouse systems provide a range of ad hoc and standardized query tools, analytical tools, and graphical reporting facilities -They may include data from Web site transactions -Data warehouse systems provide easy-to-use tools for managers to easily update data
-Web structure mining
Which of the following would you use to find out which Web sites with content related to database design were the most often linked to by other Web sites? -Web usage mining -Web server mining -Web structure mining -Web content mining
-Web usage mining
Which of the following would you use to find patterns in user interaction data recorded by Web servers? -Web usage mining -Web server mining -Web structure mining -Web content mining
-database administration
Within a corporate information services department, the task of creating the physical database and its logical relations are responsibilities of the ________ function. -database administration -data administration -server administration -data modeling
-one-to-one
You are working in the IT department of a small paper supply company and planning a new database that monitors employee benefits and productivity. What would be the relationship you need to establish between Employee_ID in the Employee table and Parking_spot_number in the Perks table? -one-to-many -one-to-one -many-to-many -many-to-one
-data scrubbing
Detecting and correcting data in a database or file that are incorrect, incomplete, improperly formatted, or redundant is called: -data auditing -defragmentation -data scrubbing -data optimization
True
End users use a data manipulation language to manipulate data in the database (T/F)
True
Every record in a file should contain at least one key field (T/F)
True
Hadoop can process large quantities of any type of data, including video and Twitter feeds (T/F)
-open-source software framework designed for distributing data-processing over inexpensive computers
Hadoop is a(n): -NoSQL database technology that stores both data and procedures acting on the data as objects -data mining platform designed for storing and analyzing big data produced by Web communications and transactions -open-source software framework designed for distributing data-processing over inexpensive computers -pre-configured hardware-software system designed for analyzing big data
False
Implementing a Web interface for a database requires changes to the database itself (T/F)
-tuple
In a relational database, a record is also called a(n): -tuple -row -entity -field
-select, project, and join
In a relational database, the three basic operations used to develop useful sets of data are: -select, project, and where -select, join, and where -select, project, and join -select, from, and join
-row
In a table for customers, the information about a single customer would reside in a single: -field -row -column -table
-find new groupings within data
In clustering, a data mining tool will: -find new groupings within data -find related predictions from existing values -find several events grouped by time -find new associations
False
In linking databases to the Web, the role of the application server is to host the DBMS (T/F)
-occurrences linked to a single event
In terms of the data relationships found by data mining, associations refers to: -events linked over time -patterns that describe a group to which an item belongs -occurrences linked to a single event -undiscovered groupings
-events linked over time
In terms of the data relationships found by data mining, sequences refers to: -events linked over time -patterns that describe a group to which an item belongs -occurrences linked to a single event -undiscovered groupings
True
In-memory computing relies on a computer's main memory for storing data (T/F)
-data redundancy
The confusion created by ________ makes it difficult for companies to create customer relationship management, supply chain management, or enterprise systems that integrate data from different sources. -batch processing -data redundancy -data independence -online processing
-presents data as they would be perceived by end users
The logical view of a database: -shows how data are organized and structured on the storage media -presents an entry screen to the user -allows the creation of supplementary reports -presents data as they would be perceived by end users
-five tables: for customers, employees, suppliers, parts, and sales.
The most basic business database is comprised of: -three tables: for suppliers, parts, and sales -four tables: for customers, suppliers, parts, and sales -four tables: for customers, suppliers, sales, and employees -five tables: for customers, employees, suppliers, parts, and sales
-SQL
The most prominent data manipulation language today is: -Access -DB2 -SQL -Crystal Reports
-normalization
The process of streamlining data to minimize redundancy and awkward many-to-many relationships is called: -normalization -data scrubbing -data cleansing -data administration
-creates a subset consisting of columns in a table
The project operation: -combines relational tables to provide the user with more information than is otherwise available -creates a subset consisting of columns in a table -organizes elements into segments -identifies the table from which the columns will be selected
-creates a subset consisting of all records in the file that meet stated criteria
The select operation: A) combines relational tables to provide the user with more information than is otherwise available creates a subset consisting of columns in a table -identifies the table from which the columns will be selected -creates a subset consisting of all records in the file that meet stated criteria
-OLAP
The tool that enables users to view the same data in different ways using multiple dimensions is: -Hadoop -SQL -OLAP -data mining
False
The two key services of Hadoop are HDFS and in-memory computing (T/F)
-relational DBMS.
The type of logical database model that treats data as if they were stored in two-dimensional tables is the: -two-tiered DBMS -pre-digital DBMS -relational DBMS -hierarchical DBMS
False
The use of different terms for identifying data in a firm's various information systems is an example of redundant data (T/F)
True
A DBMS is special software for creating, storing, organizing, and accessing data from a database (T/F)
-physical database available for different logical views
A DBMS makes the: -physical database available for different logical views -logical database available for different analytical views -physical database available for different relational views -relational database available for different physical views
-attribute.
A characteristic or quality describing an entity is called a(n): -field -tuple -key field -attribute
-warehouse
A data ________ stores current and historical data of potential interest to decision makers throughout the company. -warehouse -mart -archive -mine
-NoSQL databases
MongoDB and SimpleDB are both examples of: -Open-source databases -SQL databases -NoSQL databases -Cloud databases
False
Most back-end databases are able to interpret HTML commands (T/F)
-popular open-source DBMS
MySQL is a: -DBMS for small handheld computing devices -popular open-source DBMS -mainframe relational DBMS -DBMS for desktop systems
True
OLAP is a key tool of BI (T/F)
-users to obtain online answers to ad hoc questions in a rapid amount of time
OLAP is a tool for enabling: -users to obtain online answers to ad hoc questions in a rapid amount of time -users to view both logical and physical views of data -programmers to quickly diagram data relationships -programmers to normalize data
False
OLAP is used to find hidden patterns and relationships in large databases, and infer rules from these to infer future behavior (T/F)
-Analytic platforms
Pre-configured hardware-software systems that use both relational and non-relational technology optimized for analyzing large datasets are referred to as: -Hybrid DBMS -Hadoop -BI -Analytic platforms
-referential integrity rules
Relational database systems use ________ to ensure that relationships between coupled tables remain consistent. -intersection relations -referential integrity rules -entity-relationship diagrams -foreign keys