MIS - Chapter 6
Select
- creates a subset consisting of all records (rows) in the file that meet stated criteria.
physical database available for different logical views
A DBMS makes the A) physical database available for different logical views. B) logical database available for different analytical views. C) physical database available for different analytical views. D) logical database available for different physical views.
minimizing isolated files with repeated data
A DBMS reduces data redundancy and inconsistency by A) enforcing referential integrity B) uncoupling program and data C) utilizing a data dictionary D) minimizing isolated files with repeated data
attribute
A characteristic or quality describing an entity is called a(n) A) field B) tuple C) key field D) attribute
a data mart typically focuses on a single subject area or line of business
A data mart usually can be constructed more rapidly and at lower cost than a data warehouse because A) a data mart typically focuses on a single subject area or line of business B) all the information is historical C) a data mart uses a Web interface D) all of the information belongs to a single company
historic and current internal data
A data warehouse is composed of A) historical data from legacy systems B) current data C) internal and external data sources D) historic and current internal data
key field
A field identified in a record as holding the unique identifier for that record is called the A) primary key B) key field C) primary field D) unique ID
primary key
A field identified in a table as holding the unique identifier of the table's records is called the A) primary key B) key field C) primary field D) unique ID
two short marks
A one-to-one relationship between two entities is symbolized in a diagram by a line that ends with A) two short marks B) one short mark C) a crow's foot D) a crow's foot topped by a short mark
entity-relationship diagram
A schematic for the entire database that describes the relationships in a database is called a(n) A) data dictionary B) intersection relationship diagram C) entity-relationship diagram D) data definition diagram
CGI
An alternative to using application server software for interfacing between a Web server and back-end databases is A) CGI B) HTML C) Java D) SQL
data dictionary
An automated or manual file that stores information about data elements and data characteristics such as usage, physical representation, ownership, authorization, and security is the A) data dictionary B) data definition diagram C) entity-relationship diagram D) relationship dictionary
Data warehouses, data marts, Hadoop, in memory computing, analytical platforms
Array of tools for obtaining useful information form internal and external systems and big data
Example of big datasets
Customer behavior, Weather patterns
Microsoft Access.
DBMS for midrange computers include all of the following EXCEPT A) DB2. B) Oracle. C) Microsoft SQL Server. D) Microsoft Access.
display data in an easier-to-read format
DBMSs typically include report-generating tools in order to A) retrieve and display data B) display data in an easier-to-read format C) display data in graphs D) perform predictive analysis
enforces consistency among different sets of data
Data cleansing not only corrects errors but also A) establishes logical relationships between data B) structures data C) normalizes data D) enforces consistency among different sets of data
find hidden relationships in data
Data mining is a tool for allowing users to A) quickly compare transaction data gathered over many years. B) find hidden relationships in data. C) obtain online answers to ad hoc questions in a rapid amount of time. D) summarize massive amounts of data into much smaller, traditional reports.
data scrubbing
Detectign and correcting data in a database or file that are incorrect, incomplete, improperly formatted, or redundant is called A) data auditing B) defragmentation C) data scrubbing D) data optimization
redundancy
Duplicate data in multiple data files is called data ______ A) redundancy B) repetition C) independence D) partitions
Referential integrity
Example: when one table has a foreign key that points to another table, you may not add a record to the table with foreign key unless there is a corresponding record in the linked table.
tuple
In a relational database, a record is referred to in technical terms as a(n) A) tuple B) row C) entity D) field
select, project, and join
In a relational database, the three basic operations used to develop useful sets of data are A) select, project, and where B) select, join, and where C) select, project, and join D) select, from, and join
row
In a table for customers, the information about a single customer would reside in a single A) field B) row C) column D) table
occurrences linked to a single event
In terms of data relationships, /associations/ refers to A) events linked over time B) patterns that describe a group to which an item belongs C) occurrences linked to a single event D) undiscovered groupings
DBMS for midrange computers.
Microsoft SQL Server is a(n) A) DBMS for small handheld computing devices. B) Internet DBMS. B) Desktop relational DBMS. D) DBMS for midrange computers.
users to obtain online answers to ad-hoc questions in a rapid amount of time
OLAP is a tool for enabling A) users to obtain online answers to ad-hoc questions in a rapid amount of time. B) users to view both logical and physical views of data C) programmers to quickly diagram data relationships D) programmers to normalize data.
DBMS for small handheld computing devices.
Oracle Database Lite is a(n) A) DBMS for small handheld computing devices B) Internet DBMS C) mainframe relational DBMS D) DBMS for midrange computers.
One to one relationship, one to many relationship, many to many relationshiop
Relational database tables may have:
Many to many relationship
Requires join table or intersection relation that links the two tables to join information.
combines relational tables to provide the user with more information than is otherwise available.
The /join/ operation A) combines relational tables to provide the user with more information than is otherwise available. B) identifies the table from which the columns will be selected. C) creates a subset consisting of columns in a table. D) organizes elements in to segments
creates a subset consisting of all records in the file that meet stated criteria.
The /select/ operation A) combines relational tables to provide the user with more information than is otherwise available. B) creates a subset of consisting of columns in a table. C) identifies the table from which the columns will be selected. D) creates a subset consisting of all records in the file that meet stated criteria.
data redundancy
The confusion created by _____ makes it difficult for companies to create customer relationship management, supply chain management, or enterprise systems that integrate data from different sources. A) batch processing B) data redundancy C) data independence D) online processing
creating an inventory of the data elements contained in the database
The data dictionary serves as an important data management tool by A) assigning attributes to the data B) creating an inventory of the data elements contained in the database. C) presenting data as end users or business specialists would perceive them D) maintaining data in updated form
presents data as they would be perceived by end users.
The logical view A) shows how data are organized and structured on the storage media. B) presents an entry screen to the user. C) allows the creation of supplementary reports. D) presents data as they would be perceived by end users.
SQL
The most prominent data manipulation language today is A) Access B) DB2 C) SQL D) Crystal Reports
information policy
The organizations rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information is called a(n) A) information policy B) data definition file C) data quality audit D) data governance policy
normalization
The process of streamlining data to minimize redundancy and awkward many-to-many relationships is called A) normalization B) data scrubbing C) data cleansing D) data defining
creates a subset consisting of columns in a table.
The project operation A) combines relational tables to provide the user with more information than is otherwise available. B) creates a subset consisting of columns in a table. C) organizes elements into segments. D) identifies the table from which the columns will be selected
database administration
The special organizational function whose responsibilities include the technical and operational aspects of managing data, including physical database design and maintenance, is called A) data administration B) database administration C) information policy administration D) data auditing
a data manipulation language
The specialized language programmers use to add and change data in the database is called A) a data access language B) a data manipulation language C) Structured Query Language D) a data definition language
OLAP
The tool that enables users to view the same data in different ways using multiple dimensions is A) predictive analysis B) SQL C) OLAP D) data mining
object-oriented DBMS
The type of database management approach that can handle multimedia is the A) hierarchical DBMS B) relational DBMS C) network DBMS D) object-oriented DBMS
relational DBMS
The type of logical database model that treats data as if they were stored in two-dimensional tables is the A) OODBMS B) pre-digital DBMS C) relational DBMS D) hierarchical DBMS
business intelligence
Tools for analyzing data to help users find patterns, relationships, and insights and make better business decisions are known as A) DSS B) Business intelligence C) OLAP D) Data mining
Nonrelational DBMS
Use more flexible data model, Don't required extensive structuring, Can manage unstructured data, such as social media and graphics, For example: Amazon's SimpleDB.
data inconsistency
Which common database challenge is illustrated by a person receiving multiple copies of an L.L. Bean catalog, each addressed to a slightly different variation of his or her full name? A) data normalization B) data accuracy C) data redundancy D) data inconsistency
the entity CUSTOMER with the attribute PURCHASE
Which of the following best illustrates the relationship between entities and attributes? A)the entity CUSTOMER with the attribute PRODUCT B)the entity CUSTOMER with the attribute PURCHASE C)the entity PRODUCT with the attribute PURCHASE D)the entity product with the attribute CUSTOMER
program-data independence
Which of the following is /not/ one of the main problems with a traditional file environment? A) data inconsistency B) program-data independence C) lack of flexibility in creating ad-hoc reports D) poor security
surveying data definition and query files
Which of the following is not a method for performing a data quality audit? A) surveying entire data files B) surveying samples from data files C) surveying definition and query files D) surveying end users about their perceptions of data quality
user mining
Which of the following is not one of the techniques used in Web mining? A) content mining B) structure mining C) usage mining D) user mining
library card catalog
Which of the following non-digital data storage items is most similar to a database? A) library card catalog B) cash register receipt C) doctor's office invoice D) list of sales totals on a spreadsheet
airport code
You are creating a database to store temperature and wind data from national airport locations. Which of the following field is the most likely candidate to use as the basis for a primary key in the airport table? A) address B) city C) airport code D) state
object-relational DBMS
You are creating a video and animation sharing Web sit whose content will be supplied by content, video, and applets stored in a database and you anticipate very high loads on the server. Which of the following DBMSs will most likely serve your needs? A) object-relational DBMS B) relational DBMS C) hierarchical DBMS D) OODBMS
identifying sequences
You work for a retail chain whose primary outlets are in shopping mall and are conducting an analysis of your customers and their preferences. You wish to find out if there are any particular activities that your customers engage in, or the types of purchases made in the month before or after purchasing select items from your store. To do this, you will want to use the data mining software that is capable of. A) identifying associations B) identifying clusters C) identifying sequences D) classification
classification
You work for an national car rental agency and want to determine what characteristics are shared among your most loyal customers. To do this, you will want to use data mining software that is capable of A) identifying associations B) identifying clusters C) identifying sequences D) classification
Text mining
_____ tools are used to analyze large unstructured data sets, such as e-mail, memos, survey responses, etc., to discover patterns and relationships. A) OLAP B) Text mining C) Web mining D) Web content mining
Data definition
a capability of a DBMS to specify the structure of the content of the database.
Data warehouse
a database that stores current and historical data of potential interest to decision makers throughout the company. Data can be accessed but not altered.
Entity
a generalized category representing a person, place or thing on which information is stored.
Bytes
a group of bits and represents a single character, which can be a letter, number, or another symbol.
File
a group of records
Record
a group of related fields.
Database
a group of related files. The heart of all information systems because they keep track of the people, places, and things that a business must deal with on a continuing, often instant basis.
Field
a grouping of characters into a word, a group of words, or a complete number (such as a person's name or age).
Crystal reports
a popular report generator.
Query
a request for data from a database.
Attribute
a specific characteristic of an entity.
Data mart
a subset of a data warehouse, in which a summarized or highly focused portion of the organization's data is placed in a separate database for a specific population of users.
A database management system
a type of software used to create, store, organize, and access data from a database
Data dictionary
an automated or manual file that stores definitions of data elements and their characteristics.
Hadoop
an open-source software that enables distributed parallel processing of huge amounts of data across inexpensive computers.
Hadoop
breaks a big data problem down into subproblems, distributes them among up to thousands of inexpensive computer processing nodes, then combines the result into a smaller data set that is easier to analyze.
Data definition, data dictionary, querying and reporting
capabilities of a DBMS.
Entity relationship diagram
clarifies the table relationships in a relational database.
Join
combines relational tables to provide the user with more information than is available in individual tables.
Data warehouse
consolidates and standardizes data from many systems, operational and transactional databases.
Project
creates a subset consisting of columns in a table, permitting the user to create new tables that contain only the information required.
Big data
data sets with volumes so huge that they are beyond the ability of typical DBMS to capture, store, and analyze.
Nonrelational database management systems
designed for managing large data sets across many distributed machines and for easily scaling up or down.
In memory computing
eliminates bottlenecks in retrieving and reading data from hard disk based databases.
Foreign key
essentially the look-up field to find data from one table in another table.
Analytic platforms
feature preconfigured hardware-software systems that are specifically designed for query processing and analytics.
Relational database
organizes data into two dimensional tables (called relations) with columns and rows.
The logical view
presents the data as they would be perceived by the users
Big datasets
provide more patterns and insights than smaller datasets
Cloud databases
relational database engines provided by cloud computing services.
In memory computing
relies primarily on a computer's main memory (RAM) for data storage.
Tuples
rows or records in the relational database.
Referential integrity
rules to ensure that relationships between coupled tables remain consistent.
The physical view
shows how the data is organized on the physical storage media
Online analytical processing (OLAP)
supports multidimensional data analysis, enabling users to view the same data in different ways using multiple dimensions.
Report generation
the data of interest can be displayed in a more structured and polished format that would be possible just by querying.
Structured Query Language (SQL)
the most prominent data manipulation language.
Normalization
the process of streamlining complex groups of data to minimize redundant data elements and awkward many to many relationships and increase stability and flexibility.
Bit
the smallest unit of data a computer can handle.
Select, project, join
the three basic operations of a relational database.
Primary key
the unique identifier for all the information in any row of the table, and this primary key cannot be duplicated.
Key field
uniquely identifies each record so that the record can be retrieved, updated, or sorted.
Data manipulation language
used to add, change, delete, and retrieve the data in the database.