Mis Final chapter 9
Which of the following statements is true of a data warehouse?
A data warehouse is larger than a data mart.
Which of the following statements is true of unsupervised data mining?
Analysts create hypotheses only after performing an analysis.
Which of the following statements is true of BigData?
BigData refers to data sets that are at least a petabyte in size.
process operational and other data in organizations to analyze past performance and make predictions.
Business intelligence systems
Which of the following statements is true of business intelligence (BI) systems?
Business intelligence systems analyze an organization's past performance to make predictions
is an unsupervised data mining technique in which statistical techniques identify groups of entities that have similar characteristics.
Cluster analysis
is the process of obtaining, cleaning, organizing, relating, and cataloging source data.
Data acquisition
is the application of statistical techniques to find patterns and relationships among data for classification and prediction.
Data mining
techniques emerged from the combined discipline of statistics, mathematics, artificial intelligence, and machine-learning.
Data mining
reports are business intelligence documents that are updated at the time they are requested.
Dynamic
are reports produced when something out of predefined bounds occurs.
Exception reports
refers to the level of detail represented by data.
Granularity
Which of the following statements is true of data with granularity?
Granularity refers to the level of detail represented by the data.
is an open-source program supported by the Apache Foundation that manages thousands of computers and implements MapReduce.
Hadoop
Which of the following statements is true of Hadoop?
Hadoop is an open-source program that implements MapReduce
In the ________ phase, a BigData collection is broken into pieces and hundreds or thousands of independent processors search these pieces for something of interest.
Map
Which of the following statements is true of business intelligence (BI) publishing alternatives?
Publishing dynamic BI is more difficult than publishing static content.
requires users to request business intelligence results.
Pull publishing
is the process of delivering business intelligence to users without any request from the users.
Push publishing
The results generated in the Map phase are combined in the ________ phase
Reduce
is used to measure the impact of a set of variables on another variable during data mining.
Regression analysis
is the process of sorting, grouping, summing, filtering, and formatting structured data.
Reporting analysis
are business intelligence documents that are fixed at the time of creation and do not change.
Static reports
are user requests for particular business intelligence results on a particular schedule or in response to particular events.
Subscriptions
A ________ is a data collection that addresses the needs of a particular department or functional area of a business.
data mart
Which of the following is a fundamental category of business intelligence (BI) analysis?
data mining
A ________ is a facility for managing an organization's business intelligence data.
data warehouse
The purpose of a ________ is to extract data from operational systems and other sources, clean the data, and store and catalog that data for processing by business intelligence tools.
data warehouse
Users in a data mart obtain data that pertain to a particular business function from a
data warehouse
Problematic data are referred to as
dirty data
Which of the following problems is particularly common for data that have been gathered over time?
lack of consistency
The ________ of business intelligence servers maintains metadata about the authorized allocation of business intelligence results to users.
management function
The source, format, assumptions, constraints, and other facts concerning certain data are called _
metadata
Which of the following activities in the business intelligence process involves delivering business intelligence to the knowledge workers who need it?
publish results
The goal of ________, a type of business intelligence analysis, is to create information about past performance.
reporting
The use of an organization's operational data as the source data for a BI system is not usually recommended because it
requires considerable processing and can drastically reduce system performance
Which of the following refers to data in the form of rows and columns?
structured data
In the case of ________, data miners develop models prior to conducting analyses and then apply statistical techniques to data to estimate parameters of the models.
supervised data mining
Regression analysis is used in
supervised data mining
The more attributes there are in a sample data, the easier it is to build a model that fits the sample data, but that is worthless as a predictor. Which of the following best explains this phenomenon?
the curse of dimensionality