CH 9

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

A

A ________ is a data collection, smaller than the data warehouse that addresses the needs of a particular department or functional area of a business. A) data mart B) data room C) datasheet D) dataspace

C

A ________ is a facility for managing an organization's business intelligence data. A) datasheet B) dataspace C) data warehouse D) data table

D

A ________ is designed to extract data from operational systems and other sources, clean the data, and store and catalog that data for processing by business intelligence tools. A) data mart B) data center C) data room D) data warehouse

D

In the ________ phase, a BigData collection is broken into pieces and hundreds or thousands of independent processors search these pieces for something of interest. A) crash B) break C) reduce D) map

B

In the case of ________, data miners develop models prior to conducting analyses and then apply statistical techniques to data to estimate parameters of the models. A) pull publishing techniques B) supervised data mining C) push publishing techniques D) unsupervised data mining

D

Problematic data are termed ________. A) random data B) macro data C) vague data D) dirty data

C

Regression analysis is used in ________. A) progress reporting B) bug reporting C) supervised data mining D) unsupervised data mining

D

The ________ of business intelligence servers maintains metadata about the authorized allocation of business intelligence results to users. A) exception report B) dynamic report C) delivery function D) management function

C

The goal of ________, a type of business intelligence analysis, is to create information about past performance. A) push publishing B) data mining C) reporting analyses D) BigData

B

The more attributes there are in a sample data, the easier it is to build a model that fits the sample data, but that is worthless as a predictor. Which of the following best explains this phenomenon? A) the free rider problem B) the curse of dimensionality C) the tragedy of the commons D) the zero-sum game

C

The results generated in the map phase are combined in the ________ phase. A) pig B) control C) reduce D) construct

A

The source, format, assumptions and constraints, and other facts concerning certain data are called ________. A) metadata B) data structures C) microdata D) network packets

C

The use of an organization's operational data as the source data for a business intelligence system is not usually recommended because it ________. A) is not possible to create reports based on operational data B) is not possible to perform business intelligence analyses on operational data C) requires considerable processing and can drastically reduce system performance D) considers only the external data and not the internal data regarding the organization's

D

Users in a data mart obtain data that pertain to a particular business function from a ________. A) data room B) data center C) datasheet D) data warehouse

C

Which of the following activities in the business intelligence process involves delivering business intelligence to the knowledge workers who need it? A) data acquisition B) BI analysis C) publish results D) data mining

B

Which of the following is a fundamental category of business intelligence (BI) analysis? A) data acquisition B) reporting C) push publishing D) pull publishing

C

Which of the following problems is particularly common for data that have been gathered over time? A) wrong granularity B) lack of integration C) lack of consistency D) missing values

B

Which of the following refers to data in the form of rows and columns? A) granulated data B) structured data C) micro data D) coarse data

D

Which of the following statements is true of BigData? A) BigData contains only structured data. B) BigData has low velocity and is generated slowly. C) BigData cannot store graphics, audio, and video files. D) BigData refers to data sets that are at least a petabyte in size

C

Which of the following statements is true of Hadoop? A) Hadoop is written in C++ and runs on Linux. B) Hadoop includes a query language called Big. C) Hadoop is an open source program that implements MapReduce. D) Technical skills are not required to run and use Hadoop

A

Which of the following statements is true of a data warehouse? A) A data warehouse is larger than a data mart. B) A data warehouse functions like a retail store in a supply chain. C) Users in a data warehouse obtain data pertaining to a business function from a data mart. D) Data analysts who work with a data warehouse are experts in a particular business function

B

Which of the following statements is true of business intelligence (BI) publishing alternatives? A) The skills required to publish static content are extremely high. B) It is more difficult to publish dynamic BI than to publish static content. C) The skills required to create a publishing application for static content is high. D) Push options for Web servers are manual

D

Which of the following statements is true of business intelligence (BI) systems? A) Business intelligence systems are primarily used for developing software systems and data mining applications. B) The four standard components of business intelligence systems are software, procedures, applications, and programs. C) The software component of a business intelligence system is called an intelligence database. D) Business intelligence systems analyze an organization's past performance to make predictions.

A

Which of the following statements is true of data with granularity? A) It can be too fine or too coarse and also have wrong granularity. B) If granularity is too coarse, data can be made finer by summing and combining. C) It is not possible to have a wrong granularity for a data. D) If granularity is too coarse, data can be separated into constituent parts using regression

B

Which of the following statements is true of unsupervised data mining? A) Analysts apply unsupervised data mining techniques to estimate the parameters of a developed model. B) Analysts create hypotheses only after performing an analysis. C) Regression analysis is the most commonly used unsupervised data mining technique. D) Data miners develop models prior to performing an analysis

C

_______ are business intelligence documents that are fixed at the time of creation and do not change. A) Critical reports B) Dynamic reports C) Static reports D) Exception reports

D

_______ are business intelligence documents that are updated at the time they are requested. A) Subscriptions B) Third-party cookies C) Static reports D) Dynamic reports

A

_______ are user requests for particular business intelligence results on a particular schedule or in response to particular events. A) Subscriptions B) Third-party cookies C) Static reports D) Dynamic reports

A

_______ is an unsupervised data mining technique in which statistical techniques identify groups of entities that have similar characteristics. A) Cluster analysis B) Content indexing C) Regression analysis D) Cloud computing

C

_______ is the application of statistical techniques to find patterns and relationships among data for classification and prediction. A) Data encryption B) Data warehousing C) Data mining D) Data decryption

D

_______ is the process of sorting, grouping, summing, filtering, and formatting structured data. A) Push publishing B) Publish results C) Cloud computing D) Reporting analysis

B

_______ requires users to request business intelligence results. A) Push publishing B) Pull publishing C) Data acquisition D) Data mining

A

________ are reports produced when something out of predefined bounds occurs. A) Exception reports B) Static reports C) Dynamic reports D) Subscription reports

A

________ is an open source program supported by the Apache Foundation that manages thousands of computers and that implements MapReduce. A) Hadoop B) BigData C) Linux D) Apache Wave

A

________ is the process of delivering business intelligence to users without any request from the users. A) Push publishing B) Pull publishing C) Data acquisition D) Data mining

D

________ is the process of obtaining, cleaning, organizing, relating, and cataloging source data. A) Data manipulation B) BI analysis C) Publish results D) Data acquisition

D

________ is used to measure the impact of a set of variables on another variable during data mining. A) Cluster analysis B) Context indexing C) Cloud computing D) Regression analysis

C

________ process operational and other data in organizations to analyze past performance and make predictions. A) Virtualization techniques B) Live migration techniques C) Business intelligence systems D) Windowing systems

B

________ refers to the level of detail represented by data. A) Abstraction B) Granularity C) Dimensionality D) Aggregation

C

________ techniques emerged from the combined discipline of statistics, mathematics, artificial intelligence, and machine-learning. A) Push publishing B) Pull publishing C) Data mining D) Exception reporting


Ensembles d'études connexes

Alterations in GI Functioning- PEDS Test 4

View Set

Chapter 23 Post-Class Assignment Part II: Measuring a Nation's Income

View Set

Gettier, "Is Justified True Belief Knowledge?"

View Set

POSI 2310 Mora - Political Participation: Activating the Popular Will Assignment

View Set

Disaster Planning Adaptive Quizzing

View Set

Bio 121 Unit 6: Genetic Technology

View Set