Chapter 8 & Appendix J exam review
Excel cannot import Access data directly into a PivotTable report, but must first place the data into a worksheet.
false
In a common form of RFM analysis, customers with an R score of 5 are in the 20% of customers who have the most recent orders.
false
Operational databases contain a fact table.
false
Report delivery is more difficult for data mining than it is for reporting systems.
false
Reports that do not change once prepared are called static reports.
true
The term drill down refers to the capability of seeing the data in smaller and smaller units.
true
IBM defines big data in terms of the Four V's, in which veracity refers to the:
uncertainty of data
BI reporting systems summarize the current status of business activities and compare that status with past events but not with predicted future activities.
false
Business Intelligence (BI) systems do which of the following?
Analyze current and past activities and Predict future events
Non-relational DBMSs associated with the NoSQL movement include:
Key-Value, Document, Column Family, Graph.
A Business Intelligence (BI) reporting system that uses extensions to SQL is:
OLAP
To arrange the PivotTable columns and rows in Excel, we use the
PivotTable Field List.
Business Intelligence (BI) systems obtain their data by which of the following means?
Read and process data from an operational database, Process extracts from operational databases, Process data purchased from data vendors
Which of the following is true about data mining applications?
They use sophisticated mathematical techniques, They use sophisticated statistical techniques.
We have obtained access to the company's operational data. We have been asked to produce a report with an item by item analysis of sales, but the only sales figure available is the total sale value for each order. This is an example of:
a "wrong format" problem.
Market basket analysis is:
a data mining technique.
IBM defines big data in terms of the Four V's, in which velocity refers to the:
analysis of streaming data
We have obtained access to the company's operational data. In one record, we find that a customer's age has been recorded as "337." This is an example of:
dirty data
Which of the following is a common unsupervised data mining technique?
cluster analysis
A data warehouse database differs from an operational database because:
data warehouse data are often stored in a dimensional database.
We have done an RFM analysis on our customer data. John Smith has a score of "1 5 5." This means that John:
hasn't ordered recently, but orders a lot when he orders.
We have obtained access to the company's operational data. We examine 50 records for customers with phone numbers that should use the current area code of 345. Of these 50 records, we find 10 that still use an older area code of 567. This is an example of:
inconsistent data
In the MapReduce process, the first step is the ________ step.
map
Market basket analysis is a data mining technique for determining which sets of products customers tend to purchase at the same time.
true
OLAP stands for:
online analytical processing
The "R" in RFM analysis stands for:
recent
Which of the following is a common supervised data mining technique?
regression analysis
A data warehouse is a database system that has data and programs for, as well as personnel specialized in, BI processing.
true
A report that is sent to users on a predetermined schedule is called a push report.
true
Big Data is the name given to the enormous datasets generated by Web 2.0 applications.
true
Business Intelligence (BI) reporting systems are used to filter data, sort data, group data and make simple calculations based on the data.
true
Business Intelligence (BI) systems are information systems that help users analyze and use data.
true
Data mining is the application of mathematical and statistical techniques to find patterns and relationships that can be used to classify and predict.
true
Data warehouses are populated with data prepared by Extract, Transform, and Load (ETL) systems.
true
In a common form of RFM analysis, an RFM score of 5 1 1 means that the customer orders frequently and orders items of high monetary value but has not ordered anything for some time.
true
In a snowflake table, each dimension table is normalized.
true
In market basket analysis, support is defined as the probability that two items will be purchased together.
true
In supervised data mining, statistical techniques are used to estimate the parameters of the model.
true