CH 8 BIG DATA, DATA WAREHOUSES, AND BUSINESS INTELLIGENCE SYSTEMS
To format a PivotTable in Excel 2013, we use the: A) Design command tab in the PivotTable Tools contextual command tab. B) Format command tab in the PivotTable Tools contextual command tab. C) Design command tab in the Report Tools contextual command tab. D) Format command tab in the Report Tools contextual command tab. E) Report Tools command tab.
A) Design command tab in the PivotTable Tools contextual command tab.
OLAP stands for: A) Online Analytical Processing. B) Offline Analytical Processing. C) Online Analysis Process. D) Offline Analysis Process. E) Old, Lazy And Particular.
A) Online Analytical Processing.
We have obtained access to the company's operational data. In one record, we find that a customer's age has been recorded as "337." This is an example of: A) dirty data. B) inconsistent data. C) non-integrated data. D) a "wrong format" problem. E) a "too much data" problem.
A) dirty data.
Star schemas have a ________ at the center of the star. A) fact table B) dimension table C) map table D) reduce table E) None of the above
A) fact table
When distributed databases create copies of the database on different servers, this is known as: A) replication. B) partitioning. C) disbursing. D) distributed two-phase locking. E) None of the above
A) replication.
Dimensional databases are used to track historical data and therefore must have a: A) time dimension. B) customer dimension. C) sales dimension. D) Either A or B E) Both A and B
A) time dimension.
A Business Intelligence (BI) reporting system that uses extensions to SQL is: A) cluster analysis. B) OLAP. C) regression analysis. D) RFM analysis. E) All of the above
B) OLAP.
Snowflake schemas have normalized: A) fact tables. B) dimension tables. C) map tables. D) reduce tables. E) None of the above
B) dimension tables.
Data warehouses use a(n): A) operational database. B) dimensional database. C) structured storage. D) Either A or B E) All of the above
B) dimensional database.
Slowly changing dimensions are handled by a(n): A) operational database. B) dimensional database. C) structured storage. D) Either A or B E) All of the above
B) dimensional database.
We have obtained access to the company's operational data. We examine 50 records for customers with phone numbers that should use the current area code of 345. Of these 50 records, we find 10 that still use an older area code of 567. This is an example of: A) dirty data. B) inconsistent data. C) non-integrated data. D) a "wrong format" problem. E) a "too much data" problem.
B) inconsistent data.
When distributed databases break the database into sections and store the sections on different servers, this is known as: A) replication. B) partitioning. C) disbursing. D) distributed two-phase locking. E) None of the above
B) partitioning.
The term for the enormous datasets generated by Web 2.0 applications is ________.
BIG DATA
A(n) ________ is an information system that helps users analyze and use data.
Business Intelligence (BI) system
Apache Cassandra is an example of a(n) ________ non-relational DBMS. A) Key-Value B) Document C) Column Family D) Graph E) Object-Oriented
C) Column Family
To arrange the PivotTable columns and rows in Excel 2013, we use the: A) PivotTable Control. B) PivotTable dialog box. C) PivotTable Field List. D) PivotTable Wizard. E) PivotTable command tab.
C) PivotTable Field List.
A data warehouse database differs from an operational database because: A) data warehouse data are not stored in tables. B) data warehouse databases do not have metadata. C) data warehouse data are often stored in a dimensional database. D) Both B and C E) All of the above
C) data warehouse data are often stored in a dimensional database.
Hadoop is a(n): A) RDMBS. B) OODBMS. C) distributed file system (DFS). D) print system. E) Web server.
C) distributed file system (DFS).
Business Intelligence (BI) reporting systems can do which of the following operations? A) Filter data B) Group data C) Modify data D) Both A and B E) All of the above
D) Both A and B
Business Intelligence (BI) systems do which of the following? A) Analyze current and past activities B) Predict future events C) Record and process transactions D) Both A and B E) All of the above
D) Both A and B
Which of the following is true about data mining applications? A) They use sophisticated mathematical techniques. B) They use sophisticated statistical techniques. C) Their report delivery is more difficult than report delivery for reporting systems. D) Both A and B E) All of the above
D) Both A and B
The term drill down means the user wants to: A) summarize data. B) get older data. C) sort data. D) get more details. E) aggregate data.
D) get more details.
In the MapReduce process, the first step is the ________ step. A) column B) super column C) column family D) map E) reduce
D) map
A data mart differs from a data warehouse in that: A) it has a smaller database. B) it deals with a particular component or functional area of the business. C) data mart users do not have the data management expertise of data warehouse employees. D) Both A and B E) All of the above
E) All of the above
Business Intelligence (BI) systems obtain their data by which of the following means? A) Read and process data from an operational database B) Process extracts from operational databases C) Process data purchased from data vendors D) Both A and B E) All of the above
E) All of the above
Data mining applications are used to accomplish which of the following tasks? A) Perform what-if analysis B) Make predications C) Facilitate decision making D) Both A and B E) All of the above
E) All of the above
Non-relational DBMSs associated with the NoSQL movement include: A) Key-Value. B) Document. C) Column Family. D) Graph. E) All of the above
E) All of the above
Which of the following is a reason that operational data are difficult to read? A) Dirty data B) Missing values C) Non-integrated data D) Both A and B E) All of the above
E) All of the above
Business Intelligence (BI) systems fall into which of the following categories? A) Processing B) Reporting C) Data mining D) Both A and B E) Both B and C
E) Both B and C
Data warehouses are populated with data prepared by ________ programs.
Extract, Transform and Load (ETL)
A star schema resembles a star, with a dimension table at the center and fact tables radiating out from the center.
FALSE
BI reporting systems summarize the current status of business activities and compare that status with past events but not with predicted future activities.
FALSE
Business Intelligence (BI) systems support operational activities.
FALSE
Column family databases are indistinguishable from relational database tables.
FALSE
In the MapReduce process, the Reduce step is followed by the Map step.
FALSE
Operational databases contain a fact table.
FALSE
Operational databases store historical data.
FALSE
________ provides the ability to sum, count, average and perform other simple arithmetic operations on groups of data.
OLAP
OLAP reports can be created using the Excel 2013 ________ tool.
PivotTable
Business Intelligence (BI) systems obtain data in ________ different ways.
THREE
A data mart is a collection of data that addresses a particular component or functional area of a business
TRUE
A data warehouse is a database system that has data and programs for, as well as personnel specialized in, BI processing.
TRUE
Apache Cassandra is an example of a column family database.
TRUE
Big Data is the name given to the enormous datasets generated by Web 2.0 applications.
TRUE
Business Intelligence (BI) reporting systems are used to filter data, sort data, group data and make simple calculations based on the data.
TRUE
Business Intelligence (BI) reporting systems can analyze data using standard SQL.
TRUE
Business Intelligence (BI) systems are information systems that help users analyze and use data.
TRUE
Business Intelligence (BI) systems obtain data in three different ways.
TRUE
Data Mining uses sophisticated statistical and mathematical techniques to perform what-if analyses, to make predictions, and to facilitate decision making.
TRUE
Data warehouses are populated with data prepared by Extract, Transform, and Load (ETL) systems.
TRUE
Dimensional databases are used for analytical data processing.
TRUE
Dimensional databases use the star schema.
TRUE
In a snowflake table, each dimension table is normalized.
TRUE
NoSQL really stands for "Not only SQL."
TRUE
OLAP provides the ability to sum, count, average and perform other simple arithmetic operations on groups of data.
TRUE
Problematic data include so-called "dirty data."
TRUE
Report delivery is more difficult for data mining than it is for reporting systems.
TRUE
The term drill down refers to the capability of seeing the data in smaller and smaller units.
TRUE
Apache Cassandra is an example of a(n) ________ non-relational DBMS.
column family
A(n) ________ is a collection of data that addresses a particular component or functional area of a business.
data mart
Business Intelligence (BI) ________ uses sophisticated statistical and mathematical techniques to perform what-if analyses, to make predictions, and to facilitate decision making.
data mining
In the snowflake schema, the ________ tables are normalized.
dimension
Data warehouse data is often stored in a(n) ________.
dimensional database
Problematic data are called ________.
dirty data
Hadoop is a(n) _______
distributed file system (DFS)
The term ________ refers to the capability of seeing the data in smaller and smaller units.
drill down
Objects have ________, which are computer programs that perform some task.
methods
Distributed databases can be distributed by partitioning and ________
replication
Business Intelligence (BI) ________ are used to filter data, sort data, group data and make simple calculations based on the data.
reporting systems
Business Intelligence (BI) systems fall into two broad categories: ________ and ________.
reporting; data mining
To deal with data values that change over time, dimensional databases use a(n) ________.
slowly changing dimension
A variant of the star schema is the ________.
snowflake schema
A dimensional database uses a(n) ________.
star schema
To deal with historical data values, dimensional databases use a(n) ________.
time dimension