CH 8 BIG DATA, DATA WAREHOUSES, AND BUSINESS INTELLIGENCE SYSTEMS

Ace your homework & exams now with Quizwiz!

To format a PivotTable in Excel 2013, we use the: A) Design command tab in the PivotTable Tools contextual command tab. B) Format command tab in the PivotTable Tools contextual command tab. C) Design command tab in the Report Tools contextual command tab. D) Format command tab in the Report Tools contextual command tab. E) Report Tools command tab.

A) Design command tab in the PivotTable Tools contextual command tab.

OLAP stands for: A) Online Analytical Processing. B) Offline Analytical Processing. C) Online Analysis Process. D) Offline Analysis Process. E) Old, Lazy And Particular.

A) Online Analytical Processing.

We have obtained access to the company's operational data. In one record, we find that a customer's age has been recorded as "337." This is an example of: A) dirty data. B) inconsistent data. C) non-integrated data. D) a "wrong format" problem. E) a "too much data" problem.

A) dirty data.

Star schemas have a ________ at the center of the star. A) fact table B) dimension table C) map table D) reduce table E) None of the above

A) fact table

When distributed databases create copies of the database on different servers, this is known as: A) replication. B) partitioning. C) disbursing. D) distributed two-phase locking. E) None of the above

A) replication.

Dimensional databases are used to track historical data and therefore must have a: A) time dimension. B) customer dimension. C) sales dimension. D) Either A or B E) Both A and B

A) time dimension.

A Business Intelligence (BI) reporting system that uses extensions to SQL is: A) cluster analysis. B) OLAP. C) regression analysis. D) RFM analysis. E) All of the above

B) OLAP.

Snowflake schemas have normalized: A) fact tables. B) dimension tables. C) map tables. D) reduce tables. E) None of the above

B) dimension tables.

Data warehouses use a(n): A) operational database. B) dimensional database. C) structured storage. D) Either A or B E) All of the above

B) dimensional database.

Slowly changing dimensions are handled by a(n): A) operational database. B) dimensional database. C) structured storage. D) Either A or B E) All of the above

B) dimensional database.

We have obtained access to the company's operational data. We examine 50 records for customers with phone numbers that should use the current area code of 345. Of these 50 records, we find 10 that still use an older area code of 567. This is an example of: A) dirty data. B) inconsistent data. C) non-integrated data. D) a "wrong format" problem. E) a "too much data" problem.

B) inconsistent data.

When distributed databases break the database into sections and store the sections on different servers, this is known as: A) replication. B) partitioning. C) disbursing. D) distributed two-phase locking. E) None of the above

B) partitioning.

The term for the enormous datasets generated by Web 2.0 applications is ________.

BIG DATA

A(n) ________ is an information system that helps users analyze and use data.

Business Intelligence (BI) system

Apache Cassandra is an example of a(n) ________ non-relational DBMS. A) Key-Value B) Document C) Column Family D) Graph E) Object-Oriented

C) Column Family

To arrange the PivotTable columns and rows in Excel 2013, we use the: A) PivotTable Control. B) PivotTable dialog box. C) PivotTable Field List. D) PivotTable Wizard. E) PivotTable command tab.

C) PivotTable Field List.

A data warehouse database differs from an operational database because: A) data warehouse data are not stored in tables. B) data warehouse databases do not have metadata. C) data warehouse data are often stored in a dimensional database. D) Both B and C E) All of the above

C) data warehouse data are often stored in a dimensional database.

Hadoop is a(n): A) RDMBS. B) OODBMS. C) distributed file system (DFS). D) print system. E) Web server.

C) distributed file system (DFS).

Business Intelligence (BI) reporting systems can do which of the following operations? A) Filter data B) Group data C) Modify data D) Both A and B E) All of the above

D) Both A and B

Business Intelligence (BI) systems do which of the following? A) Analyze current and past activities B) Predict future events C) Record and process transactions D) Both A and B E) All of the above

D) Both A and B

Which of the following is true about data mining applications? A) They use sophisticated mathematical techniques. B) They use sophisticated statistical techniques. C) Their report delivery is more difficult than report delivery for reporting systems. D) Both A and B E) All of the above

D) Both A and B

The term drill down means the user wants to: A) summarize data. B) get older data. C) sort data. D) get more details. E) aggregate data.

D) get more details.

In the MapReduce process, the first step is the ________ step. A) column B) super column C) column family D) map E) reduce

D) map

A data mart differs from a data warehouse in that: A) it has a smaller database. B) it deals with a particular component or functional area of the business. C) data mart users do not have the data management expertise of data warehouse employees. D) Both A and B E) All of the above

E) All of the above

Business Intelligence (BI) systems obtain their data by which of the following means? A) Read and process data from an operational database B) Process extracts from operational databases C) Process data purchased from data vendors D) Both A and B E) All of the above

E) All of the above

Data mining applications are used to accomplish which of the following tasks? A) Perform what-if analysis B) Make predications C) Facilitate decision making D) Both A and B E) All of the above

E) All of the above

Non-relational DBMSs associated with the NoSQL movement include: A) Key-Value. B) Document. C) Column Family. D) Graph. E) All of the above

E) All of the above

Which of the following is a reason that operational data are difficult to read? A) Dirty data B) Missing values C) Non-integrated data D) Both A and B E) All of the above

E) All of the above

Business Intelligence (BI) systems fall into which of the following categories? A) Processing B) Reporting C) Data mining D) Both A and B E) Both B and C

E) Both B and C

Data warehouses are populated with data prepared by ________ programs.

Extract, Transform and Load (ETL)

A star schema resembles a star, with a dimension table at the center and fact tables radiating out from the center.

FALSE

BI reporting systems summarize the current status of business activities and compare that status with past events but not with predicted future activities.

FALSE

Business Intelligence (BI) systems support operational activities.

FALSE

Column family databases are indistinguishable from relational database tables.

FALSE

In the MapReduce process, the Reduce step is followed by the Map step.

FALSE

Operational databases contain a fact table.

FALSE

Operational databases store historical data.

FALSE

________ provides the ability to sum, count, average and perform other simple arithmetic operations on groups of data.

OLAP

OLAP reports can be created using the Excel 2013 ________ tool.

PivotTable

Business Intelligence (BI) systems obtain data in ________ different ways.

THREE

A data mart is a collection of data that addresses a particular component or functional area of a business

TRUE

A data warehouse is a database system that has data and programs for, as well as personnel specialized in, BI processing.

TRUE

Apache Cassandra is an example of a column family database.

TRUE

Big Data is the name given to the enormous datasets generated by Web 2.0 applications.

TRUE

Business Intelligence (BI) reporting systems are used to filter data, sort data, group data and make simple calculations based on the data.

TRUE

Business Intelligence (BI) reporting systems can analyze data using standard SQL.

TRUE

Business Intelligence (BI) systems are information systems that help users analyze and use data.

TRUE

Business Intelligence (BI) systems obtain data in three different ways.

TRUE

Data Mining uses sophisticated statistical and mathematical techniques to perform what-if analyses, to make predictions, and to facilitate decision making.

TRUE

Data warehouses are populated with data prepared by Extract, Transform, and Load (ETL) systems.

TRUE

Dimensional databases are used for analytical data processing.

TRUE

Dimensional databases use the star schema.

TRUE

In a snowflake table, each dimension table is normalized.

TRUE

NoSQL really stands for "Not only SQL."

TRUE

OLAP provides the ability to sum, count, average and perform other simple arithmetic operations on groups of data.

TRUE

Problematic data include so-called "dirty data."

TRUE

Report delivery is more difficult for data mining than it is for reporting systems.

TRUE

The term drill down refers to the capability of seeing the data in smaller and smaller units.

TRUE

Apache Cassandra is an example of a(n) ________ non-relational DBMS.

column family

A(n) ________ is a collection of data that addresses a particular component or functional area of a business.

data mart

Business Intelligence (BI) ________ uses sophisticated statistical and mathematical techniques to perform what-if analyses, to make predictions, and to facilitate decision making.

data mining

In the snowflake schema, the ________ tables are normalized.

dimension

Data warehouse data is often stored in a(n) ________.

dimensional database

Problematic data are called ________.

dirty data

Hadoop is a(n) _______

distributed file system (DFS)

The term ________ refers to the capability of seeing the data in smaller and smaller units.

drill down

Objects have ________, which are computer programs that perform some task.

methods

Distributed databases can be distributed by partitioning and ________

replication

Business Intelligence (BI) ________ are used to filter data, sort data, group data and make simple calculations based on the data.

reporting systems

Business Intelligence (BI) systems fall into two broad categories: ________ and ________.

reporting; data mining

To deal with data values that change over time, dimensional databases use a(n) ________.

slowly changing dimension

A variant of the star schema is the ________.

snowflake schema

A dimensional database uses a(n) ________.

star schema

To deal with historical data values, dimensional databases use a(n) ________.

time dimension


Related study sets

Lesson 19 Dialogue II Planning an Itinerary

View Set

Mastery Level Quiz Qs: Communication (Chapter 8)

View Set

Chapter 6 - Establishing Trust and Building a Relationship

View Set

Business Life Cycle and Expansion

View Set

Mixture of Solving Quadratic and Linear Equations

View Set