Tableau Multiple-Choice Test, 2019 Tableau Specialist Exam, isds 415 ch 3, BD Chap2, Business Intelligence (CH3), Business Analytics Exam 1, CIS 4093 Chapter 3, isds 2001 ch. 2 test bank, Data Science Exam 1

Ace your homework & exams now with Quizwiz!

What is the profitable moving average in November 2013, including four months prior and four months after? $8,553 $8,256 $8,441 $7,501

$8,441.

A Reference Distribution plot cannot be along a continuous axis. True False

False, A Reference Distribution plot can be along a continuous axis.

A Reference Line cannot be added from the Analytics pane. True False

False, A reference line can be added from the analytics pane.

14) In the Starwood Hotels case, up-to-date data and faster reporting helped hotel managers better manage their occupancy rates.

Answer: TRUE

17) The data warehousing maturity model consists of six stages: prenatal, infant, child, teenager, adult, and sage.

Answer: TRUE

2) The "islands of data" problem in the 1980s describes the phenomenon of unconnected data being stored in numerous locations within an organization.

Answer: TRUE

5) One way an operational data store differs from a data warehouse is the recency of their data.

Answer: TRUE

7) Without middleware, different BI programs cannot easily connect to the data warehouse.

Answer: TRUE

3 categories of business analytics

1. descriptive or reporting analytics 2. predictive analytics 3. prescriptive analytics

In which stage of extraction, transformation, and load (ETL) into a data warehouse are data aggregated? A) transformation B) extraction C) load D) cleanse

A

Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are A) subject-oriented and nonvolatile. B) product-oriented and nonvolatile. C) product-oriented and volatile. D) subject-oriented and volatile.

A

The Internet emerged as a new medium for visualization and brought all the following EXCEPT A) new forms of computation of business logic. B) new graphics displays through PC displays. C) worldwide digital distribution of visualization. D) immersive environments for consuming data.

A

What is the management feature of a dashboard? A) operational data that identify what actions to take to resolve a problem B) summarized dimensional data to monitor key performance metrics C) graphical, abstracted data to monitor key performance metrics D) summarized dimensional data to analyze the root cause of problems

A

When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure? A) star schema B) snowflake schema C) relational schema D) dimensional schema

A

Which characteristic of data means that all the required data elements are included in the data set? A) data richness B) data accessibility C) data source reliability D) data granularity

A

Which of the following is LEAST related to data/information visualization? A) graphic artwork B) scientific visualization C) information graphics D) statistical graphics

A

Which type of question does visual analytics seeks to answer? A) Why is it happening? B) When did it happen? C) What happened yesterday? D) What is happening today?

A

Which of the following can you use to create a calculated field that returns data independent of the data granularity in a view?

A FIXED LOD calculation.

Give an example of continuous data

A person can weigh 90 pounds or 90.5 or 90.12 or 90.345 and so on. Continuous data is always numeric.

What is the definition of a data warehouse (DW) in simple terms?

A pool of data produced to support decision making; it is also a repository of current and historical data of potential interest to managers throughout the organization.

34) When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure? A) star schema B) snowflake schema C) relational schema D) dimensional schema

A) star schema

22) Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are A) subject-oriented and nonvolatile. B) product-oriented and nonvolatile. C) product-oriented and volatile. D) subject-oriented and volatile.

A) subject-oriented and nonvolatile.

30) In which stage of extraction, transformation, and load (ETL) into a data warehouse are data aggregated? A) transformation B) extraction C) load D) cleanse

A) transformation

A field that shows average home values for the United States in 2016 is most likely:

An aggregated measure.

Give an example of unstructured data

An open-ended response that someone types on a survey. Not easily searchable with an algorithm.

1) In the Isle of Capri case, the only capability added by the new software was increased processing speed of processing reports.

Answer: FALSE

12) Bill Inmon advocates the data mart bus architecture whereas Ralph Kimball promotes the hub-and-spoke architecture, a data mart bus architecture with conformed dimensions.

Answer: FALSE

13) The ETL process in data warehousing usually takes up a small portion of the time in a data-centric project.

Answer: FALSE

15) Large companies, especially those with revenue upwards of $500 million consistently reap substantial cost savings through the use of hosted data warehouses.

Answer: FALSE

16) OLTP systems are designed to handle ad hoc analysis and complex queries that deal with many data items.

Answer: FALSE

18) A well-designed data warehouse means that user requirements do not have to change as business needs change.

Answer: FALSE

19) Data warehouse administrators (DWAs) do not need strong business insight since they only handle the technical aspect of the infrastructure.

Answer: FALSE

20) Because the recession has raised interest in low-cost open source software, it is now set to replace traditional enterprise software.

Answer: FALSE

3) Subject oriented databases for data warehousing are organized by detailed subjects such as disk drives, computers, and networks.

Answer: FALSE

4) Data warehouses are subsets of data marts.

Answer: FALSE

6) Organizations seldom devote a lot of effort to creating metadata because it is not important for the effective use of data warehouses.

Answer: FALSE

8) Two-tier data warehouse/BI infrastructures offer organizations more flexibility but cost more than three-tier ones.

Answer: FALSE

9) Moving the data into a data warehouse is usually the easiest part of its creation.

Answer: FALSE

10) The hub-and-spoke data warehouse model uses a centralized warehouse feeding dependent data marts.

Answer: TRUE

11) Because of performance and data quality issues, most experts agree that the federated architecture should supplement data warehouses, not replace them.

Answer: TRUE

How does the use of cloud computing affect the scalability of a data warehouse? A) Cloud computing vendors bring as much hardware as needed to users' offices. B) Hardware resources are dynamically allocated as use increases. C) Cloud vendors are mostly based overseas where the cost of labor is low. D) Cloud computing has little effect on a data warehouse's scalability.

B

The very design that makes an OLTP system efficient for transaction processing makes it inefficient for A) the collection of reputable sources of intelligence. B) end-user ad hoc reports, queries, and analysis. C) transaction processing systems that constantly update operational databases. D) transactions such as ATM withdrawals, where we need to reduce a bank balance accordingly.

B

This plot is a graphical illustration of several descriptive statistics about a given data set. A) kurtosis B) box-and-whiskers plot C) bar graph D) pie chart

B

What type of analytics seeks to determine what is likely to happen in the future? A) descriptive B) predictive C) prescriptive D) domain

B

What type of analytics seeks to recognize what is going on as well as the likely forecast and make decisions to achieve the best performance possible? A) predictive B) prescriptive C) descriptive D) domain

B

Which approach to data warehouse integration focuses more on sharing process functionality than data across systems? A) extraction, transformation, and load B) enterprise application integration C) enterprise information integration D) enterprise function integration

B

Which of the following BEST enables a data warehouse to handle complex queries and scale up to handle many more requests? A) use of the web by users as a front-end B) parallel processing C) Microsoft Windows D) a larger IT staff

B

Which of the following online analytical processing (OLAP) technologies does NOT require the precomputation and storage of information? A) MOLAP B) ROLAP C) HOLAP D) SQL

B

Online ________ is arguably the most commonly used data analysis technique in data warehouses.

analytical processing

39) How does the use of cloud computing affect the scalability of a data warehouse? A) Cloud computing vendors bring as much hardware as needed to users' offices. B) Hardware resources are dynamically allocated as use increases. C) Cloud vendors are mostly based overseas where the cost of labor is low. D) Cloud computing has little effect on a data warehouse's scalability.

B) Hardware resources are dynamically allocated as use increases.

36) Which of the following online analytical processing (OLAP) technologies does NOT require the precomputation and storage of information? A) MOLAP B) ROLAP C) HOLAP D) SQL

B) ROLAP

29) Which approach to data warehouse integration focuses more on sharing process functionality than data across systems? A) extraction, transformation, and load B) enterprise application integration C) enterprise information integration D) enterprise function integration

B) enterprise application integration

26) Which of the following BEST enables a data warehouse to handle complex queries and scale up to handle many more requests? A) use of the web by users as a front-end B) parallel processing C) Microsoft Windows D) a larger IT staff

B) parallel processing

Which of the following is not a Trend Line model Linear Trend Line Exponential Trend Line Binomial Trend Line Logarithmic Trend Line

Binomial Trend Line

A Web client that connects to a Web server, which is in turn connected to a BI application server, is reflective of a A) one tier architecture. B) two tier architecture. C) three tier architecture. D) four tier architecture.

C

Active data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is A) country of (data) origin. B) nature of the data. C) speed of data transfer. D) source of the data.

C

All of the following are benefits of hosted data warehouses EXCEPT A) smaller upfront investment. B) better quality hardware. C) greater control of data. D) frees up in-house systems.

C

Online transaction processing (OLTP) systems handle a company's routine ongoing business. In contrast, a data warehouse is typically A) a repository of actionable intelligence obtained from a data mart. B) an integral subsystem of an online analytical processing (OLAP) system. C) a distinct system that provides storage for data that will be made use of in analysis. D) the end result of BI processes and operations.

C

What has caused the growth of the demand for instant, on-demand access to dispersed information? A) the need to create a database infrastructure that is always online and contains all the information from the OLTP systems B) the fact that BI cannot simply be a technical exercise for the information systems department C) the more pressing need to close the gap between the operational data and strategic objectives D) the increasing divide between users who focus on the strategic level and those who are more oriented to the tactical level

C

When you tell a story in a presentation, all of the following are true EXCEPT A) a story should make sense and order out of a lot of background noise. B) the outcome and reasons for it should be clear at the end of your story. C) a well-told story should have no need for subsequent discussion. D) stories and their lessons should be easy to remember.

C

Which data warehouse architecture uses a normalized relational warehouse that feeds multiple data marts? A) independent data marts architecture B) centralized data warehouse architecture C) hub-and-spoke data warehouse architecture D) federated architecture

C

Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates? A) sectional data mart B) public data mart C) independent data mart D) volatile data mart

C

Which of the following statements is more descriptive of active data warehouses in contrast with traditional data warehouses? A) strategic decisions whose impacts are hard to measure B) detailed data available for strategic use only C) large numbers of users, including operational staffs D) restrictive reporting with daily and weekly data currency

C

33) All of the following are benefits of hosted data warehouses EXCEPT A) smaller upfront investment. B) better quality hardware. C) greater control of data. D) frees up in-house systems.

C) greater control of data.

23) Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates? A) sectional data mart B) public data mart C) independent data mart D) volatile data mart

C) independent data mart

38) Which of the following statements is more descriptive of active data warehouses in contrast with traditional data warehouses? A) strategic decisions whose impacts are hard to measure B) detailed data available for strategic use only C) large numbers of users, including operational staffs D) restrictive reporting with daily and weekly data currency

C) large numbers of users, including operational staffs

37) Active data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is A) country of (data) origin. B) nature of the data. C) speed of data transfer. D) source of the data.

C) speed of data transfer.

25) A Web client that connects to a Web server, which is in turn connected to a BI application server, is reflective of a A) one tier architecture. B) two tier architecture. C) three tier architecture. D) four tier architecture.

C) three tier architecture.

For creating variable size bins we use _____________. Sets Groups Calculated fields Table Calculations

Calculated fields with a logical statement "<sum> or <avg> to create variable size bins

A good reason to use a bullet graph. Analyzing the trend for a time period Comparing the actual against the target sales Adding data to bins and calculating count measure Displaying the sales growth for a particular year

Comparing the actual against the target sales

All of the following are true about in-database processing technology EXCEPT A) it pushes the algorithms to where the data is. B) it makes the response to queries much faster than conventional databases. C) it is often used for apps like credit card fraud detection and investment risk management. D) it is the same as in-memory storage technology.

D

All of the following statements about metadata are true EXCEPT A) metadata gives context to reported data. B) there may be ethical issues involved in the creation of metadata. C) metadata helps to describe the meaning and structure of data. D) for most organizations, data warehouse metadata are an unnecessary expense.

D

Big Data often involves a form of distributed storage and processing using Hadoop and MapReduce. One reason for this is A) centralized storage creates too many vulnerabilities. B) the "Big" in Big Data necessitates over 10,000 processing nodes. C) Big Data systems have to match the geographical spread of social media. D) the processing power needed for the centralized model would overload a single computer.

D

Contextual metadata for a dashboard includes all the following EXCEPT A) whether the dashboard is presenting "fresh" or "stale" information. B) when the data warehouse was last refreshed. C) whether any high-value transactions that would skew the overall trends were rejected as a part of the loading process. D) which operating system is running the dashboard server software.

D

Dashboards can be presented at all the following levels EXCEPT A) the visual dashboard level. B) the static report level. C) the self-service cube level. D) the visual cube level.

D

Data warehouses provide direct and indirect benefits to using organizations. Which of the following is an indirect benefit of data warehouses? A) better and more timely information B) extensive new analyses performed by users C) simplified access to data D) improved customer service

D

In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected? A) transformation B) extraction C) load D) cleanse

D

The "single version of the truth" embodied in a data warehouse such as Capri Casinos' means all of the following EXCEPT A) decision makers get to see the same results to queries. B) decision makers have the same data available to support their decisions. C) decision makers get to use more dependable data for their decisions. D) decision makers have unfettered access to all data in the warehouse.

D

What is the fundamental challenge of dashboard design? A) ensuring that the organization has the appropriate hardware onsite to support it B) ensuring that the organization has access to the latest Web browsers C) ensuring that users across the organization have access to it D) ensuring that the required information is shown clearly on a single screen

D

When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is A) dice. B) slice. C) roll-up. D) drill down.

D

Which characteristic of data requires that the variables and data values be defined at the lowest (or as low as required) level of detail for the intended use of the data? A) data accessibility B) data source reliability C) data richness D) data granularity

D

Which data warehouse architecture uses metadata from existing data warehouses to create a hybrid logical data warehouse comprised of data from the other warehouses? A) independent data marts architecture B) centralized data warehouse architecture C) hub-and-spoke data warehouse architecture D) federated architecture

D

Which of the following statements about Big Data is true? A) MapReduce is a storage filing system. B) Data chunks are stored in different locations on one computer. C) Hadoop is a type of processor used to process Big Data applications. D) Pure Big Data systems do not involve fault tolerance.

D

31) In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected? A) transformation B) extraction C) load D) cleanse

D) cleanse

21) The "single version of the truth" embodied in a data warehouse such as Capri Casinos' means all of the following EXCEPT A) decision makers get to see the same results to queries. B) decision makers have the same data available to support their decisions. C) decision makers get to use more dependable data for their decisions. D) decision makers have unfettered access to all data in the warehouse.

D) decision makers have unfettered access to all data in the warehouse.

35) When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is A) dice. B) slice. C) roll-up. D) drill down.

D) drill down.

27) Which data warehouse architecture uses metadata from existing data warehouses to create a hybrid logical data warehouse comprised of data from the other warehouses? A) independent data marts architecture B) centralized data warehouse architecture C) hub-and-spoke data warehouse architecture D) federated architecture

D) federated architecture

28) Which data warehouse architecture uses a normalized relational warehouse that feeds multiple data marts? A) independent data marts architecture B) centralized data warehouse architecture C) hub-and-spoke data warehouse architecture D) federated architecture

D) federated architecture

24) All of the following statements about metadata are true EXCEPT A) metadata gives context to reported data. B) there may be ethical issues involved in the creation of metadata. C) metadata helps to describe the meaning and structure of data. D) for most organizations, data warehouse metadata are an unnecessary expense.

D) for most organizations, data warehouse metadata are an unnecessary expense.

32) Data warehouses provide direct and indirect benefits to using organizations. Which of the following is an indirect benefit of data warehouses? A) better and more timely information B) extensive new analyses performed by users C) simplified access to data D) improved customer service

D) improved customer service

40) All of the following are true about in-database processing technology EXCEPT A) it pushes the algorithms to where the data is. B) it makes the response to queries much faster than conventional databases. C) it is often used for apps like credit card fraud detection and investment risk management. D) it is the same as in-memory storage technology.

D) it is the same as in-memory storage technology.

How do you differentiate dimension and measure on Tableau?

Dimension is categorical and Measure is numerical

________ modeling is a retrieval-based system that supports high-volume query access.

Dimensional

Sets can only be created on: Measures Dimensions

Dimensions, sets can only be created on Dimensions

________ is a mechanism that integrates application functionality and shares functionality (rather than data) across systems, thereby enabling flexibility and reuse.

Enterprise application integration (EAI)

________ is a mechanism for pulling data from source systems to satisfy a request for information. It is an evolving tool space that promises real-time data integration from a variety of sources, such as relational databases, Web services, and multidimensional databases.

Enterprise information integration (EII)

When do you use an extract connection and when do you use a live connection?

Extracts are faster, especially in more complex visualizations with large data sets, filters, calculations. These extracts are snapshots of data optimized for aggregation and loaded into system memory to be quickly recalled for visualization. This is for if hospitals need weekly/monthly trends. Live connection are real time updates. This relies on the database for all queries and not always optimized for fast performance. Your data queries are only as fast as the database itself. This is for if hospitals need real-time updates.

Data warehouse administrators (DWAs) do not need strong business insight since they only handle the technical aspect of the infrastructure. (T/F)

F

In the Isle of Capri case, the only capability added by the new software was increased processing speed of processing reports. (T/F)

F

A Reference Band cannot be based on two fixed points. False True

False

BI represents a bold new paradigm in which the company's business strategy must be aligned to its business intelligence analysis initiatives.

False

Business intelligence (BI) is a specific term that describes architectures and tools only.

False

Computerized support is only used for organizational decisions that are responses to external pressures, not for taking advantage of opportunities.

False

Dashboards provide visual displays of important information that is consolidated and arranged across several screens to maintain data order.

False

Data is the contextualization of information, that is, information set in context.

False

Data source reliability means that data are correct and are a good match for the analytics problem.

False

Demands for instant, on-demand access to dispersed information decrease as firms successfully integrate BI into their operations.

False

In the Dallas Cowboys case study, the focus was on using data analytics to decide which players would play every week.

False

Information systems that support such transactions as ATM withdrawals, bank deposits, and cash register scans at the grocery store represent transaction processing, a critical branch of BI.

False

Nominal data represent the labels of multiple classes used to divide a variable into specific groups.

False

Successful BI is a tool for the information systems department, but is not exposed to the larger organization.

False

T/F: OLTP databases are optimized for output (querying/asking questions of the data) and data warehouses are optimized for input (getting new or updated data into the database).

False

The data storage component of a business reporting system builds the various reports and hosts them for, or disseminates them to users. It also provides notification, annotation, collaboration, and other services.

False

The growth in hardware, software, and network capacities has had little impact on modern BI innovations.

False

Visual analytics is aimed at answering, "What is it happening?" and is usually associated with business analytics.

False

When telling a story during a presentation, it is best to avoid describing hurdles that your character must overcome, to avoid souring the mood.

False

T/F: Organizations seldom devote a lot of effort to creating metadata because it is not important for the effective use of data warehouses.

False Page Ref: 45-46 Metadata describe the structure of and some meaning about data, thereby contributing to their effective or ineffective use.

T/F: OLTP systems are designed to handle ad hoc analysis and complex queries that deal with many data items.

False Page Ref: 70 OLTP systems focus routine, periodic, narrow reports.

T/F: Data warehouse administrators (DWAs) do not need strong business insight since they only handle the technical aspect of the infrastructure.

False Page Ref: 82 DWA should be familiar with high-performance software, hardware, and networking technologies.

T/F: Data warehouses are subsets of data marts.

False Page Ref: 43 Data mart is a subset of a data warehouse, typically consisting of a single subject area.

The highlight action in a dashboard is similar to filtering action in a worksheet. TRUE FALSE

False, The highlighting action in dashboard will highlight the selection and related data in other view in a dashboard. It won't filter the selection.

A sheet cannot be used within a story directly. Either sheets should be used within a dashboard, or a dashboard should be used within a story. True False

False. A sheet can be used within a story directly.

T/F: Subject oriented databases for data warehousing are organized by detailed subjects such as disk drives, computers, and networks

False: Page Ref: 42 Subject oriented databases for data warehousing are organized by detailed subjects such as sales, products, or customers, containing only information relevant for decision support.

Discrete Data Example (DRAW)

Graph showing 0 and 1, malignant and benign

How does the use of cloud computing affect the scalability of a data warehouse? A) Cloud computing vendors bring as much hardware as needed to users' offices. B) Cloud vendors are mostly based overseas where the cost of labor is low. C) Cloud computing has little effect on a data warehouse's scalability. D) Hardware resources are dynamically allocated as use increases.

Hardware resources are dynamically allocated as use increases.

You created a group by selecting field labels in a view. How can you remove members from the group?

In the Data pane, right-click the group and select Edit Group.

________ (also called in-database analytics) refers to the integration of the algorithmic extent of data analytics into data warehouse.

In-database processing

____________ refers to architectural-hardware and software-enhancements.

Infrastructure

The ________ Model, also known as the EDW approach (top-down) to data warehouse development

Inmon

How do you identify a continuous field in Tableau? It is identified by a blue pill in the visualization. It is identified by a green pill in a visualization. It is preceded by a # symbol in the data window. When added to the visualization, it produces distinct values.

It is identified by a green pill in a visualization.

The ________ Model, also known as the data mart approach, is a "plan big, build small" approach. A data mart is a subject-oriented or department-oriented data warehouse. It is a scaled-down version of a data warehouse that focuses on the requests of a specific department, such as marketing or sales.

Kimball

The default join type in case of Blended data sources is? Cross Join Inner Join Left outer Join Full outer Join

Left outer Join is the default join type in case of Blended data sources. The primary dataset is considered to be the left table.

By definition, Tableau displays measures over time as a ____________. Bar Line Histogram Scatter Plots

Line, By definition, Tableau displays measures over time as a Lines.

Bins can only be created on: Measures Dimensions

Measures, bins can only be created on Measures

________ describe the structure and meaning of the data, contributing to their effective use.

Metadata

Give an example of Nominal data

Nominal- Someone's hair color. There is no quantitative value.

The icon associated with the field that has been grouped is a ______________. Paper Clip Set Hash Equal To

Paper Clip

The best trend model for your view would be the one with? R-Squared value closest to 1 P-Value more than 1 R-Squared value greater than 1 R-Squared value equal to P-Value

R-Squared value closest to 1 is the best trend model for a view.

Which of the following online analytical processing (OLAP) technologies does NOT require the precomputation and storage of information? A) SQL B) HOLAP C) MOLAP D) ROLAP

ROLAP

Give an example of Ordinal data

Ranking 5 drinks from most flavorful to least flavorful. There is no objective distance between any two points on the subjective scale.

________, or "The Extended ASP Model," is a creative way of deploying information system applications where the provider licenses its applications to customers for use as a service on demand (usually over the Internet)

SaaS (software as a service)

Which of the following is NOT an example of transaction processing? Sales report ATM withdrawal Bank deposit Cash register scans

Sales report

_________ refers to mechanisms for acquisition of data from diverse and dispersed sources.

Sourcing

The aggregation function attr() returns a * when __________________. There is a single value for all rows in the group. It is a null value. There are more than one value in all rows in the group. The data is not present at the desired level..

There are more than one value in all rows in the group.

How are descriptive analytics methods different from the other two types?

They answer the "what-is?" queries, not "what will be?" queries

Computer applications have moved from transaction processing and monitoring activities to problem analysis and solution applications.

True

Data accessibility means that the data are easily and readily obtainable.

True

Data is the main ingredient for any BI, data science, and business analytics initiative.

True

Descriptive statistics is all about describing the sample data on hand.

True

During the early days of analytics, data was often obtained from the domain experts using manual processes to build mathematical or knowledge-based models.

True

Interval data are variables that can be measured on interval scales.

True

Predictive algorithms generally require a flat file with a target variable, so making data analytics ready for prediction means that data sets must be transformed into a flat-file format and made ready for ingestion into those predictive algorithms.

True

Structured data is what data mining algorithms use and can be classified as categorical or numeric.

True

There are basic chart types and specialized chart types. A Gantt chart is a specialized chart type.

True

Traditional BI systems use a large volume of static data that has been extracted, cleansed, and loaded into a data warehouse to produce reports and analyses.

True

Visualization differs from traditional charts and graphs in complexity of data sets and use of multiple dimensions and measures.

True

T/F: One way an operational data store differs from a data warehouse is the recency of their data.

True Page Ref: 43-44 An Operational Data Store (ODS) provides a fairly recent form of customer information file (CIF).

T/F: Because of performance and data quality issues, most experts agree that the federated architecture should supplement data warehouses, not replace them.

True Page Ref: 54 Because of performance and data quality issues, most experts agree that federated approaches work well to supplement data warehouses, not replace them.

Disaggregation returns all records in the underlying data source. True False

True, Disaggregation returns all records in the underlying data source.

Groups can be used in a calculated field. TRUE FALSE

True, Groups can also be used in a calculated field. From Tableau 10.x onwards

It is possible to change the geographic roles of a dimension. True False

True, It is possible to change the geographic role of a dimension.

Is it possible to deploy a URL action on a dashboard object to open a Web Page within a dashboard rather than opening the system's web browser? True, with the use of Tableau Server True, with the use of a Web Page object False, not possible True, requires a plug-in

True, with the use of a Web Page object

The Highlighting action can be disabled for the entire workbook. True False

True. From the toolbar the Highlighting action can be disabled for the entire workbook.

Trend Lines can only be used with numeric or date fields. True False

True. Trend lines can only be used with numeric or date fields.

Interactive elements that you can add to a dashboard for users include ______.

URL actions & filter actions.

What are scatter plots best used for?

Visualizing relationships between numerical variables.

Is it possible to use measures in the same view multiple times (e.g. SUM of the measure and AVG of the measure)? No Yes

Yes, measures can be used multiple times in the same view.

Which of the following is the best reason to use an extract instead of a live connection?

You need to apply an aggregation that takes too long when using a live connection.

Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are A) subject-oriented and nonvolatile. B) product-oriented and nonvolatile. C) product-oriented and volatile. D) subject-oriented and volatile.

a

When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure? A) star schema B) snowflake schema C) relational schema D) dimensional schema

a

What is the definition of a data warehouse (DW) in simple terms?

a pool of data produced to support decision making; it is also a repository of current and historical data of potential interest to managers throughout the organization.

What is the definition of a data mart?

a subset of a data warehouse, typically consisting of a single subject area (e.g., marketing, operations). Whereas a data warehouse combines databases across an entire enterprise, a data mart is usually smaller and focuses on a particular subject or department.

What is a cross-tab?

a text table or a table of numbers

Examples of infrastructure

a) columnar b) real-time DW c) data management technologies and practices d) data warehouse applicances (all in 1 solutions to DW) e) in-memory storage technology (moving the data in the memory for faster processing) f) new database management systems g) advanced analytics

Examples of sourcing

a) web, social media and Big Data b) open source software c) SaaS (software as a service) d) cloud computing

4. Interactive elements that you can add to a dashboard for users include ______. (Select all that apply.) a. URL actions b. edit tooltip options c. filter actions d. hide and unhide all sheet options

a. URL actions c. filter actions

Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are a.) subject-oriented and nonvolatile. b.) product-oriented and nonvolatile. c.) product-oriented and volatile. d.) subject-oriented and volatile.

a.) subject-oriented and nonvolatile. Page Ref: 40 The characteristics of data warehouses are subject oriented, integrated, time variant and nonvolatile.

Which approach to data warehouse integration focuses more on sharing process functionality than data across systems? A) extraction, transformation, and load B) enterprise application integration C) enterprise information integration D) enterprise function integration

b

You created a group by selecting field labels in a view. How can you remove members from the group? a. In the view, right-click the group members you want to remove and select Exclude. b. In the Data pane, right-click the group and select Edit Group. c. In the view, right-click the group members you want to remove and select Format. d. On a color legend, right-click a member you want to remove and select Format legends.

b. In the Data pane, right-click the group and select Edit Group.

A large storage location that can hold vast quantities of data (mostly unstructured) in its native/raw format for future/potential analytics consumption is referred to as a(n) A) extended ASP. B) data cloud. C) data lake. D) relational database.

c

All of the following are benefits of hosted data warehouses EXCEPT A) smaller upfront investment. B) better quality hardware. C) greater control of data. D) frees up in-house systems.

c

Real-time data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is A) country of (data) origin. B) nature of the data. C) speed of data transfer. D) source of the data.

c

Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates? A) sectional data mart B) public data mart C) independent data mart D) volatile data mart

c

Which of the following is the best reason to use an extract instead of a live connection? a. Your data source only supports a live connection via ODBC. b. You need the freshest possible data at all times. c. You need to apply an aggregation that takes too long when using a live connection. d. You need to join tables that are in the data source.

c. You need to apply an aggregation that takes too long when using a live connection.

Active data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is: a.) country of (data) origin. b.) nature of the data. c.) speed of data transfer. d.) source of the data.

c.) speed of data transfer. Page Ref: 77 Real-time data warehousing (RDW), also known as active data warehousing (ADW), is the process of loading and providing data via the data warehouse as they become available.

In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected? A) cleanse B) transformation C) load D) extraction

cleanse

The primary purpose of metadata should be to provide context to the reported data; that is, it provides enriching information that leads to ______________.

creation of knowledge

All of the following are true about in-database processing technology EXCEPT A) it pushes the algorithms to where the data is. B) it makes the response to queries much faster than conventional databases. C) it is often used for apps like credit card fraud detection and investment risk management. D) it is the same as in-memory storage technology.

d

Data warehouses provide direct and indirect benefits to organizations. Which of the following is an indirect benefit of data warehouses? A) better and more timely information B) extensive new analyses performed by users C) simplified access to data D) improved customer service

d

In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected? A) transformation B) extraction C) load D) cleanse

d

Oper marts are created when operational data needs to be analyzed A) linearly. B) in a dashboard. C) unidimensionally. D) multidimensionally.

d

Why is a performance management system superior to a performance measurement system? A) because performance measurement systems are only in their infancy B) because measurement automatically leads to problem solution C) because performance management systems cost more D) because measurement alone has little use without action

d

Which of the following can you use to create a calculated field that returns data independent of the data granularity in a view? a. An INCLUDE LOD calculation b. A table calculation c. A basic calculation d. A FIXED LOD calculation

d. A FIXED LOD calculation

A field that shows average home values for the United States in 2016 is most likely: a. A discrete date part dimension b. A continuous date value dimension c. A geographical dimension d. An aggregated measure

d. An aggregated measure

The three main types of data warehouses are data marts, operational ________, and enterprise data warehouses.

data stores

The role responsible for successful administration and management of a data warehouse is the ________, who should be familiar with high-performance software, hardware, and networking technologies, and also possesses solid business insight.

data warehouse administrator (DWA)

The "single version of the truth" embodied in a data warehouse such as Capri Casinos' means all of the following EXCEPT A) decision makers get to see the same results to queries. B) decision makers get to use more dependable data for their decisions. C) decision makers have unfettered access to all data in the warehouse. D) decision makers have the same data available to support their decisions.

decision makers have unfettered access to all data in the warehouse.

________ Analytics answers questions like "what happened" or "Why did it happen".

descriptive

________ modeling is a retrieval-based system that supports high-volume query access.

dimensional

When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is

drill down

When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is A) slice. B) dice. C) drill down. D) roll-up.

drill down.

In the Michigan State Agencies case, the approach used was a(n) ________ one, instead of developing separate BI/DW platforms for each business area or state agency.

enterprise

Which approach to data warehouse integration focuses more on sharing process functionality than data across systems? A) extraction, transformation, and load B) enterprise function integration C) enterprise information integration D) enterprise application integration

enterprise application integration

Performing extensive ________ to move data to the data warehouse may be a sign of poorly managed data and a fundamental lack of a coherent data management strategy.

extraction, transformation, and load (ETL)

Data warehouse administrators (DWAs) do not need strong business insight since they only handle the technical aspect of the infrastructure

false

Data warehouses are subsets of data marts.

false

Moving the data into a data warehouse is usually the easiest part of its creation.

false

OLTP systems are designed to handle ad hoc analysis and complex queries that deal with many data items.

false

Organizations seldom devote a lot of effort to creating metadata because it is not important for the effective use of data warehouses.

false

The ________ data warehouse architecture involves integrating disparate systems and analytical resources from multiple sources to meet changing needs or business conditions.

federated

Which data warehouse architecture uses metadata from existing data warehouses to create a hybrid logical data warehouse comprised of data from the other warehouses?

federated architecture

Which data warehouse architecture uses metadata from existing data warehouses to create a hybrid logical data warehouse comprised of data from the other warehouses? A) independent data marts architecture B) hub-and-spoke data warehouse architecture C) centralized data warehouse architecture D) federated architecture

federated architecture

All of the following statements about metadata are true EXCEPT

for most organizations, data warehouse metadata are an unnecessary expense

In answering the question "Which customers are likely to be using fake credit cards?" you are most likely to use which of the following analytic applications?

fraud detection

All of the following are benefits of hosted data warehouses EXCEPT A) greater control of data. B) smaller upfront investment. C) better quality hardware. D) frees up in-house systems.

greater control of data.

A(n) ________ architecture is used to build a scalable and maintainable infrastructure that includes a centralized data warehouse and several dependent data marts.

hub-and-spoke

Data warehouses provide direct and indirect benefits to using organizations. Which of the following is an indirect benefit of data warehouses?

improved customer service

Data warehouses provide direct and indirect benefits to using organizations. Which of the following is an indirect benefit of data warehouses? A) better and more timely information B) extensive new analyses performed by users C) simplified access to data D) improved customer service

improved customer service

Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates?

independent data mart

Which kind of data warehouse is created separately from the enterprise data warehouse by a department and not reliant on it for updates? A) independent data mart B) sectional data mart C) volatile data mart D) public data mart

independent data mart

Data ________ comprises data access, data federation, and change capture.

integration

All of the following are true about in-database processing technology EXCEPT A) it is the same as in-memory storage technology. B) it is often used for apps like credit card fraud detection and investment risk management. C) it makes the response to queries much faster than conventional databases. D) it pushes the algorithms to where the data is.

it is the same as in-memory storage technology.

If a company's strategy is properly aligned with DW and BI initiatives, and if the company's IS organization can be made capable of playing its role in such a project, and if the requisite user community is in place and has the proper motivation, then

it is wise to start BI and establish a BI Competency Center (BICC) within a company

Which of the following statements is more descriptive of active data warehouses in contrast with traditional data warehouses? A) restrictive reporting with daily and weekly data currency B) large numbers of users, including operational staffs C) detailed data available for strategic use only D) strategic decisions whose impacts are hard to measure

large numbers of users, including operational staffs

Real-time data warehousing, also known as active data warehousing (ADW), is the process of _______________ via the data warehouse as they become available.

loading and providing data

mien /mēn/

noun 1. a person's look or manner, especially one of a particular kind indicating their character or mood.

re·bus /ˈrēbəs/

noun 1. a puzzle in which words are represented by combinations of pictures and individual letters; for instance, apex might be represented by a picture of an ape followed by a letter X.

In the Magpie Sensing case study, the automated collection of temperature and humidity data on shipped goods helped with various types of analytics. Which of the following is an example of prescriptive analytics?

optimal temperature setting

In ________ oriented data warehousing, operational databases are tuned to handle transactions that update the database.

product

With ________ data flows, managers can view the current state of their businesses and quickly identify problems.

real-time

More data, coming in faster and requiring immediate conversion into decisions, means that organizations are confronting the need for _____________.

real-time data warehousing (RDW)

Most data warehouses are built using ________ database management systems to control and manage the data.

relational

Given that the size of data warehouses is expanding at an exponential rate, ________ is an important issue.

scalability

A data mart is a subset of a data warehouse, typically consisting of a ___________ whereas a data warehouse _____________ across an entire enterprise.

single subject area or department; combines databases

Give an example of structured data

someone selects yes or no from a drop down menu. Easily searchable with an algorithm.

Active data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is A) nature of the data. B) speed of data transfer. C) country of (data) origin. D) source of the data.

speed of data transfer.

When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure? A) star schema B) snowflake schema C) relational schema D) dimensional schema

star schema

Metadata are data about data. Metadata describe the ________ and some meaning about data, thereby contributing to their effective or ineffective use.

structure of

Big Data often involves a form of distributed storage and processing using Hadoop and MapReduce. One reason for this is

the processing power needed for the centralized model would overload a single computer

A Web client that connects to a Web server, which is in turn connected to a BI application server, is reflective of a A) one tier architecture. B) two tier architecture. C) three tier architecture. D) four tier architecture.

three tier architecture.

Online ________ is a term used for a transaction system that is primarily responsible for capturing and storing data related to day-to-day business functions such as ERP, CRM, SCM, and point of sale.

transaction processing

In which stage of extraction, transformation, and load (ETL) into a data warehouse are data aggregated? A) extraction B) cleanse C) transformation D) load

transformation

One way an operational data store differs from a data warehouse is the recency of their data.

true

With key performance indicators, driver KPIs have a significant effect on outcome KPIs, but the reverse is not necessarily true.

true

Without middleware, different BI programs cannot easily connect to the data warehouse

true

The most common information system architectures that can be used for data warehousing are _______ and _________ but sometimes there is simply ________.

two-tier; three-tier architectures; one tier

major components of the data warehousing process

∙ Data sources ∙ ETL process: data extraction, transformation & loading ∙ EDW and metadata ∙ data marts (if desired) ∙ Middleware tools: enable access to the DW ex. data mining, OLAP, reporting tools, and data visualization tools.

4 characteristics of data warehousing.

∙ Subject oriented ∙ Integrated ∙ Time variant (time series) ∙ Nonvolatile

In the MultiCare case, how was data warehousing able to reduce septicemia mortality rates in MultiCare hospitals?

∙ The Adaptive Data WarehouseTM organized and simplified data from multiple data sources across the continuum of care. It became the single source of truth required to see care improvement opportunities and to measure change, integrated teams consisting of clinicians, technologists, analysts, and quality personnel were essential for accelerating MultiCare's efforts to reduce septicemia mortality. ∙ Together the collaborative effort addressed three key bodies of work-standard of care definition, early identification, and efficient delivery of defined-care standard.


Related study sets

ATI Nursing Concepts Beginning Test

View Set

CSWA-S Multiple Choice Q- Extended Edition

View Set

General Psychology Module 10 Quiz

View Set

BIS-1120 Access 3 Review Training

View Set