Data Analytics ch1,2,3,7

Pataasin ang iyong marka sa homework at exams ngayon gamit ang Quizwiz!

Which of the following is an umbrella term that combines architectures, tools, databases, analytical tools, applications, and methodologies?

BI

Why is a performance management system superior to a performance measurement system?

because measurement alone has little use without action

This plot is a graphical illustration of several descriptive statistics about a given data set:

box-and-whiskers plot

Which kind of chart is described as an enhanced version of a scatter plot?

bubble chart

In which stage of extraction, transformation, and load (ETL) into a data warehouse are anomalies detected and corrected?

Cleanse

This technique makes no a priori assumption of whether one variable is dependent on the other(s) and is not concerned with the relationship between variables; instead it gives an estimate on the degree of association between the variables.

correlation

Which characteristic of data requires that the variables and data values be defined at the lowest (or as low as required) level of detail for the intended use of the data?

data granularity

Which characteristic of data means that all the required data elements are included in the data set?

data richness

In the Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse case study, what was the analytic goal?

determine differences in rates of disease in urban and rural populations

When querying a dimensional database, a user went from summarized data to its underlying details. The function that served this purpose is

drill down.

In a network analysis, what connects nodes?

edges

The very design that makes an OLTP system efficient for transaction processing makes it inefficient for

end-user ad hoc reports, queries, and analysis.

What is the fundamental challenge of dashboard design?

ensuring that the required information is shown clearly on a single screen

Which approach to data warehouse integration focuses more on sharing process functionality than data across systems?

enterprise application integration

Which data warehouse architecture uses metadata from existing data warehouses to create a hybrid logical data warehouse comprised of data from the other warehouses?

federated architecture

All of the following are benefits of hosted data warehouses EXCEPT

greater control of data.

Data warehouses provide direct and indirect benefits to organizations. Which of the following is an indirect benefit of data warehouses?

improved customer service

Key performance indicators (KPIs) are metrics typically used to measure:

internal results.

All of the following are true about in-database processing technology EXCEPT

it is the same as in-memory storage technology.

The Internet emerged as a new medium for visualization and brought all the following EXCEPT:

new forms of computation of business logic.

What is the management feature of a dashboard?

operational data that identify what actions to take to resolve a problem

Which of the following BEST enables a data warehouse to handle complex queries and scale up to handle many more requests?

parallel processing

Which type of visualization tool can be very helpful when the intention is to show relative proportions of dollars per department allocated by a university administration?

pie chart

In a Hadoop "stack," what node periodically replicates and stores data from the Name Node should it fail?

secondary node

Real-time data warehousing can be used to support the highest level of decision making sophistication and power. The major feature that enables this in relation to handling the data is

speed of data transfer

This measure of dispersion is calculated by simply taking the square root of the variations.

standard deviation

When representing data in a data warehouse, using several dimension tables that are each connected only to a fact table means you are using which warehouse structure?

star schema

Operational or transaction databases are product oriented, handling transactions that update the database. In contrast, data warehouses are

subject-oriented and nonvolatile.

What has caused the growth of the demand for instant, on-demand access to dispersed information?

the more pressing need to close the gap between the operational data and strategic objectives

Traditional data warehouses have not been able to keep up with:

the variety and complexity of data

A Web client that connects to a Web server, which is in turn connected to a BI application server, is reflective of a

three-tier architecture.

Under which of the following requirements would it be more appropriate to use Hadoop over a data warehouse?

unrestricted, ungoverned sandbox explorations

What is the Hadoop Distributed File System (HDFS) designed to handle?

unstructured and semistructured non-relational data

Data flows can be highly inconsistent, with periodic peaks, making data loads hard to manage. What is this feature of Big Data called?

variability

A large storage location that can hold vast quantities of data (mostly unstructured) in its native/raw format for future/potential analytics consumption is referred to as a(n)

Data lake

Which of the following is LEAST related to data/information visualization?

Graphic artwork

How does the use of cloud computing affect the scalability of a data warehouse?

Hardware resources are dynamically allocated as use increases.

Allowing Big Data to be processed in memory and distributed across a dedicated set of nodes can solve complex problems in near-real time with highly accurate insights. What is this process called?

In memory analytics

All of the following statements about MapReduce are true EXCEPT:

MapReduce runs without fault tolerance

In the Alternative Data for Market Analysis or Forecasts case study, satellite data was NOT used for

Monitoring Individual customer patterns

What type of analytics seeks to recognize what is going on as well as the likely forecast and make decisions to achieve the best performance possible?

Prescriptive

Which of the following sources is likely to produce Big Data the fastest?

RFID tags

In the financial services industry, Big Data can be used to improve:

Regulatory oversight and decision making

Companies with the largest revenues from Big Data tend to be

The largest computer and IT services firms.

Which type of question does visual analytics seeks to answer?

Why is it happening?

This measure of central tendency is the sum of all the values/observations divided by the number of observations in the data set.

arithmetic mean

A newly popular unit of data in the Big Data era is the petabyte (PB), which is

10^15 bytes

Relational databases began to be used in the:

1980s

________ is an evolving tool space that promises real-time data integration from a variety of sources, such as relational databases, Web services, and multidimensional databases.

Enterprise information integration (EII)

What is Six Sigma?

a methodology aimed at reducing the number of defects in a business process

In a Hadoop "stack," what is a slave node?

a node where data is stored and processed


Kaugnay na mga set ng pag-aaral

EXAM 2 Igneous Rocks, EXAM 2 Sedimentary Rocks, EXAM 2 Metamorphic Rock

View Set

Markt final consumer (need ch 15)

View Set

Bible Unit 5: Esther--A Story of Divine Providence

View Set

DEV LECTURE 21: SOCIO-EMOTIONAL DEVELOPMENT 2 - EMPATHY

View Set