Data Management Final Exam

Ace your homework & exams now with Quizwiz!

Operational and informational systems are generally separated because of which of the following factors?

A data warehouse centralizes data that are scattered throughout disparate operational systems and makes them readily available for decision support applications.

Which of the following factors drives the need for data warehousing?

Businesses need an integrated view of company information

_________ duplicates data across databases.

Data propagation

When reporting and analysis organization of data is determined when the data is used is called a

Schema on read

One characteristic of independent data marts is complexity for end users when they need access to data in separate data marts. This complexity is caused by not only having to access data from separate databases, but also form:

The possibility of a new generation of inconsistent data systems, the data marts themselves.

__________ includes noSQL accommodation of various data types.

Variety

The three V's commonly associated with big data include?

Volume, Variety, and Velocity

When using HDFS, the _____ node creates new files by communicating with the _______ node.

client; name

Apache Cassandra is a leading producer of _____ NoSQL data management systems.

columnar

When we consider data in the data warehouse to be time variant, we mean:

data in the warehouse contain time dimension so that they may be used to study trends and changes

All of the following are ways to consolidate data EXCEPT:

data rollup and integration

A data mart is a

data warehouse that is limited in scope

The level of detail in a fact table determined by the intersection of all components of the primary key, including all foreign keys and other primary key elements, is called the

grain

A dependent data mart:

is filled exclusively from the enterprise data warehouse with reconciled data.

the process if combining data from various sources into a single table or view is called:

joining

Informational and operation data differ in all of the following ways except:

level of detail

Informational systems are designed for all of the following except:

running a business real time

The key discovery that triggered the development of data warehousing was:

the recognition of the differences between transactional and informational systems

________ is an important scripting language to help reduce the complexity of MapReduce.

Pig

Loading data into the warehouse does not involve?

Formatting the Hard Drive.

Conformed dimensions allows users to do the following?

Query across fact tables with consistency.

When using MapReduce, a ______ function takes a collection and data and sorts and filters it into a set key-value pairs.

Map

The hadoop framework consists of the ______ algorithm to solve large scale problems.

MapReduce

An organization that requires graph database that is highly scalable would select the _______ database management system.

Neo4j

An organization using HDFS realizes that the hardware failure is a?

Norm

The process of transforming data from a detailed to a summary level is called?

Normalization


Related study sets

Topic 12: Historical failed attempts at healthcare reform

View Set

Chapter 6 Lipids: Triglycerides, Phospholipids, and Sterols

View Set

HRM 3375 test 2- multiple choice

View Set

Micro-Economics 152-80B Final Exam

View Set

AP Computer Science Midterm Questions

View Set