5370 Quiz 10

¡Supera tus tareas y exámenes ahora con Quizwiz!

When a data repository (including internal and external data) does NOT follow a predefined schema, this is called a:

data lake.

An organization using HDFS realizes that hardware failure is a(n):

norm.

The three 'v's' commonly associated with big data include:

volume, variety, and velocity.

Apache Cassandra is a leading producer of ________ NoSQL database management systems.

wide-column

Economies of storage indicate data storage costs increase every year.

False

MongoDB is a proprietary NoSQL database management system created by Oracle.

False

The dive in anywhere characteristic of a data lake overrides constraints related to confidentiality.

False

The philosophical underpinnings of big data are based on schema on write.

False

The schema on write and schema on read are considered synonymous approaches.

False

Hive uses ________ to query data.

HiveQL

MongoDB databases are composed of:

collections

________ generally processes the largest quantities of data.

Transaction processing

Apache Cassandra is a wide-column NoSQL database management system.

True

Big data databases tend to sacrifice consistency for availability.

True

Collect everything is a characteristic of a data lake.

True

Hive creates MapReduce jobs and executes them on a Hadoop Cluster.

True

MapReduce is an algorithm for massive parallel processing utilized by Hadoop.

True

NoSQL databases DO NOT support ACID (atomicity, consistency, isolation, and durability).

True

NoSQL stands for 'Not only SQL.'

True

The 'schema on read' approach often incorporates JSON or XML.

True

The original three 'v's' attributed to big data include volume, variety, and velocity.

True

_______ includes concern about data quality issues.

Veracity

Hive is a(n) ________ data warehouse software.

Apache

Data in MongoDB is represented in:

BSON

________ is the most popular key-value store NoSQL database management system.

Redis

Big data:

does not require a strictly defined data model.

NoSQL focuses on:

flexibility

Big data includes:

large volumes of data with many different data types that are processed at very high speeds.

The Hadoop framework consists of the ________ algorithm to solve large scale problems.

MapReduce

An organization that decides to adopt the most popular NoSQL database management system would select:

MongoDB.


Conjuntos de estudio relacionados

762: Intro to Autonomic Nervous System ⭐️

View Set

Chapter 5. Georgia Laws, Rules, and Regulations Common to All Lines

View Set

AWS Cloud Practitioner Study Guide

View Set

Chapter 8 APUSH Multiple Choice Questions

View Set

Algebra 4.07: Transform Linear Functions

View Set

Fundamentos Históricos en Ciencias de la Salud Primer Parcial

View Set