CPSC 440 Q&A 7
Descriptive Analytics
Allowing users to dive deeper into the view of data with online analytical processing (OLAP) is an important part of:
Veracity and Value
Although volume, variety, and velocity are considered the initial three v dimensions, two additional Vs of big data were added and include:
Neo4j
An organization that requires a graph database that is highly scalable would select the ________ database management system.
Norm
An organization using HDFS realizes that hardware failure is a(n):
Wide-column
Apache Cassandra is a leading producer of ________ NoSQL database management systems.
Predictive Analytics
Application of statistical and computational methods to predict data events is:
True
Big data databases tend to sacrifice consistency for availability. (T/F)
True
Data in HDFS files cannot be updated. (T/F)
Descriptive, Predictive, Prescriptive
What are the three main types of analytics?
True
YARN, also called MapReduce 2.0, is like a traffic cop that controls the allocation of resources in a system. (T/F)
Pig
________ is an important scripting language to help reduce the complexity of MapReduce.
False
Hadoop is considered a relational database management system. (T/F)
True
Hive creates MapReduce jobs and executes them on a Hadoop Cluster. (T/F)
Apache
Hive is a(n) ________ data warehouse software.
Large number of slaves
It is true that in an HDFS cluster the DataNodes are the:
Single Master Server.
It is true that in an HDFS cluster the NameNode is the:
True
JSON is commonly used in conjunction with the 'document store' NoSQL database model. (T/F)
True
MapReduce is an algorithm for massive parallel processing utilized by Hadoop. (T/F)
Flexibility
NoSQL focuses on:
Sharding
NoSQL systems enable automated ________ to allow distribution of the data among multiple nodes to allow servers to operate independently on the data located on it.
Wide-column Store
The NoSQL model that incorporates 'column families' is called a: