DBMS Chapter 11
HDFS is an acronym for Hadoop distributed file system.
True
HP HAVEn integrates HP technologies with open source big data technologies.
True
Hive creates MapReduce jobs and executes them on a Hadoop Cluster.
True
JSON is commonly used in conjunction with the 'document store' NoSQL database model.
True
NoSQL stands for 'Not only SQL.'
True
Server logs are considered a big data variety data type.
True
Smartphones can produce millions of observations per second making them Business Intelligence and Analytics 3.0.
True
The original three 'v's attributed to big data include volume, variety, and velocity.
True
Big data allows for two different data types (text and numeric).
False
OLAP, ROLAP, and TLAP are tools commonly used to load data into intermediate hypercube structures.
False
Predictive analytics answers the question: "How can we make it happen?"
False
Structured Query Language (SQL) is a set of methodologies, processes, architectures, and technologies that transform raw data into meaningful information.
False
Value (related to the five 'v's of big data) addresses the pursuit of a meaningful goal.
True
YARN, also called MapReduce 2.0, is like a traffic cop that controls the allocation of resources in a system.
True
MongoDB is a proprietary NoSQL database management system created by Oracle.
False
Neo4j is a wide-column NoSQL database management system developed by Oracle.
False
NoSQL focuses on avoidance of replication and minimizing storage space.
False
The dive in anywhere characteristic of a data lake is overrides constraints related to confidentiality.
False
The philosophical underpinnings of big data are based on schema on write.
False
The schema on write and schema on read are considered synonymous approaches.
False
The target market for Hadoop is small to medium companies using local area networks.
False
Transaction processing and management reporting tend to fit big data databases better than relational databases.
False
Word processing documents are commonly stored in a 'document store' NoSQL database model.
False
Apache Cassandra is a wide-column NoSQL database management system.
True
Economies of storage indicate data storage costs increase every year.
False
HDFS allows indexing which specifically allows applications real-time access to the data.
False
HDFS requires a single master server, but does not allow slave servers.
False
Decision Support Systems (DSS) was a precursor to analytics and business intelligence.
True
NameNode in an HDFS cluster represents the single master server.
True
NoSQL databases DO NOT support ACID (atomicity, consistency, isolation, and durability).
True
Using data to predict events is an example of predictive analytics.
True
Big data databases tend to sacrifice consistency for availability.
True
Collect everything is a characteristic of a data lake.
True
Hadoop is considered a relational database management system.
False
Hive is an Oracle data warehouse software.
False
Horse, an important scripting language, helps reduce the complexity of MapReduce.
False
The 'schema on read' approach often incorporates JSON or XML.
True
Data in HDFS files cannot be updated.
True
A business owner that needs carefully normalized tables would likely need a relational database instead of a NoSQL database.
True
Customers leave clues about their preferences when navigating a company's Web site.
True
DataNodes in an HDFS cluster represents the large number of slaves.
True
Descriptive analytics answers the question: "What happened yesterday?"
True
Descriptive analytics is the oldest form of analytics.
True
Graph-oriented databases are designed to maintain information regarding the relationships between data items.
True
HBASE is a wide-column store database that runs on top of HDFS (modeled after Google).
True
Human intervention is an important part of big data analytics.
True
Many developing countries are using advanced applications of analytics to utilize data collected from mobile devices.
True
MapReduce is an algorithm for massive parallel processing utilized by Hadoop.
True
The three major types of analytics are: descriptive, predictive, and prescriptive.
True