CIS 3050 Bank 5
________ includes concern about data quality issues. A) Velocity B) Vigilant C) Veracity D) Variety
C
An organization that decides to adopt the most popular NoSQL database management system would select: A) Access. B) MongoDB. C) Neo4j. D) Redis.
B
Apache Cassandra is a leading producer of ________ NoSQL database management systems. A) key-value store B) wide-column C) relational D) graph
B
Big data: A) requires a normalized dataset to 3rd Normal Form. B) does not require a strictly defined data model. C) requires a strictly defined schema. D) requires a normalized dataset to BCNF.
B
Data in MongoDB is represented in: A) JSON. B) BSON. C) CSON. D) SON.
B
Hive uses ________ to query data. A) SQL B) HiveQL C) BeesNest D) Honeyquery
B
________ generally processes the largest quantities of data. A) Operational databases B) Transaction processing C) Big data D) Data marts
B
MongoDB databases are composed of: A) collections. B) tables. C) rowsets. D) columns.
A
The NoSQL model that includes a simple pair of a key and an associated collection of values is called a: A) key-value store. B) document store. C) wide-column store. D) graph database.
A
The primary use of Pig is to: A) transform raw data into a format that is useful for analysis. B) query large databases. C) create large databases. D) create data warehouses.
A
The Hadoop framework consists of the ________ algorithm to solve large scale problems. A) MapSystem B) MapReduce C) MapCluster D) MapComponent
B
The three 'v's' commonly associated with big data include: A) viewable, volume, and variety. B) volume, variety, and velocity. C) verified, variety, and velocity. D) vigilant, viewable, and verified.
B
It is true that in an HDFS cluster the NameNode is the: A) large number of slaves. B) single master server. C) language library. D) business intelligence.
B
________ includes the value of speed in a NoSQL database. A) Velocity B) Vigilant C) Verified D) Variety
A
________ is an important scripting language to help reduce the complexity of MapReduce. A) Pig B) Horse C) Dog D) Cat
A
NoSQL systems allow ________ by incorporating commodity servers that can be easily added to the architectural solution. A) scaling down B) scaling out C) scaling up D) scaling over
B
NoSQL systems enable automated ________ to allow distribution of the data among multiple nodes to allow servers to operate independently on the data located on it. A) sharing B) sharding C) SQL D) mongo
B
When a data repository (including internal and external data) does NOT follow a predefined schema, this is called a: A) data dump. B) data ocean. C) data lake. D) data stream.
C
________ includes NoSQL accommodation of various data types. A) Velocity B) Vigilant C) Verified D) Variety
D
An organization using HDFS realizes that hardware failure is a(n): A) norm. B) irregularity. C) anomaly. D) inconsistency.
A
Big data includes: A) large volumes of data with many different data types that are processed at very high speeds. B) large volumes of data entry with a single data type processed at very high speeds. C) large volumes of entity relationship diagrams (ERD) with many different data types that are processed at very high speeds. D) large volumes of entity relationship diagrams (ERD) with a single data type processed at very high speeds.
A
It is true that in an HDFS cluster the DataNodes are the: A) large number of slaves. B) single master servers. C) language libraries. D) business intelligences.
A
According to your text, NoSQL stands for: A) Numbered SQL. B) No SQL. C) Not Only SQL. D) Numeric Only SQL.
C
An organization that requires a graph database that is highly scalable would select the ________ database management system. A) Access B) Excel Spreadsheet C) Neo4j D) Redis
C
At a basic level, analytics refers to: A) collecting data. B) conducting a needs analysis. C) analysis and interpretation of data. D) normalizing data
C
Big data requires effectively processing: A) a single data type (numeric). B) two data types (text and numeric). C) many data types. D) a single data type (text).
C
NoSQL includes data storage and retrieval: A) based on the relational model. B) based on normalized tables. C) not based on the relational model. D) not based on data
C
Although volume, variety, and velocity are considered the initial three v dimensions, two additional Vs of big data were added and include: A) veracity and verified. B) volume and verified. C) verified and valuable. D) veracity and value
D
An organization that requires a sole focus on performance with the ability for keys to include strings, hashes, lists, and sorted sets would select ________ database management system. A) Access B) Excel Spreadsheet C) Neo4j D) Redis
D
Hive is a(n) ________ data warehouse software. A) Oracle B) Microsoft C) Macintosh D) Apache
D
The NoSQL model that incorporates 'column families' is called a: A) key-value store. B) document store. C) wide-column store. D) column-SQL database.
C
When reporting and analysis organization of the data is determined when the data is used is called a(n): A) entity relationship diagram. B) schema binding. C) schema on read. D) cognitive schema.
C
With HDFS it is less expensive to move the execution of computation to data than to move the: A) data to hardware. B) data to systems analysis. C) data to computation. D) data to processes.
C
NoSQL focuses on: A) avoidance of replication of data. B) minimizing storage space. C) normalized data. D) flexibility.
D
The Hadoop Distributed File System (HDFS) is the foundation of a ________ infrastructure of Hadoop. A) relational database management system B) DBBMS C) Java D) data management
D
The NoSQL model that is specifically designed to maintain information regarding the relationships (often real-world instances of entities) between data items is called a: A) key-value store. B) document store. C) wide-column store. D) graph-oriented database
D
________ is the most popular key-value store NoSQL database management system. A) Access B) Apache Cassandra C) Neo4j D) Redis
D